0% found this document useful (0 votes)

296 views885 pages

VGAM Ajah

This document provides information about the VGAM package in R. It includes the package description, author, dependencies, and topics covered in the documentation. The VGAM package implements over 150 statistical regression models, including vector generalized linear and additive models. It allows for maximum likelihood and penalized maximum likelihood estimation of these models.

Uploaded by

Sela

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

296 views885 pages

VGAM Ajah

Uploaded by

Sela

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 885

Package VGAM

January 11, 2017

Version 1.0-3
Date 2017-01-11
Title Vector Generalized Linear and Additive Models
Author Thomas W. Yee <[email protected]>
Maintainer Thomas Yee <[email protected]>
Depends R (>= 3.1.0), methods, stats, stats4, splines
Suggests VGAMdata, MASS, mgcv
Description An implementation of about 6 major classes of
statistical regression models. At the heart of it are the
vector generalized linear and additive model (VGLM/VGAM)
classes, and the book ``Vector Generalized Linear and
Additive Models: With an Implementation in R'' (Yee, 2015)
<DOI:10.1007/978-1-4939-2818-7>
gives details of the statistical framework and VGAM package.
Currently only fixed-effects models are implemented,
i.e., no random-effects models. Many (150+) models and
distributions are estimated by maximum likelihood estimation
(MLE) or penalized MLE, using Fisher scoring. VGLMs can be
loosely thought of as multivariate GLMs. VGAMs are data-driven
VGLMs (i.e., with smoothing). The other classes are RR-VGLMs
(reduced-rank VGLMs), quadratic RR-VGLMs, reduced-rank VGAMs,
RCIMs (row-column interaction models)---these classes perform
constrained and unconstrained quadratic ordination (CQO/UQO)
models in ecology, as well as constrained additive ordination
(CAO). Note that these functions are subject to change;
see the NEWS and ChangeLog files for latest changes.
License GPL-2 | GPL-3
URL https://www.stat.auckland.ac.nz/~yee/VGAM
NeedsCompilation yes
BuildVignettes yes
LazyLoad yes
LazyData yes

1
2 R topics documented:

Repository CRAN
Date/Publication 2017-01-11 08:59:31

R topics documented:
VGAM-package . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
A1A2A3 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
AA.Aa.aa . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
AB.Ab.aB.ab . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18
ABO . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
acat . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20
AICvlm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
alaplace . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24
alaplaceUC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
amlbinomial . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
amlexponential . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
amlnormal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34
amlpoisson . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
AR1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
AR1EIM . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
auuc . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
aux.posbernoulli.t . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47
backPain . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
beggs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
Benford . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50
Benini . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
benini1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53
Betabinom . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54
betabinomial . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57
betabinomialff . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60
betaff . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63
Betageom . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
betageometric . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66
betaII . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68
Betanorm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 69
betaprime . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 71
betaR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
Biamhcop . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74
biamhcop . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 75
Biclaytoncop . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77
biclaytoncop . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 78
BICvlm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80
Bifgmcop . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
bifgmcop . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83
bifgmexp . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84
bifrankcop . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86
bigamma.mckay . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 87
R topics documented: 3

bigumbelIexp . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89
bilogis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90
bilogistic . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 92
Binom2.or . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 94
binom2.or . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95
Binom2.rho . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99
binom2.rho . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 100
binomialff . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 103
Binorm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 106
binormal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 108
binormalcop . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 109
Binormcop . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 111
Biplackett . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 112
biplackettcop . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 114
biplot-methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 115
Bisa . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116
bisa . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 117
Bistudentt . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 119
bistudentt . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 120
bmi.nz . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 122
borel.tanner . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 123
Bort . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 124
Brat . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 125
brat . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 127
bratt . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 129
calibrate . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 131
calibrate-methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 132
calibrate.qrrvglm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 132
calibrate.qrrvglm.control . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 135
cao . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 136
cao.control . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 140
Card . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 144
cardioid . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 145
cauchit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 147
cauchy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 149
cdf.lmscreg . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 151
cens.gumbel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 152
cens.normal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 154
cens.poisson . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 156
cfibrosis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 158
cgo . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 159
chest.nz . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 160
chinese.nz . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 161
chisq . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 162
clo . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 163
cloglog . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 164
coalminers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 166
Coef . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 167
4 R topics documented:

Coef.qrrvglm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 168
Coef.qrrvglm-class . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 170
Coef.rrvglm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 172
Coef.rrvglm-class . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 173
Coef.vlm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 174
coefvgam . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 175
coefvlm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 176
CommonVGAMffArguments . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 177
concoef . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 184
concoef-methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 186
confintvglm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 186
constraints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 188
corbet . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 190
cqo . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 191
crashes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 198
cratio . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 200
cumulative . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 201
Dagum . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 205
dagum . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 206
dAR1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 208
deermice . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 210
deplot.lmscreg . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 211
depvar . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 212
df.residual . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 213
dgenpois . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 214
dhuber . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 216
Diffzeta . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 217
diffzeta . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 219
dirichlet . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 220
dirmul.old . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 221
dirmultinomial . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 223
dlogF . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 226
double.cens.normal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 227
double.expbinomial . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 228
ducklings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 231
enzyme . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 232
erf . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 233
erlang . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 234
Expectiles-Exponential . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 235
Expectiles-Normal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 237
Expectiles-sc.t2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 238
Expectiles-Uniform . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 240
expexpff . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 242
expexpff1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 244
expgeom . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 246
expgeometric . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 248
expint . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 249
explink . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 250
R topics documented: 5

explog . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 252
explogff . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 253
exponential . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 254
exppois . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 256
exppoisson . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 257
Felix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 259
felix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 260
fff . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 261
fill . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 262
finney44 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 266
fisherz . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 267
Fisk . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 268
fisk . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 269
fittedvlm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 271
flourbeetle . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 273
Foldnorm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 274
foldnormal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 275
foldsqrt . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 277
formulavlm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 279
Frank . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 280
Frechet . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 282
frechet . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 283
freund61 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 284
gamma1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 287
gamma2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 288
gammahyperbola . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 290
gammaR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 291
garma . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 293
gaussianff . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 295
GenbetaII . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 297
genbetaII . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 298
gengamma.stacy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 301
gengammaUC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 303
genpoisson . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 304
genray . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 306
genrayleigh . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 307
geometric . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 309
get.smart . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 310
get.smart.prediction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 311
gev . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 312
gevUC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 315
gew . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 317
golf . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 318
Gompertz . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 320
gompertz . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 322
gpd . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 323
gpdUC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 326
grain.us . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 328
6 R topics documented:

grc . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 329
gumbel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 334
Gumbel-II . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 337
gumbelII . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 339
gumbelUC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 340
guplot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 342
has.interceptvlm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 344
hatvalues . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 345
hormone . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 347
hspider . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 349
huber2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 350
Huggins89.t1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 352
hunua . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 354
hyperg . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 356
hypersecant . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 357
Hzeta . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 359
hzeta . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 360
iam . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 361
identitylink . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 363
inv.binomial . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 364
Inv.gaussian . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 366
inv.gaussianff . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 367
Inv.lomax . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 369
inv.lomax . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 370
Inv.paralogistic . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 372
inv.paralogistic . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 373
is.buggy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 375
is.parallel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 376
is.smart . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 377
is.zero . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 378
kendall.tau . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 379
Kumar . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 380
kumar . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 382
lakeO . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 383
lambertW . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 384
laplace . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 385
laplaceUC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 387
latvar . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 389
leipnik . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 390
lerch . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 391
leukemia . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 393
levy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 394
lgamma1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 395
lgammaUC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 397
Lindley . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 398
lindley . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 400
linkfun . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 401
linkfun.vglm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 402
R topics documented: 7

Links . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 403
Lino . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 406
lino . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 407
lirat . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 409
lms.bcg . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 410
lms.bcn . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 412
lms.yjn . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 415
Log . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 418
log1mexp . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 419
logc . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 420
loge . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 422
logF . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 423
logff . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 424
logistic . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 426
logit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 428
logitoffsetlink . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 430
loglaplace . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 432
loglapUC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 435
logLik.vlm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 437
loglinb2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 438
loglinb3 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 440
loglog . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 441
lognormal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 442
logoff . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 444
Lomax . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 445
lomax . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 446
lqnorm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 448
lrtest . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 450
lvplot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 452
lvplot.qrrvglm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 453
lvplot.rrvglm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 457
machinists . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 460
Makeham . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 461
makeham . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 462
margeff . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 464
marital.nz . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 466
Max . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 467
Maxwell . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 468
maxwell . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 470
mccullagh89 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 471
melbmaxtemp . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 472
meplot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 473
micmen . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 475
mix2exp . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 477
mix2normal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 479
mix2poisson . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 481
MNSs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 483
model.framevlm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 485
8 R topics documented:

model.matrixvlm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 486
moffset . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 488
multilogit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 489
multinomial . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 491
Nakagami . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 495
nakagami . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 496
nbcanlink . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 498
nbolf . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 500
negbinomial . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 502
negbinomial.size . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 509
normal.vcm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 510
nparam.vlm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 514
Oalog . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 515
oalog . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 516
Oapospois . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 518
oapospoisson . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 519
Oazeta . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 521
oazeta . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 522
Oilog . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 523
oilog . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 525
Oiposbinom . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 526
oiposbinomial . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 528
Oipospois . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 530
oipospoisson . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 532
Oizeta . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 533
oizeta . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 535
Oizipf . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 536
oizipf . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 537
olympics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 539
Opt . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 540
ordpoisson . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 542
Otlog . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 544
otlog . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 545
Otpospois . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 546
otpospoisson . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 548
Otzeta . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 549
otzeta . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 550
oxtemp . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 551
Paralogistic . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 552
paralogistic . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 553
Pareto . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 555
paretoff . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 556
ParetoIV . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 558
paretoIV . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 560
Perks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 563
perks . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 564
perspqrrvglm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 566
pgamma.deriv . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 569
R topics documented: 9

pgamma.deriv.unscaled . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 570
plotdeplot.lmscreg . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 571
plotqrrvglm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 573
plotqtplot.lmscreg . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 574
plotrcim0 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 576
plotvgam . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 578
plotvgam.control . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 581
plotvglm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 582
pneumo . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 583
poisson.points . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 584
poissonff . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 586
PoissonPoints . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 588
polf . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 589
Polono . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 591
posbernoulli.b . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 593
posbernoulli.t . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 596
posbernoulli.tb . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 599
posbernUC . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 602
Posbinom . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 604
posbinomial . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 605
Posgeom . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 608
Posnegbin . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 609
posnegbinomial . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 611
Posnorm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 614
posnormal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 615
Pospois . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 617
pospoisson . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 619
powerlink . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 620
prats . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 622
predictqrrvglm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 623
predictvglm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 624
prentice74 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 627
prinia . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 628
probit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 629
propodds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 631
prplot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 632
put.smart . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 634
qrrvglm.control . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 635
qtplot.gumbel . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 640
qtplot.lmscreg . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 642
quasibinomialff . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 644
quasipoissonff . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 646
Qvar . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 648
qvar . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 651
Rayleigh . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 652
rayleigh . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 653
Rcim . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 655
rcqo . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 657
10 R topics documented:

rdiric . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 661
rec.exp1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 663
rec.normal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 664
reciprocal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 665
rhobit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 667
Rice . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 668
riceff . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 669
rigff . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 671
rlplot.gevff . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 672
rrar . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 674
rrvglm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 676
rrvglm-class . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 679
rrvglm.control . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 682
rrvglm.optim.control . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 685
ruge . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 686
s . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 687
sc.studentt2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 689
Select . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 690
seq2binomial . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 692
setup.smart . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 694
Simplex . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 695
simplex . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 697
simulate.vlm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 698
Sinmad . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 700
sinmad . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 701
Skellam . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 703
skellam . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 704
skewnorm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 706
skewnormal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 707
Slash . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 709
slash . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 710
sm.os . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 712
sm.ps . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 716
smart.expression . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 718
smart.mode.is . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 719
smartpred . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 720
sratio . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 722
studentt . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 724
summarypvgam . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 726
summaryvgam . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 727
summaryvglm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 728
SURff . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 730
SurvS4 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 732
SurvS4-class . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 734
Tikuv . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 735
tikuv . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 736
Tobit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 738
tobit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 739
R topics documented: 11

Tol . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 743
Topple . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 745
topple . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 746
toxop . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 748
Triangle . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 749
triangle . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 750
trplot . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 752
trplot.qrrvglm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 753
Truncpareto . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 755
truncweibull . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 757
ucberk . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 759
uninormal . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 760
UtilitiesVGAM . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 761
V1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 763
vcovvlm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 764
venice . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 765
vgam . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 767
vgam-class . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 771
vgam.control . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 774
vglm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 776
vglm-class . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 782
vglm.control . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 784
vglmff-class . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 788
vonmises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 791
vsmooth.spline . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 793
waitakere . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 795
waldff . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 797
weibull.mean . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 798
weibullR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 799
weightsvglm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 802
wine . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 804
wrapup.smart . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 805
yeo.johnson . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 805
yip88 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 807
Yules . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 809
yulesimon . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 810
Zabinom . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 811
zabinomial . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 813
Zageom . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 815
zageometric . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 816
Zanegbin . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 818
zanegbinomial . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 819
Zapois . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 822
zapoisson . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 824
zero . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 826
Zeta . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 827
zeta . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 829
zetaff . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 831
12 VGAM-package

Zibinom . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 833
zibinomial . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 834
Zigeom . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 837
zigeometric . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 838
Zinegbin . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 840
zinegbinomial . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 842
zipebcom . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 845
Zipf . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 848
zipf . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 849
Zipois . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 850
zipoisson . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 852
Zoabeta . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 856
zoabetaR . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 858

Index 860

VGAM-package Vector Generalized Linear and Additive Models

Description
VGAM provides functions for fitting vector generalized linear and additive models (VGLMs and
VGAMs), and associated models (Reduced-rank VGLMs, Quadratic RR-VGLMs, Reduced-rank
VGAMs). This package fits many models and distributions by maximum likelihood estimation
(MLE) or penalized MLE. Also fits constrained ordination models in ecology such as constrained
quadratic ordination (CQO).

Details
This package centers on the iteratively reweighted least squares (IRLS) algorithm. Other key words
include Fisher scoring, additive models, penalized likelihood, reduced-rank regression and con-
strained ordination. The central modelling functions are vglm, vgam, rrvglm, rcim, cqo, cao.
For detailed control of fitting, each of these has its own control function, e.g., vglm.control. The
package uses S4 (see methods-package). A companion package called VGAMdata contains some
larger data sets which were shifted from VGAM.
The classes of GLMs and GAMs are special cases of VGLMs and VGAMs. The VGLM/VGAM
framework is intended to be very general so that it encompasses as many distributions and mod-
els as possible. VGLMs are limited only by the assumption that the regression coefficients enter
through a set of linear predictors. The VGLM class is very large and encompasses a wide range of
multivariate response types and models, e.g., it includes univariate and multivariate distributions,
categorical data analysis, time series, survival analysis, generalized estimating equations, extreme
values, correlated binary data, quantile and expectile regression, bioassay data and nonlinear least-
squares problems.
Crudely, VGAMs are to VGLMs what GAMs are to GLMs. Two types of VGAMs are implemented:
1st-generation VGAMs with s use vector backfitting, while 2nd-generation VGAMs with sm.os
and sm.ps use O-splines and P-splines, do not use the backfitting algorithm, and have automatic
smoothing parameter selection. The former is older and is based on Yee and Wild (1996). The
VGAM-package 13

latter is more modern (Yee, Somchit and Wild, 2017) but it requires a reasonably large number of
observations to work well.
For a complete list of this package, use library(help = "VGAM"). New VGAM family functions
are continually being written and added to the package. A monograph about VGLM and VGAMs
etc. appeared in October 2015.

Warning
This package is undergoing continual development and improvement, therefore users should treat
everything as subject to change. This includes the family function names, argument names, many
of the internals, the use of link functions, and slot names. For example, all link functions may be
renamed so that they end in "link", e.g., loglink() instead of loge(). Some future pain can be
avoided by using good programming techniques, e.g., using extractor/accessor functions such as
coef(), weights(), vcov(), predict(). Nevertheless, please expect changes in all aspects of the
package. See the NEWS file for a list of changes from version to version.

Author(s)
Thomas W. Yee, <[email protected]>.
Maintainer: Thomas Yee <[email protected]>.

References
Yee, T. W. (2015) Vector Generalized Linear and Additive Models: With an Implementation in R.
New York, USA: Springer.
Yee, T. W. and Hastie, T. J. (2003) Reduced-rank vector generalized linear models. Statistical
Modelling, 3, 1541.
Yee, T. W. and Stephenson, A. G. (2007) Vector generalized linear and additive extreme value
models. Extremes, 10, 119.
Yee, T. W. and Wild, C. J. (1996) Vector generalized additive models. Journal of the Royal Statisti-
cal Society, Series B, Methodological, 58, 481493.
Yee, T. W. (2004) A new technique for maximum-likelihood canonical Gaussian ordination. Eco-
logical Monographs, 74, 685701.
Yee, T. W. (2006) Constrained additive ordination. Ecology, 87, 203213.
Yee, T. W. (2008) The VGAM Package. R News, 8, 2839.
Yee, T. W. (2010) The VGAM package for categorical data analysis. Journal of Statistical Software,
32, 134. http://www.jstatsoft.org/v32/i10/.
Yee, T. W. (2014) Reduced-rank vector generalized linear models with two linear predictors. Com-
putational Statistics and Data Analysis, 71, 889902.
Yee, T. W. and Somchit, C. and Wild, C. J. (2017) Penalized vector generalized additive models.
Manuscript in preparation.
My website for the VGAM package and book is at https://www.stat.auckland.ac.nz/~yee
and I hope to put more resources there in the future, especially as relating to my book.
14 VGAM-package

See Also
vglm, vgam, rrvglm, rcim, cqo, TypicalVGAMfamilyFunction, CommonVGAMffArguments, Links,
https://CRAN.R-project.org/package=VGAM.

Examples
# Example 1; proportional odds model
pneumo <- transform(pneumo, let = log(exposure.time))
(fit1 <- vglm(cbind(normal, mild, severe) ~ let, propodds, data = pneumo))
depvar(fit1) # Better than using fit1@y; dependent variable (response)
weights(fit1, type = "prior") # Number of observations
coef(fit1, matrix = TRUE) # p.179, in McCullagh and Nelder (1989)
constraints(fit1) # Constraint matrices
summary(fit1)

# Example 2; zero-inflated Poisson model

zdata <- data.frame(x2 = runif(nn <- 2000))
zdata <- transform(zdata, pstr0 = logit(-0.5 + 1*x2, inverse = TRUE),
lambda = loge( 0.5 + 2*x2, inverse = TRUE))
zdata <- transform(zdata, y = rzipois(nn, lambda, pstr0 = pstr0))
with(zdata, table(y))
fit2 <- vglm(y ~ x2, zipoisson, data = zdata, trace = TRUE)
coef(fit2, matrix = TRUE) # These should agree with the above values

# Example 3; fit a two species GAM simultaneously

fit3 <- vgam(cbind(agaaus, kniexc) ~ s(altitude, df = c(2, 3)),
binomialff(multiple.responses = TRUE), data = hunua)
coef(fit3, matrix = TRUE) # Not really interpretable
## Not run: plot(fit3, se = TRUE, overlay = TRUE, lcol = 3:4, scol = 3:4)

ooo <- with(hunua, order(altitude))

with(hunua, matplot(altitude[ooo], fitted(fit3)[ooo, ], type = "l",
lwd = 2, col = 3:4,
xlab = "Altitude (m)", ylab = "Probability of presence", las = 1,
main = "Two plant species' response curves", ylim = c(0, 0.8)))
with(hunua, rug(altitude))
## End(Not run)

# Example 4; LMS quantile regression

fit4 <- vgam(BMI ~ s(age, df = c(4, 2)), lms.bcn(zero = 1),
data = bmi.nz, trace = TRUE)
head(predict(fit4))
head(fitted(fit4))
head(bmi.nz) # Person 1 is near the lower quartile among people his age
head(cdf(fit4))

## Not run: par(mfrow = c(1, 1), bty = "l", mar = c(5,4,4,3)+0.1, xpd = TRUE)
qtplot(fit4, percentiles = c(5,50,90,99), main = "Quantiles", las = 1,
xlim = c(15, 90), ylab = "BMI", lwd = 2, lcol = 4) # Quantile plot
A1A2A3 15

ygrid <- seq(15, 43, len = 100) # BMI ranges

par(mfrow = c(1, 1), lwd = 2) # Density plot
aa <- deplot(fit4, x0 = 20, y = ygrid, xlab = "BMI", col = "black",
main = "Density functions at Age = 20 (black), 42 (red) and 55 (blue)")
aa
aa <- deplot(fit4, x0 = 42, y = ygrid, add = TRUE, llty = 2, col = "red")
aa <- deplot(fit4, x0 = 55, y = ygrid, add = TRUE, llty = 4, col = "blue",
Attach = TRUE)
aa@post$deplot # Contains density function values

## End(Not run)

# Example 5; GEV distribution for extremes

(fit5 <- vglm(maxtemp ~ 1, gevff, data = oxtemp, trace = TRUE))
head(fitted(fit5))
coef(fit5, matrix = TRUE)
Coef(fit5)
vcov(fit5)
vcov(fit5, untransform = TRUE)
sqrt(diag(vcov(fit5))) # Approximate standard errors
## Not run: rlplot(fit5)

A1A2A3 The A1A2A3 Blood Group System

Description
Estimates the three independent parameters of the the A1A2A3 blood group system.

Usage
A1A2A3(link = "logit", inbreeding = FALSE, ip1 = NULL, ip2 = NULL, iF = NULL)

Arguments
link Link function applied to p1, p2 and f. See Links for more choices.
inbreeding Logical. Is there inbreeding?
ip1, ip2, iF Optional initial value for p1, p2 and f.

Details
The parameters p1 and p2 are probabilities, so that p3=1-p1-p2 is the third probability. The pa-
rameter f is the third independent parameter if inbreeding = TRUE. If inbreeding = FALSE then
f = 0 and Hardy-Weinberg Equilibrium (HWE) is assumed.
16 AA.Aa.aa

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Note
The input can be a 6-column matrix of counts, with columns corresponding to A1A1, A1A2, A1A3,
A2A2, A2A3, A3A3 (in order). Alternatively, the input can be a 6-column matrix of proportions (so
each row adds to 1) and the weights argument is used to specify the total number of counts for each
row.

Author(s)
T. W. Yee

References
Lange, K. (2002) Mathematical and Statistical Methods for Genetic Analysis, 2nd ed. New York:
Springer-Verlag.

See Also
AA.Aa.aa, AB.Ab.aB.ab, ABO, MNSs.

Examples
ymat <- cbind(108, 196, 429, 143, 513, 559)
fit <- vglm(ymat ~ 1, A1A2A3(link = probit), trace = TRUE, crit = "coef")
fit <- vglm(ymat ~ 1, A1A2A3(link = logit, ip1 = 0.3, ip2 = 0.3, iF = 0.02),
trace = TRUE, crit = "coef")
Coef(fit) # Estimated p1 and p2
rbind(ymat, sum(ymat) * fitted(fit))
sqrt(diag(vcov(fit)))

AA.Aa.aa The AA-Aa-aa Blood Group System

Description
Estimates the parameter of the AA-Aa-aa blood group system, with or without Hardy Weinberg
equilibrium.

Usage
AA.Aa.aa(linkp = "logit", linkf = "logit", inbreeding = FALSE,
ipA = NULL, ifp = NULL, zero = NULL)
AA.Aa.aa 17

Arguments
linkp, linkf Link functions applied to pA and f. See Links for more choices.
ipA, ifp Optional initial values for pA and f.
inbreeding Logical. Is there inbreeding?
zero See CommonVGAMffArguments for information.

Details
This one or two parameter model involves a probability called pA. The probability of getting a
count in the first column of the input (an AA) is pA*pA. When inbreeding = TRUE, an additional
parameter f is used. If inbreeding = FALSE then f = 0 and Hardy-Weinberg Equilibrium (HWE)
is assumed. The EIM is used if inbreeding = FALSE.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Warning
Setting inbreeding = FALSE makes estimation difficult with non-intercept-only models. Currently,
this code seems to work with intercept-only models.

Note
The input can be a 3-column matrix of counts, where the columns are AA, Ab and aa (in order).
Alternatively, the input can be a 3-column matrix of proportions (so each row adds to 1) and the
weights argument is used to specify the total number of counts for each row.

Author(s)
T. W. Yee

References
Weir, B. S. (1996) Genetic Data Analysis II: Methods for Discrete Population Genetic Data, Sun-
derland, MA: Sinauer Associates, Inc.

See Also
AB.Ab.aB.ab, ABO, A1A2A3, MNSs.

Examples
y <- cbind(53, 95, 38)
fit1 <- vglm(y ~ 1, AA.Aa.aa, trace = TRUE)
fit2 <- vglm(y ~ 1, AA.Aa.aa(inbreeding = TRUE), trace = TRUE)
rbind(y, sum(y) * fitted(fit1))
Coef(fit1) # Estimated pA
18 AB.Ab.aB.ab

Coef(fit2) # Estimated pA and f

summary(fit1)

AB.Ab.aB.ab The AB-Ab-aB-ab Blood Group System

Description
Estimates the parameter of the AB-Ab-aB-ab blood group system.

Usage
AB.Ab.aB.ab(link = "logit", init.p = NULL)

Arguments
link Link function applied to p. See Links for more choices.
init.p Optional initial value for p.

Details
This one parameter model involves a probability called p.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Note
The input can be a 4-column matrix of counts, where the columns are AB, Ab, aB and ab (in order).
Alternatively, the input can be a 4-column matrix of proportions (so each row adds to 1) and the
weights argument is used to specify the total number of counts for each row.

Author(s)
T. W. Yee

References
Lange, K. (2002) Mathematical and Statistical Methods for Genetic Analysis, 2nd ed. New York:
Springer-Verlag.

See Also
AA.Aa.aa, ABO, A1A2A3, MNSs.
ABO 19

Examples
ymat <- cbind(AB=1997, Ab=906, aB=904, ab=32) # Data from Fisher (1925)
fit <- vglm(ymat ~ 1, AB.Ab.aB.ab(link = "identitylink"), trace = TRUE)
fit <- vglm(ymat ~ 1, AB.Ab.aB.ab, trace = TRUE)
rbind(ymat, sum(ymat)*fitted(fit))
Coef(fit) # Estimated p
p <- sqrt(4*(fitted(fit)[, 4]))
p*p
summary(fit)

ABO The ABO Blood Group System

Description
Estimates the two independent parameters of the the ABO blood group system.

Usage
ABO(link.pA = "logit", link.pB = "logit", ipA = NULL, ipB = NULL,
ipO = NULL, zero = NULL)

Arguments
link.pA, link.pB
Link functions applied to pA and pB. See Links for more choices.
ipA, ipB, ipO Optional initial value for pA and pB and pO. A NULL value means values are
computed internally.
zero Details at CommonVGAMffArguments.

Details
The parameters pA and pB are probabilities, so that pO=1-pA-pB is the third probability. The proba-
bilities pA and pB correspond to A and B respectively, so that pO is the probability for O. It is easier
to make use of initial values for pO than for pB. In documentation elsewhere I sometimes use pA=p,
pB=q, pO=r.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Note
The input can be a 4-column matrix of counts, where the columns are A, B, AB, O (in order).
Alternatively, the input can be a 4-column matrix of proportions (so each row adds to 1) and the
weights argument is used to specify the total number of counts for each row.
20 acat

Author(s)
T. W. Yee

References
Lange, K. (2002) Mathematical and Statistical Methods for Genetic Analysis, 2nd ed. New York:
Springer-Verlag.

See Also
AA.Aa.aa, AB.Ab.aB.ab, A1A2A3, MNSs.

Examples
ymat <- cbind(A = 725, B = 258, AB = 72, O = 1073) # Order matters, not the name
fit <- vglm(ymat ~ 1, ABO(link.pA = identitylink,
link.pB = identitylink), trace = TRUE, cri = "coef")
coef(fit, matrix = TRUE)
Coef(fit) # Estimated pA and pB
rbind(ymat, sum(ymat) * fitted(fit))
sqrt(diag(vcov(fit)))

acat Ordinal Regression with Adjacent Categories Probabilities

Description
Fits an adjacent categories regression model to an ordered (preferably) factor response.

Usage
acat(link = "loge", parallel = FALSE, reverse = FALSE,
zero = NULL, whitespace = FALSE)

Arguments
link Link function applied to the ratios of the adjacent categories probabilities. See
Links for more choices.
parallel A logical, or formula specifying which terms have equal/unequal coefficients.
reverse Logical. By default, the linear/additive predictors used are j = log(P [Y =
j + 1]/P [Y = j]) for j = 1, . . . , M . If reverse is TRUE then j = log(P [Y =
j]/P [Y = j + 1]) will be used.
zero An integer-valued vector specifying which linear/additive predictors are mod-
elled as intercepts only. The values must be from the set {1,2,. . . ,M }.
whitespace See CommonVGAMffArguments for information.
acat 21

Details
In this help file the response Y is assumed to be a factor with ordered values 1, 2, . . . , M + 1, so
that M is the number of linear/additive predictors j .
By default, the log link is used because the ratio of two probabilities is positive.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.

Warning
No check is made to verify that the response is ordinal if the response is a matrix; see ordered.

Note
The response should be either a matrix of counts (with row sums that are all positive), or an ordered
factor. In both cases, the y slot returned by vglm/vgam/rrvglm is the matrix of counts.
For a nominal (unordered) factor response, the multinomial logit model (multinomial) is more
appropriate.
Here is an example of the usage of the parallel argument. If there are covariates x1, x2 and x3,
then parallel = TRUE ~ x1 + x2 -1 and parallel = FALSE ~ x3 are equivalent. This
would constrain the regression coefficients for x1 and x2 to be equal; those of the intercepts and x3
would be different.

Author(s)
Thomas W. Yee

References
Agresti, A. (2013) Categorical Data Analysis, 3rd ed. Hoboken, NJ, USA: Wiley.
Simonoff, J. S. (2003) Analyzing Categorical Data, New York: Springer-Verlag.
Yee, T. W. (2010) The VGAM package for categorical data analysis. Journal of Statistical Software,
32, 134. http://www.jstatsoft.org/v32/i10/.

See Also
cumulative, cratio, sratio, multinomial, margeff, pneumo.

Examples
pneumo <- transform(pneumo, let = log(exposure.time))
(fit <- vglm(cbind(normal, mild, severe) ~ let, acat, data = pneumo))
coef(fit, matrix = TRUE)
constraints(fit)
model.matrix(fit)
22 AICvlm

AICvlm Akaikes Information Criterion

Description
Calculates the Akaike information criterion for a fitted model object for which a log-likelihood
value has been obtained.

Usage
AICvlm(object, ..., corrected = FALSE, k = 2)
AICvgam(object, ..., k = 2)
AICrrvglm(object, ..., k = 2)
AICqrrvglm(object, ..., k = 2)
AICrrvgam(object, ..., k = 2)

Arguments
object Some VGAM object, for example, having class vglmff-class.
... Other possible arguments fed into logLik in order to compute the log-likelihood.
corrected Logical, perform the finite sample correction?
k Numeric, the penalty per parameter to be used; the default is the classical AIC.

Details
The following formula is used for VGLMs: 2log-likelihood + knpar , where npar represents the
number of parameters in the fitted model, and k = 2 for the usual AIC. One could assign k = log(n)
(n the number of observations) for the so-called BIC or SBC (Schwarzs Bayesian criterion). This
is the function AICvlm().
This code relies on the log-likelihood being defined, and computed, for the object. When comparing
fitted objects, the smaller the AIC, the better the fit. The log-likelihood and hence the AIC is only
defined up to an additive constant.
Any estimated scale parameter (in GLM parlance) is used as one parameter.
For VGAMs and CAO the nonlinear effective degrees of freedom for each smoothed component is
used. This formula is heuristic. These are the functions AICvgam() and AICcao().
The finite sample correction is usually recommended when the sample size is small or when the
number of parameters is large. When the sample size is large their difference tends to be negligible.
The correction is described in Hurvich and Tsai (1989), and is based on a (univariate) linear model
with normally distributed errors.

Value
Returns a numeric value with the corresponding AIC (or BIC, or . . . , depending on k).
AICvlm 23

Warning

This code has not been double-checked. The general applicability of AIC for the VGLM/VGAM
classes has not been developed fully. In particular, AIC should not be run on some VGAM family
functions because of violation of certain regularity conditions, etc.

Note

AIC has not been defined for QRR-VGLMs yet.

Using AIC to compare posbinomial models with, e.g., posbernoulli.tb models, requires posbinomial(omit.constant =
See posbinomial for an example. A warning is given if it suspects a wrong omit.constant value
was used.
Where defined, AICc(...) is the same as AIC(..., corrected = TRUE).

Author(s)

T. W. Yee.

References

Hurvich, C. M. and Tsai, C.-L. (1989) Regression and time series model selection in small samples,
Biometrika, 76, 297307.

See Also

VGLMs are described in vglm-class; VGAMs are described in vgam-class; RR-VGLMs are
described in rrvglm-class; AIC, BICvlm.

Examples

pneumo <- transform(pneumo, let = log(exposure.time))

(fit1 <- vglm(cbind(normal, mild, severe) ~ let,
cumulative(parallel = TRUE, reverse = TRUE), data = pneumo))
coef(fit1, matrix = TRUE)
AIC(fit1)
AICc(fit1) # Quick way
AIC(fit1, corrected = TRUE) # Slow way
(fit2 <- vglm(cbind(normal, mild, severe) ~ let,
cumulative(parallel = FALSE, reverse = TRUE), data = pneumo))
coef(fit2, matrix = TRUE)
AIC(fit2)
AICc(fit2)
AIC(fit2, corrected = TRUE)
24 alaplace

alaplace Asymmetric Laplace Distribution Family Functions

Description
Maximum likelihood estimation of the 1, 2 and 3-parameter asymmetric Laplace distributions
(ALDs). The 2-parameter ALD may, with trepidation and lots of skill, sometimes be used as an
approximation of quantile regression.

Usage
alaplace1(tau = NULL, llocation = "identitylink",
ilocation = NULL, kappa = sqrt(tau/(1 - tau)), Scale.arg = 1,
ishrinkage = 0.95, parallel.locat = TRUE ~ 0, digt = 4,
idf.mu = 3, zero = NULL, imethod = 1)

alaplace2(tau = NULL, llocation = "identitylink", lscale = "loge",

ilocation = NULL, iscale = NULL, kappa = sqrt(tau/(1 - tau)),
ishrinkage = 0.95,
parallel.locat = TRUE ~ 0,
parallel.scale = FALSE ~ 0,
digt = 4, idf.mu = 3, imethod = 1, zero = "scale")

alaplace3(llocation = "identitylink", lscale = "loge", lkappa = "loge",

ilocation = NULL, iscale = NULL, ikappa = 1,
imethod = 1, zero = c("scale", "kappa"))

Arguments
tau, kappa Numeric vectors with 0 < < 1 and > 0. Most users will only specify
tau since the estimated location parameter corresponds to the th regression
quantile, which is easier to understand. See below for details.
llocation, lscale, lkappa
Character. Parameter link functions for location parameter , scale parameter
, asymmetry parameter . See Links for more choices. For example, the
argument llocation can help handle count data by restricting the quantiles to
be positive (use llocation = "loge"). However, llocation is best left alone
since the theory only works properly with the identity link.
ilocation, iscale, ikappa
Optional initial values. If given, it must be numeric and values are recycled to
the appropriate length. The default is to choose the value internally.
parallel.locat, parallel.scale
See the parallel argument of CommonVGAMffArguments. These arguments ap-
ply to the location and scale parameters. It generally only makes sense for the
scale parameters to be equal, hence set parallel.scale = TRUE. Note that
alaplace 25

assigning parallel.locat the value TRUE circumvents the seriously embar-

rassing quantile crossing problem because all constraint matrices except for the
intercept correspond to a parallelism assumption.
imethod Initialization method. Either the value 1, 2, 3 or 4.
idf.mu Degrees of freedom for the cubic smoothing spline fit applied to get an initial
estimate of the location parameter. See vsmooth.spline. Used only when
imethod = 3.
ishrinkage How much shrinkage is used when initializing . The value must be between 0
and 1 inclusive, and a value of 0 means the individual response values are used,
and a value of 1 means the median or mean is used. This argument is used only
when imethod = 4. See CommonVGAMffArguments for more information.
Scale.arg The value of the scale parameter . This argument may be used to compute
quantiles at different values from an existing fitted alaplace2() model (prac-
tical only if it has a single value). If the model has parallel.locat = TRUE
then only the intercept need be estimated; use an offset. See below for an exam-
ple.
digt Passed into Round as the digits argument for the tau values; used cosmetically
for labelling.
zero See CommonVGAMffArguments for more information. Where possible, the de-
fault is to model all the and as an intercept-only term.

Details
These VGAM family functions implement one variant of asymmetric Laplace distributions (ALDs)
suitable for quantile regression. Kotz et al. (2001) call it the ALD. Its density function is
!
2 2
f (y; , , ) = exp |y |
1 + 2

for y , and
!
2 2
f (y; , , ) = exp |y |
1 + 2
for y > . Here, the ranges are for all real y and , positive and positive . The special case
= 1 corresponds to the (symmetric) Laplace distribution of Kotz et al. (2001). The mean is
+ (1/ )/ 2 and the variance is 2 (1 + 4 )/(22 ). The enumeration of the linear/additive
predictors used for alaplace2() is the first location parameter followed by the first scale parameter,
then the second location parameter followed by the second scale parameter, etc. For alaplace3(),
only a vector response is handled and the last (third) linear/additive predictor is for the asymmetry
parameter.
It is known that the maximum likelihood estimate of the location parameter corresponds to the
regression quantile estimate of the classical quantile regression approach of Koenker and Bassett
2 2
(1978). Anp important property of the ALD is that P (Y ) = where = /(1 + ) so
that = /(1 ). Thus alaplace2() might be used as an alternative to rq in the quantreg
package, although scoring is really an unsuitable algorithm for estimation here.
Both alaplace1() and alaplace2() can handle multiple responses, and the number of linear/additive
predictors is dictated by the length of tau or kappa. The functions alaplace1() and alaplace2()
26 alaplace

can also handle multiple responses (i.e., a matrix response) but only with a single-valued tau or
kappa.

Warning
These functions are experimental and especially subject to change or withdrawal. The MLE regu-
larity conditions do not hold for this distribution so that misleading inferences may result, e.g., in
the summary and vcov of the object.
Care is needed with tau values which are too small, e.g., for count data with llocation = "loge"
and if the sample proportion of zeros is greater than tau.

Note
These VGAM family functions use Fisher scoring. Convergence may be slow and half-stepping is
usual (although one can use trace = TRUE to see which is the best model and then use maxit to
choose that model) due to the regularity conditions not holding. Often the iterations slowly crawl
towards the solution so monitoring the convergence (set trace = TRUE) is highly recommended.
For large data sets it is a very good idea to keep the length of tau/kappa low to avoid large
memory requirements. Then for parallel.locat = FALSE one can repeatedly fit a model with
alaplace1() with one at a time; and for parallel.locat = TRUE one can refit a model with
alaplace1() with one at a time but using offsets and an intercept-only model.
A second method for solving the noncrossing quantile problem is illustrated below in Example 3.
This is called the accumulative quantile method (AQM) and details are in Yee (2015). It does not
make the strong parallelism assumption.
The functions alaplace2() and laplace differ slightly in terms of the parameterizations.

Author(s)
Thomas W. Yee

References
Koenker, R. and Bassett, G. (1978) Regression quantiles. Econometrica, 46, 3350.
Kotz, S., Kozubowski, T. J. and Podgorski, K. (2001) The Laplace distribution and generaliza-
tions: a revisit with applications to communications, economics, engineering, and finance, Boston:
Birkhauser.

See Also
ralap, laplace, CommonVGAMffArguments, lms.bcn, amlnormal, sc.studentt2, simulate.vlm.
alaplace 27

Examples

## Not run:
# Example 1: quantile regression with smoothing splines
set.seed(123); adata <- data.frame(x2 = sort(runif(n <- 500)))
mymu <- function(x) exp(-2 + 6*sin(2*x-0.2) / (x+0.5)^2)
adata <- transform(adata, y = rpois(n, lambda = mymu(x2)))
mytau <- c(0.25, 0.75); mydof <- 4

fit <- vgam(y ~ s(x2, df = mydof), data = adata, trace = TRUE, maxit = 900,
alaplace2(tau = mytau, llocat = "loge",
parallel.locat = FALSE))
fitp <- vgam(y ~ s(x2, df = mydof), data = adata, trace = TRUE, maxit = 900,
alaplace2(tau = mytau, llocat = "loge", parallel.locat = TRUE))

par(las = 1); mylwd <- 1.5

with(adata, plot(x2, jitter(y, factor = 0.5), col = "orange",
main = "Example 1; green: parallel.locat = TRUE",
ylab = "y", pch = "o", cex = 0.75))
with(adata, matlines(x2, fitted(fit ), col = "blue",
lty = "solid", lwd = mylwd))
with(adata, matlines(x2, fitted(fitp), col = "green",
lty = "solid", lwd = mylwd))
finexgrid <- seq(0, 1, len = 1001)
for (ii in 1:length(mytau))
lines(finexgrid, qpois(p = mytau[ii], lambda = mymu(finexgrid)),
col = "blue", lwd = mylwd)
fit@extra # Contains useful information

# Example 2: regression quantile at a new tau value from an existing fit

# Nb. regression splines are used here since it is easier.
fitp2 <- vglm(y ~ sm.bs(x2, df = mydof), data = adata, trace = TRUE,
alaplace1(tau = mytau, llocation = "loge",
parallel.locat = TRUE))

newtau <- 0.5 # Want to refit the model with this tau value
fitp3 <- vglm(y ~ 1 + offset(predict(fitp2)[, 1]),
alaplace1(tau = newtau, llocation = "loge"), data = adata)
with(adata, plot(x2, jitter(y, factor = 0.5), col = "orange",
pch = "o", cex = 0.75, ylab = "y",
main = "Example 2; parallel.locat = TRUE"))
with(adata, matlines(x2, fitted(fitp2), col = "blue",
lty = 1, lwd = mylwd))
with(adata, matlines(x2, fitted(fitp3), col = "black",
lty = 1, lwd = mylwd))

# Example 3: noncrossing regression quantiles using a trick: obtain

# successive solutions which are added to previous solutions; use a log
# link to ensure an increasing quantiles at any value of x.

mytau <- seq(0.2, 0.9, by = 0.1)

28 alaplaceUC

answer <- matrix(0, nrow(adata), length(mytau)) # Stores the quantiles

adata <- transform(adata, offsety = y*0)
usetau <- mytau
for (ii in 1:length(mytau)) {
# cat("\n\nii = ", ii, "\n")
adata <- transform(adata, usey = y-offsety)
iloc <- ifelse(ii == 1, with(adata, median(y)), 1.0) # Well-chosen!
mydf <- ifelse(ii == 1, 5, 3) # Maybe less smoothing will help
# lloc <- ifelse(ii == 1, "loge", "loge") # 2nd value must be "loge"
fit3 <- vglm(usey ~ sm.ns(x2, df = mydf), data = adata, trace = TRUE,
alaplace2(tau = usetau[ii], lloc = "loge", iloc = iloc))
answer[, ii] <- (if(ii == 1) 0 else answer[, ii-1]) + fitted(fit3)
adata <- transform(adata, offsety = answer[, ii])
}

# Plot the results.

with(adata, plot(x2, y, col = "blue",
main = paste("Noncrossing and nonparallel; tau = ",
paste(mytau, collapse = ", "))))
with(adata, matlines(x2, answer, col = "orange", lty = 1))

# Zoom in near the origin.

with(adata, plot(x2, y, col = "blue", xlim = c(0, 0.2), ylim = 0:1,
main = paste("Noncrossing and nonparallel; tau = ",
paste(mytau, collapse = ", "))))
with(adata, matlines(x2, answer, col = "orange", lty = 1))

## End(Not run)

alaplaceUC The Laplace Distribution

Description

Density, distribution function, quantile function and random generation for the 3-parameter asym-
metric Laplace distribution with location parameter location, scale parameter scale, and asym-
metry parameter kappa.

Usage

dalap(x, location = 0, scale = 1, tau = 0.5, kappa = sqrt(tau/(1-tau)),

log = FALSE)
palap(q, location = 0, scale = 1, tau = 0.5, kappa = sqrt(tau/(1-tau)),
lower.tail = TRUE, log.p = FALSE)
qalap(p, location = 0, scale = 1, tau = 0.5, kappa = sqrt(tau/(1-tau)),
lower.tail = TRUE, log.p = FALSE)
ralap(n, location = 0, scale = 1, tau = 0.5, kappa = sqrt(tau/(1-tau)))
alaplaceUC 29

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. If length(n) > 1 then the length is taken to be the
number required.
location the location parameter .
scale the scale parameter . Must consist of positive values.
tau the quantile parameter . Must consist of values in (0, 1). This argument is used
to specify kappa and is ignored if kappa is assigned.
kappa the asymmetry parameter . Must consist of positive values.
log if TRUE, probabilities p are given as log(p).
lower.tail, log.p
Same meaning as in pnorm or qnorm.

Details
There are many variants of asymmetric Laplace distributions (ALDs) and this one is known as the
ALD by Kotz et al. (2001). See alaplace3, the VGAM family function for estimating the three
parameters by maximum likelihood estimation, for formulae and details.

Value
dalap gives the density, palap gives the distribution function, qalap gives the quantile function,
and ralap generates random deviates.

Author(s)
T. W. Yee and Kai Huang

abline(h = 0, col = "blue", lty = 2)

lines(qalap(seq(0.05, 0.95, by = 0.05), loc, sigma, kappa = kappa),
dalap(qalap(seq(0.05, 0.95, by = 0.05), loc, sigma, kappa = kappa),
loc, sigma, kappa = kappa), col = "purple", lty = 3, type = "h")
lines(x, palap(x, loc, sigma, kappa = kappa), type = "l", col = "orange")
abline(h = 0, lty = 2)
## End(Not run)

pp <- seq(0.05, 0.95, by = 0.05) # Test two functions

max(abs(palap(qalap(pp, loc, sigma, kappa = kappa),
loc, sigma, kappa = kappa) - pp)) # Should be 0

amlbinomial Binomial Logistic Regression by Asymmetric Maximum Likelihood

Estimation

Description
Binomial quantile regression estimated by maximizing an asymmetric likelihood function.

Usage
amlbinomial(w.aml = 1, parallel = FALSE, digw = 4, link = "logit")

Arguments
w.aml Numeric, a vector of positive constants controlling the percentiles. The larger
the value the larger the fitted percentile value (the proportion of points below
the w-regression plane). The default value of unity results in the ordinary
maximum likelihood (MLE) solution.
parallel If w.aml has more than one value then this argument allows the quantile curves
to differ by the same amount as a function of the covariates. Setting this to be
TRUE should force the quantile curves to not cross (although they may not cross
anyway). See CommonVGAMffArguments for more information.
digw Passed into Round as the digits argument for the w.aml values; used cosmeti-
cally for labelling.
link See binomialff.

Details
The general methodology behind this VGAM family function is given in Efron (1992) and full
details can be obtained there. This model is essentially a logistic regression model (see binomialff)
but the usual deviance is replaced by an asymmetric squared error loss function; it is multiplied by
w.aml for positive residuals. The solution is the set of regression coefficients that minimize the
sum of these deviance-type values over the data set, weighted by the weights argument (so that it
can contain frequencies). Newton-Raphson estimation is used here.
amlbinomial 31

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Warning
If w.aml has more than one value then the value returned by deviance is the sum of all the
(weighted) deviances taken over all the w.aml values. See Equation (1.6) of Efron (1992).

Author(s)
Thomas W. Yee

References
Efron, B. (1992) Poisson overdispersion estimates based on the method of asymmetric maximum
likelihood. Journal of the American Statistical Association, 87, 98107.

See Also
amlpoisson, amlexponential, amlnormal, alaplace1, denorm.

Examples
# Example: binomial data with lots of trials per observation
set.seed(1234)
sizevec <- rep(100, length = (nn <- 200))
mydat <- data.frame(x = sort(runif(nn)))
mydat <- transform(mydat, prob = logit(-0 + 2.5*x + x^2, inverse = TRUE))
mydat <- transform(mydat, y = rbinom(nn, size = sizevec, prob = prob))
(fit <- vgam(cbind(y, sizevec - y) ~ s(x, df = 3),
amlbinomial(w = c(0.01, 0.2, 1, 5, 60)),
mydat, trace = TRUE))
fit@extra

## Not run:
par(mfrow = c(1,2))
# Quantile plot
with(mydat, plot(x, jitter(y), col = "blue", las = 1, main =
32 amlexponential

paste(paste(round(fit@extra$percentile, digits = 1), collapse = ", "),

"percentile-expectile curves")))
with(mydat, matlines(x, 100 * fitted(fit), lwd = 2, col = "blue", lty = 1))

# Compare the fitted expectiles with the quantiles

with(mydat, plot(x, jitter(y), col = "blue", las = 1, main =
paste(paste(round(fit@extra$percentile, digits = 1), collapse = ", "),
"percentile curves are red")))
with(mydat, matlines(x, 100 * fitted(fit), lwd = 2, col = "blue", lty = 1))

for (ii in fit@extra$percentile)

with(mydat, matlines(x, 100 *
qbinom(p = ii/100, size = sizevec, prob = prob) / sizevec,
col = "red", lwd = 2, lty = 1))

## End(Not run)

amlexponential Exponential Regression by Asymmetric Maximum Likelihood Estima-

tion

Description
Exponential expectile regression estimated by maximizing an asymmetric likelihood function.

Usage
amlexponential(w.aml = 1, parallel = FALSE, imethod = 1, digw = 4,
link = "loge")

Arguments
w.aml Numeric, a vector of positive constants controlling the expectiles. The larger the
value the larger the fitted expectile value (the proportion of points below the w-
regression plane). The default value of unity results in the ordinary maximum
likelihood (MLE) solution.
parallel If w.aml has more than one value then this argument allows the quantile curves
to differ by the same amount as a function of the covariates. Setting this to be
TRUE should force the quantile curves to not cross (although they may not cross
anyway). See CommonVGAMffArguments for more information.
imethod Integer, either 1 or 2 or 3. Initialization method. Choose another value if con-
vergence fails.
digw Passed into Round as the digits argument for the w.aml values; used cosmeti-
cally for labelling.
link See exponential and the warning below.
amlexponential 33

Details

The general methodology behind this VGAM family function is given in Efron (1992) and full
details can be obtained there.
This model is essentially an exponential regression model (see exponential) but the usual deviance
is replaced by an asymmetric squared error loss function; it is multiplied by w.aml for positive
residuals. The solution is the set of regression coefficients that minimize the sum of these deviance-
type values over the data set, weighted by the weights argument (so that it can contain frequencies).
Newton-Raphson estimation is used here.

Value

An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Warning

Note that the link argument of exponential and amlexponential are currently different: one is
the rate parameter and the other is the mean (expectile) parameter.
If w.aml has more than one value then the value returned by deviance is the sum of all the
(weighted) deviances taken over all the w.aml values. See Equation (1.6) of Efron (1992).

Note

On fitting, the extra slot has list components "w.aml" and "percentile". The latter is the percent
of observations below the w-regression plane, which is the fitted values. Also, the individual
deviance values corresponding to each element of the argument w.aml is stored in the extra slot.
For amlexponential objects, methods functions for the generic functions qtplot and cdf have not
been written yet.
See amlpoisson about comments on the jargon, e.g., expectiles etc.
In this documentation the word quantile can often be interchangeably replaced by expectile (things
are informal here).

Author(s)

Thomas W. Yee

References

Efron, B. (1992) Poisson overdispersion estimates based on the method of asymmetric maximum
likelihood. Journal of the American Statistical Association, 87, 98107.

See Also

exponential, amlbinomial, amlpoisson, amlnormal, alaplace1, lms.bcg, deexp.

34 amlnormal

Examples
nn <- 2000
mydat <- data.frame(x = seq(0, 1, length = nn))
mydat <- transform(mydat, mu = loge(-0 + 1.5*x + 0.2*x^2, inverse = TRUE))
mydat <- transform(mydat, mu = loge(0 - sin(8*x), inverse = TRUE))
mydat <- transform(mydat, y = rexp(nn, rate = 1/mu))
(fit <- vgam(y ~ s(x,df = 5), amlexponential(w = c(0.001, 0.1, 0.5, 5, 60)),
mydat, trace = TRUE))
fit@extra

## Not run: # These plots are against the sqrt scale (to increase clarity)
par(mfrow = c(1,2))
# Quantile plot
with(mydat, plot(x, sqrt(y), col = "blue", las = 1, main =
paste(paste(round(fit@extra$percentile, digits = 1), collapse = ", "),
"percentile-expectile curves")))
with(mydat, matlines(x, sqrt(fitted(fit)), lwd = 2, col = "blue", lty = 1))

# Compare the fitted expectiles with the quantiles

with(mydat, plot(x, sqrt(y), col = "blue", las = 1, main =
paste(paste(round(fit@extra$percentile, digits = 1), collapse = ", "),
"percentile curves are orange")))
with(mydat, matlines(x, sqrt(fitted(fit)), lwd = 2, col = "blue", lty = 1))

for (ii in fit@extra$percentile)

with(mydat, matlines(x, sqrt(qexp(p = ii/100, rate = 1/mu)), col = "orange"))
## End(Not run)

amlnormal Asymmetric Least Squares Quantile Regression

Description
Asymmetric least squares, a special case of maximizing an asymmetric likelihood function of a
normal distribution. This allows for expectile/quantile regression using asymmetric least squares
error loss.

Usage
amlnormal(w.aml = 1, parallel = FALSE, lexpectile = "identitylink",
iexpectile = NULL, imethod = 1, digw = 4)

parallel If w.aml has more than one value then this argument allows the quantile curves
to differ by the same amount as a function of the covariates. Setting this to be
TRUE should force the quantile curves to not cross (although they may not cross
anyway). See CommonVGAMffArguments for more information.
lexpectile, iexpectile
See CommonVGAMffArguments for more information.
imethod Integer, either 1 or 2 or 3. Initialization method. Choose another value if con-
vergence fails.
digw Passed into Round as the digits argument for the w.aml values; used cosmeti-
cally for labelling.

Details
This is an implementation of Efron (1991) and full details can be obtained there. Equation numbers
below refer to that article. The model is essentially a linear model (see lm), however, the asymmetric
squared error loss function for a residual r is r2 if r 0 and wr2 if r > 0. The solution is the set of
regression coefficients that minimize the sum of these over the data set, weighted by the weights
argument (so that it can contain frequencies). Newton-Raphson estimation is used here.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Note
On fitting, the extra slot has list components "w.aml" and "percentile". The latter is the percent
of observations below the w-regression plane, which is the fitted values.
One difficulty is finding the w.aml value giving a specified percentile. One solution is to fit the
model within a root finding function such as uniroot; see the example below.
For amlnormal objects, methods functions for the generic functions qtplot and cdf have not been
written yet.
See the note in amlpoisson on the jargon, including expectiles and regression quantiles.
The deviance slot computes the total asymmetric squared error loss (2.5). If w.aml has more than
one value then the value returned by the slot is the sum taken over all the w.aml values.
This VGAM family function could well be renamed amlnormal() instead, given the other function
names amlpoisson, amlbinomial, etc.
In this documentation the word quantile can often be interchangeably replaced by expectile (things
are informal here).

Author(s)
Thomas W. Yee

References
Efron, B. (1991) Regression percentiles using asymmetric squared error loss. Statistica Sinica, 1,
93125.
36 amlnormal

See Also
amlpoisson, amlbinomial, amlexponential, bmi.nz, alaplace1, denorm, lms.bcn and similar
variants are alternative methods for quantile regression.

Examples
## Not run:
# Example 1
ooo <- with(bmi.nz, order(age))
bmi.nz <- bmi.nz[ooo, ] # Sort by age
(fit <- vglm(BMI ~ sm.bs(age), amlnormal(w.aml = 0.1), data = bmi.nz))
fit@extra # Gives the w value and the percentile
coef(fit, matrix = TRUE)

# Quantile plot
with(bmi.nz, plot(age, BMI, col = "blue", main =
paste(round(fit@extra$percentile, digits = 1),
"expectile-percentile curve")))
with(bmi.nz, lines(age, c(fitted(fit)), col = "black"))

# Example 2
# Find the w values that give the 25, 50 and 75 percentiles
find.w <- function(w, percentile = 50) {
fit2 <- vglm(BMI ~ sm.bs(age), amlnormal(w = w), data = bmi.nz)
fit2@extra$percentile - percentile
}
# Quantile plot
with(bmi.nz, plot(age, BMI, col = "blue", las = 1, main =
"25, 50 and 75 expectile-percentile curves"))
for (myp in c(25, 50, 75)) {
# Note: uniroot() can only find one root at a time
bestw <- uniroot(f = find.w, interval = c(1/10^4, 10^4), percentile = myp)
fit2 <- vglm(BMI ~ sm.bs(age), amlnormal(w = bestw$root), data = bmi.nz)
with(bmi.nz, lines(age, c(fitted(fit2)), col = "orange"))
}

# Example 3; this is Example 1 but with smoothing splines and

# a vector w and a parallelism assumption.
ooo <- with(bmi.nz, order(age))
bmi.nz <- bmi.nz[ooo, ] # Sort by age
fit3 <- vgam(BMI ~ s(age, df = 4), data = bmi.nz, trace = TRUE,
amlnormal(w = c(0.1, 1, 10), parallel = TRUE))
fit3@extra # The w values, percentiles and weighted deviances

# The linear components of the fit; not for human consumption:

coef(fit3, matrix = TRUE)

# Quantile plot
with(bmi.nz, plot(age, BMI, col="blue", main =
paste(paste(round(fit3@extra$percentile, digits = 1), collapse = ", "),
"expectile-percentile curves")))
with(bmi.nz, matlines(age, fitted(fit3), col = 1:fit3@extra$M, lwd = 2))
amlpoisson 37

with(bmi.nz, lines(age, c(fitted(fit )), col = "black")) # For comparison

## End(Not run)

amlpoisson Poisson Regression by Asymmetric Maximum Likelihood Estimation

Description
Poisson quantile regression estimated by maximizing an asymmetric likelihood function.

Usage
amlpoisson(w.aml = 1, parallel = FALSE, imethod = 1, digw = 4,
link = "loge")

Arguments
w.aml Numeric, a vector of positive constants controlling the percentiles. The larger
the value the larger the fitted percentile value (the proportion of points below
the w-regression plane). The default value of unity results in the ordinary
maximum likelihood (MLE) solution.
parallel If w.aml has more than one value then this argument allows the quantile curves
to differ by the same amount as a function of the covariates. Setting this to be
TRUE should force the quantile curves to not cross (although they may not cross
anyway). See CommonVGAMffArguments for more information.
imethod Integer, either 1 or 2 or 3. Initialization method. Choose another value if con-
vergence fails.
digw Passed into Round as the digits argument for the w.aml values; used cosmeti-
cally for labelling.
link See poissonff.

Details
This method was proposed by Efron (1992) and full details can be obtained there.
The model is essentially a Poisson regression model (see poissonff) but the usual deviance is
replaced by an asymmetric squared error loss function; it is multiplied by w.aml for positive resid-
uals. The solution is the set of regression coefficients that minimize the sum of these deviance-type
values over the data set, weighted by the weights argument (so that it can contain frequencies).
Newton-Raphson estimation is used here.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.
38 amlpoisson

Warning
If w.aml has more than one value then the value returned by deviance is the sum of all the
(weighted) deviances taken over all the w.aml values. See Equation (1.6) of Efron (1992).

Note
On fitting, the extra slot has list components "w.aml" and "percentile". The latter is the percent
of observations below the w-regression plane, which is the fitted values. Also, the individual
deviance values corresponding to each element of the argument w.aml is stored in the extra slot.
For amlpoisson objects, methods functions for the generic functions qtplot and cdf have not been
written yet.
About the jargon, Newey and Powell (1987) used the name expectiles for regression surfaces ob-
tained by asymmetric least squares. This was deliberate so as to distinguish them from the original
regression quantiles of Koenker and Bassett (1978). Efron (1991) and Efron (1992) use the general
name regression percentile to apply to all forms of asymmetric fitting. Although the asymmetric
maximum likelihood method very nearly gives regression percentiles in the strictest sense for the
normal and Poisson cases, the phrase quantile regression is used loosely in this VGAM documen-
tation.
In this documentation the word quantile can often be interchangeably replaced by expectile (things
are informal here).

Author(s)
Thomas W. Yee

References
Efron, B. (1991) Regression percentiles using asymmetric squared error loss. Statistica Sinica, 1,
93125.
Efron, B. (1992) Poisson overdispersion estimates based on the method of asymmetric maximum
likelihood. Journal of the American Statistical Association, 87, 98107.
Koenker, R. and Bassett, G. (1978) Regression quantiles. Econometrica, 46, 3350.
Newey, W. K. and Powell, J. L. (1987) Asymmetric least squares estimation and testing. Economet-
rica, 55, 819847.

See Also
amlnormal, amlbinomial, alaplace1.

Examples
set.seed(1234)
mydat <- data.frame(x = sort(runif(nn <- 200)))
mydat <- transform(mydat, y = rpois(nn, exp(0 - sin(8*x))))
(fit <- vgam(y ~ s(x), fam = amlpoisson(w.aml = c(0.02, 0.2, 1, 5, 50)),
mydat, trace = TRUE))
fit@extra
AR1 39

## Not run:
# Quantile plot
with(mydat, plot(x, jitter(y), col = "blue", las = 1, main =
paste(paste(round(fit@extra$percentile, digits = 1), collapse = ", "),
"percentile-expectile curves")))
with(mydat, matlines(x, fitted(fit), lwd = 2))
## End(Not run)

AR1 Autoregressive Process with Order-1 Family Function

Description

Maximum likelihood estimation of the three-parameter AR-1 model

Usage

AR1(ldrift = "identitylink", lsd = "loge", lvar = "loge", lrho = "rhobit",

idrift = NULL, isd = NULL, ivar = NULL, irho = NULL, imethod = 1,
ishrinkage = 0.95, type.likelihood = c("exact", "conditional"),
type.EIM = c("exact", "approximate"), var.arg = FALSE, nodrift = FALSE,
print.EIM = FALSE, zero = c(if (var.arg) "var" else "sd", "rho"))

Arguments
ldrift, lsd, lvar, lrho
Link functions applied to the scaled mean, standard deviation or variance, and
correlation parameters. The parameter drift is known as the drift, and it is a
scaled mean. See Links for more choices.
idrift, isd, ivar, irho
Optional initial values for the parameters. If failure to converge occurs then
try different values and monitor convergence by using trace = TRUE. For a
S-column response, these arguments can be of length S, and they are recycled
by the columns first. A value NULL means an initial value for each response is
computed internally.
ishrinkage, imethod, zero
See CommonVGAMffArguments for more information. The default for zero as-
sumes there is a drift parameter to be estimated (the default for that argument),
so if a drift parameter is suppressed and there are covariates, then zero will need
to be assigned the value 1 or 2 or NULL.
var.arg Same meaning as uninormal.
nodrift Logical, for determining whether to estimate the drift parameter. The default is
to estimate it. If TRUE, the drift parameter is set to 0 and not estimated.
40 AR1

type.EIM What type of expected information matrix (EIM) is used in Fisher scoring. By
default, this family function calls AR1EIM, which recursively computes the exact
EIM for the AR process with Gaussian white noise. See Porat and Friedlander
(1986) for further details on the exact EIM.
If type.EIM = "approximate" then approximate expression for the EIM of
Autoregressive processes is used; this approach holds when the number of ob-
servations is large enough. Succinct details about the approximate EIM are de-
lineated at Porat and Friedlander (1987).
print.EIM Logical. If TRUE, then the first few EIMs are printed. Here, the result shown is
the sum of each EIM.
type.likelihood
What type of likelihood function is maximized. The first choice (default) is
the sum of the marginal likelihood and the conditional likelihood. Choosing
the conditional likelihood means that the first observation is effectively ignored
(this is handled internally by setting the value of the first prior weight to be some
small positive number, e.g., 1.0e-6). See the note below.

Details
The AR-1 model implemented here has

Y1 N (, 2 /(1 2 )),

and
Yi = + Yi1 + ei ,
where the ei are i.i.d. Normal(0, sd = ) random variates.
Here are a few notes: (1). A test for weak stationarity might be to verify whether 1/ lies outside the
unit circle. (2). The mean of all the Yi is /(1 ) and these are returned as the fitted values. (3).
The correlation of all the Yi with Yi1 is . (4). The default link function ensures that 1 < < 1.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Warning
Monitoring convergence is urged, i.e., set trace = TRUE.
Moreover, if the exact EIMs are used, set print.EIM = TRUE to compare the computed exact to the
approximate EIM.
Under the VGLM/VGAM approach, parameters can be modelled in terms of covariates. Particu-
larly, if the standard deviation of the white noise is modelled in this way, then type.EIM = "exact"
may certainly lead to unstable results. The reason is that white noise is a stationary process, and
consequently, its variance must remain as a constant. Consequently, the use of variates to model
this parameter contradicts the assumption of stationary random components to compute the exact
EIMs proposed by Porat and Friedlander (1987).
To prevent convergence issues in such cases, this family function internally verifies whether the vari-
ance of the white noise remains as a constant at each Fisher scoring iteration. If this assumption is
AR1 41

violated and type.EIM = "exact" is set, then AR1 automatically shifts to type.EIM = "approximate".
Also, a warning is accordingly displayed.

Note
Multiple responses are handled. The mean is returned as the fitted values.

Author(s)
Victor Miranda (exact method) and Thomas W. Yee (approximate method).

References
Porat, B. and Friedlander, B. (1987) The Exact Cramer-Rao Bond for Gaussian Autoregressive
Processes. IEEE Transactions on Aerospace and Electronic Systems, AES-23(4), 537542.

See Also
AR1EIM, vglm.control, dAR1, arima.sim.

Examples
### Example 1: using arima.sim() to generate a 0-mean stationary time series.
nn <- 500
tsdata <- data.frame(x2 = runif(nn))
ar.coef.1 <- rhobit(-1.55, inverse = TRUE) # Approx -0.65
ar.coef.2 <- rhobit( 1.0, inverse = TRUE) # Approx 0.50
set.seed(1)
tsdata <- transform(tsdata,
index = 1:nn,
TS1 = arima.sim(nn, model = list(ar = ar.coef.1),
sd = exp(1.5)),
TS2 = arima.sim(nn, model = list(ar = ar.coef.2),
sd = exp(1.0 + 1.5 * x2)))

### An autoregressive intercept--only model. ###

### Using the exact EIM, and "nodrift = TRUE" ###
fit1a <- vglm(TS1 ~ 1, data = tsdata, trace = TRUE,
AR1(var.arg = FALSE, nodrift = TRUE,
type.EIM = "exact",
print.EIM = FALSE),
crit = "coefficients")
Coef(fit1a)
summary(fit1a)

## Not run:
### Two responses. Here, the white noise standard deviation of TS2 ###
### is modelled in terms of 'x2'. Also, 'type.EIM = exact'. ###
fit1b <- vglm(cbind(TS1, TS2) ~ x2,
AR1(zero = NULL, nodrift = TRUE,
var.arg = FALSE,
type.EIM = "exact"),
42 AR1EIM

constraints = list("(Intercept)" = diag(4),

"x2" = rbind(0, 0, 1, 0)),
data = tsdata, trace = TRUE, crit = "coefficients")
coef(fit1b, matrix = TRUE)
summary(fit1b)

### Example 2: another stationary time series

nn <- 500
my.rho <- rhobit(1.0, inverse = TRUE)
my.mu <- 1.0
my.sd <- exp(1)
tsdata <- data.frame(index = 1:nn, TS3 = runif(nn))

set.seed(2)
for (ii in 2:nn)
tsdata$TS3[ii] <- my.mu/(1 - my.rho) +
my.rho * tsdata$TS3[ii-1] + rnorm(1, sd = my.sd)
tsdata <- tsdata[-(1:ceiling(nn/5)), ] # Remove the burn-in data:

### Fitting an AR(1). The exact EIMs are used.

fit2a <- vglm(TS3 ~ 1, AR1(type.likelihood = "exact", # "conditional",
type.EIM = "exact"),
data = tsdata, trace = TRUE, crit = "coefficients")

Coef(fit2a)
summary(fit2a) # SEs are useful to know

Coef(fit2a)["rho"] # Estimate of rho, for intercept-only models

my.rho # The 'truth' (rho)
Coef(fit2a)["drift"] # Estimate of drift, for intercept-only models
my.mu /(1 - my.rho) # The 'truth' (drift)

## End(Not run)

AR1EIM Computation of the Exact EIM of an Order-1 Autoregressive Process

Description

Computation of the exact Expected Information Matrix of the Autoregressive process of order-1
(AR(1)) with Gaussian white noise and stationary random components.

Usage

AR1EIM(x = NULL, var.arg = NULL, p.drift = NULL,

WNsd = NULL, ARcoeff1 = NULL, eps.porat = 1e-2)
AR1EIM 43

Arguments
x A vector of quantiles. The gaussian time series for which the EIMs are com-
puted.
If multiple time series are being analyzed, then x must be a matrix where each
column allocates a response. That is, the number of columns (denoted as N OS)
must match the number of responses.
var.arg Logical. Same as with AR1.
p.drift A numeric vector with the scaled mean(s) (commonly referred as drift) of the
AR process(es) in turn. Its length matches the number of responses.
WNsd, ARcoeff1 Matrices. The standard deviation of the white noise, and the correlation (coeffi-
cient) of the AR(1) model, for each observation.
That is, the dimension for each matrix is N N OS, where N is the number of
observations and N OS is the number of responses. Else, these arguments are
recycled.
eps.porat A very small positive number to test whether the standar deviation (WNsd) is
close enough to its value estimated in this function.
See below for further details.

Details
This function implements the algorithm of Porat and Friedlander (1986) to recursively compute the
exact expected information matrix (EIM) of Gaussian time series with stationary random compo-
nents.
By default, when the VGLM/VGAM family function AR1 is used to fit an AR(1) model via vglm,
Fisher scoring is executed using the approximate EIM for the AR process. However, this model
can also be fitted using the exact EIMs computed by AR1EIM.
Given N consecutive data points, y0 , y1 , . . . , yN 1 with probability density f (y), the Porat and
Friedlander algorithm calculates the EIMs [Jn1 ()], for all 1 n N . This is done based on
the Levinson-Durbin algorithm for computing the orthogonal polynomials of a Toeplitz matrix. In
particular, for the AR(1) model, the vector of parameters to be estimated under the VGAM/VGLM
approach is

= ( , loge( 2 ), rhobit()),

where 2 is the variance of the white noise and mu is the drift parameter (See AR1 for further
details on this).
Consequently, for each observation n = 1, . . . , N , the EIM, Jn (), has dimension 3 3, where the
diagonal elements are:

J[n,1,1] = E[ 2 log f (y)/( )2 ],

J[n,2,2] = E[ 2 log f (y)/( 2 )2 ],

and
44 AR1EIM

J[n,3,3] = E[ 2 log f (y)/()2 ].

As for the off-diagonal elements, one has the usual entries, i.e.,
J[n,1,2] = J[n,2,1] = E[ 2 log f (y)/ 2 ],
etc.
If var.arg = FALSE, then instead of 2 is estimated. Therefore, J[n,2,2] , J[n,1,2] , etc., are
correspondingly replaced.
Once these expected values are internally computed, they are returned in an array of dimension
N 1 6, of the form

J[, 1, ] = [J[,1,1] , J[,2,2] , J[,3,3] , J[,1,2] , J[,2,3] , J[,1,3] ].

AR1EIM handles multiple time series, say N OS. If this happens, then it accordingly returns an array
of dimension N N OS 6. Here, J[, k, ], for k = 1, . . . , N OS, is a matrix of dimension N 6,
which stores the EIMs for the k th th response, as above, i.e.,

J[, k, ] = [J[,1,1] , J[,2,2] , J[,3,3] , . . .],

the bandwith form, as per required by AR1.

Value
An array of dimension N N OS 6, as above.
This array stores the EIMs calculated from the joint density as a function of
= ( , 2 , ).

Nevertheless, note that, under the VGAM/VGLM approach, the EIMs must be correspondingly
calculated in terms of the linear predictors, .

Asymptotic behaviour of the algorithm

For large enough n, the EIMs, Jn (), become approximately linear in n. That is, for some n0 ,

Jn () Jn0 () + (n n0 )J(), ()

where J() is a constant matrix.
This relationsihip is internally considered if a proper value of n0 is determined. Different ways can
be adopted to find n0 . In AR1EIM, this is done by checking the difference between the internally
estimated variances and the entered ones at WNsd. If this difference is less than eps.porat at some
iteration, say at iteration n0 , then AR1EIM takes J()
as the last computed increment of Jn (), and
extraplotates Jk (), for all k n0 using (). Else, the algorithm will complete the iterations for
1 n N.
Finally, note that the rate of convergence reasonably decreases if the asymptotic relationship () is
used to compute Jk (), k n0 . Normally, the number of operations involved on this algorithm is
proportional to N 2 .
See Porat and Friedlander (1986) for full details on the asymptotic behaviour of the algorithm.
AR1EIM 45

Warning
Arguments WNsd, and ARcoeff1 are matrices of dimension N N OS. Else, these arguments are
accordingly recycled.

Note
For simplicity, one can assume that the time series analyzed has a 0-mean. Consequently, where the
family function AR1 calls AR1EIM to compute the EIMs, the argument p.drift is internally set to
zero-vector, whereas x is centered by subtracting its mean value.

Author(s)
V. Miranda and T. W. Yee.

References
Porat, B. and Friedlander, B. (1986) Computation of the Exact Information Matrix of Gaussian
Time Series with Stationary Random Components. IEEE Transactions on Acoustics, Speech, and
Signal Processing, 54(1), 118130.

### Generate two (zero-mean) AR(1) processes ###

ts1 <- p.drift[1]/(1 - ARcoeff1[1]) +
arima.sim(model = list(ar = ARcoeff1[1]), n = nn,
sd = WNsd[1])
ts2 <- p.drift[2]/(1 - ARcoeff1[2]) +
arima.sim(model = list(ar = ARcoeff1[2]), n = nn,
sd = WNsd[2])

ARdata <- matrix(cbind(ts1, ts2), ncol = 2)

### Compute the exact EIMs: TWO responses. ###

ExactEIM <- AR1EIM(x = ARdata, var.arg = FALSE, p.drift = p.drift,
WNsd = WNsd, ARcoeff1 = ARcoeff1)

### For response 1:

head(ExactEIM[, 1 ,]) # NOTICE THAT THIS IS A (nn x 6) MATRIX!

### For response 2:

46 auuc

head(ExactEIM[, 2 ,]) # NOTICE THAT THIS IS A (nn x 6) MATRIX!

auuc Auckland University Undergraduate Counts Data

Description
Undergraduate student enrolments at the University of Auckland in 1990.

Usage
data(auuc)

Format
A data frame with 4 observations on the following 5 variables.

Commerce a numeric vector of counts.

Arts a numeric vector of counts.
SciEng a numeric vector of counts.
Law a numeric vector of counts.
Medicine a numeric vector of counts.

Details
Each student is cross-classified by their colleges (Science and Engineering have been combined)
and the socio-economic status (SES) of their fathers (1 = highest, down to 4 = lowest).

Source
Dr Tony Morrison.

References
Wild, C. J. and Seber, G. A. F. (2000) Chance Encounters: A First Course in Data Analysis and
Inference, New York: Wiley.

Examples
auuc
## Not run:
round(fitted(grc(auuc)))
round(fitted(grc(auuc, Rank = 2)))

## End(Not run)
aux.posbernoulli.t 47

aux.posbernoulli.t Auxiliary Function for the Positive Bernoulli Family Function with
Time Effects

Description

Returns behavioural effects indicator variables from a capture history matrix.

Usage

aux.posbernoulli.t(y, check.y = FALSE, rename = TRUE, name = "bei")

Arguments

y Capture history matrix. Rows are animals, columns are sampling occasions, and
values should be 0s and 1s only.
check.y Logical, if TRUE then some basic checking is performed.
rename, name If rename = TRUE then the behavioural effects indicator are named using the
value of name as the prefix. If FALSE then use the same column names as y.

Details

This function can help fit certain capturerecapture models (commonly known as Mtb or Mtbh (no
prefix h means it is an intercept-only model) in the literature). See posbernoulli.t for details.

Value

A list with the following components.

cap.hist1 A matrix the same dimension as y. In any particular row there are 0s up to the first
capture. Then there are 1s thereafter.
cap1 A vector specifying which time occasion the animal was first captured.
y0i Number of noncaptures before the first capture.
yr0i Number of noncaptures after the first capture.
yr1i Number of recaptures after the first capture.

See Also

posbernoulli.t, deermice.
48 backPain

Examples
# Fit a M_tbh model to the deermice data:
(pdata <- aux.posbernoulli.t(with(deermice, cbind(y1, y2, y3, y4, y5, y6))))

deermice <- data.frame(deermice,

bei = 0, # Add this
pdata$cap.hist1) # Incorporate these
head(deermice) # Augmented with behavioural effect indicator variables
tail(deermice)

backPain Data on Back Pain Prognosis, from Anderson (1984)

Description
Data from a study of patients suffering from back pain. Prognostic variables were recorded at
presentation and progress was categorised three weeks after treatment.

Usage
data(backPain)

Format
A data frame with 101 observations on the following 4 variables.
x1 length of previous attack.
x2 pain change.
x3 lordosis.
pain an ordered factor describing the progress of each patient with levels worse < same < slight.improvement
< moderate.improvement < marked.improvement < complete.relief.

Source
http://ideas.repec.org/c/boc/bocode/s419001.html
The data set and this help file was copied from gnm so that a vignette in VGAM could be run; the
analysis is described in Yee (2010).

References
Anderson, J. A. (1984) Regression and Ordered Categorical Variables. J. R. Statist. Soc. B, 46(1),
1-30.
Yee, T. W. (2010) The VGAM package for categorical data analysis. Journal of Statistical Software,
32, 134. http://www.jstatsoft.org/v32/i10/.

Examples
summary(backPain)
beggs 49

beggs Bacon and Eggs Data

Description
Purchasing of bacon and eggs.

Usage
data(beggs)

Format
Data frame of a two way table.
b0, b1, b2, b3, b4 The b refers to bacon. The number of times bacon was purchased was 0, 1, 2, 3,
or 4.
e0, e1, e2, e3, e4 The e refers to eggs. The number of times eggs was purchased was 0, 1, 2, 3, or
4.

Details
The data is from Information Resources, Inc., a consumer panel based in a large US city [see
Bell and Lattin (1998) for further details]. Starting in June 1991, the purchases in the bacon and
fresh eggs product categories for a sample of 548 households over four consecutive store trips was
tracked. Only those grocery shopping trips with a total basket value of at least five dollars was
considered. For each household, the total number of bacon purchases in their four eligible shopping
trips and the total number of egg purchases (usually a package of eggs) for the same trips, were
counted.

Source
Bell, D. R. and Lattin, J. M. (1998) Shopping Behavior and Consumer Preference for Store Price
Format: Why Large Basket Shoppers Prefer EDLP. Marketing Science, 17, 6688.

References
Danaher, P. J. and Hardie, B. G. S. (2005) Bacon with Your Eggs? Applications of a New Bivariate
Beta-Binomial Distribution. American Statistician, 59(4), 282286.

Benford Benfords Distribution

Description
Density, distribution function, quantile function, and random generation for Benfords distribution.

Usage
dbenf(x, ndigits = 1, log = FALSE)
pbenf(q, ndigits = 1, lower.tail = TRUE, log.p = FALSE)
qbenf(p, ndigits = 1, lower.tail = TRUE, log.p = FALSE)
rbenf(n, ndigits = 1)

Arguments
x, q Vector of quantiles. See ndigits.
p vector of probabilities.
n number of observations. A single positive integer. Else if length(n) > 1 then
the length is taken to be the number required.
ndigits Number of leading digits, either 1 or 2. If 1 then the support of the distribution
is {1,. . . ,9}, else {10,. . . ,99}.
log, log.p Logical. If log.p = TRUE then all probabilities p are given as log(p).
lower.tail Same meaning as in pnorm or qnorm.

Details
Benfords Law (aka the significant-digit law) is the empirical observation that in many naturally
occuring tables of numerical data, the leading significant (nonzero) digit is not uniformly distributed
in {1, 2, . . . , 9}. Instead, the leading significant digit (= D, say) obeys the law

1
P (D = d) = log10 1 +
d

for d = 1, . . . , 9. This means the probability the first significant digit is 1 is approximately 0.301,
etc.
Benfords Law was apparently first discovered in 1881 by astronomer/mathematician S. Newcombe.
It started by the observation that the pages of a book of logarithms were dirtiest at the beginning
and progressively cleaner throughout. In 1938, a General Electric physicist called F. Benford re-
discovered the law on this same observation. Over several years he collected data from different
sources as different as atomic weights, baseball statistics, numerical data from Readers Digest, and
drainage areas of rivers.
Applications of Benfords Law has been as diverse as to the area of fraud detection in accounting
and the design computers.
Benini 51

Value
dbenf gives the density, pbenf gives the distribution function, and qbenf gives the quantile func-
tion, and rbenf generates random deviates.

Author(s)
T. W. Yee and Kai Huang

References
Benford, F. (1938) The Law of Anomalous Numbers. Proceedings of the American Philosophical
Society, 78, 551572.
Newcomb, S. (1881) Note on the Frequency of Use of the Different Digits in Natural Numbers.
American Journal of Mathematics, 4, 3940.

Examples
dbenf(x <- c(0:10, NA, NaN, -Inf, Inf))
pbenf(x)

## Not run:
xx <- 1:9
barplot(dbenf(xx), col = "lightblue", las = 1, xlab = "Leading digit",
ylab = "Probability", names.arg = as.character(xx),
main = paste("Benford's distribution", sep = ""))

hist(rbenf(n = 1000), border = "blue", prob = TRUE,

main = "1000 random variates from Benford's distribution",
xlab = "Leading digit", sub="Red is the true probability",
breaks = 0:9 + 0.5, ylim = c(0, 0.35), xlim = c(0, 10.0))
lines(xx, dbenf(xx), col = "red", type = "h")
points(xx, dbenf(xx), col = "red")

## End(Not run)

Benini The Benini Distribution

Description
Density, distribution function, quantile function and random generation for the Benini distribution
with parameter shape.

Usage
dbenini(x, y0, shape, log = FALSE)
pbenini(q, y0, shape, lower.tail = TRUE, log.p = FALSE)
qbenini(p, y0, shape, lower.tail = TRUE, log.p = FALSE)
rbenini(n, y0, shape)
52 Benini

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. Same as runif.
y0 the scale parameter y0 .
shape the shape parameter b.
log Logical. If log = TRUE then the logarithm of the density is returned.
lower.tail, log.p
Same meaning as in pnorm or qnorm.

Details
See benini1, the VGAM family function for estimating the parameter s by maximum likelihood
estimation, for the formula of the probability density function and other details.

Value
dbenini gives the density, pbenini gives the distribution function, qbenini gives the quantile
function, and rbenini generates random deviates.

Author(s)
T. W. Yee and Kai Huang

References
Kleiber, C. and Kotz, S. (2003) Statistical Size Distributions in Economics and Actuarial Sciences,
Hoboken, NJ, USA: Wiley-Interscience.

benini1 Benini Distribution Family Function

Description
Estimating the 1-parameter Benini distribution by maximum likelihood estimation.

Usage
benini1(y0 = stop("argument 'y0' must be specified"), lshape = "loge",
ishape = NULL, imethod = 1, zero = NULL)

Arguments
y0 Positive scale parameter.
lshape Parameter link function and extra argument of the parameter b, which is the
shape parameter. See Links for more choices. A log link is the default because
b is positive.
ishape Optional initial value for the shape parameter. The default is to compute the
value internally.
imethod, zero Details at CommonVGAMffArguments.

Details
The Benini distribution has a probability density function that can be written
f (y) = 2s exp(s[(log(y/y0 ))2 ]) log(y/y0 )/y
for 0 < y0 < y, and shape s > 0. The cumulative distribution function for Y is
F (y) = 1 exp(s[(log(y/y0 ))2 ]).
Here, Newton-Raphson and Fisher scoring coincide. The median of Y is now returned as the fitted
values, by default. This VGAM family function can handle a multiple responses, which is inputted
as a matrix.
On fitting, the extra slot has a component called y0 which contains the value of the y0 argument.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Note
Yet to do: the 2-parameter Benini distribution estimates another shape parameter a too. Hence, the
code may change in the future.
54 Betabinom

Author(s)
T. W. Yee

References
Kleiber, C. and Kotz, S. (2003) Statistical Size Distributions in Economics and Actuarial Sciences,
Hoboken, NJ, USA: Wiley-Interscience.

Betabinom The Beta-Binomial Distribution

Description
Density, distribution function, and random generation for the beta-binomial distribution and the
inflated beta-binomial distribution.

Usage
dbetabinom(x, size, prob, rho = 0, log = FALSE)
pbetabinom(q, size, prob, rho, log.p = FALSE)
rbetabinom(n, size, prob, rho = 0)
dbetabinom.ab(x, size, shape1, shape2, log = FALSE, Inf.shape = exp(20),
limit.prob = 0.5)
pbetabinom.ab(q, size, shape1, shape2, log.p = FALSE)
rbetabinom.ab(n, size, shape1, shape2, .dontuse.prob = NULL)
dzoibetabinom(x, size, prob, rho = 0, pstr0 = 0, pstrsize = 0, log = FALSE)
pzoibetabinom(q, size, prob, rho, pstr0 = 0, pstrsize = 0,
lower.tail = TRUE, log.p = FALSE)
rzoibetabinom(n, size, prob, rho = 0, pstr0 = 0, pstrsize = 0)
dzoibetabinom.ab(x, size, shape1, shape2, pstr0 = 0, pstrsize = 0, log = FALSE)
pzoibetabinom.ab(q, size, shape1, shape2, pstr0 = 0, pstrsize = 0,
lower.tail = TRUE, log.p = FALSE)
rzoibetabinom.ab(n, size, shape1, shape2, pstr0 = 0, pstrsize = 0)
Betabinom 55

Arguments
x, q vector of quantiles.
size number of trials.
n number of observations. Same as runif.
prob the probability of success . Must be in the unit closed interval [0, 1].
rho the correlation parameter . Usually must be in the unit open interval (0, 1),
however, the value 0 is sometimes supported (if so then it corresponds to the
usual binomial distribution).
shape1, shape2 the two (positive) shape parameters of the standard beta distribution. They are
called a and b in beta respectively.
log, log.p, lower.tail
Same meaning as runif.
Inf.shape Numeric. A large value such that, if shape1 or shape2 exceeds this, then special
measures are taken, e.g., calling dbinom. Also, if shape1 or shape2 is less than
its reciprocal, then special measures are also taken. This feature/approximation
is needed to avoid numerical problem with catastrophic cancellation of multiple
lbeta calls.
limit.prob If either shape parameters are Inf then the binomial limit is taken, with shape1 / (shape1 + shape2)
as the probability of success. In the case where both are Inf this probability will
be a NaN = Inf/Inf, however, the value limit.prob is used instead. Hence the
default is to assume that both shape parameters are equal as the limit is taken.
Purists may assign NaN to this argument.
.dontuse.prob An argument that should be ignored and unused.
pstr0 Probability of a structual zero (i.e., ignoring the beta-binomial distribution). The
default value of pstr0 corresponds to the response having a beta-binomial dis-
tribuion inflated only at size.
pstrsize Probability of a structual maximum value size. The default value of pstrsize
corresponds to the response having a beta-binomial distribution inflated only at
0.

Details
The beta-binomial distribution is a binomial distribution whose probability of success is not a con-
stant but it is generated from a beta distribution with parameters shape1 and shape2. Note that the
mean of this beta distribution is mu = shape1/(shape1+shape2), which therefore is the mean or
the probability of success.
See betabinomial and betabinomialff, the VGAM family functions for estimating the parame-
ters, for the formula of the probability density function and other details.
For the inflated beta-binomial distribution, the probability mass function is
P (Y = y) = (1 pstr0 pstrsize) BB(y) + pstr0 I[y = 0] + pstrsize I[y = size]

where BB(y) is the probability mass function of the beta-binomial distribution with the same shape
parameters (pbetabinom.ab), pstr0 is the inflated probability at 0 and pstrsize is the inflated
probability at 1. The default values of pstr0 and pstrsize mean that these functions behave like
the ordinary Betabinom when only the essential arguments are inputted.
56 Betabinom

Value
dbetabinom and dbetabinom.ab give the density, pbetabinom and pbetabinom.ab give the dis-
tribution function, and rbetabinom and rbetabinom.ab generate random deviates.
dzoibetabinom and dzoibetabinom.ab give the inflated density, pzoibetabinom and pzoibetabinom.ab
give the inflated distribution function, and rzoibetabinom and rzoibetabinom.ab generate ran-
dom inflated deviates.

Note
pzoibetabinom, pzoibetabinom.ab, pbetabinom and pbetabinom.ab can be particularly slow.
The functions here ending in .ab are called from those functions which dont. The simple trans-
formations = /( + ) and = 1/(1 + + ) are used, where and are the two shape
parameters.

Author(s)
T. W. Yee and Xiangjie Xue

See Also
betabinomial, betabinomialff, Zoabeta.

Examples
set.seed(1); rbetabinom(10, 100, prob = 0.5)
set.seed(1); rbinom(10, 100, prob = 0.5) # The same since rho = 0

## Not run: N <- 9; xx <- 0:N; s1 <- 2; s2 <- 3

dy <- dbetabinom.ab(xx, size = N, shape1 = s1, shape2 = s2)
barplot(rbind(dy, dbinom(xx, size = N, prob = s1 / (s1+s2))),
beside = TRUE, col = c("blue","green"), las = 1,
main = paste("Beta-binomial (size=",N,", shape1=", s1,
", shape2=", s2, ") (blue) vs\n",
" Binomial(size=", N, ", prob=", s1/(s1+s2), ") (green)", sep = ""),
names.arg = as.character(xx), cex.main = 0.8)
sum(dy * xx) # Check expected values are equal
sum(dbinom(xx, size = N, prob = s1 / (s1+s2)) * xx)
cumsum(dy) - pbetabinom.ab(xx, N, shape1 = s1, shape2 = s2) # Should be all 0

y <- rbetabinom.ab(n = 1e4, size = N, shape1 = s1, shape2 = s2)

ty <- table(y)
barplot(rbind(dy, ty / sum(ty)),
beside = TRUE, col = c("blue", "orange"), las = 1,
main = paste("Beta-binomial (size=", N, ", shape1=", s1,
", shape2=", s2, ") (blue) vs\n",
" Random generated beta-binomial(size=", N, ", prob=", s1/(s1+s2),
") (orange)", sep = ""), cex.main = 0.8,
names.arg = as.character(xx))

N <- 1e5; size <- 20; pstr0 <- 0.2; pstrsize <- 0.2
kk <- rzoibetabinom.ab(N, size, s1, s2, pstr0, pstrsize)
betabinomial 57

hist(kk, probability = TRUE, border = "blue", ylim = c(0, 0.25),

main = "Blue/green = inflated; orange = ordinary beta-binomial",
breaks = -0.5 : (size + 0.5))
sum(kk == 0) / N # Proportion of 0
sum(kk == size) / N # Proportion of size
lines(0 : size,
dbetabinom.ab(0 : size, size, s1, s2), col = "orange")
lines(0 : size, col = "green", type = "b",
dzoibetabinom.ab(0 : size, size, s1, s2, pstr0, pstrsize))

## End(Not run)

betabinomial Beta-binomial Distribution Family Function

Description
Fits a beta-binomial distribution by maximum likelihood estimation. The two parameters here are
the mean and correlation coefficient.

Usage
betabinomial(lmu = "logit", lrho = "logit", irho = NULL, imethod = 1,
ishrinkage = 0.95, nsimEIM = NULL, zero = "rho")

Arguments
lmu, lrho Link functions applied to the two parameters. See Links for more choices. The
defaults ensure the parameters remain in (0, 1), however, see the warning below.
irho Optional initial value for the correlation parameter. If given, it must be in (0, 1),
and is recyled to the necessary length. Assign this argument a value if a conver-
gence failure occurs. Having irho = NULL means an initial value is obtained
internally, though this can give unsatisfactory results.
imethod An integer with value 1 or 2 or . . . , which specifies the initialization method for
. If failure to converge occurs try the another value and/or else specify a value
for irho.
zero Specifyies which linear/additive predictor is to be modelled as an intercept only.
If assigned, the single value can be either 1 or 2. The default is to have a single
correlation parameter. To model both parameters as functions of the covariates
assign zero = NULL. See CommonVGAMffArguments for more information.
ishrinkage, nsimEIM
See CommonVGAMffArguments for more information. The argument ishrinkage
is used only if imethod = 2. Using the argument nsimEIM may offer large ad-
vantages for large values of N and/or large data sets.
58 betabinomial

Details
There are several parameterizations of the beta-binomial distribution. This family function directly
models the mean and correlation parameter, i.e., the probability of success. The model can be
written T |P = p Binomial(N, p) where P has a beta distribution with shape parameters and
. Here, N is the number of trials (e.g., litter size), T = N Y is the number of successes, and p is
the probability of a success (e.g., a malformation). That is, Y is the proportion of successes. Like
binomialff, the fitted values are the estimated probability of success (i.e., E[Y ] and not E[T ]) and
the prior weights N are attached separately on the object in a slot.
The probability function is

N B( + t, + N t)
P (T = t) =
t B(, )

where t = 0, 1, . . . , N , and B is the beta function with shape parameters and . Recall Y =
T /N is the real response being modelled.
The default model is 1 = logit() and 2 = logit() because both parameters lie between 0 and 1.
The mean (of Y ) is p = = /(+) and the variance (of Y ) is (1)(1+(N 1))/N . Here,
the correlation is given by 1/(1 + + ) and is the correlation between the N individuals within a
litter. A litter effect is typically reflected by a positive value of . It is known as the over-dispersion
parameter.
This family function uses Fisher scoring. Elements of the second-order expected derivatives with
respect to and are computed numerically, which may fail for large , , N or else take a long
time.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm.
Suppose fit is a fitted beta-binomial model. Then fit@y contains the sample proportions y,
fitted(fit) returns estimates of E(Y ), and weights(fit, type="prior") returns the num-
ber of trials N .

Warning
If the estimated rho parameter is close to 0 then it pays to try lrho = "rhobit". One day this may
become the default link function.
This family function is prone to numerical difficulties due to the expected information matrices
not being positive-definite or ill-conditioned over some regions of the parameter space. If problems
occur try setting irho to some numerical value, nsimEIM = 100, say, or else use etastart argument
of vglm, etc.

The extended beta-binomial distribution of Prentice (1986) is currently not implemented in the
VGAM package as it has range-restrictions for the correlation parameter that are currently too
difficult to handle in this package. However, try lrho = "rhobit".

Author(s)
T. W. Yee

References
Moore, D. F. and Tsiatis, A. (1991) Robust estimation of the variance in moment methods for
extra-binomial and extra-Poisson variation. Biometrics, 47, 383401.
Prentice, R. L. (1986) Binary regression using an extended beta-binomial distribution, with discus-
sion of correlation induced by covariate measurement errors. Journal of the American Statistical
Association, 81, 321327.

See Also
betabinomialff, Betabinom, binomialff, betaff, dirmultinomial, lirat, simulate.vlm.

Examples
# Example 1
bdata <- data.frame(N = 10, mu = 0.5, rho = 0.8)
bdata <- transform(bdata,
y = rbetabinom(n = 100, size = N, prob = mu, rho = rho))
fit <- vglm(cbind(y, N-y) ~ 1, betabinomial, data = bdata, trace = TRUE)
coef(fit, matrix = TRUE)
Coef(fit)
head(cbind(depvar(fit), weights(fit, type = "prior")))

# Example 2
fit <- vglm(cbind(R, N-R) ~ 1, betabinomial, lirat,
trace = TRUE, subset = N > 1)
coef(fit, matrix = TRUE)
Coef(fit)
t(fitted(fit))
t(depvar(fit))
t(weights(fit, type = "prior"))

# Example 3, which is more complicated

lirat <- transform(lirat, fgrp = factor(grp))
summary(lirat) # Only 5 litters in group 3
fit2 <- vglm(cbind(R, N-R) ~ fgrp + hb, betabinomial(zero = 2),
data = lirat, trace = TRUE, subset = N > 1)
coef(fit2, matrix = TRUE)
## Not run: with(lirat, plot(hb[N > 1], fit2@misc$rho,
xlab = "Hemoglobin", ylab = "Estimated rho",
pch = as.character(grp[N > 1]), col = grp[N > 1]))
## End(Not run)
60 betabinomialff

## Not run: # cf. Figure 3 of Moore and Tsiatis (1991)

with(lirat, plot(hb, R / N, pch = as.character(grp), col = grp, las = 1,
xlab = "Hemoglobin level", ylab = "Proportion Dead",
main = "Fitted values (lines)"))
smalldf <- with(lirat, lirat[N > 1, ])
for (gp in 1:4) {
xx <- with(smalldf, hb[grp == gp])
yy <- with(smalldf, fitted(fit2)[grp == gp])
ooo <- order(xx)
lines(xx[ooo], yy[ooo], col = gp)
}
## End(Not run)

betabinomialff Beta-binomial Distribution Family Function

Description

Fits a beta-binomial distribution by maximum likelihood estimation. The two parameters here are
the shape parameters of the underlying beta distribution.

Usage

betabinomialff(lshape1 = "loge", lshape2 = "loge", ishape1 = 1,

ishape2 = NULL, imethod = 1, ishrinkage = 0.95,
nsimEIM = NULL, zero = NULL)

Arguments
lshape1, lshape2
Link functions for the two (positive) shape parameters of the beta distribution.
See Links for more choices.
ishape1, ishape2
Initial value for the shape parameters. The first must be positive, and is recyled
to the necessary length. The second is optional. If a failure to converge occurs,
try assigning a different value to ishape1 and/or using ishape2.
zero Can be an integer specifying which linear/additive predictor is to be modelled
as an intercept only. If assigned, the single value should be either 1 or 2. The
default is to model both shape parameters as functions of the covariates. If a
failure to converge occurs, try zero = 2. See CommonVGAMffArguments for
more information.
ishrinkage, nsimEIM, imethod
See CommonVGAMffArguments for more information. The argument ishrinkage
is used only if imethod = 2. Using the argument nsimEIM may offer large ad-
vantages for large values of N and/or large data sets.
betabinomialff 61

Details
There are several parameterizations of the beta-binomial distribution. This family function directly
models the two shape parameters of the associated beta distribution rather than the probability of
success (however, see Note below). The model can be written T |P = p Binomial(N, p) where
P has a beta distribution with shape parameters and . Here, N is the number of trials (e.g.,
litter size), T = N Y is the number of successes, and p is the probability of a success (e.g., a
malformation). That is, Y is the proportion of successes. Like binomialff, the fitted values are
the estimated probability of success (i.e., E[Y ] and not E[T ]) and the prior weights N are attached
separately on the object in a slot.
The probability function is

N B( + t, + N t)
P (T = t) =
t B(, )

where t = 0, 1, . . . , N , and B is the beta function with shape parameters and . Recall Y = T /N
is the real response being modelled.
The default model is 1 = log() and 2 = log() because both parameters are positive. The mean
(of Y ) is p = = /( + ) and the variance (of Y ) is (1 )(1 + (N 1))/N . Here, the
correlation is given by 1/(1 + + ) and is the correlation between the N individuals within a
litter. A litter effect is typically reflected by a positive value of . It is known as the over-dispersion
parameter.
This family function uses Fisher scoring. The two diagonal elements of the second-order expected
derivatives with respect to and are computed numerically, which may fail for large , , N or
else take a long time.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm.
Suppose fit is a fitted beta-binomial model. Then fit@y (better: depvar(fit)) contains the sam-
ple proportions y, fitted(fit) returns estimates of E(Y ), and weights(fit, type = "prior")
returns the number of trials N .

Warning
This family function is prone to numerical difficulties due to the expected information matrices not
being positive-definite or ill-conditioned over some regions of the parameter space. If problems
occur try setting ishape1 to be some other positive value, using ishape2 and/or setting zero = 2.
This family function may be renamed in the future. See the warnings in betabinomial.

Note
This function processes the input in the same way as binomialff. But it does not handle the case
N = 1 very well because there are two parameters to estimate, not one, for each row of the input.
Cases where N = 1 can be omitted via the subset argument of vglm.
Although the two linear/additive predictors given above are in terms of and , basic algebra shows
that the default amounts to fitting a logit link to the probability of success; subtracting the second
62 betabinomialff

linear/additive predictor from the first gives that logistic regression linear/additive predictor. That
is, logit(p) = 1 2 . This is illustated in one of the examples below.
The extended beta-binomial distribution of Prentice (1986) is currently not implemented in the
VGAM package as it has range-restrictions for the correlation parameter that are currently too
difficult to handle in this package.

Author(s)
T. W. Yee

See Also
betabinomial, Betabinom, binomialff, betaff, dirmultinomial, lirat, simulate.vlm.

Examples
# Example 1
N <- 10; s1 <- exp(1); s2 <- exp(2)
y <- rbetabinom.ab(n = 100, size = N, shape1 = s1, shape2 = s2)
fit <- vglm(cbind(y, N-y) ~ 1, betabinomialff, trace = TRUE)
coef(fit, matrix = TRUE)
Coef(fit)
head(fit@misc$rho) # The correlation parameter
head(cbind(depvar(fit), weights(fit, type = "prior")))

# Example 2
fit <- vglm(cbind(R, N-R) ~ 1, betabinomialff, data = lirat,
trace = TRUE, subset = N > 1)
coef(fit, matrix = TRUE)
Coef(fit)
fit@misc$rho # The correlation parameter
t(fitted(fit))
t(depvar(fit))
t(weights(fit, type = "prior"))
# A "loge" link for the 2 shape parameters is a logistic regression:
all.equal(c(fitted(fit)),
as.vector(logit(predict(fit)[, 1] -
predict(fit)[, 2], inverse = TRUE)))

# Example 3, which is more complicated

lirat <- transform(lirat, fgrp = factor(grp))
betaff 63

summary(lirat) # Only 5 litters in group 3

fit2 <- vglm(cbind(R, N-R) ~ fgrp + hb, betabinomialff(zero = 2),
data = lirat, trace = TRUE, subset = N > 1)
coef(fit2, matrix = TRUE)
coef(fit2, matrix = TRUE)[, 1] -
coef(fit2, matrix = TRUE)[, 2] # logit(p)
## Not run: with(lirat, plot(hb[N > 1], fit2@misc$rho,
xlab = "Hemoglobin", ylab = "Estimated rho",
pch = as.character(grp[N > 1]), col = grp[N > 1]))
## End(Not run)
## Not run: # cf. Figure 3 of Moore and Tsiatis (1991)
with(lirat, plot(hb, R / N, pch = as.character(grp), col = grp, las = 1,
xlab = "Hemoglobin level", ylab = "Proportion Dead",
main = "Fitted values (lines)"))

smalldf <- with(lirat, lirat[N > 1, ])

for (gp in 1:4) {
xx <- with(smalldf, hb[grp == gp])
yy <- with(smalldf, fitted(fit2)[grp == gp])
ooo <- order(xx)
lines(xx[ooo], yy[ooo], col = gp)
}
## End(Not run)

betaff The Two-parameter Beta Distribution Family Function

Description
Estimation of the mean and precision parameters of the beta distribution.

Usage
betaff(A = 0, B = 1, lmu = "logit", lphi = "loge",
imu = NULL, iphi = NULL,
gprobs.y = ppoints(8), gphi = exp(-3:5)/4, zero = NULL)

Arguments
A, B Lower and upper limits of the distribution. The defaults correspond to the stan-
dard beta distribution where the response lies between 0 and 1.
lmu, lphi Link function for the mean and precision parameters. The values A and B are
extracted from the min and max arguments of extlogit. Consequently, only
extlogit is allowed.
imu, iphi Optional initial value for the mean and precision parameters respectively. A
NULL value means a value is obtained in the initialize slot.
gprobs.y, gphi, zero
See CommonVGAMffArguments for more information.
64 betaff

Details
The two-parameter beta distribution can be written f (y) =

(y A)1 1 (B y)(11 )1 /[beta(1 , (1 1 )) (B A)1 ]

for A < y < B, and beta(., .) is the beta function (see beta). The parameter 1 satisfies 1 =
(A)/(BA) where is the mean of Y . That is, 1 is the mean of of a standard beta distribution:
E(Y ) = A + (B A) 1 , and these are the fitted values of the object. Also, is positive and
A < < B. Here, the limits A and B are known.
Another parameterization of the beta distribution involving the raw shape parameters is imple-
mented in betaR.
For general A and B, the variance of Y is (B A)2 1 (1 1 )/(1 + ). Then can be
interpreted as a precision parameter in the sense that, for fixed , the larger the value of , the
smaller the variance of Y . Also, 1 = shape1/(shape1 + shape2) and = shape1 + shape2.
Fisher scoring is implemented.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Note
The response must have values in the interval (A, B). The user currently needs to manually choose
lmu to match the input of arguments A and B, e.g., with extlogit; see the example below.

Author(s)
Thomas W. Yee

References
Ferrari, S. L. P. and Francisco C.-N. (2004) Beta regression for modelling rates and proportions.
Journal of Applied Statistics, 31, 799815.

See Also
betaR,
Beta, dzoabeta, genbetaII, betaII, betabinomialff, betageometric, betaprime, rbetageom,
rbetanorm, kumar, extlogit, simulate.vlm.

Examples
bdata <- data.frame(y = rbeta(nn <- 1000, shape1 = exp(0), shape2 = exp(1)))
fit1 <- vglm(y ~ 1, betaff, data = bdata, trace = TRUE)
coef(fit1, matrix = TRUE)
Coef(fit1) # Useful for intercept-only models

# General A and B, and with a covariate

Betageom 65

bdata <- transform(bdata, x2 = runif(nn))

bdata <- transform(bdata, mu = logit(0.5 - x2, inverse = TRUE),
prec = exp(3.0 + x2)) # prec == phi
bdata <- transform(bdata, shape2 = prec * (1 - mu),
shape1 = mu * prec)
bdata <- transform(bdata,
y = rbeta(nn, shape1 = shape1, shape2 = shape2))
bdata <- transform(bdata, Y = 5 + 8 * y) # From 5 to 13, not 0 to 1
fit <- vglm(Y ~ x2, data = bdata, trace = TRUE,
betaff(A = 5, B = 13, lmu = extlogit(min = 5, max = 13)))
coef(fit, matrix = TRUE)

Betageom The Beta-Geometric Distribution

Description
Density, distribution function, and random generation for the beta-geometric distribution.

Usage
dbetageom(x, shape1, shape2, log = FALSE)
pbetageom(q, shape1, shape2, log.p = FALSE)
rbetageom(n, shape1, shape2)

Arguments
x, q vector of quantiles.
n number of observations. Same as runif.
shape1, shape2 the two (positive) shape parameters of the standard beta distribution. They are
called a and b in beta respectively.
log, log.p Logical. If TRUE then all probabilities p are given as log(p).

Details
The beta-geometric distribution is a geometric distribution whose probability of success is not a
constant but it is generated from a beta distribution with parameters shape1 and shape2. Note that
the mean of this beta distribution is shape1/(shape1+shape2), which therefore is the mean of the
probability of success.

Value
dbetageom gives the density, pbetageom gives the distribution function, and rbetageom generates
random deviates.

Note
pbetageom can be particularly slow.
66 betageometric

Author(s)
T. W. Yee

betageometric Beta-geometric Distribution Family Function

Description
Maximum likelihood estimation for the beta-geometric distribution.

Usage
betageometric(lprob = "logit", lshape = "loge",
iprob = NULL, ishape = 0.1,
moreSummation = c(2, 100), tolerance = 1.0e-10, zero = NULL)

Arguments
lprob, lshape Parameter link functions applied to the parameters p and (called prob and
shape below). The former lies in the unit interval and the latter is positive. See
Links for more choices.
iprob, ishape Numeric. Initial values for the two parameters. A NULL means a value is com-
puted internally.
moreSummation Integer, of length 2. When computing the expected information matrix a se-
ries summation from 0 to moreSummation[1]*max(y)+moreSummation[2] is
made, in which the upper limit is an approximation to infinity. Here, y is the
response.
tolerance Positive numeric. When all terms are less than this then the series is deemed to
have converged.
zero An integer-valued vector specifying which linear/additive predictors are mod-
elled as intercepts only. If used, the value must be from the set {1,2}.
betageometric 67

Details
A random variable Y has a 2-parameter beta-geometric distribution if P (Y = y) = p(1 p)y
for y = 0, 1, 2, . . . where p are generated from a standard beta distribution with shape param-
eters shape1 and shape2. The parameterization here is to focus on the parameters p and =
1/(shape1 + shape2), where is shape. The default link functions for these ensure that the appro-
priate range of the parameters is maintained. The mean of Y is E(Y ) = shape2/(shape1 1) =
(1 p)/(p ) if shape1 > 1, and if so, then this is returned as the fitted values.
The geometric distribution is a special case of the beta-geometric distribution with = 0 (see
geometric). However, fitting data from a geometric distribution may result in numerical problems
because the estimate of log() will converge to -Inf.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Note
The first iteration may be very slow; if practical, it is best for the weights argument of vglm etc. to
be used rather than inputting a very long vector as the response, i.e., vglm(y ~ 1, ..., weights = wts)
is to be preferred over vglm(rep(y, wts) ~ 1, ...). If convergence problems occur try inputting
some values of argument ishape.
If an intercept-only model is fitted then the misc slot of the fitted object has list components shape1
and shape2.

Author(s)
T. W. Yee

References
Paul, S. R. (2005) Testing goodness of fit of the geometric distribution: an application to human
fecundability data. Journal of Modern Applied Statistical Methods, 4, 425433.

betaII Beta Distribution of the Second Kind

Description
Maximum likelihood estimation of the 3-parameter beta II distribution.

Usage
betaII(lscale = "loge", lshape2.p = "loge", lshape3.q = "loge",
iscale = NULL, ishape2.p = NULL, ishape3.q = NULL, imethod = 1,
gscale = exp(-5:5), gshape2.p = exp(-5:5),
gshape3.q = seq(0.75, 4, by = 0.25),
probs.y = c(0.25, 0.5, 0.75), zero = "shape")

Arguments
lscale, lshape2.p, lshape3.q
Parameter link functions applied to the (positive) parameters scale, p and q.
See Links for more choices.
iscale, ishape2.p, ishape3.q, imethod, zero
See CommonVGAMffArguments for information.
gscale, gshape2.p, gshape3.q
See CommonVGAMffArguments for information.
probs.y See CommonVGAMffArguments for information.

Details
The 3-parameter beta II is the 4-parameter generalized beta II distribution with shape parameter
a = 1. It is also known as the Pearson VI distribution. Other distributions which are special cases
of the 3-parameter beta II include the Lomax (p = 1) and inverse Lomax (q = 1). More details can
be found in Kleiber and Kotz (2003).
The beta II distribution has density

f (y) = y p1 /[bp B(p, q){1 + y/b}p+q ]

for b > 0, p > 0, q > 0, y 0. Here, b is the scale parameter scale, and the others are shape
parameters. The mean is

E(Y ) = b (p + 1) (q 1)/((p) (q))

provided q > 1; these are returned as the fitted values. This family function handles multiple
responses.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.
Betanorm 69

Note

See the notes in genbetaII.

Author(s)

T. W. Yee

References

Kleiber, C. and Kotz, S. (2003) Statistical Size Distributions in Economics and Actuarial Sciences,
Hoboken, NJ, USA: Wiley-Interscience.

See Also

betaff, genbetaII, dagum, sinmad, fisk, inv.lomax, lomax, paralogistic, inv.paralogistic.

Examples
bdata <- data.frame(y = rsinmad(2000, shape1.a = 1, shape3.q = exp(2),
scale = exp(1))) # Not genuine data!
fit <- vglm(y ~ 1, betaII, data = bdata, trace = TRUE)
fit <- vglm(y ~ 1, betaII(ishape2.p = 0.7, ishape3.q = 0.7),
data = bdata, trace = TRUE)
coef(fit, matrix = TRUE)
Coef(fit)
summary(fit)

Betanorm The Beta-Normal Distribution

Description

Density, distribution function, quantile function and random generation for the univariate beta-
normal distribution.

Usage

dbetanorm(x, shape1, shape2, mean = 0, sd = 1, log = FALSE)

pbetanorm(q, shape1, shape2, mean = 0, sd = 1,
lower.tail = TRUE, log.p = FALSE)
qbetanorm(p, shape1, shape2, mean = 0, sd = 1,
lower.tail = TRUE, log.p = FALSE)
rbetanorm(n, shape1, shape2, mean = 0, sd = 1)
70 Betanorm

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. Same as runif.
shape1, shape2 the two (positive) shape parameters of the standard beta distribution. They are
called a and b respectively in beta.
mean, sd the mean and standard deviation of the univariate normal distribution (Normal).
log, log.p Logical. If TRUE then all probabilities p are given as log(p).
lower.tail Logical. If TRUE then the upper tail is returned, i.e., one minus the usual answer.

Details
The function betauninormal, the VGAM family function for estimating the parameters, has not
yet been written.

Value
dbetanorm gives the density, pbetanorm gives the distribution function, qbetanorm gives the quan-
tile function, and rbetanorm generates random deviates.

Author(s)
T. W. Yee

References
pp.146152 of Gupta, A. K. and Nadarajah, S. (2004) Handbook of Beta Distribution and Its Ap-
plications, New York: Marcel Dekker.

Examples
## Not run:
shape1 <- 0.1; shape2 <- 4; m <- 1
x <- seq(-10, 2, len = 501)
plot(x, dbetanorm(x, shape1, shape2, m = m), type = "l", ylim = 0:1, las = 1,
ylab = paste("betanorm(",shape1,", ",shape2,", m=",m, ", sd=1)", sep = ""),
main = "Blue is density, orange is cumulative distribution function",
sub = "Gray lines are the 10,20,...,90 percentiles", col = "blue")
lines(x, pbetanorm(x, shape1, shape2, m = m), col = "orange")
abline(h = 0, col = "black")
probs <- seq(0.1, 0.9, by = 0.1)
Q <- qbetanorm(probs, shape1, shape2, m = m)
lines(Q, dbetanorm(Q, shape1, shape2, m = m), col = "gray50", lty = 2, type = "h")
lines(Q, pbetanorm(Q, shape1, shape2, m = m), col = "gray50", lty = 2, type = "h")
abline(h = probs, col = "gray50", lty = 2)
pbetanorm(Q, shape1, shape2, m = m) - probs # Should be all 0

## End(Not run)
betaprime 71

betaprime The Beta-Prime Distribution

Description
Estimation of the two shape parameters of the beta-prime distribution by maximum likelihood esti-
mation.

Usage
betaprime(lshape = "loge", ishape1 = 2, ishape2 = NULL, zero = NULL)

Arguments
lshape Parameter link function applied to the two (positive) shape parameters. See
Links for more choices.
ishape1, ishape2, zero
See CommonVGAMffArguments.

Details
The beta-prime distribution is given by

f (y) = y shape11 (1 + y)shape1shape2 /B(shape1, shape2)

for y > 0. The shape parameters are positive, and here, B is the beta function. The mean of Y is
shape1/(shape2 1) provided shape2 > 1; these are returned as the fitted values.
If Y has a Beta(shape1, shape2) distribution then Y /(1Y ) and (1Y )/Y have a Betaprime(shape1, shape2)
and Betaprime(shape2, shape1) distribution respectively. Also, if Y1 has a gamma(shape1) dis-
tribution and Y2 has a gamma(shape2) distribution then Y1 /Y2 has a Betaprime(shape1, shape2)
distribution.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.

Note
The response must have positive values only.
The beta-prime distribution is also known as the beta distribution of the second kind or the inverted
beta distribution.

Author(s)
Thomas W. Yee
72 betaR

References

Johnson, N. L. and Kotz, S. and Balakrishnan, N. (1995) Chapter 25 of: Continuous Univariate
Distributions, 2nd edition, Volume 2, New York: Wiley.

See Also

betaff, Beta.

Examples

nn <- 1000
bdata <- data.frame(shape1 = exp(1), shape2 = exp(3))
bdata <- transform(bdata, yb = rbeta(nn, shape1, shape2))
bdata <- transform(bdata, y1 = (1-yb) / yb,
y2 = yb / (1-yb),
y3 = rgamma(nn, exp(3)) / rgamma(nn, exp(2)))

fit1 <- vglm(y1 ~ 1, betaprime, data = bdata, trace = TRUE)

coef(fit1, matrix = TRUE)

fit2 <- vglm(y2 ~ 1, betaprime, data = bdata, trace = TRUE)

coef(fit2, matrix = TRUE)

fit3 <- vglm(y3 ~ 1, betaprime, data = bdata, trace = TRUE)

coef(fit3, matrix = TRUE)

# Compare the fitted values

with(bdata, mean(y3))
head(fitted(fit3))
Coef(fit3) # Useful for intercept-only models

betaR The Two-parameter Beta Distribution Family Function

Description

Estimation of the shape parameters of the two-parameter beta distribution.

Usage

betaR(lshape1 = "loge", lshape2 = "loge",

i1 = NULL, i2 = NULL, trim = 0.05,
A = 0, B = 1, parallel = FALSE, zero = NULL)
betaR 73

Arguments
lshape1, lshape2, i1, i2
Details at CommonVGAMffArguments. See Links for more choices.
trim An argument which is fed into mean(); it is the fraction (0 to 0.5) of observations
to be trimmed from each end of the response y before the mean is computed.
This is used when computing initial values, and guards against outliers.
A, B Lower and upper limits of the distribution. The defaults correspond to the stan-
dard beta distribution where the response lies between 0 and 1.
parallel, zero See CommonVGAMffArguments for more information.

Details
The two-parameter beta distribution is given by f (y) =

(y A)shape11 (B y)shape21 /[Beta(shape1, shape2) (B A)shape1+shape21 ]

for A < y < B, and Beta(., .) is the beta function (see beta). The shape parameters are pos-
itive, and here, the limits A and B are known. The mean of Y is E(Y ) = A + (B A)
shape1/(shape1 + shape2), and these are the fitted values of the object.
For the standard beta distribution the variance of Y is shape1 shape2/[(1 + shape1 + shape2)
(shape1 + shape2)2 ]. If 2 = 1/(1 + shape1 + shape2) then the variance of Y can be written
2 (1 ) where = shape1/(shape1 + shape2) is the mean of Y .
Another parameterization of the beta distribution involving the mean and a precision parameter is
implemented in betaff.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.

Note
The response must have values in the interval (A, B). VGAM 0.7-4 and prior called this function
betaff.

Author(s)
Thomas W. Yee

References
Johnson, N. L. and Kotz, S. and Balakrishnan, N. (1995) Chapter 25 of: Continuous Univariate
Distributions, 2nd edition, Volume 2, New York: Wiley.
Gupta, A. K. and Nadarajah, S. (2004) Handbook of Beta Distribution and Its Applications, New
York: Marcel Dekker.
74 Biamhcop

See Also
betaff,
Beta, genbetaII, betaII, betabinomialff, betageometric, betaprime, rbetageom, rbetanorm,
kumar, simulate.vlm.

Examples
bdata <- data.frame(y = rbeta(n = 1000, shape1 = exp(0), shape2 = exp(1)))
fit <- vglm(y ~ 1, betaR(lshape1 = "identitylink", lshape2 = "identitylink"),
data = bdata, trace = TRUE, crit = "coef")
fit <- vglm(y ~ 1, betaR, data = bdata, trace = TRUE, crit = "coef")
coef(fit, matrix = TRUE)
Coef(fit) # Useful for intercept-only models

bdata <- transform(bdata, Y = 5 + 8 * y) # From 5 to 13, not 0 to 1

fit <- vglm(Y ~ 1, betaR(A = 5, B = 13), data = bdata, trace = TRUE)
Coef(fit)
c(meanY = with(bdata, mean(Y)), head(fitted(fit),2))

Biamhcop Ali-Mikhail-Haq Bivariate Distribution

Description
Density, distribution function, and random generation for the (one parameter) bivariate Ali-Mikhail-
Haq distribution.

Usage
dbiamhcop(x1, x2, apar, log = FALSE)
pbiamhcop(q1, q2, apar)
rbiamhcop(n, apar)

Arguments
x1, x2, q1, q2 vector of quantiles.
n number of observations. Same as runif
apar the association parameter.
log Logical. If TRUE then the logarithm is returned.

Details
See biamhcop, the VGAM family functions for estimating the parameter by maximum likelihood
estimation, for the formula of the cumulative distribution function and other details.
biamhcop 75

Value
dbiamhcop gives the density, pbiamhcop gives the distribution function, and rbiamhcop generates
random deviates (a two-column matrix).

Author(s)
T. W. Yee and C. S. Chee

plot(r <- rbiamhcop(n = 1000, apar = apar), col = "blue")

par(mfrow = c(1, 2))
hist(r[, 1]) # Should be uniform
hist(r[, 2]) # Should be uniform

## End(Not run)

biamhcop Ali-Mikhail-Haq Distribution Family Function

Description
Estimate the association parameter of Ali-Mikhail-Haqs bivariate distribution by maximum likeli-
hood estimation.

Usage
biamhcop(lapar = "rhobit", iapar = NULL, imethod = 1, nsimEIM = 250)

Arguments
lapar Link function applied to the association parameter , which is real and 1 <
< 1. See Links for more choices.
iapar Numeric. Optional initial value for . By default, an initial value is chosen inter-
nally. If a convergence failure occurs try assigning a different value. Assigning
a value will override the argument imethod.
76 biamhcop

imethod An integer with value 1 or 2 which specifies the initialization method. If failure
to converge occurs try the other value, or else specify a value for iapar.
nsimEIM See CommonVGAMffArguments for more information.

Details

The cumulative distribution function is

P (Y1 y1 , Y2 y2 ) = y1 y2 /(1 (1 y1 )(1 y2 ))

for 1 < < 1. The support of the function is the unit square. The marginal distributions are
the standard uniform distributions. When = 0 the random variables are independent. This is an
Archimedean copula.

Value

An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Note

The response must be a two-column matrix. Currently, the fitted value is a matrix with two columns
and values equal to 0.5. This is because each marginal distribution corresponds to a standard uni-
form distribution.

Author(s)

T. W. Yee and C. S. Chee

References

Balakrishnan, N. and Lai, C.-D. (2009) Continuous Bivariate Distributions, 2nd ed. New York:
Springer.

See Also

rbiamhcop, bifgmcop, bigumbelIexp, rbilogis, simulate.vlm.

Examples
ymat <- rbiamhcop(1000, apar = rhobit(2, inverse = TRUE))
fit <- vglm(ymat ~ 1, biamhcop, trace = TRUE)
coef(fit, matrix = TRUE)
Coef(fit)
Biclaytoncop 77

Biclaytoncop Clayton Copula (Bivariate) Distribution

Description
Density and random generation for the (one parameter) bivariate Clayton copula distribution.

Usage
dbiclaytoncop(x1, x2, apar = 0, log = FALSE)
rbiclaytoncop(n, apar = 0)

Arguments
x1, x2 vector of quantiles. The x1 and x2 should both be in the interval (0, 1).
n number of observations. Same as rnorm.
apar the association parameter. Should be in the interval [0, ). The default corre-
sponds to independence.
log Logical. If TRUE then the logarithm is returned.

Details
See biclaytoncop, the VGAM family functions for estimating the parameter by maximum likeli-
hood estimation, for the formula of the cumulative distribution function and other details.

Value
dbiclaytoncop gives the density at point (x1,x2), rbiclaytoncop generates random deviates (a
two-column matrix).

Note
dbiclaytoncop() does not yet handle x1 = 0 and/or x2 = 0.

Author(s)
R. Feyter and T. W. Yee

References
Clayton, D. (1982) A model for association in bivariate survival data. Journal of the Royal Statisti-
cal Society, Series B, Methodological, 44, 414422.

See Also
biclaytoncop, binormalcop, binormal.
78 biclaytoncop

Examples
## Not run: edge <- 0.01 # A small positive value
N <- 101; x <- seq(edge, 1.0 - edge, len = N); Rho <- 0.7
ox <- expand.grid(x, x)
zedd <- dbiclaytoncop(ox[, 1], ox[, 2], apar = Rho, log = TRUE)
par(mfrow = c(1, 2))
contour(x, x, matrix(zedd, N, N), col = "blue", labcex = 1.5, las = 1)
plot(rbiclaytoncop(1000, 2), col = "blue", las = 1)

## End(Not run)

biclaytoncop Clayton Copula (Bivariate) Family Function

Description
Estimate the correlation parameter of the (bivariate) Clayton copula distribution by maximum like-
lihood estimation.

Usage
biclaytoncop(lapar = "loge", iapar = NULL, imethod = 1,
parallel = FALSE, zero = NULL)

Arguments
lapar, iapar, imethod
Details at CommonVGAMffArguments. See Links for more link function choices.
parallel, zero Details at CommonVGAMffArguments. If parallel = TRUE then the constraint
is also applied to the intercept.

Details
The cumulative distribution function is

P (u1 , u2 ; ) = (u
1 + u2 1)
1/

for 0 . Here, is the association parameter. The support of the function is the interior of the
unit square; however, values of 0 and/or 1 are not allowed (currently). The marginal distributions
are the standard uniform distributions. When = 0 the random variables are independent.
This VGAM family function can handle multiple responses, for example, a six-column matrix
where the first 2 columns is the first out of three responses, the next 2 columns being the next
response, etc.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.
biclaytoncop 79

Note
The response matrix must have a multiple of two-columns. Currently, the fitted value is a matrix
with the same number of columns and values equal to 0.5. This is because each marginal distribution
corresponds to a standard uniform distribution.
This VGAM family function is fragile; each response must be in the interior of the unit square.

Author(s)
R. Feyter and T. W. Yee

References
Clayton, D. (1982) A model for association in bivariate survival data. Journal of the Royal Statisti-
cal Society, Series B, Methodological, 44, 414422.
Stober, J. and Schepsmeier, U. (2013) Derivatives and Fisher information of bivariate copulas. Sta-
tistical Papers.

See Also
rbiclaytoncop, dbiclaytoncop, kendall.tau.

Examples
ymat <- rbiclaytoncop(n = (nn <- 1000), apar = exp(2))
bdata <- data.frame(y1 = ymat[, 1],
y2 = ymat[, 2],
y3 = ymat[, 1],
y4 = ymat[, 2],
x2 = runif(nn))

summary(bdata)
## Not run: plot(ymat, col = "blue")
fit1 <- vglm(cbind(y1, y2, y3, y4) ~ 1, # 2 responses, e.g., (y1,y2) is the first
biclaytoncop, data = bdata,
trace = TRUE, crit = "coef") # Sometimes a good idea

coef(fit1, matrix = TRUE)

Coef(fit1)
head(fitted(fit1))
summary(fit1)

# Another example; apar is a function of x2

bdata <- transform(bdata, apar = exp(-0.5 + x2))
ymat <- rbiclaytoncop(n = nn, apar = with(bdata, apar))
bdata <- transform(bdata, y5 = ymat[, 1],
y6 = ymat[, 2])
fit2 <- vgam(cbind(y5, y6) ~ s(x2), data = bdata,
biclaytoncop(lapar = "loge"), trace = TRUE)
## Not run: plot(fit2, lcol = "blue", scol = "orange", se = TRUE, las = 1)
80 BICvlm

BICvlm Bayesian Information Criterion

Description
Calculates the Bayesian information criterion (BIC) for a fitted model object for which a log-
likelihood value has been obtained.

Usage
BICvlm(object, ..., k = log(nobs(object)))

Arguments
object, ... Same as AICvlm.
k Numeric, the penalty per parameter to be used; the default is log(n) where n is
the number of observations).

Details
The so-called BIC or SBC (Schwarzs Bayesian criterion) can be computed by calling AICvlm with
a different k argument. See AICvlm for information and caveats.

Value
Returns a numeric value with the corresponding BIC, or . . . , depending on k.

Warning
Like AICvlm, this code has not been double-checked. The general applicability of BIC for the
VGLM/VGAM classes has not been developed fully. In particular, BIC should not be run on some
VGAM family functions because of violation of certain regularity conditions, etc.
Many VGAM family functions such as cumulative can have the number of observations absorbed
into the prior weights argument (e.g., weights in vglm), either before or after fitting. Almost all
VGAM family functions can have the number of observations defined by the weights argument,
e.g., as an observed frequency. BIC simply uses the number of rows of the model matrix, say, as
defining n, hence the user must be very careful of this possible error. Use at your own risk!!

Note
BIC, AIC and other ICs can have have many additive constants added to them. The important thing
are the differences since the minimum value corresponds to the best model. Preliminary testing
shows absolute differences with some VGAM family functions such as gaussianff, however, they
should agree with non-normal families.
BIC has not been defined for QRR-VGLMs yet.
Bifgmcop 81

Author(s)

T. W. Yee.

See Also

AICvlm, VGLMs are described in vglm-class; VGAMs are described in vgam-class; RR-VGLMs
are described in rrvglm-class; BIC, AIC.

Examples
pneumo <- transform(pneumo, let = log(exposure.time))
(fit1 <- vglm(cbind(normal, mild, severe) ~ let,
cumulative(parallel = TRUE, reverse = TRUE), data = pneumo))
coef(fit1, matrix = TRUE)
BIC(fit1)
(fit2 <- vglm(cbind(normal, mild, severe) ~ let,
cumulative(parallel = FALSE, reverse = TRUE), data = pneumo))
coef(fit2, matrix = TRUE)
BIC(fit2)

# These do not agree in absolute terms:

gdata <- data.frame(x2 = sort(runif(n <- 40)))
gdata <- transform(gdata, y1 = 1 + 2*x2 + rnorm(n, sd = 0.1))
fit.v <- vglm(y1 ~ x2, gaussianff, data = gdata)
fit.g <- glm(y1 ~ x2, gaussian , data = gdata)
fit.l <- lm(y1 ~ x2, data = gdata)
c(BIC(fit.l), BIC(fit.g), BIC(fit.v))
c(AIC(fit.l), AIC(fit.g), AIC(fit.v))
c(AIC(fit.l) - AIC(fit.v),
AIC(fit.g) - AIC(fit.v))
c(logLik(fit.l), logLik(fit.g), logLik(fit.v))

Bifgmcop Farlie-Gumbel-Morgensterns Bivariate Distribution

Description

Density, distribution function, and random generation for the (one parameter) bivariate Farlie-
Gumbel-Morgensterns distribution.

Usage

dbifgmcop(x1, x2, apar, log = FALSE)

pbifgmcop(q1, q2, apar)
rbifgmcop(n, apar)
82 Bifgmcop

Arguments

x1, x2, q1, q2 vector of quantiles.

n number of observations. Same as in runif.
apar the association parameter.
log Logical. If TRUE then the logarithm is returned.

Details

See bifgmcop, the VGAM family functions for estimating the parameter by maximum likelihood
estimation, for the formula of the cumulative distribution function and other details.

Value

dbifgmcop gives the density, pbifgmcop gives the distribution function, and rbifgmcop generates
random deviates (a two-column matrix).

Author(s)

T. W. Yee

See Also

bifgmcop.

Examples

## Not run: N <- 101; x <- seq(0.0, 1.0, len = N); apar <- 0.7
ox <- expand.grid(x, x)
zedd <- dbifgmcop(ox[, 1], ox[, 2], apar = apar)
contour(x, x, matrix(zedd, N, N), col = "blue")
zedd <- pbifgmcop(ox[, 1], ox[, 2], apar = apar)
contour(x, x, matrix(zedd, N, N), col = "blue")

plot(r <- rbifgmcop(n = 3000, apar = apar), col = "blue")

par(mfrow = c(1, 2))
hist(r[, 1]) # Should be uniform
hist(r[, 2]) # Should be uniform

## End(Not run)
bifgmcop 83

bifgmcop Farlie-Gumbel-Morgensterns Bivariate Distribution Family Func-

tion

Description
Estimate the association parameter of Farlie-Gumbel-Morgensterns bivariate distribution by maxi-
mum likelihood estimation.

Usage
bifgmcop(lapar = "rhobit", iapar = NULL, imethod = 1)

Arguments
lapar, iapar, imethod
Details at CommonVGAMffArguments. See Links for more link function choices.

Details
The cumulative distribution function is

P (Y1 y1 , Y2 y2 ) = y1 y2 (1 + (1 y1 )(1 y2 ))

for 1 < < 1. The support of the function is the unit square. The marginal distributions are the
standard uniform distributions. When = 0 the random variables are independent.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Note
The response must be a two-column matrix. Currently, the fitted value is a matrix with two columns
and values equal to 0.5. This is because each marginal distribution corresponds to a standard uni-
form distribution.

Author(s)
T. W. Yee

References
Castillo, E., Hadi, A. S., Balakrishnan, N. Sarabia, J. S. (2005) Extreme Value and Related Models
with Applications in Engineering and Science, Hoboken, NJ, USA: Wiley-Interscience.
Smith, M. D. (2007) Invariance theorems for Fisher information. Communications in Statistics
Theory and Methods, 36(12), 22132222.
84 bifgmexp

See Also
rbifgmcop, bifrankcop, bifgmexp, simulate.vlm.

Examples
ymat <- rbifgmcop(n = 1000, apar = rhobit(3, inverse = TRUE))
## Not run: plot(ymat, col = "blue")
fit <- vglm(ymat ~ 1, fam = bifgmcop, trace = TRUE)
coef(fit, matrix = TRUE)
Coef(fit)
head(fitted(fit))

bifgmexp Bivariate Farlie-Gumbel-Morgenstern Exponential Distribution Fam-

ily Function

Description
Estimate the association parameter of FGM bivariate exponential distribution by maximum likeli-
hood estimation.

Usage
bifgmexp(lapar = "rhobit", iapar = NULL, tola0 = 0.01, imethod = 1)

Arguments
lapar Link function for the association parameter , which lies between 1 and 1.
See Links for more choices and other information.
iapar Numeric. Optional initial value for . By default, an initial value is chosen inter-
nally. If a convergence failure occurs try assigning a different value. Assigning
a value will override the argument imethod.
tola0 Positive numeric. If the estimate of has an absolute value less than this then
it is replaced by this value. This is an attempt to fix a numerical problem when
the estimate is too close to zero.
imethod An integer with value 1 or 2 which specifies the initialization method. If failure
to converge occurs try the other value, or else specify a value for ia.

Details
The cumulative distribution function is

P (Y1 y1 , Y2 y2 ) = ey1 y2 (1 + [1 ey1 ][1 ey2 ]) + 1 ey1 ey2

for between 1 and 1. The support of the function is for y1 > 0 and y2 > 0. The marginal
distributions are an exponential distribution with unit mean. When = 0 then the random variables
bifgmexp 85

are independent, and this causes some problems in the estimation process since the distribution no
longer depends on the parameter.
A variant of Newton-Raphson is used, which only seems to work for an intercept model. It is a very
good idea to set trace = TRUE.

Value

An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Note

The response must be a two-column matrix. Currently, the fitted value is a matrix with two columns
and values equal to 1. This is because each marginal distribution corresponds to a exponential
distribution with unit mean.
This VGAM family function should be used with caution.

Author(s)

T. W. Yee

References

Castillo, E., Hadi, A. S., Balakrishnan, N. Sarabia, J. S. (2005) Extreme Value and Related Models
with Applications in Engineering and Science, Hoboken, NJ, USA: Wiley-Interscience.

See Also

bifgmcop, bigumbelIexp.

Examples

N <- 1000; mdata <- data.frame(y1 = rexp(N), y2 = rexp(N))

## Not run: plot(ymat)
fit <- vglm(cbind(y1, y2) ~ 1, bifgmexp, data = mdata, trace = TRUE)
fit <- vglm(cbind(y1, y2) ~ 1, bifgmexp, data = mdata, # This may fail
trace = TRUE, crit = "coef")
coef(fit, matrix = TRUE)
Coef(fit)
head(fitted(fit))
86 bifrankcop

bifrankcop Franks Bivariate Distribution Family Function

Description
Estimate the association parameter of Franks bivariate distribution by maximum likelihood estima-
tion.

Usage
bifrankcop(lapar = "loge", iapar = 2, nsimEIM = 250)

Arguments
lapar Link function applied to the (positive) association parameter . See Links for
more choices.
iapar Numeric. Initial value for . If a convergence failure occurs try assigning a
different value.
nsimEIM See CommonVGAMffArguments.

Details
The cumulative distribution function is

P (Y1 y1 , Y2 y2 ) = H (y1 , y2 ) = log [1 + (y1 1)(y2 1)/( 1)]

for 6= 1. Note the logarithm here is to base . The support of the function is the unit square.
When 0 < < 1 the probability density function h (y1 , y2 ) is symmetric with respect to the lines
y2 = y1 and y2 = 1 y1 . When > 1 then h (y1 , y2 ) = h1/ (1 y1 , y2 ).
If = 1 then H(y1 , y2 ) = y1 y2 , i.e., uniform on the unit square. As approaches 0 then
H(y1 , y2 ) = min(y1 , y2 ). As approaches infinity then H(y1 , y2 ) = max(0, y1 + y2 1).
The default is to use Fisher scoring implemented using rbifrankcop. For intercept-only models an
alternative is to set nsimEIM=NULL so that a variant of Newton-Raphson is used.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Note
The response must be a two-column matrix. Currently, the fitted value is a matrix with two columns
and values equal to a half. This is because the marginal distributions correspond to a standard
uniform distribution.
bigamma.mckay 87

Author(s)
T. W. Yee

References
Genest, C. (1987) Franks family of bivariate distributions. Biometrika, 74, 549555.

See Also
rbifrankcop, bifgmcop, simulate.vlm.

Examples
## Not run:
ymat <- rbifrankcop(n = 2000, apar = exp(4))
plot(ymat, col = "blue")
fit <- vglm(ymat ~ 1, fam = bifrankcop, trace = TRUE)
coef(fit, matrix = TRUE)
Coef(fit)
vcov(fit)
head(fitted(fit))
summary(fit)

## End(Not run)

bigamma.mckay Bivariate Gamma: McKays Distribution

Description
Estimate the three parameters of McKays bivariate gamma distribution by maximum likelihood
estimation.

Usage
bigamma.mckay(lscale = "loge", lshape1 = "loge", lshape2 = "loge",
iscale = NULL, ishape1 = NULL, ishape2 = NULL,
imethod = 1, zero = "shape")

Arguments
lscale, lshape1, lshape2
Link functions applied to the (positive) parameters a, p and q respectively. See
Links for more choices.
iscale, ishape1, ishape2
Optional initial values for a, p and q respectively. The default is to compute
them internally.
imethod, zero See CommonVGAMffArguments.
88 bigamma.mckay

Details
One of the earliest forms of the bivariate gamma distribution has a joint probability density function
given by

f (y1 , y2 ; a, p, q) = (1/a)p+q y1p1 (y2 y1 )q1 exp(y2 /a)/[(p)(q)]

for a > 0, p > 0, q > 0 and 0 < y1 < y2 (Mckay, 1934). Here, is the gamma function, as in
gamma. By default, the linear/additive predictors are 1 = log(a), 2 = log(p), 3 = log(q).
The marginal distributions are gamma, with shape parameters p and p + q respectively, but they
p a common scale parameter a. Pearsons product-moment correlation coefficient of y1 and y2
have
is p/(p + q). This distribution is also known as the bivariate Pearson type III distribution. Also,
Y2 y1 , conditional on Y1 = y1 , has a gamma distribution with shape parameter q.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Note
The response must be a two column matrix where the first column is y1 and the second y2 . It is
necessary that each element of the vectors y1 and y2 y1 be positive. Currently, the fitted value is
a matrix with two columns; the first column has values ap for the marginal mean of y1 , while the
second column has values a(p + q) for the marginal mean of y2 (all evaluated at the final iteration).

Author(s)
T. W. Yee

References
McKay, A. T. (1934) Sampling from batches. Journal of the Royal Statistical SocietySupplement,
1, 207216.
Kotz, S. and Balakrishnan, N. and Johnson, N. L. (2000) Continuous Multivariate Distributions
Volume 1: Models and Applications, 2nd edition, New York: Wiley.
Balakrishnan, N. and Lai, C.-D. (2009) Continuous Bivariate Distributions, 2nd edition. New York:
Springer.

coef(fit, matrix = TRUE)

Coef(fit)
vcov(fit)

colMeans(depvar(fit)) # Check moments

head(fitted(fit), 1)

bigumbelIexp Gumbels Type I Bivariate Distribution Family Function

Description
Estimate the association parameter of Gumbels Type I bivariate distribution by maximum likeli-
hood estimation.

Usage
bigumbelIexp(lapar = "identitylink", iapar = NULL, imethod = 1)

Arguments
lapar Link function applied to the association parameter . See Links for more
choices.
iapar Numeric. Optional initial value for . By default, an initial value is chosen inter-
nally. If a convergence failure occurs try assigning a different value. Assigning
a value will override the argument imethod.
imethod An integer with value 1 or 2 which specifies the initialization method. If failure
to converge occurs try the other value, or else specify a value for ia.

Details
The cumulative distribution function is

P (Y1 y1 , Y2 y2 ) = ey1 y2 +y1 y2 + 1 ey1 ey2

for real . The support of the function is for y1 > 0 and y2 > 0. The marginal distributions are an
exponential distribution with unit mean.
A variant of Newton-Raphson is used, which only seems to work for an intercept model. It is a very
good idea to set trace=TRUE.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.
90 bilogis

Note

Author(s)

T. W. Yee

References

Gumbel, E. J. (1960) Bivariate Exponential Distributions. Journal of the American Statistical As-
sociation, 55, 698707.

See Also

bifgmexp.

Examples
nn <- 1000
gdata <- data.frame(y1 = rexp(nn), y2 = rexp(nn))
## Not run: with(gdata, plot(cbind(y1, y2)))
fit <- vglm(cbind(y1, y2) ~ 1, bigumbelIexp, data = gdata, trace = TRUE)
coef(fit, matrix = TRUE)
Coef(fit)
head(fitted(fit))

bilogis Bivariate Logistic Distribution

Description

Density, distribution function, quantile function and random generation for the 4-parameter bivari-
ate logistic distribution.

Usage

dbilogis(x1, x2, loc1 = 0, scale1 = 1, loc2 = 0, scale2 = 1, log = FALSE)

pbilogis(q1, q2, loc1 = 0, scale1 = 1, loc2 = 0, scale2 = 1)
rbilogis(n, loc1 = 0, scale1 = 1, loc2 = 0, scale2 = 1)
bilogis 91

Arguments
x1, x2, q1, q2 vector of quantiles.
n number of observations. Same as rlogis.
loc1, loc2 the location parameters l1 and l2 .
scale1, scale2 the scale parameters s1 and s2 .
log Logical. If log = TRUE then the logarithm of the density is returned.

Details
See bilogis, the VGAM family function for estimating the four parameters by maximum likeli-
hood estimation, for the formula of the cumulative distribution function and other details.

Value
dbilogis gives the density, pbilogis gives the distribution function, and rbilogis generates
random deviates (a two-column matrix).

Note
Gumbel (1961) proposed two bivariate logistic distributions with logistic distribution marginals,
which he called Type I and Type II. The Type I is this one. The Type II belongs to the Morgenstern
type. The biamhcop distribution has, as a special case, this distribution, which is when the random
variables are independent.

Author(s)
T. W. Yee

References
Gumbel, E. J. (1961) Bivariate logistic distributions. Journal of the American Statistical Associa-
tion, 56, 335349.

contour(x1, x2, matrix(z, N, N), main = "density")

z <- pbilogis(ox[,1], ox[,2], loc1 = 5, loc2 = 7, scale2 = exp(1))
contour(x1, x2, matrix(z, N, N), main = "cdf")
## End(Not run)

bilogistic Bivariate Logistic Distribution Family Function

Description
Estimates the four parameters of the bivariate logistic distribution by maximum likelihood estima-
tion.

Usage
bilogistic(llocation = "identitylink", lscale = "loge",
iloc1 = NULL, iscale1 = NULL, iloc2 = NULL, iscale2 = NULL,
imethod = 1, zero = NULL)

Arguments
llocation Link function applied to both location parameters l1 and l2 . See Links for more
choices.
lscale Parameter link function applied to both (positive) scale parameters s1 and s2 .
See Links for more choices.
iloc1, iloc2 Initial values for the location parameters. By default, initial values are chosen
internally using imethod. Assigning values here will override the argument
imethod.
iscale1, iscale2
Initial values for the scale parameters. By default, initial values are chosen
internally using imethod. Assigning values here will override the argument
imethod.
imethod An integer with value 1 or 2 which specifies the initialization method. If failure
to converge occurs try the other value.
zero An integer-valued vector specifying which linear/additive predictors are mod-
elled as intercepts only. The default is none of them. If used, one can choose
values from the set {1,2,3,4}. See CommonVGAMffArguments for more informa-
tion.

Details
The four-parameter bivariate logistic distribution has a density that can be written as

exp[(y1 l1 )/s1 (y2 l2 )/s2 ]

f (y1 , y2 ; l1 , s1 , l2 , s2 ) = 2 3
s1 s2 (1 + exp[(y1 l1 )/s1 ] + exp[(y2 l2 )/s2 ])
bilogistic 93

where s1 > 0 and s2 > 0 are the scale parameters, and l1 and l2 are the location parameters. Each
of the two responses are unbounded, i.e., < yj < . The mean of Y1 is l1 etc. The fitted
values are returned in a 2-column matrix. The cumulative distribution function is
1
F (y1 , y2 ; l1 , s1 , l2 , s2 ) = (1 + exp[(y1 l1 )/s1 ] + exp[(y2 l2 )/s2 ])

The marginal distribution of Y1 is

1
P (Y1 y1 ) = F (y1 ; l1 , s1 ) = (1 + exp[(y1 l1 )/s1 ]) .

By default, 1 = l1 , 2 = log(s1 ), 3 = l2 , 4 = log(s2 ) are the linear/additive predictors.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.

Note
This family function uses the BFGS quasi-Newton update formula for the working weight matrices.
Consequently the estimated variance-covariance matrix may be inaccurate or simply wrong! The
standard errors must be therefore treated with caution; these are computed in functions such as
vcov() and summary().

Author(s)
T. W. Yee

References
Gumbel, E. J. (1961) Bivariate logistic distributions. Journal of the American Statistical Associa-
tion, 56, 335349.
Castillo, E., Hadi, A. S., Balakrishnan, N. Sarabia, J. S. (2005) Extreme Value and Related Models
with Applications in Engineering and Science, Hoboken, NJ, USA: Wiley-Interscience.

Binom2.or Bivariate Binary Regression with an Odds Ratio

Description
Density and random generation for a bivariate binary regression model using an odds ratio as the
measure of dependency.

Usage
rbinom2.or(n, mu1,
mu2 = if (exchangeable) mu1 else stop("argument 'mu2' not specified"),
oratio = 1, exchangeable = FALSE, tol = 0.001, twoCols = TRUE,
colnames = if (twoCols) c("y1","y2") else c("00", "01", "10", "11"),
ErrorCheck = TRUE)
dbinom2.or(mu1, mu2 = if (exchangeable) mu1 else stop("'mu2' not specified"),
oratio = 1, exchangeable = FALSE, tol = 0.001,
colnames = c("00", "01", "10", "11"), ErrorCheck = TRUE)

Arguments
n number of observations. Same as in runif. The arguments mu1, mu2, oratio
are recycled to this value.
mu1, mu2 The marginal probabilities. Only mu1 is needed if exchangeable = TRUE.
Values should be between 0 and 1.
oratio Odds ratio. Must be numeric and positive. The default value of unity means the
responses are statistically independent.
exchangeable Logical. If TRUE, the two marginal probabilities are constrained to be equal.
twoCols Logical. If TRUE, then a n 2 matrix of 1s and 0s is returned. If FALSE, then a
n 4 matrix of 1s and 0s is returned.
colnames The dimnames argument of matrix is assigned list(NULL, colnames).
tol Tolerance for testing independence. Should be some small positive numerical
value.
ErrorCheck Logical. Do some error checking of the input parameters?

Details
The function rbinom2.or generates data coming from a bivariate binary response model. The data
might be fitted with the VGAM family function binom2.or.
The function dbinom2.or does not really compute the density (because that does not make sense
here) but rather returns the four joint probabilities.
binom2.or 95

Value
The function rbinom2.or returns either a 2 or 4 column matrix of 1s and 0s, depending on the
argument twoCols.
The function dbinom2.or returns a 4 column matrix of joint probabilities; each row adds up to
unity.

Author(s)
T. W. Yee

bdata <- data.frame(x2 = sort(runif(nn))) # Example 2

bdata <- transform(bdata, mu1 = logit(-2 + 4 * x2, inverse = TRUE),
mu2 = logit(-1 + 3 * x2, inverse = TRUE))
dmat <- with(bdata, dbinom2.or(mu1 = mu1, mu2 = mu2, oratio = exp(2)))
ymat <- with(bdata, rbinom2.or(n = nn, mu1 = mu1, mu2 = mu2, oratio = exp(2)))
fit2 <- vglm(ymat ~ x2, binom2.or, data = bdata)
coef(fit2, matrix = TRUE)
## Not run:
matplot(with(bdata, x2), dmat, lty = 1:4, col = 1:4, type = "l",
main = "Joint probabilities", ylim = 0:1,
ylab = "Probabilities", xlab = "x2", las = 1)
legend("top", lty = 1:4, col = 1:4,
legend = c("1 = (y1=0, y2=0)", "2 = (y1=0, y2=1)",
"3 = (y1=1, y2=0)", "4 = (y1=1, y2=1)"))

## End(Not run)

binom2.or Bivariate Binary Regression with an Odds Ratio (Family Function)

Description
Fits a Palmgren (bivariate odds-ratio model, or bivariate logistic regression) model to two binary
responses. Actually, a bivariate logistic/probit/cloglog/cauchit model can be fitted. The odds ratio
is used as a measure of dependency.
96 binom2.or

Usage
binom2.or(lmu = "logit", lmu1 = lmu, lmu2 = lmu, loratio = "loge",
imu1 = NULL, imu2 = NULL, ioratio = NULL, zero = "oratio",
exchangeable = FALSE, tol = 0.001, more.robust = FALSE)

Arguments
lmu Link function applied to the two marginal probabilities. See Links for more
choices. See the note below.
lmu1, lmu2 Link function applied to the first and second of the two marginal probabilities.
loratio Link function applied to the odds ratio. See Links for more choices.
imu1, imu2, ioratio
Optional initial values for the marginal probabilities and odds ratio. See CommonVGAMffArguments
for more details. In general good initial values are often required so use these
arguments if convergence failure occurs.
zero Which linear/additive predictor is modelled as an intercept only? The default is
for the odds ratio. A NULL means none. See CommonVGAMffArguments for more
details.
exchangeable Logical. If TRUE, the two marginal probabilities are constrained to be equal.
tol Tolerance for testing independence. Should be some small positive numerical
value.
more.robust Logical. If TRUE then some measures are taken to compute the derivatives and
working weights more robustly, i.e., in an attempt to avoid numerical problems.
Currently this feature is not debugged if set TRUE.

Details
Also known informally as the Palmgren model, the bivariate logistic model is a full-likelihood
based model defined as two logistic regressions plus log(oratio) = eta3 where eta3 is the
third linear/additive predictor relating the odds ratio to explanatory variables. Explicitly, the default
model is
logit[P (Yj = 1)] = j , j = 1, 2
for the marginals, and

log[P (Y00 = 1)P (Y11 = 1)/(P (Y01 = 1)P (Y10 = 1))] = 3 ,

specifies the dependency between the two responses. Here, the responses equal 1 for a success and
a 0 for a failure, and the odds ratio is often written = p00 p11 /(p10 p01 ). The model is fitted by
maximum likelihood estimation since the full likelihood is specified. The two binary responses are
independent if and only if the odds ratio is unity, or equivalently, the log odds ratio is 0. Fisher
scoring is implemented.
The default models 3 as a single parameter only, i.e., an intercept-only model, but this can be
circumvented by setting zero = NULL in order to model the odds ratio as a function of all the
explanatory variables. The function binom2.or() can handle other probability link functions such
as probit, cloglog and cauchit links as well, so is quite general. In fact, the two marginal
probabilities can each have a different link function. A similar model is the bivariate probit model
binom2.or 97

(binom2.rho), which is based on a standard bivariate normal distribution, but the bivariate probit
model is less interpretable and flexible.
The exchangeable argument should be used when the error structure is exchangeable, e.g., with
eyes or ears data.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.
When fitted, the fitted.values slot of the object contains the four joint probabilities, labelled as
(Y1 , Y2 ) = (0,0), (0,1), (1,0), (1,1), respectively. These estimated probabilities should be extracted
with the fitted generic function.

Note
At present we call binom2.or families a bivariate odds-ratio model. The response should be either
a 4-column matrix of counts (whose columns correspond to (Y1 , Y2 ) = (0,0), (0,1), (1,0), (1,1)
respectively), or a two-column matrix where each column has two distinct values, or a factor with
four levels. The function rbinom2.or may be used to generate such data. Successful convergence
requires at least one case of each of the four possible outcomes.
By default, a constant odds ratio is fitted because zero = 3. Set zero = NULL if you want the odds
ratio to be modelled as a function of the explanatory variables; however, numerical problems are
more likely to occur.
The argument lmu, which is actually redundant, is used for convenience and for upward compat-
ibility: specifying lmu only means the link function will be applied to lmu1 and lmu2. Users
who want a different link function for each of the two marginal probabilities should use the lmu1
and lmu2 arguments, and the argument lmu is then ignored. It doesnt make sense to specify
exchangeable = TRUE and have different link functions for the two marginal probabilities.
Regarding Yee and Dirnbock (2009), the xij (see vglm.control) argument enables environmental
variables with different values at the two time points to be entered into an exchangeable binom2.or
model. See the authors webpage for sample code.

Author(s)
Thomas W. Yee

References
McCullagh, P. and Nelder, J. A. (1989) Generalized Linear Models, 2nd ed. London: Chapman &
Hall.
le Cessie, S. and van Houwelingen, J. C. (1994) Logistic regression for correlated binary data.
Applied Statistics, 43, 95108.
Palmgren, J. (1989) Regression Models for Bivariate Binary Responses. Technical Report no. 101,
Department of Biostatistics, University of Washington, Seattle.
Yee, T. W. and Dirnbock, T. (2009) Models for analysing species presence/absence data at two
time points. Journal of Theoretical Biology, 259(4), 684694.
98 binom2.or

See Also
rbinom2.or, binom2.rho, loglinb2, zipebcom, coalminers, binomialff, logit, probit, cloglog,
cauchit.

Examples
# Fit the model in Table 6.7 in McCullagh and Nelder (1989)
coalminers <- transform(coalminers, Age = (age - 42) / 5)
fit <- vglm(cbind(nBnW, nBW, BnW, BW) ~ Age,
binom2.or(zero = NULL), data = coalminers)
fitted(fit)
summary(fit)
coef(fit, matrix = TRUE)
c(weights(fit, type = "prior")) * fitted(fit) # Table 6.8

## Not run: with(coalminers, matplot(Age, fitted(fit), type = "l", las = 1,

xlab = "(age - 42) / 5", lwd = 2))
with(coalminers, matpoints(Age, depvar(fit), col=1:4))
legend(x = -4, y = 0.5, lty = 1:4, col = 1:4, lwd = 2,
legend = c("1 = (Breathlessness=0, Wheeze=0)",
"2 = (Breathlessness=0, Wheeze=1)",
"3 = (Breathlessness=1, Wheeze=0)",
"4 = (Breathlessness=1, Wheeze=1)"))
## End(Not run)

# Another model: pet ownership

## Not run: data(xs.nz, package = "VGAMdata")
# More homogeneous:
petdata <- subset(xs.nz, ethnicity == "European" & age < 70 & sex == "M")
petdata <- na.omit(petdata[, c("cat", "dog", "age")])
summary(petdata)
with(petdata, table(cat, dog)) # Can compute the odds ratio

fit <- vgam(cbind((1-cat) * (1-dog), (1-cat) * dog,

cat * (1-dog), cat * dog) ~ s(age, df = 5),
binom2.or(zero = 3), data = petdata, trace = TRUE)
colSums(depvar(fit))
coef(fit, matrix = TRUE)

## End(Not run)

## Not run: # Plot the estimated probabilities

ooo <- order(with(petdata, age))
matplot(with(petdata, age)[ooo], fitted(fit)[ooo, ], type = "l",
xlab = "Age", ylab = "Probability", main = "Pet ownership",
ylim = c(0, max(fitted(fit))), las = 1, lwd = 1.5)
legend("topleft", col=1:4, lty = 1:4, leg = c("no cat or dog ",
"dog only", "cat only", "cat and dog"), lwd = 1.5)
## End(Not run)
Binom2.rho 99

Binom2.rho Bivariate Probit Model

Description

Density and random generation for a bivariate probit model. The correlation parameter rho is the
measure of dependency.

Usage

rbinom2.rho(n, mu1,
mu2 = if (exchangeable) mu1 else stop("argument 'mu2' not specified"),
rho = 0, exchangeable = FALSE, twoCols = TRUE,
colnames = if (twoCols) c("y1","y2") else c("00", "01", "10", "11"),
ErrorCheck = TRUE)
dbinom2.rho(mu1,
mu2 = if (exchangeable) mu1 else stop("'mu2' not specified"),
rho = 0, exchangeable = FALSE,
colnames = c("00", "01", "10", "11"), ErrorCheck = TRUE)

Arguments

n number of observations. Same as in runif. The arguments mu1, mu2, rho are
recycled to this value.
mu1, mu2 The marginal probabilities. Only mu1 is needed if exchangeable = TRUE.
Values should be between 0 and 1.
rho The correlation parameter. Must be numeric and lie between 1 and 1. The
default value of zero means the responses are uncorrelated.
exchangeable Logical. If TRUE, the two marginal probabilities are constrained to be equal.
twoCols Logical. If TRUE, then a n 2 matrix of 1s and 0s is returned. If FALSE, then a
n 4 matrix of 1s and 0s is returned.
colnames The dimnames argument of matrix is assigned list(NULL, colnames).
ErrorCheck Logical. Do some error checking of the input parameters?

Details

The function rbinom2.rho generates data coming from a bivariate probit model. The data might
be fitted with the VGAM family function binom2.rho.
The function dbinom2.rho does not really compute the density (because that does not make sense
here) but rather returns the four joint probabilities.
100 binom2.rho

Value

The function rbinom2.rho returns either a 2 or 4 column matrix of 1s and 0s, depending on the
argument twoCols.
The function dbinom2.rho returns a 4 column matrix of joint probabilities; each row adds up to
unity.

Author(s)

T. W. Yee

See Also

binom2.rho.

Examples
(myrho <- rhobit(2, inverse = TRUE)) # Example 1
ymat <- rbinom2.rho(nn <- 2000, mu1 = 0.8, rho = myrho, exch = TRUE)
(mytab <- table(ymat[, 1], ymat[, 2], dnn = c("Y1", "Y2")))
fit <- vglm(ymat ~ 1, binom2.rho(exch = TRUE))
coef(fit, matrix = TRUE)

bdata <- data.frame(x2 = sort(runif(nn))) # Example 2

bdata <- transform(bdata, mu1 = probit(-2+4*x2, inverse = TRUE),
mu2 = probit(-1+3*x2, inverse = TRUE))
dmat <- with(bdata, dbinom2.rho(mu1, mu2, myrho))
ymat <- with(bdata, rbinom2.rho(nn, mu1, mu2, myrho))
fit2 <- vglm(ymat ~ x2, binom2.rho, data = bdata)
coef(fit2, matrix = TRUE)
## Not run: matplot(with(bdata, x2), dmat, lty = 1:4, col = 1:4,
type = "l", main = "Joint probabilities",
ylim = 0:1, lwd = 2, ylab = "Probability")
legend(x = 0.25, y = 0.9, lty = 1:4, col = 1:4, lwd = 2,
legend = c("1 = (y1=0, y2=0)", "2 = (y1=0, y2=1)",
"3 = (y1=1, y2=0)", "4 = (y1=1, y2=1)"))
## End(Not run)

binom2.rho Bivariate Probit Model (Family Function)

Description

Fits a bivariate probit model to two binary responses. The correlation parameter rho is the measure
of dependency.
binom2.rho 101

Usage
binom2.rho(lmu = "probit", lrho = "rhobit", imu1 = NULL, imu2 = NULL,
irho = NULL, imethod = 1, zero = "rho",
exchangeable = FALSE, grho = seq(-0.95, 0.95, by = 0.05),
nsimEIM = NULL)
binom2.Rho(rho = 0, imu1 = NULL, imu2 = NULL,
exchangeable = FALSE, nsimEIM = NULL)

Arguments
lmu Link function applied to the marginal probabilities. Should be left alone.
lrho Link function applied to the association parameter. See Links for more choices.
imu1, imu2 Optional initial values for the two marginal probabilities. May be a vector.
irho Optional initial value for . If given, this should lie between 1 and 1. See
below for more comments.
zero Specifies which linear/additive predictors are modelled as intercept-only. A
NULL means none. Numerically, the parameter is easiest modelled as an in-
tercept only, hence the default. See CommonVGAMffArguments for more infor-
mation.
exchangeable Logical. If TRUE, the two marginal probabilities are constrained to be equal.
imethod, nsimEIM, grho
See CommonVGAMffArguments for more information. A value of at least 100 for
nsimEIM is recommended; the larger the value the better.
rho Numeric vector. Values are recycled to the needed length, and ought to be in
range, which is (1, 1).

Details
The bivariate probit model was one of the earliest regression models to handle two binary responses
jointly. It has a probit link for each of the two marginal probabilities, and models the association
between the responses by the parameter of a standard bivariate normal distribution (with zero
means and unit variances). One can think of the joint probabilities being (1 , 2 ; ) where is
the cumulative distribution function of a standard bivariate normal distribution.
Explicitly, the default model is

probit[P (Yj = 1)] = j , j = 1, 2

for the marginals, and

rhobit[rho] = 3 .
The joint probability P (Y1 = 1, Y2 = 1) = (1 , 2 ; ), and from these the other three joint
probabilities are easily computed. The model is fitted by maximum likelihood estimation since the
full likelihood is specified. Fisher scoring is implemented.
The default models 3 as a single parameter only, i.e., an intercept-only model for rho, but this can
be circumvented by setting zero = NULL in order to model rho as a function of all the explanatory
variables.
102 binom2.rho

The bivariate probit model should not be confused with a bivariate logit model with a probit link
(see binom2.or). The latter uses the odds ratio to quantify the association. Actually, the bivariate
logit model is recommended over the bivariate probit model because the odds ratio is a more natural
way of measuring the association between two binary responses.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.
When fitted, the fitted.values slot of the object contains the four joint probabilities, labelled as
(Y1 , Y2 ) = (0,0), (0,1), (1,0), (1,1), respectively.

Note
See binom2.or about the form of input the response should have.
By default, a constant is fitted because zero = "rho". Set zero = NULL if you want the
parameter to be modelled as a function of the explanatory variables. The value lies in the interval
(1, 1), therefore a rhobit link is default.
Converge problems can occur. If so, assign irho a range of values and monitor convergence (e.g.,
set trace = TRUE). Else try imethod. Practical experience shows that local solutions can occur,
and that irho needs to be quite close to the (global) solution. Also, imu1 and imu2 may be used.
This help file is mainly about binom2.rho(). binom2.Rho() fits a bivariate probit model with
known . The inputted rho is saved in the misc slot of the fitted object, with rho as the component
name.
In some econometrics applications (e.g., Freedman 2010, Freedman and Sekhon 2010) one response
is used as an explanatory variable, e.g., a recursive binomial probit model. Such will not work here.
Historically, the bivariate probit model was the first VGAM I ever wrote, based on Ashford and
Sowden (1970). I dont think they ever thought of it either! Hence the criticisms raised go beyond
the use of what was originally intended.

Author(s)
Thomas W. Yee

References
Ashford, J. R. and Sowden, R. R. (1970) Multi-variate probit analysis. Biometrics, 26, 535546.
Freedman, D. A. (2010) Statistical Models and Causal Inference: a Dialogue with the Social Sci-
ences, Cambridge: Cambridge University Press.
Freedman, D. A. and Sekhon, J. S. (2010) Endogeneity in probit response models. Political Analy-
sis, 18, 138150.

See Also
rbinom2.rho, rhobit, pbinorm, binom2.or, loglinb2, coalminers, binomialff, rhobit, fisherz.
binomialff 103

Examples
coalminers <- transform(coalminers, Age = (age - 42) / 5)
fit <- vglm(cbind(nBnW, nBW, BnW, BW) ~ Age,
binom2.rho, data = coalminers, trace = TRUE)
summary(fit)
coef(fit, matrix = TRUE)

binomialff Binomial Family Function

Description
Family function for fitting generalized linear models to binomial responses, where the dispersion
parameter may be known or unknown.

Usage
binomialff(link = "logit", dispersion = 1, multiple.responses = FALSE,
onedpar = !multiple.responses, parallel = FALSE,
zero = NULL, bred = FALSE, earg.link = FALSE)

Arguments
link Link function; see Links and CommonVGAMffArguments for more information.
dispersion Dispersion parameter. By default, maximum likelihood is used to estimate the
model because it is known. However, the user can specify dispersion = 0 to
have it estimated, or else specify a known positive value (or values if multiple.responses
is TRUE).
multiple.responses
Multivariate response? If TRUE, then the response is interpreted as M inde-
pendent binary responses, where M is the number of columns of the response
matrix. In this case, the response matrix should have Q columns consisting of
counts (successes), and the weights argument should have Q columns consist-
ing of the number of trials (successes plus failures).
If FALSE and the response is a (2-column) matrix, then the number of successes
is given in the first column, and the second column is the number of failures.
onedpar One dispersion parameter? If multiple.responses, then a separate dispersion
parameter will be computed for each response (column), by default. Setting
onedpar = TRUE will pool them so that there is only one dispersion parameter
to be estimated.
parallel A logical or formula. Used only if multiple.responses is TRUE. This argument
allows for the parallelism assumption whereby the regression coefficients for a
variable is constrained to be equal over the M linear/additive predictors. If
parallel = TRUE then the constraint is not applied to the intercepts.
104 binomialff

zero An integer-valued vector specifying which linear/additive predictors are mod-

elled as intercepts only. The values must be from the set {1,2,. . . ,M }, where M
is the number of columns of the matrix response. See CommonVGAMffArguments
for more information.
earg.link Details at CommonVGAMffArguments.
bred Details at CommonVGAMffArguments. Setting bred = TRUE should work for
multiple responses (multiple.responses = TRUE) and all VGAM link func-
tions; it has been tested for logit only (and it gives similar results to brglm
but not identical), and further testing is required. One result from fitting bias
reduced binary regression is that finite regression coefficients occur when the
data is separable (see example below).

Details
This function is largely to mimic binomial, however there are some differences.
If the dispersion parameter is unknown, then the resulting estimate is not fully a maximum likeli-
hood estimate (see pp.1248 of McCullagh and Nelder, 1989).
A dispersion parameter that is less/greater than unity corresponds to under-/over-dispersion relative
to the binomial model. Over-dispersion is more common in practice.
Setting multiple.responses = TRUE is necessary when fitting a Quadratic RR-VGLM (see cqo)
because the response is a matrix of M columns (e.g., one column per species). Then there will be
M dispersion parameters (one per column of the response matrix).
When used with cqo and cao, it may be preferable to use the cloglog link.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, vgam, rrvglm, cqo, and cao.

Warning
With a multivariate response, assigning a known dispersion parameter for each response is not
handled well yet. Currently, only a single known dispersion parameter is handled well.
See the above note regarding bred.
The maximum likelihood estimate will not exist if the data is completely separable or quasi-
completely separable. See Chapter 10 of Altman et al. (2004) for more details, and safeBina-
ryRegression. Yet to do: add a sepcheck = TRUE, say, argument to detect this problem and give
an appropriate warning.

Note
If multiple.responses is FALSE (default) then the response can be of one of two formats: a
factor (first level taken as failure), or a 2-column matrix (first column = successes) of counts. The
argument weights in the modelling function can also be specified as any vector of positive values.
In general, 1 means success and 0 means failure (to check, see the y slot of the fitted object). Note
that a general vector of proportions of success is no longer accepted.
The notation M is used to denote the number of linear/additive predictors.
binomialff 105

If multiple.responses is TRUE, then the matrix response can only be of one format: a matrix of
1s and 0s (1 = success).
The call binomialff(dispersion = 0, ...) is equivalent to quasibinomialff(...). The
latter was written so that R users of quasibinomial() would only need to add a ff to the end of
the family function name.
Regardless of whether the dispersion parameter is to be estimated or not, its value can be seen from
the output from the summary() of the object.
Fisher scoring is used. This can sometimes fail to converge by oscillating between successive
iterations (Ridout, 1990). See the example below.

Author(s)
Thomas W. Yee

References
McCullagh, P. and Nelder, J. A. (1989) Generalized Linear Models, 2nd ed. London: Chapman &
Hall.
Altman, M. and Gill, J. and McDonald, M. P. (2004) Numerical Issues in Statistical Computing for
the Social Scientist, Hoboken, NJ, USA: Wiley-Interscience.
Ridout, M. S. (1990) Non-convergence of Fishers method of scoringa simple example. GLIM
Newsletter, 20(6).

See Also
quasibinomialff, Links, rrvglm, cqo, cao, betabinomial, posbinomial, zibinomial, double.expbinomial,
seq2binomial, amlbinomial, simplex, binomial, simulate.vlm, safeBinaryRegression.

Examples
quasibinomialff()
quasibinomialff(link = "probit")

shunua <- hunua[sort.list(with(hunua, altitude)), ] # Sort by altitude

fit <- vglm(agaaus ~ poly(altitude, 2), binomialff(link = cloglog),
data = shunua)
## Not run:
plot(agaaus ~ jitter(altitude), shunua, col = "blue", ylab = "P(Agaaus = 1)",
main = "Presence/absence of Agathis australis", las = 1)
with(shunua, lines(altitude, fitted(fit), col = "orange", lwd = 2))
## End(Not run)

# Fit two species simultaneously

fit2 <- vgam(cbind(agaaus, kniexc) ~ s(altitude),
binomialff(multiple.responses = TRUE), data = shunua)
## Not run:
with(shunua, matplot(altitude, fitted(fit2), type = "l",
main = "Two species response curves", las = 1))
106 Binorm

## End(Not run)

# Shows that Fisher scoring can sometime fail. See Ridout (1990).
ridout <- data.frame(v = c(1000, 100, 10), r = c(4, 3, 3), n = c(5, 5, 5))
(ridout <- transform(ridout, logv = log(v)))
# The iterations oscillates between two local solutions:
glm.fail <- glm(r / n ~ offset(logv) + 1, weight = n,
binomial(link = 'cloglog'), ridout, trace = TRUE)
coef(glm.fail)
# vglm()'s half-stepping ensures the MLE of -5.4007 is obtained:
vglm.ok <- vglm(cbind(r, n-r) ~ offset(logv) + 1,
binomialff(link = cloglog), ridout, trace = TRUE)
coef(vglm.ok)

# Separable data
set.seed(123)
threshold <- 0
bdata <- data.frame(x2 = sort(rnorm(nn <- 100)))
bdata <- transform(bdata, y1 = ifelse(x2 < threshold, 0, 1))
fit <- vglm(y1 ~ x2, binomialff(bred = TRUE),
data = bdata, criter = "coef", trace = TRUE)
coef(fit, matrix = TRUE) # Finite!!
summary(fit)
## Not run: plot(depvar(fit) ~ x2, data = bdata, col = "blue", las = 1)
lines(fitted(fit) ~ x2, data = bdata, col = "orange")
abline(v = threshold, col = "gray", lty = "dashed")
## End(Not run)

Binorm Bivariate Normal Distribution Cumulative Distribution Function

Description
Density, cumulative distribution function and random generation for the bivariate normal distribu-
tion distribution.

Usage
dbinorm(x1, x2, mean1 = 0, mean2 = 0, var1 = 1, var2 = 1, cov12 = 0, log = FALSE)
pbinorm(q1, q2, mean1 = 0, mean2 = 0, var1 = 1, var2 = 1, cov12 = 0)
rbinorm(n, mean1 = 0, mean2 = 0, var1 = 1, var2 = 1, cov12 = 0)
pnorm2(x1, x2, mean1 = 0, mean2 = 0, var1 = 1, var2 = 1, cov12 = 0)

Arguments
x1, x2, q1, q2 vector of quantiles.
mean1, mean2, var1, var2, cov12
vector of means, variances and the covariance.
Binorm 107

n number of observations. Same as rnorm.

log Logical. If log = TRUE then the logarithm of the density is returned.

Details
The default arguments correspond to the standard bivariate normal distribution with correlation
parameter = 0. That is, two independent standard normal distributions. Let sd1 (say) be
sqrt(var1) and written 1 , etc. Then the general formula for the correlation coefficient is =
cov/(1 2 ) where cov is argument cov12. Thus if arguments var1 and var2 are left alone then
cov12 can be inputted with .
One can think of this function as an extension of pnorm to two dimensions, however note that the
argument names have been changed for VGAM 0.9-1 onwards.

Value
dbinorm gives the density, pbinorm gives the cumulative distribution function, rbinorm generates
random deviates (n by 2 matrix).

Warning
Being based on an approximation, the results of pbinorm() may be negative! Also, pnorm2()
should be withdrawn soon; use pbinorm() instead because it is identical.

Note
For rbinorm(), if the ith variance-covariance matrix is not positive-definite then the ith row is all
NAs.

References
pbinorm() is based on Donnelly (1973), the code was translated from FORTRAN to ratfor using
struct, and then from ratfor to C manually. The function was originally called bivnor, and TWY
only wrote a wrapper function.
Donnelly, T. G. (1973) Algorithm 462: Bivariate Normal Distribution. Communications of the
ACM, 16, 638.

## Not run: rhovec <- seq(-0.95, 0.95, by = 0.01)

plot(rhovec, pbinorm(0, 0, cov12 = rhovec), type = "l", col = "blue", las = 1)
abline(v = 0, h = 0.25, col = "gray", lty = "dashed")
## End(Not run)
108 binormal

binormal Bivariate Normal Distribution Family Function

Description
Maximum likelihood estimation of the five parameters of a bivariate normal distribution.

Usage
binormal(lmean1 = "identitylink", lmean2 = "identitylink",
lsd1 = "loge", lsd2 = "loge",
lrho = "rhobit",
imean1 = NULL, imean2 = NULL,
isd1 = NULL, isd2 = NULL,
irho = NULL, imethod = 1,
eq.mean = FALSE, eq.sd = FALSE,
zero = c("sd", "rho"))

Arguments
lmean1, lmean2, lsd1, lsd2, lrho
Link functions applied to the means, standard deviations and rho parameters.
See Links for more choices. Being positive quantities, a log link is the default
for the standard deviations.
imean1, imean2, isd1, isd2, irho, imethod, zero
See CommonVGAMffArguments for more information.
eq.mean, eq.sd Logical or formula. Constrains the means or the standard deviations to be equal.

Details
For the bivariate normal distribution, this fits a linear model (LM) to the means, and by default, the
other parameters are intercept-only. The response should be a two-column matrix. The correlation
parameter is rho, which lies between 1 and 1 (thus the rhobit link is a reasonable choice). The
fitted means are returned as the fitted values, which is in the form of a two-column matrix. Fisher
scoring is implemented.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Warning
This function may be renamed to normal2() or something like that at a later date.
binormalcop 109

Note

If both equal means and equal standard deviations are desired then use something like constraints = list("(Intercept)"
and maybe zero = NULL etc.

Author(s)

T. W. Yee

See Also

uninormal, gaussianff, pbinorm, bistudentt.

Examples
set.seed(123); nn <- 1000
bdata <- data.frame(x2 = runif(nn), x3 = runif(nn))
bdata <- transform(bdata, y1 = rnorm(nn, 1 + 2 * x2),
y2 = rnorm(nn, 3 + 4 * x2))
fit1 <- vglm(cbind(y1, y2) ~ x2,
binormal(eq.sd = TRUE), data = bdata, trace = TRUE)
coef(fit1, matrix = TRUE)
constraints(fit1)
summary(fit1)

# Estimated P(Y1 <= y1, Y2 <= y2) under the fitted model
var1 <- loge(2 * predict(fit1)[, "loge(sd1)"], inverse = TRUE)
var2 <- loge(2 * predict(fit1)[, "loge(sd2)"], inverse = TRUE)
cov12 <- rhobit(predict(fit1)[, "rhobit(rho)"], inverse = TRUE)
head(with(bdata, pbinorm(y1, y2,
mean1 = predict(fit1)[, "mean1"],
mean2 = predict(fit1)[, "mean2"],
var1 = var1, var2 = var2, cov12 = cov12)))

binormalcop Gaussian Copula (Bivariate) Family Function

Description

Estimate the correlation parameter of the (bivariate) Gaussian copula distribution by maximum
likelihood estimation.

Usage

binormalcop(lrho = "rhobit", irho = NULL, imethod = 1,

parallel = FALSE, zero = NULL)
110 binormalcop

Arguments
lrho, irho, imethod
Details at CommonVGAMffArguments. See Links for more link function choices.
parallel, zero Details at CommonVGAMffArguments. If parallel = TRUE then the constraint
is applied to the intercept too.

Details
The cumulative distribution function is

P (Y1 y1 , Y2 y2 ) = 2 (1 (y1 ), 1 (y2 ); )

for 1 < < 1, 2 is the cumulative distribution function of a standard bivariate normal (see
pbinorm), and is the cumulative distribution function of a standard univariate normal (see pnorm).
The support of the function is the interior of the unit square; however, values of 0 and/or 1 are not
allowed. The marginal distributions are the standard uniform distributions. When = 0 the random
variables are independent.
This VGAM family function can handle multiple responses, for example, a six-column matrix
where the first 2 columns is the first out of three responses, the next 2 columns being the next
response, etc.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Author(s)
T. W. Yee

References
Schepsmeier, U. and Stober, J. (2013) Derivatives and Fisher information of bivariate copulas. Sta-
tistical Papers.

See Also
rbinormcop, pnorm, kendall.tau.
Binormcop 111

Examples
nn <- 1000
ymat <- rbinormcop(n = nn, rho = rhobit(-0.9, inverse = TRUE))
bdata <- data.frame(y1 = ymat[, 1],
y2 = ymat[, 2],
y3 = ymat[, 1],
y4 = ymat[, 2],
x2 = runif(nn))

summary(bdata)
## Not run: plot(ymat, col = "blue")
fit1 <- vglm(cbind(y1, y2, y3, y4) ~ 1, # 2 responses, e.g., (y1,y2) is the first
fam = binormalcop,
crit = "coef", # Sometimes a good idea
data = bdata, trace = TRUE)

coef(fit1, matrix = TRUE)

Coef(fit1)
head(fitted(fit1))
summary(fit1)

# Another example; rho is a linear function of x2

bdata <- transform(bdata, rho = -0.5 + x2)
ymat <- rbinormcop(n = nn, rho = with(bdata, rho))
bdata <- transform(bdata, y5 = ymat[, 1],
y6 = ymat[, 2])
fit2 <- vgam(cbind(y5, y6) ~ s(x2), data = bdata,
binormalcop(lrho = "identitylink"), trace = TRUE)
## Not run: plot(fit2, lcol = "blue", scol = "orange", se = TRUE, las = 1)

Binormcop Gaussian Copula (Bivariate) Distribution

Description
Density, distribution function, and random generation for the (one parameter) bivariate Gaussian
copula distribution.

Usage
dbinormcop(x1, x2, rho = 0, log = FALSE)
pbinormcop(q1, q2, rho = 0)
rbinormcop(n, rho = 0)

Arguments
x1, x2, q1, q2 vector of quantiles. The x1 and x2 should be in the interval (0, 1). Ditto for q1
and q2.
n number of observations. Same as rnorm.
112 Biplackett

rho the correlation parameter. Should be in the interval (1, 1).

log Logical. If TRUE then the logarithm is returned.

Details
See binormalcop, the VGAM family functions for estimating the parameter by maximum likeli-
hood estimation, for the formula of the cumulative distribution function and other details.

Value
dbinormcop gives the density, pbinormcop gives the distribution function, and rbinormcop gener-
ates random deviates (a two-column matrix).

Note
Yettodo: allow x1 and/or x2 to have values 1, and to allow any values for x1 and/or x2 to be outside
the unit square.

Author(s)
T. W. Yee

Biplackett Placketts Bivariate Copula

Description
Density, distribution function, and random generation for the (one parameter) bivariate Plackett
copula.
Biplackett 113

Usage
dbiplackcop(x1, x2, oratio, log = FALSE)
pbiplackcop(q1, q2, oratio)
rbiplackcop(n, oratio)

Arguments
x1, x2, q1, q2 vector of quantiles.
n number of observations. Same as in runif.
oratio the positive odds ratio .
log Logical. If TRUE then the logarithm is returned.

Details
See biplackettcop, the VGAM family functions for estimating the parameter by maximum like-
lihood estimation, for the formula of the cumulative distribution function and other details.

Value
dbiplackcop gives the density, pbiplackcop gives the distribution function, and rbiplackcop
generates random deviates (a two-column matrix).

Author(s)
T. W. Yee

References
Mardia, K. V. (1967) Some contributions to contingency-type distributions. Biometrika, 54, 235
249.

plot(rr <- rbiplackcop(n = 3000, oratio = oratio))

par(mfrow = c(1, 2))
hist(rr[, 1]) # Should be uniform
hist(rr[, 2]) # Should be uniform
114 biplackettcop

## End(Not run)

biplackettcop Placketts Bivariate Copula Family Function

Description
Estimate the association parameter of Placketts bivariate distribution (copula) by maximum likeli-
hood estimation.

Usage
biplackettcop(link = "loge", ioratio = NULL, imethod = 1, nsimEIM = 200)

Arguments
link Link function applied to the (positive) odds ratio . See Links for more choices
and information.
ioratio Numeric. Optional initial value for . If a convergence failure occurs try as-
signing a value or a different value.
imethod, nsimEIM
See CommonVGAMffArguments.

Details
The defining equation is

= H (1 y1 y2 + H)/((y1 H) (y2 H))

where P (Y1 y1 , Y2 y2 ) = H (y1 , y2 ) is the cumulative distribution function. The density

function is h (y1 , y2 ) =
3/2
[1 + ( 1)(y1 + y2 2y1 y2 )]/ [1 + ( 1)(y1 + y2 )]2 4( 1)y1 y2

for > 0. Some writers call the cross product ratio but it is called the odds ratio here. The
support of the function is the unit square. The marginal distributions here are the standard uniform
although it is commonly generalized to other distributions.
If = 1 then h (y1 , y2 ) = y1 y2 , i.e., independence. As the odds ratio tends to infinity one has
y1 = y2 . As the odds ratio tends to 0 one has y2 = 1 y1 .
Fisher scoring is implemented using rbiplackcop. Convergence is often quite slow.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.
biplot-methods 115

Note
The response must be a two-column matrix. Currently, the fitted value is a 2-column matrix with
0.5 values because the marginal distributions correspond to a standard uniform distribution.

Author(s)
T. W. Yee

References
Plackett, R. L. (1965) A class of bivariate distributions. Journal of the American Statistical Associ-
ation, 60, 516522.

biplot-methods Biplot of Constrained Regression Models

Description
biplot is a generic function applied to RR-VGLMs and QRR-VGLMs etc. These apply to rank-
1 and rank-2 models of these only. For RR-VGLMs these plot the second latent variable scores
against the first latent variable scores.

Methods
x The object from which the latent variables are extracted and/or plotted.

Note
See lvplot which is very much related to biplots.
116 Bisa

Bisa The Birnbaum-Saunders Distribution

Description

Density, distribution function, and random generation for the Birnbaum-Saunders distribution.

Usage

dbisa(x, scale = 1, shape, log = FALSE)

pbisa(q, scale = 1, shape, lower.tail = TRUE, log.p = FALSE)
qbisa(p, scale = 1, shape, lower.tail = TRUE, log.p = FALSE)
rbisa(n, scale = 1, shape)

Arguments

x, q vector of quantiles.
p vector of probabilities.
n Same as in runif.
scale, shape the (positive) scale and shape parameters.
log Logical. If TRUE then the logarithm of the density is returned.
lower.tail, log.p
Same meaning as in pnorm or qnorm.

Details

The Birnbaum-Saunders distribution is a distribution which is used in survival analysis. See bisa,
the VGAM family function for estimating the parameters, for more details.

Value

dbisa gives the density, pbisa gives the distribution function, and qbisa gives the quantile func-
tion, and rbisa generates random deviates.

Author(s)

T. W. Yee and Kai Huang

See Also

bisa.
bisa 117

Examples
## Not run:
x <- seq(0, 6, len = 400)
plot(x, dbisa(x, shape = 1), type = "l", col = "blue",
ylab = "Density", lwd = 2, ylim = c(0,1.3), lty = 3,
main = "X ~ Birnbaum-Saunders(shape, scale = 1)")
lines(x, dbisa(x, shape = 2), col = "orange", lty = 2, lwd = 2)
lines(x, dbisa(x, shape = 0.5), col = "green", lty = 1, lwd = 2)
legend(x = 3, y = 0.9, legend = paste("shape = ",c(0.5, 1,2)),
col = c("green","blue","orange"), lty = 1:3, lwd = 2)

shape <- 1; x <- seq(0.0, 4, len = 401)

plot(x, dbisa(x, shape = shape), type = "l", col = "blue", las = 1, ylab = "",
main = "Blue is density, orange is cumulative distribution function",
sub = "Purple lines are the 10,20,...,90 percentiles", ylim = 0:1)
abline(h = 0, col = "blue", lty = 2)
lines(x, pbisa(x, shape = shape), col = "orange")
probs <- seq(0.1, 0.9, by = 0.1)
Q <- qbisa(probs, shape = shape)
lines(Q, dbisa(Q, shape = shape), col = "purple", lty = 3, type = "h")
pbisa(Q, shape = shape) - probs # Should be all zero
abline(h = probs, col = "purple", lty = 3)
lines(Q, pbisa(Q, shape = shape), col = "purple", lty = 3, type = "h")

## End(Not run)

bisa Birnbaum-Saunders Distribution Family Function

Description
Estimates the shape and scale parameters of the Birnbaum-Saunders distribution by maximum like-
lihood estimation.

Usage
bisa(lscale = "loge", lshape = "loge", iscale = 1,
ishape = NULL, imethod = 1, zero = "shape", nowarning = FALSE)

Arguments
nowarning Logical. Suppress a warning? Ignored for VGAM 0.9-7 and higher.
lscale, lshape Parameter link functions applied to the shape and scale parameters (a and b
below). See Links for more choices. A log link is the default for both because
they are positive.
iscale, ishape Initial values for a and b. A NULL means an initial value is chosen internally
using imethod.
118 bisa

imethod An integer with value 1 or 2 or 3 which specifies the initialization method. If

failure to converge occurs try the other value, or else specify a value for ishape
and/or iscale.
zero Specifies which linear/additive predictor is modelled as intercept-only. If used,
choose one value from the set {1,2}. See CommonVGAMffArguments for more
details.

Details
The (two-parameter) Birnbaum-Saunders distribution has a cumulative distribution function that
can be written as
F (y; a, b) = [(y/b)/a]

() is the cumulative distribution function of a standard normal (see pnorm), (t) = t
where
1/ t, y > 0, a > 0 is the shape parameter, b > 0 is the scale parameter. The mean of Y (which
is the fitted value) is b(1 + a2 /2). and the variance is a2 b2 (1 + 54 a2 ). By default, 1 = log(a) and
2 = log(b) for this family function.
Note that a and b are orthogonal, i.e., the Fisher information matrix is diagonal. This family function
implements Fisher scoring, and it is unnecessary to compute any integrals numerically.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Author(s)
T. W. Yee

References
Lemonte, A. J. and Cribari-Neto, F. and Vasconcellos, K. L. P. (2007) Improved statistical inference
for the two-parameter Birnbaum-Saunders distribution. Computational Statistics \& Data Analysis,
51, 46564681.
Birnbaum, Z. W. and Saunders, S. C. (1969) A new family of life distributions. Journal of Applied
Probability, 6, 319327.
Birnbaum, Z. W. and Saunders, S. C. (1969) Estimation for a family of life distributions with appli-
cations to fatigue. Journal of Applied Probability, 6, 328347.
Engelhardt, M. and Bain, L. J. and Wright, F. T. (1981) Inferences on the parameters of the
Birnbaum-Saunders fatigue life distribution based on maximum likelihood estimation. Techno-
metrics, 23, 251256.
Johnson, N. L. and Kotz, S. and Balakrishnan, N. (1995) Continuous Univariate Distributions, 2nd
edition, Volume 2, New York: Wiley.

See Also
pbisa, inv.gaussianff, CommonVGAMffArguments.
Bistudentt 119

Examples
bdata1 <- data.frame(x2 = runif(nn <- 1000))
bdata1 <- transform(bdata1, shape = exp(-0.5 + x2), scale = exp(1.5))
bdata1 <- transform(bdata1, y = rbisa(nn, scale, shape))
fit1 <- vglm(y ~ x2, bisa(zero = 1), data = bdata1, trace = TRUE)
coef(fit1, matrix = TRUE)

## Not run:
bdata2 <- data.frame(shape = exp(-0.5), scale = exp(0.5))
bdata2 <- transform(bdata2, y = rbisa(nn, scale, shape))
fit <- vglm(y ~ 1, bisa, data = bdata2, trace = TRUE)
with(bdata2, hist(y, prob = TRUE, ylim = c(0, 0.5), col = "lightblue"))
coef(fit, matrix = TRUE)
with(bdata2, mean(y))
head(fitted(fit))
x <- with(bdata2, seq(0, max(y), len = 200))
lines(dbisa(x, Coef(fit)[1], Coef(fit)[2]) ~ x, data = bdata2,
col = "orange", lwd = 2)
## End(Not run)

Bistudentt Bivariate Student-t distribution cumulative distribution function

Description
Density for the bivariate Student-t distribution distribution.

Usage
dbistudentt(x1, x2, df, rho = 0, log = FALSE)

Arguments
x1, x2 vector of quantiles.
df, rho vector of degrees of freedom and correlation parameter. For df, a value Inf is
currently not working.
log Logical. If log = TRUE then the logarithm of the density is returned.

Details
One can think of this function as an extension of dt to two dimensions. See bistudentt for more
information.

Value
dbistudentt gives the density.
120 bistudentt

References
Schepsmeier, U. and Stober, J. (2013) Derivatives and Fisher information of bivariate copulas. Sta-
tistical Papers.

bistudentt Bivariate Student-t Family Function

Description
Estimate the degrees of freedom and correlation parameters of the (bivariate) Student-t distribution
by maximum likelihood estimation.

Usage
bistudentt(ldf = "loglog", lrho = "rhobit",
idf = NULL, irho = NULL, imethod = 1,
parallel = FALSE, zero = "rho")

Arguments
ldf, lrho, idf, irho, imethod
Details at CommonVGAMffArguments. See Links for more link function choices.
parallel, zero Details at CommonVGAMffArguments.

Details
The density function is
1
f (y1 , y2 ; , ) = p (1 + y12 + y22 2y1 y2 )/((1 2 ))(+2)/2
2 1 2
for 1 < < 1, and real y1 and y2 .
This VGAM family function can handle multiple responses, for example, a six-column matrix
where the first 2 columns is the first out of three responses, the next 2 columns being the next
response, etc.
bistudentt 121

Value

An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Warning

The working weight matrices have not been fully checked.

Note

The response matrix must have a multiple of two-columns. Currently, the fitted value is a matrix
with the same number of columns and values equal to 0.0.

Author(s)

T. W. Yee, with help from Thibault Vatter.

References

Schepsmeier, U. and Stober, J. (2013) Derivatives and Fisher information of bivariate copulas. Sta-
tistical Papers.

See Also

dbistudentt, binormal, pt.

Examples

nn <- 1000
mydof <- loglog(1, inverse = TRUE)
ymat <- cbind(rt(nn, df = mydof), rt(nn, df = mydof))
bdata <- data.frame(y1 = ymat[, 1], y2 = ymat[, 2],
y3 = ymat[, 1], y4 = ymat[, 2], x2 = runif(nn))
summary(bdata)
## Not run: plot(ymat, col = "blue")
fit1 <- vglm(cbind(y1, y2, y3, y4) ~ 1, # 2 responses, e.g., (y1,y2) is the 1st
fam = bistudentt, # crit = "coef", # Sometimes a good idea
data = bdata, trace = TRUE)

coef(fit1, matrix = TRUE)

Coef(fit1)
head(fitted(fit1))
summary(fit1)
122 bmi.nz

bmi.nz Body Mass Index of New Zealand Adults Data

Description
The body mass indexes and ages from an approximate random sample of 700 New Zealand adults.

Usage
data(bmi.nz)

Format
A data frame with 700 observations on the following 2 variables.

age a numeric vector; their age (years).

BMI a numeric vector; their body mass indexes, which is their weight divided by the square of
their height (kg / m2 ).

Details
They are a random sample from the Fletcher Challenge/Auckland Heart and Health survey con-
ducted in the early 1990s.
There are some outliers in the data set.
A variable gender would be useful, and may be added later.

Source
Clinical Trials Research Unit, University of Auckland, New Zealand, http://www.ctru.auckland.
ac.nz.

References
MacMahon, S., Norton, R., Jackson, R., Mackie, M. J., Cheng, A., Vander Hoorn, S., Milne, A.,
McCulloch, A. (1995) Fletcher Challenge-University of Auckland Heart & Health Study: design
and baseline findings. New Zealand Medical Journal, 108, 499502.

Examples
## Not run: with(bmi.nz, plot(age, BMI, col = "blue"))
fit <- vgam(BMI ~ s(age, df = c(2, 4, 2)), lms.yjn, data = bmi.nz, trace = TRUE)
qtplot(fit, pcol = "blue", tcol = "brown", lcol = "brown")
## End(Not run)
borel.tanner 123

borel.tanner Borel-Tanner Distribution Family Function

Description
Estimates the parameter of a Borel-Tanner distribution by maximum likelihood estimation.

Usage
borel.tanner(Qsize = 1, link = "logit", imethod = 1)

Arguments
Qsize A positive integer. It is called Q below and is the initial queue size. The default
value Q = 1 corresponds to the Borel distribution.
link Link function for the parameter; see Links for more choices and for general
information.
imethod See CommonVGAMffArguments. Valid values are 1, 2, 3 or 4.

Details
The Borel-Tanner distribution (Tanner, 1953) describes the distribution of the total number of cus-
tomers served before a queue vanishes given a single queue with random arrival times of customers
(at a constant rate r per unit time, and each customer taking a constant time b to be served). Initially
the queue has Q people and the first one starts to be served. The two parameters appear in the den-
sity only in the form of the product rb, therefore we use a = rb, say, to denote the single parameter
to be estimated. The density function is

Q
f (y; a) = y yQ1 ayQ exp(ay)
(y Q)!

where y = Q, Q + 1, Q + 2, . . .. The case Q = 1 corresponds to the Borel distribution (Borel,

1942). For the Q = 1 case it is necessary for 0 < a < 1 for the distribution to be proper. The Borel
distribution is a basic Lagrangian distribution of the first kind. The Borel-Tanner distribution is an
Q-fold convolution of the Borel distribution.
The mean is Q/(1 a) (returned as the fitted values) and the variance is Qa/(1 a)3 . The
distribution has a very long tail unless a is small. Fisher scoring is implemented.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Author(s)
T. W. Yee
124 Bort

References
Tanner, J. C. (1953) A problem of interference between two queues. Biometrika, 40, 5869.
Borel, E. (1942) Sur lemploi du theoreme de Bernoulli pour faciliter le calcul dune infinite de
coefficients. Application au probleme de lattente a un guichet. Comptes Rendus, Academie des
Sciences, Paris, Series A, 214, 452456.
Page 328 of Johnson N. L., Kemp, A. W. and Kotz S. (2005) Univariate Discrete Distributions, 3rd
edition, Hoboken, New Jersey: Wiley.
Consul, P. C. and Famoye, F. (2006) Lagrangian Probability Distributions, Boston, MA, USA:
Birkhauser.

Bort The Borel-Tanner Distribution

Description
Density and random generation for the Borel-Tanner distribution.

Usage
dbort(x, Qsize = 1, a = 0.5, log = FALSE)
rbort(n, Qsize = 1, a = 0.5)

Arguments
x vector of quantiles.
n number of observations. Must be a positive integer of length 1.
Qsize, a See borel.tanner.
log Logical. If log = TRUE then the logarithm of the density is returned.

Details
See borel.tanner, the VGAM family function for estimating the parameter, for the formula of the
probability density function and other details.
Brat 125

Value
dbort gives the density, rbort generates random deviates.

Warning
Looping is used for rbort, therefore values of a close to 1 will result in long (or infinite!) compu-
tational times. The default value of a is subjective.

Author(s)
T. W. Yee

Brat Inputting Data to fit a Bradley Terry Model

Description
Takes in a square matrix of counts and outputs them in a form that is accessible to the brat and
bratt family functions.

Usage
Brat(mat, ties = 0 * mat, string = c(">", "=="), whitespace = FALSE)

Arguments
mat Matrix of counts, which is considered M by M in dimension when there are
ties, and M + 1 by M + 1 when there are no ties. The rows are winners and
the columns are losers, e.g., the 2-1 element is now many times Competitor 2
has beaten Competitor 1. The matrices are best labelled with the competitors
names.
ties Matrix of counts. This should be the same dimension as mat. By default, there
are no ties. The matrix must be symmetric, and the diagonal should contain NAs.
string Character. The matrices are labelled with the first value of the descriptor, e.g.,
"NZ > Oz" means NZ beats Australia in rugby. Suggested alternatives include
" beats " or " wins against ". The second value is used to handle ties.
126 Brat

whitespace Logical. If TRUE then a white space is added before and after string; it gener-
ally enhances readability. See CommonVGAMffArguments for some similar-type
information.

Details

In the VGAM package it is necessary for each matrix to be represented as a single row of data by
brat and bratt. Hence the non-diagonal elements of the M + 1 by M + 1 matrix are concatenated
into M (M + 1) values (no ties), while if there are ties, the non-diagonal elements of the M by M
matrix are concatenated into M (M 1) values.

Value

A matrix with 1 row and either M (M + 1) or M (M 1) columns.

Note

This is a data preprocessing function for brat and bratt.

Yet to do: merge InverseBrat into brat.

Author(s)

T. W. Yee

References

Agresti, A. (2013) Categorical Data Analysis, 3rd ed. Hoboken, NJ, USA: Wiley.

See Also

brat, bratt, InverseBrat.

Examples

journal <- c("Biometrika", "Comm Statist", "JASA", "JRSS-B")

mat <- matrix(c( NA, 33, 320, 284, 730, NA, 813, 276,
498, 68, NA, 325, 221, 17, 142, NA), 4, 4)
dimnames(mat) <- list(winner = journal, loser = journal)
Brat(mat) # Less readable
Brat(mat, whitespace = TRUE) # More readable
vglm(Brat(mat, whitespace = TRUE) ~ 1, brat, trace = TRUE)
brat 127

brat Bradley Terry Model

Description
Fits a Bradley Terry model (intercept-only model) by maximum likelihood estimation.

Usage
brat(refgp = "last", refvalue = 1, ialpha = 1)

Arguments
refgp Integer whose value must be from the set {1,. . . ,M + 1}, where there are M + 1
competitors. The default value indicates the last competitor is usedbut dont
input a character string, in general.
refvalue Numeric. A positive value for the reference group.
ialpha Initial values for the s. These are recycled to the appropriate length.

Details
The Bradley Terry model involves M + 1 competitors who either win or lose against each other
(no draws/ties allowed in this implementationsee bratt if there are ties). The probability that
Competitor i beats Competitor j is i /(i + j ), where all the s are positive. Loosely, the s can
be thought of as the competitors abilities. For identifiability, one of the i is set to a known value
refvalue, e.g., 1. By default, this function chooses the last competitor to have this reference value.
The data can be represented in the form of a M + 1 by M + 1 matrix of counts, where winners are
the rows and losers are the columns. However, this is not the way the data should be inputted (see
below).
Excluding the reference value/group, this function chooses log(j ) as the M linear predictors. The
log link ensures that the s are positive.
The Bradley Terry model can be fitted by logistic regression, but this approach is not taken here.
The Bradley Terry model can be fitted with covariates, e.g., a home advantage variable, but unfor-
tunately, this lies outside the VGLM theoretical framework and therefore cannot be handled with
this code.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm.

Warning
Presently, the residuals are wrong, and the prior weights are not handled correctly. Ideally, the total
number of counts should be the prior weights, after the response has been converted to proportions.
This would make it similar to family functions such as multinomial and binomialff.
128 brat

Note

The function Brat is useful for coercing a M + 1 by M + 1 matrix of counts into a one-row
matrix suitable for brat. Diagonal elements are skipped, and the usual S order of c(a.matrix) of
elements is used. There should be no missing values apart from the diagonal elements of the square
matrix. The matrix should have winners as the rows, and losers as the columns. In general, the
response should be a 1-row matrix with M (M + 1) columns.
Only an intercept model is recommended with brat. It doesnt make sense really to include covari-
ates because of the limited VGLM framework.
Notationally, note that the VGAM family function brat has M + 1 contestants, while bratt has
M contestants.

Author(s)

T. W. Yee

References

Agresti, A. (2013) Categorical Data Analysis, 3rd ed. Hoboken, NJ, USA: Wiley.
Stigler, S. (1994) Citation patterns in the journals of statistics and probability. Statistical Science,
9, 94108.
The BradleyTerry2 package has more comprehensive capabilities than this function.

See Also

bratt, Brat, multinomial, binomialff.

Examples
# Citation statistics: being cited is a 'win'; citing is a 'loss'
journal <- c("Biometrika", "Comm.Statist", "JASA", "JRSS-B")
mat <- matrix(c( NA, 33, 320, 284,
730, NA, 813, 276,
498, 68, NA, 325,
221, 17, 142, NA), 4, 4)
dimnames(mat) <- list(winner = journal, loser = journal)
fit <- vglm(Brat(mat) ~ 1, brat(refgp = 1), trace = TRUE)
fit <- vglm(Brat(mat) ~ 1, brat(refgp = 1), trace = TRUE, crit = "coef")
summary(fit)
c(0, coef(fit)) # Log-abilities (in order of "journal")
c(1, Coef(fit)) # Abilities (in order of "journal")
fitted(fit) # Probabilities of winning in awkward form
(check <- InverseBrat(fitted(fit))) # Probabilities of winning
check + t(check) # Should be 1's in the off-diagonals
bratt 129

bratt Bradley Terry Model With Ties

Description

Fits a Bradley Terry model with ties (intercept-only model) by maximum likelihood estimation.

Usage

bratt(refgp = "last", refvalue = 1, ialpha = 1, i0 = 0.01)

Arguments

refgp Integer whose value must be from the set {1,. . . ,M }, where there are M com-
petitors. The default value indicates the last competitor is usedbut dont input
a character string, in general.
refvalue Numeric. A positive value for the reference group.
ialpha Initial values for the s. These are recycled to the appropriate length.
i0 Initial value for 0 . If convergence fails, try another positive value.

Details

There are several models that extend the ordinary Bradley Terry model to handle ties. This family
function implements one of these models. It involves M competitors who either win or lose or
tie against each other. (If there are no draws/ties then use brat). The probability that Competitor
i beats Competitor j is i /(i + j + 0 ), where all the s are positive. The probability that
Competitor i ties with Competitor j is 0 /(i + j + 0 ). Loosely, the s can be thought of as the
competitors abilities, and 0 is an added parameter to model ties. For identifiability, one of the
i is set to a known value refvalue, e.g., 1. By default, this function chooses the last competitor to
have this reference value. The data can be represented in the form of a M by M matrix of counts,
where winners are the rows and losers are the columns. However, this is not the way the data should
be inputted (see below).
Excluding the reference value/group, this function chooses log(j ) as the first M 1 linear predic-
tors. The log link ensures that the s are positive. The last linear predictor is log(0 ).
The Bradley Terry model can be fitted with covariates, e.g., a home advantage variable, but unfor-
tunately, this lies outside the VGLM theoretical framework and therefore cannot be handled with
this code.

Value

An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm.
130 bratt

Note
The function Brat is useful for coercing a M by M matrix of counts into a one-row matrix suitable
for bratt. Diagonal elements are skipped, and the usual S order of c(a.matrix) of elements is
used. There should be no missing values apart from the diagonal elements of the square matrix.
The matrix should have winners as the rows, and losers as the columns. In general, the response
should be a matrix with M (M 1) columns.
Also, a symmetric matrix of ties should be passed into Brat. The diagonal of this matrix should be
all NAs.
Only an intercept model is recommended with bratt. It doesnt make sense really to include
covariates because of the limited VGLM framework.
Notationally, note that the VGAM family function brat has M + 1 contestants, while bratt has
M contestants.

Author(s)
T. W. Yee

References
Torsney, B. (2004) Fitting Bradley Terry models using a multiplicative algorithm. In: Antoch,
J. (ed.) Proceedings in Computational Statistics COMPSTAT 2004, Physica-Verlag: Heidelberg.
Pages 513526.

# Add some ties. This is fictitional data.

ties <- 5 + 0 * mat
ties[2, 1] <- ties[1,2] <- 9

# Now fit the model

fit <- vglm(Brat(mat, ties) ~ 1, bratt(refgp = 1), trace = TRUE)
fit <- vglm(Brat(mat, ties) ~ 1, bratt(refgp = 1), trace = TRUE, crit = "coef")

summary(fit)
c(0, coef(fit)) # Log-abilities (in order of "journal"); last is log(alpha0)
c(1, Coef(fit)) # Abilities (in order of "journal"); last is alpha0

fit@misc$alpha # alpha_1,...,alpha_M
calibrate 131

fit@misc$alpha0 # alpha_0

fitted(fit) # Probabilities of winning and tying, in awkward form

predict(fit)
(check <- InverseBrat(fitted(fit))) # Probabilities of winning
qprob <- attr(fitted(fit), "probtie") # Probabilities of a tie
qprobmat <- InverseBrat(c(qprob), NCo = nrow(ties)) # Probabilities of a tie
check + t(check) + qprobmat # Should be 1s in the off-diagonals

calibrate Model Calibrations

Description
calibrate is a generic function used to produce calibrations from various model fitting functions.
The function invokes particular methods which depend on the class of the first argument.

Usage
calibrate(object, ...)

Arguments
object An object for which a calibration is desired.
... Additional arguments affecting the calibration produced. Usually the most im-
portant argument in ... is newdata which, for calibrate, contains new re-
sponse data, Y, say.

Details
Given a regression model with explanatory variables X and response Y, calibration involves es-
timating X from Y using the regression model. It can be loosely thought of as the opposite of
predict (which takes an X and returns a Y.)

Value
In general, given a new response Y, the explanatory variables X are returned. However, for con-
strained ordination models such as CQO and CAO models, it is usually not possible to return X,
so the latent variables are returned instead (they are linear combinations of the X). See the specific
calibrate methods functions to see what they return.

Note
This function was not called predictx because of the inability of constrained ordination models to
return X; they can only return the latent variable values (site scores) instead.

Author(s)
T. W. Yee
132 calibrate.qrrvglm

siteNos <- 1:2 # Calibrate these sites

cp1 <- calibrate(p1, new = data.frame(depvar(p1)[siteNos, ]), trace = TRUE)

# Graphically compare the actual site scores with their calibrated values
persp(p1, main = "Solid=actual, dashed=calibrated site scores",
label = TRUE, col = "blue", las = 1)
# Actual site scores:
abline(v = latvar(p1)[siteNos], lty = 1, col = 1:length(siteNos))
abline(v = cp1, lty = 2, col = 1:length(siteNos)) # Calibrated values

## End(Not run)

calibrate-methods Calibration for Constrained Regression Models

Description
calibrate is a generic function applied to QRR-VGLMs and RR-VGAMs etc.

Methods
object The object from which the calibration is performed.

calibrate.qrrvglm Calibration for CQO and CAO models

Description
Performs maximum likelihood calibration for constrained and unconstrained quadratic and additive
ordination models (CQO and CAO models are better known as QRR-VGLMs and RR-VGAMs
respectively).
calibrate.qrrvglm 133

Usage
calibrate.qrrvglm(object, newdata = NULL,
type = c("latvar", "predictors", "response", "vcov", "all3or4"),
initial.vals = NULL, ...)

Arguments
object The fitted CQO/CAO model.
newdata A data frame with new response data (usually new species data). The default is
to use the original data used to fit the model; however, the calibration may take
a long time to compute because the computations are expensive.
type What type of result is to be returned. The first are the calibrated latent variables
or site scores. This must be computed always. The "predictors" are the lin-
ear/quadratic or additive predictors evaluated at the calibrated latent variables
or site scores. The "response" are the fitted means evaluated at the calibrated
latent variables or site scores. The "vcov" are the estimated variance-covariance
matrices of the calibrated latent variables or site scores. The "all3or4" is for
all of them, i.e., all types. For CAO models, "vcov" is unavailable, so all 3 are
returned. For CQO models, "vcov" is available, so all 4 are returned.
initial.vals Initial values for the search. For rank-1 models, this should be a vector of length
nrow(newdata), and for rank 2 models this should be a two column matrix with
the number of rows equalling the number of rows in newdata. The default is a
grid defined by arguments in calibrate.qrrvglm.control.
... Arguments that are fed into calibrate.qrrvglm.control.

Details
Given a fitted regression CQO/CAO model, maximum likelihood calibration is theoretically easy
and elegant. However, the method assumes that all species are independent, which is not really true
in practice. More details and references are given in Yee (2012).
The function optim is used to search for the maximum likelihood solution. Good initial values are
needed, and calibrate.qrrvglm.control allows the user some control over the choice of these.

Value
The argument type determines what is returned. If type = "all3or4" then all the type values are
returned in a list, with the following components. Each component has length nrow(newdata).

latvar Calibrated latent variables or site scores.

predictors linear/quadratic or additive predictors. For example, for Poisson families, this
will be on a log scale, and for binomial families, this will be on a logit scale.
response Fitted values of the response, evaluated at the calibrated latent variables or site
scores.
vcov Estimated variance-covariance matrix of the calibrated latent variables or site
scores. Actually, these are stored in an array whose last dimension is nrow(newdata).
134 calibrate.qrrvglm

Warning
This function is computationally expensive. Setting trace = TRUE to get a running log is a good
idea.

Note
Despite the name of this function, CAO models are handled as well.

Author(s)
T. W. Yee

References
Yee, T. W. (2012) On constrained and unconstrained quadratic ordination. Manuscript in prepara-
tion.
ter Braak, C. J. F. 1995. Calibration. In: Data Analysis in Community and Landscape Ecology by
Jongman, R. H. G., ter Braak, C. J. F. and van Tongeren, O. F. R. (Eds.) Cambridge University
Press, Cambridge.

See Also
calibrate.qrrvglm.control, calibrate, cqo, cao.

Examples
## Not run:
hspider[,1:6] <- scale(hspider[, 1:6]) # Standardize the environmental variables
set.seed(123)
p1 <- cqo(cbind(Pardlugu, Pardmont, Pardnigr, Pardpull, Zoraspin) ~
WaterCon + BareSand + FallTwig +
CoveMoss + CoveHerb + ReflLux,
family = poissonff, data = hspider, Rank = 1,
I.toler = TRUE, Crow1positive = TRUE)

siteNos <- 3:4 # Calibrate these sites

cp1 <- calibrate(p1, new = data.frame(depvar(p1)[siteNos, ]), trace = TRUE)

## End(Not run)

## Not run:
# Graphically compare the actual site scores with their calibrated values
persp(p1, main = "Site scores: solid=actual, dashed=calibrated",
label = TRUE, col = "blue", las = 1)
# Actual site scores:
abline(v = latvar(p1)[siteNos], lty = 1, col = 1:length(siteNos))
abline(v = cp1, lty = 2, col = 1:length(siteNos)) # Calibrated values

## End(Not run)
calibrate.qrrvglm.control 135

calibrate.qrrvglm.control
Control Function for CQO/CAO Calibration

Description
Algorithmic constants and parameters for running calibrate.qrrvglm are set using this function.

Usage
calibrate.qrrvglm.control(object, trace = FALSE, Method.optim = "BFGS",
gridSize = if (Rank == 1) 9 else 5, varI.latvar = FALSE, ...)

Arguments
object The fitted CQO/CAO model. The user should ignore this argument.
trace Logical indicating if output should be produced for each iteration. It is a good
idea to set this argument to be TRUE since the computations are expensive.
Method.optim Character. Fed into the method argument of optim.
gridSize Numeric, recycled to length Rank. Controls the resolution of the grid used
for initial values. For each latent variable, an equally spaced grid of length
gridSize is cast from the smallest site score to the largest site score. Then the
likelihood function is evaluated on the grid, and the best fit is chosen as the initial
value. Thus increasing the value of gridSize increases the chance of obtaining
the global solution, however, the computing time increases proportionately.
varI.latvar Logical. For CQO objects only, this argument is fed into Coef.qrrvglm.
... Avoids an error message for extraneous arguments.

Details
Most CQO/CAO users will only need to make use of trace and gridSize. These arguments should
be used inside their call to calibrate.qrrvglm, not this function directly.

Value
A list which with the following components.

trace Numeric (even though the input can be logical).

gridSize Positive integer.
varI.latvar Logical.

Note
Despite the name of this function, CAO models are handled as well.
136 cao

Author(s)

T. W. Yee

References

Yee, T. W. (2013) On constrained and unconstrained quadratic ordination. Manuscript in prepara-

tion.

See Also

calibrate.qrrvglm, Coef.qrrvglm.

Examples
## Not run: hspider[, 1:6] <- scale(hspider[, 1:6]) # Needed when I.tol = TRUE
set.seed(123)
p1 <- cqo(cbind(Alopacce, Alopcune, Pardlugu, Pardnigr,
Pardpull, Trocterr, Zoraspin) ~
WaterCon + BareSand + FallTwig +
CoveMoss + CoveHerb + ReflLux,
family = poissonff, data = hspider, I.tol = TRUE)
sort(deviance(p1, history = TRUE)) # A history of all the iterations

siteNos <- 3:4 # Calibrate these sites

cp1 <- calibrate(p1, new = data.frame(depvar(p1)[siteNos, ]), trace = TRUE)

## End(Not run)

## Not run:
# Graphically compare the actual site scores with their calibrated values
persp(p1, main = "Site scores: solid=actual, dashed=calibrated",
label = TRUE, col = "blue", las = 1)
abline(v = latvar(p1)[siteNos], lty = 1,
col = 1:length(siteNos)) # Actual site scores
abline(v = cp1, lty = 2, col = 1:length(siteNos)) # Calibrated values

## End(Not run)

cao Fitting Constrained Additive Ordination (CAO)

Description

A constrained additive ordination (CAO) model is fitted using the reduced-rank vector generalized
additive model (RR-VGAM) framework.
cao 137

Usage
cao(formula, family, data = list(),
weights = NULL, subset = NULL, na.action = na.fail,
etastart = NULL, mustart = NULL, coefstart = NULL,
control = cao.control(...), offset = NULL,
method = "cao.fit", model = FALSE, x.arg = TRUE, y.arg = TRUE,
contrasts = NULL, constraints = NULL,
extra = NULL, qr.arg = FALSE, smart = TRUE, ...)

Arguments
formula a symbolic description of the model to be fit. The RHS of the formula is used
to construct the latent variables, upon which the smooths are applied. All the
variables in the formula are used for the construction of latent variables except
for those specified by the argument noRRR, which is itself a formula. The LHS
of the formula contains the response variables, which should be a matrix with
each column being a response (species).
family a function of class "vglmff" (see vglmff-class) describing what statistical
model is to be fitted. This is called a VGAM family function. See CommonVGAMffArguments
for general information about many types of arguments found in this type of
function. See cqo for a list of those presently implemented.
data an optional data frame containing the variables in the model. By default the vari-
ables are taken from environment(formula), typically the environment from
which cao is called.
weights an optional vector or matrix of (prior) weights to be used in the fitting process.
For cao, this argument currently should not be used.
subset an optional logical vector specifying a subset of observations to be used in the
fitting process.
na.action a function which indicates what should happen when the data contain NAs. The
default is set by the na.action setting of options, and is na.fail if that is
unset. The factory-fresh default is na.omit.
etastart starting values for the linear predictors. It is a M -column matrix. If M = 1 then
it may be a vector. For cao, this argument currently should not be used.
mustart starting values for the fitted values. It can be a vector or a matrix. Some family
functions do not make use of this argument. For cao, this argument currently
should not be used.
coefstart starting values for the coefficient vector. For cao, this argument currently should
not be used.
control a list of parameters for controlling the fitting process. See cao.control for
details.
offset a vector or M -column matrix of offset values. These are a priori known and are
added to the linear predictors during fitting. For cao, this argument currently
should not be used.
method the method to be used in fitting the model. The default (and presently only)
method cao.fit uses iteratively reweighted least squares (IRLS) within FOR-
TRAN code called from optim.
138 cao

model a logical value indicating whether the model frame should be assigned in the
model slot.
x.arg, y.arg logical values indicating whether the model matrix and response vector/matrix
used in the fitting process should be assigned in the x and y slots. Note the
model matrix is the linear model (LM) matrix.
contrasts an optional list. See the contrasts.arg of model.matrix.default.
constraints an optional list of constraint matrices. For cao, this argument currently should
not be used. The components of the list must be named with the term it cor-
responds to (and it must match in character format). Each constraint matrix
must have M rows, and be of full-column rank. By default, constraint matrices
are the M by M identity matrix unless arguments in the family function itself
override these values. If constraints is used it must contain all the terms; an
incomplete list is not accepted.
extra an optional list with any extra information that might be needed by the family
function. For cao, this argument currently should not be used.
qr.arg For cao, this argument currently should not be used.
smart logical value indicating whether smart prediction (smartpred) will be used.
... further arguments passed into cao.control.

Details
The arguments of cao are a mixture of those from vgam and cqo, but with some extras in cao.control.
Currently, not all of the arguments work properly.
CAO can be loosely be thought of as the result of fitting generalized additive models (GAMs) to
several responses (e.g., species) against a very small number of latent variables. Each latent variable
is a linear combination of the explanatory variables; the coefficients C (called C below) are called
constrained coefficients or canonical coefficients, and are interpreted as weights or loadings. The
C are estimated by maximum likelihood estimation. It is often a good idea to apply scale to each
explanatory variable first.
For each response (e.g., species), each latent variable is smoothed by a cubic smoothing spline,
thus CAO is data-driven. If each smooth were a quadratic then CAO would simplify to con-
strained quadratic ordination (CQO; formerly called canonical Gaussian ordination or CGO). If
each smooth were linear then CAO would simplify to constrained linear ordination (CLO). CLO
can theoretically be fitted with cao by specifying df1.nl=0, however it is more efficient to use
rrvglm.
Currently, only Rank=1 is implemented, and only noRRR = ~1 models are handled.
With binomial data, the default formula is

logit(P [Ys = 1]) = s = fs (), s = 1, 2, . . . , S

where x2 is a vector of environmental variables, and = C T x2 is a R-vector of latent variables.

The s is an additive predictor for species s, and it models the probabilities of presence as an
additive model on the logit scale. The matrix C is estimated from the data, as well as the smooth
functions fs . The argument noRRR = ~ 1 specifies that the vector x1 , defined for RR-VGLMs
and QRR-VGLMs, is simply a 1 for an intercept. Here, the intercept in the model is absorbed into
the functions. A cloglog link may be preferable over a logit link.
cao 139

With Poisson count data, the formula is

log(E[Ys ]) = s = fs ()

which models the mean response as an additive models on the log scale.
The fitted latent variables (site scores) are scaled to have unit variance. The concept of a tolerance
is undefined for CAO models, but the optimums and maximums are defined. The generic functions
Max and Opt should work for CAO objects, but note that if the maximum occurs at the boundary
then Max will return a NA. Inference for CAO models is currently undeveloped.

Value
An object of class "cao" (this may change to "rrvgam" in the future). Several generic functions
can be applied to the object, e.g., Coef, concoef, lvplot, summary.

Warning
CAO is very costly to compute. With version 0.7-8 it took 28 minutes on a fast machine. I hope to
look at ways of speeding things up in the future.
Use set.seed just prior to calling cao() to make your results reproducible. The reason for this
is finding the optimal CAO model presents a difficult optimization problem, partly because the
log-likelihood function contains many local solutions. To obtain the (global) solution the user is
advised to try many initial values. This can be done by setting Bestof some appropriate value (see
cao.control). Trying many initial values becomes progressively more important as the nonlinear
degrees of freedom of the smooths increase.
Currently the dispersion parameter for a gaussianff CAO model is estimated slightly differently
and may be slightly biassed downwards (usually a little too small).

Note
CAO models are computationally expensive, therefore setting trace = TRUE is a good idea, as well
as running it on a simple random sample of the data set instead.
Sometimes the IRLS algorithm does not converge within the FORTRAN code. This results in
warnings being issued. In particular, if an error code of 3 is issued, then this indicates the IRLS
algorithm has not converged. One possible remedy is to increase or decrease the nonlinear degrees
of freedom so that the curves become more or less flexible, respectively.

Author(s)
T. W. Yee

References
Yee, T. W. (2006) Constrained additive ordination. Ecology, 87, 203213.

See Also
cao.control, Coef.cao, cqo, latvar, Opt, Max, persp.cao, poissonff, binomialff, negbinomial,
gamma2, gaussianff, set.seed, gam, trapO.
140 cao.control

Examples
## Not run:
hspider[, 1:6] <- scale(hspider[, 1:6]) # Standardized environmental vars
set.seed(149) # For reproducible results
ap1 <- cao(cbind(Pardlugu, Pardmont, Pardnigr, Pardpull) ~
WaterCon + BareSand + FallTwig + CoveMoss + CoveHerb + ReflLux,
family = poissonff, data = hspider, Rank = 1,
df1.nl = c(Pardpull= 2.7, 2.5),
Bestof = 7, Crow1positive = FALSE)
sort(deviance(ap1, history = TRUE)) # A history of all the iterations

Coef(ap1)
concoef(ap1)

par(mfrow = c(2, 2))

plot(ap1) # All the curves are unimodal; some quite symmetric

par(mfrow = c(1, 1), las = 1)

index <- 1:ncol(depvar(ap1))
lvplot(ap1, lcol = index, pcol = index, y = TRUE)

trplot(ap1, label = TRUE, col = index)

abline(a = 0, b = 1, lty = 2)

trplot(ap1, label = TRUE, col = "blue", log = "xy", which.sp = c(1, 3))
abline(a = 0, b = 1, lty = 2)

persp(ap1, col = index, lwd = 2, label = TRUE)

abline(v = Opt(ap1), lty = 2, col = index)
abline(h = Max(ap1), lty = 2, col = index)

## End(Not run)

cao.control Control Function for RR-VGAMs (CAO)

Description
Algorithmic constants and parameters for a constrained additive ordination (CAO), by fitting a
reduced-rank vector generalized additive model (RR-VGAM), are set using this function. This is
the control function for cao.

Usage
cao.control(Rank = 1, all.knots = FALSE, criterion = "deviance", Cinit = NULL,
Crow1positive = TRUE, epsilon = 1.0e-05, Etamat.colmax = 10,
GradientFunction = FALSE, iKvector = 0.1, iShape = 0.1,
noRRR = ~ 1, Norrr = NA,
SmallNo = 5.0e-13, Use.Init.Poisson.QO = TRUE,
cao.control 141

Bestof = if (length(Cinit)) 1 else 10, maxitl = 10,

imethod = 1, bf.epsilon = 1.0e-7, bf.maxit = 10,
Maxit.optim = 250, optim.maxit = 20, sd.sitescores = 1.0,
sd.Cinit = 0.02, suppress.warnings = TRUE,
trace = TRUE, df1.nl = 2.5, df2.nl = 2.5,
spar1 = 0, spar2 = 0, ...)

Arguments
Rank The numerical rank R of the model, i.e., the number of latent variables. Cur-
rently only Rank = 1 is implemented.
all.knots Logical indicating if all distinct points of the smoothing variables are to be used
as knots. Assigning the value FALSE means fewer knots are chosen when the
number of distinct points is large, meaning less computational expense. See
vgam.control for details.
criterion Convergence criterion. Currently, only one is supported: the deviance is mini-
mized.
Cinit Optional initial C matrix which may speed up convergence.
Crow1positive Logical vector of length Rank (recycled if necessary): are the elements of the
first row of C positive? For example, if Rank is 4, then specifying Crow1positive = c(FALSE, TRUE)
will force C[1,1] and C[1,3] to be negative, and C[1,2] and C[1,4] to be positive.
epsilon Positive numeric. Used to test for convergence for GLMs fitted in FORTRAN.
Larger values mean a loosening of the convergence criterion.
Etamat.colmax Positive integer, no smaller than Rank. Controls the amount of memory used by
.Init.Poisson.QO(). It is the maximum number of columns allowed for the
pseudo-response and its weights. In general, the larger the value, the better the
initial value. Used only if Use.Init.Poisson.QO = TRUE.
GradientFunction
Logical. Whether optims argument gr is used or not, i.e., to compute gradient
values. Used only if FastAlgorithm is TRUE. Currently, this argument must be
set to FALSE.
iKvector, iShape
See qrrvglm.control.
noRRR Formula giving terms that are not to be included in the reduced-rank regression
(or formation of the latent variables). The default is to omit the intercept term
from the latent variables. Currently, only noRRR = ~ 1 is implemented.
Norrr Defunct. Please use noRRR. Use of Norrr will become an error soon.
SmallNo Positive numeric between .Machine$double.eps and 0.0001. Used to avoid
under- or over-flow in the IRLS algorithm.
Use.Init.Poisson.QO
Logical. If TRUE then the function .Init.Poisson.QO is used to obtain initial
values for the canonical coefficients C. If FALSE then random numbers are used
instead.
Bestof Integer. The best of Bestof models fitted is returned. This argument helps guard
against local solutions by (hopefully) finding the global solution from many fits.
142 cao.control

The argument works only when the function generates its own initial value for C,
i.e., when C are not passed in as initial values. The default is only a convenient
minimal number and users are urged to increase this value.
maxitl Positive integer. Maximum number of Newton-Raphson/Fisher-scoring/local-
scoring iterations allowed.
imethod See qrrvglm.control.
bf.epsilon Positive numeric. Tolerance used by the modified vector backfitting algorithm
for testing convergence.
bf.maxit Positive integer. Number of backfitting iterations allowed in the compiled code.
Maxit.optim Positive integer. Number of iterations given to the function optim at each of the
optim.maxit iterations.
optim.maxit Positive integer. Number of times optim is invoked.
sd.sitescores Numeric. Standard deviation of the initial values of the site scores, which are
generated from a normal distribution. Used when Use.Init.Poisson.QO is
FALSE.
sd.Cinit Standard deviation of the initial values for the elements of C. These are normally
distributed with mean zero. This argument is used only if Use.Init.Poisson.QO = FALSE.
suppress.warnings
Logical. Suppress warnings?
trace Logical indicating if output should be produced for each iteration. Having the
value TRUE is a good idea for large data sets.
df1.nl, df2.nl Numeric and non-negative, recycled to length S. Nonlinear degrees of freedom
for smooths of the first and second latent variables. A value of 0 means the
smooth is linear. Roughly, a value between 1.0 and 2.0 often has the approximate
flexibility of a quadratic. The user should not assign too large a value to this
argument, e.g., the value 4.0 is probably too high. The argument df1.nl is
ignored if spar1 is assigned a positive value or values. Ditto for df2.nl.
spar1, spar2 Numeric and non-negative, recycled to length S. Smoothing parameters of the
smooths of the first and second latent variables. The larger the value, the more
smooth (less wiggly) the fitted curves. These arguments are an alternative to
specifying df1.nl and df2.nl. A value 0 (the default) for spar1 means that
df1.nl is used. Ditto for spar2. The values are on a scaled version of the latent
variables. See Green and Silverman (1994) for more information.
... Ignored at present.

Details
Many of these arguments are identical to qrrvglm.control. Here, R is the Rank, M is the number
of additive predictors, and S is the number of responses (species). Thus M = S for binomial and
Poisson responses, and M = 2S for the negative binomial and 2-parameter gamma distributions.
Allowing the smooths too much flexibility means the CAO optimization problem becomes more
difficult to solve. This is because the number of local solutions increases as the nonlinearity of the
smooths increases. In situations of high nonlinearity, many initial values should be used, so that
Bestof should be assigned a larger value. In general, there should be a reasonable value of df1.nl
somewhere between 0 and about 3 for most data sets.
cao.control 143

Value
A list with the components corresponding to its arguments, after some basic error checking.

Note
The argument df1.nl can be inputted in the format c(spp1 = 2, spp2 = 3, 2.5), say, meaning
the default value is 2.5, but two species have alternative values.
If spar1 = 0 and df1.nl = 0 then this represents fitting linear functions (CLO). Currently, this is
handled in the awkward manner of setting df1.nl to be a small positive value, so that the smooth
is almost linear but not quite. A proper fix to this special case should done in the short future.

Author(s)
T. W. Yee

References
Yee, T. W. (2006) Constrained additive ordination. Ecology, 87, 203213.
Green, P. J. and Silverman, B. W. (1994) Nonparametric Regression and Generalized Linear Mod-
els: A Roughness Penalty Approach, London: Chapman & Hall.

par(mfrow = c(1, 1), las = 1)

index <- 1:ncol(depvar(ap1)) # lvplot is jagged because only 28 sites
lvplot(ap1, lcol = index, pcol = index, y = TRUE)

trplot(ap1, label = TRUE, col = index)

abline(a = 0, b = 1, lty = 2)
144 Card

persp(ap1, label = TRUE, col = 1:4)

## End(Not run)

Card Cardioid Distribution

Description
Density, distribution function, quantile function and random generation for the cardioid distribution.

Usage
dcard(x, mu, rho, log = FALSE)
pcard(q, mu, rho, lower.tail = TRUE, log.p = FALSE)
qcard(p, mu, rho, tolerance = 1e-07, maxits = 500,
lower.tail = TRUE, log.p = FALSE)
rcard(n, mu, rho, ...)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. Same as in runif.
mu, rho See cardioid for more information.
tolerance, maxits, ...
The first two are control parameters for the algorithm used to solve for the roots
of a nonlinear system of equations; tolerance controls for the accuracy and
maxits is the maximum number of iterations. rcard calls qcard so the ... can
be used to vary the two arguments.
log Logical. If log = TRUE then the logarithm of the density is returned.
lower.tail, log.p
Same meaning as in pnorm or qnorm.

Details
See cardioid, the VGAM family function for estimating the two parameters by maximum likeli-
hood estimation, for the formula of the probability density function and other details.

Value
dcard gives the density, pcard gives the distribution function, qcard gives the quantile function,
and rcard generates random deviates.

Note
Convergence problems might occur with rcard.
cardioid 145

Author(s)
Thomas W. Yee and Kai Huang

probs <- seq(0.1, 0.9, by = 0.1)

Q <- qcard(probs, mu, rho)
lines(Q, dcard(Q, mu, rho), col = "purple", lty = 3, type = "h")
lines(Q, pcard(Q, mu, rho), col = "purple", lty = 3, type = "h")
abline(h = c(0,probs, 1), v = c(0, 2*pi), col = "purple", lty = 3)
max(abs(pcard(Q, mu, rho) - probs)) # Should be 0

## End(Not run)

cardioid Cardioid Distribution Family Function

Description
Estimates the two parameters of the cardioid distribution by maximum likelihood estimation.

Usage
cardioid(lmu = extlogit(min = 0, max = 2*pi),
lrho = extlogit(min = -0.5, max = 0.5),
imu = NULL, irho = 0.3, nsimEIM = 100, zero = NULL)

Arguments
lmu, lrho Parameter link functions applied to the and parameters, respectively. See
Links for more choices.
imu, irho Initial values. A NULL means an initial value is chosen internally. See CommonVGAMffArguments
for more information.
nsimEIM, zero See CommonVGAMffArguments for more information.
146 cardioid

Details
The two-parameter cardioid distribution has a density that can be written as
1
f (y; , ) = (1 + 2 cos(y ))
2
where 0 < y < 2, 0 < < 2, and 0.5 < < 0.5 is the concentration parameter. The default
link functions enforce the range constraints of the parameters.
For positive the distribution is unimodal and symmetric about . The mean of Y (which make up
the fitted values) is + (/)((2 ) sin(2 ) + cos(2 ) sin() cos()).

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.

Warning
Numerically, this distribution can be difficult to fit because of a log-likelihood having multiple
maximums. The user is therefore encouraged to try different starting values, i.e., make use of imu
and irho.

Note
Fisher scoring using simulation is used.

Author(s)
T. W. Yee

References
Jammalamadaka, S. R. and SenGupta, A. (2001) Topics in Circular Statistics, Singapore: World
Scientific.

See Also
rcard, extlogit, vonmises.
CircStats and circular currently have a lot more R functions for circular data than the VGAM
package.

Examples
## Not run:
cdata <- data.frame(y = rcard(n = 1000, mu = 4, rho = 0.45))
fit <- vglm(y ~ 1, cardioid, data = cdata, trace = TRUE)
coef(fit, matrix=TRUE)
Coef(fit)
c(with(cdata, mean(y)), head(fitted(fit), 1))
summary(fit)
cauchit 147

## End(Not run)

cauchit Cauchit Link Function

Description
Computes the cauchit (tangent) link transformation, including its inverse and the first two deriva-
tives.

Usage
cauchit(theta, bvalue = .Machine$double.eps,
inverse = FALSE, deriv = 0, short = TRUE, tag = FALSE)

Arguments
theta Numeric or character. See below for further details.
bvalue See Links.
inverse, deriv, short, tag
Details at Links.

Details
This link function is an alternative link function for parameters that lie in the unit interval. This type
of link bears the same relation to the Cauchy distribution as the probit link bears to the Gaussian.
One characteristic of this link function is that the tail is heavier relative to the other links (see
examples below).
Numerical values of theta close to 0 or 1 or out of range result in Inf, -Inf, NA or NaN.

Value
For deriv = 0, the tangent of theta, i.e., tan(pi * (theta-0.5)) when inverse = FALSE, and
if inverse = TRUE then 0.5 + atan(theta)/pi.
For deriv = 1, then the function returns d eta / d theta as a function of theta if inverse = FALSE,
else if inverse = TRUE then it returns the reciprocal.

Note
Numerical instability may occur when theta is close to 1 or 0. One way of overcoming this is to
use bvalue.
As mentioned above, in terms of the threshold approach with cumulative probabilities for an ordinal
response this link function corresponds to the Cauchy distribution (see cauchy1).

Author(s)
Thomas W. Yee
148 cauchit

References
McCullagh, P. and Nelder, J. A. (1989) Generalized Linear Models, 2nd ed. London: Chapman &
Hall.

See Also
logit, probit, cloglog, loge, cauchy, cauchy1.

Examples
p <- seq(0.01, 0.99, by=0.01)
cauchit(p)
max(abs(cauchit(cauchit(p), inverse = TRUE) - p)) # Should be 0

p <- c(seq(-0.02, 0.02, by=0.01), seq(0.97, 1.02, by = 0.01))

cauchit(p) # Has no NAs

## Not run:
par(mfrow = c(2, 2), lwd = (mylwd <- 2))
y <- seq(-4, 4, length = 100)
p <- seq(0.01, 0.99, by = 0.01)

for (d in 0:1) {
matplot(p, cbind(logit(p, deriv = d), probit(p, deriv = d)),
type = "n", col = "purple", ylab = "transformation",
las = 1, main = if (d == 0) "Some probability link functions"
else "First derivative")
lines(p, logit(p, deriv = d), col = "limegreen")
lines(p, probit(p, deriv = d), col = "purple")
lines(p, cloglog(p, deriv = d), col = "chocolate")
lines(p, cauchit(p, deriv = d), col = "tan")
if (d == 0) {
abline(v = 0.5, h = 0, lty = "dashed")
legend(0, 4.5, c("logit", "probit", "cloglog", "cauchit"), lwd = mylwd,
col = c("limegreen","purple","chocolate", "tan"))
} else
abline(v = 0.5, lty = "dashed")
}

for (d in 0) {
matplot(y, cbind( logit(y, deriv = d, inverse = TRUE),
probit(y, deriv = d, inverse = TRUE)),
type = "n", col = "purple", xlab = "transformation", ylab = "p",
main = if (d == 0) "Some inverse probability link functions"
else "First derivative", las=1)
lines(y, logit(y, deriv = d, inverse = TRUE), col = "limegreen")
lines(y, probit(y, deriv = d, inverse = TRUE), col = "purple")
lines(y, cloglog(y, deriv = d, inverse = TRUE), col = "chocolate")
lines(y, cauchit(y, deriv = d, inverse = TRUE), col = "tan")
if (d == 0) {
abline(h = 0.5, v = 0, lty = "dashed")
legend(-4, 1, c("logit", "probit", "cloglog", "cauchit"), lwd = mylwd,
cauchy 149

col = c("limegreen", "purple", "chocolate", "tan"))

}
}
par(lwd = 1)

## End(Not run)

cauchy Cauchy Distribution Family Function

Description
Estimates either the location parameter or both the location and scale parameters of the Cauchy
distribution by maximum likelihood estimation.

Usage
cauchy(llocation = "identitylink", lscale = "loge",
ilocation = NULL, iscale = NULL,
iprobs = seq(0.2, 0.8, by = 0.2),
imethod = 1, nsimEIM = NULL, zero = "scale")
cauchy1(scale.arg = 1, llocation = "identitylink",
ilocation = NULL, imethod = 1)

Arguments
llocation, lscale
Parameter link functions for the location parameter a and the scale parameter b.
See Links for more choices.
ilocation, iscale
Optional initial value for a and b. By default, an initial value is chosen internally
for each.
imethod Integer, either 1 or 2 or 3. Initial method, three algorithms are implemented. The
user should try all possible values to help avoid converging to a local solution.
Also, choose the another value if convergence fails, or use ilocation and/or
iscale.
iprobs Probabilities used to find the respective sample quantiles; used to compute iscale.
zero, nsimEIM See CommonVGAMffArguments for information.
scale.arg Known (positive) scale parameter, called b below.

Details
The Cauchy distribution has density function
1
f (y; a, b) = b[1 + ((y a)/b)2 ]

150 cauchy

where y and a are real and finite, and b > 0. The distribution is symmetric about a and has a heavy
tail. Its median and mode are a, but the mean does not exist. The fitted values are the estimates of
a. Fisher scoring is the default but if nsimEIM is specified then Fisher scoring with simulation is
used.
If the scale parameter is known (cauchy1) then there may be multiple local maximum likelihood
solutions for the location parameter. However, if both location and scale parameters are to be
estimated (cauchy) then there is a unique maximum likelihood solution provided n > 2 and less
than half the data are located at any one point.

Value

An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Warning

It is well-known that the Cauchy distribution may have local maximums in its likelihood function;
make full use of imethod, ilocation, iscale etc.

Note

Good initial values are needed. By default these VGAM family functions search for a starting value
for a on a grid. It also pays to select a wide range of initial values via the ilocation and/or iscale
and/or imethod arguments.

Author(s)

T. W. Yee

References

Forbes, C., Evans, M., Hastings, N. and Peacock, B. (2011) Statistical Distributions, Hoboken, NJ,
USA: John Wiley and Sons, Fourth edition.
Barnett, V. D. (1966) Evaluation of the maximum-likehood estimator where the likelihood equation
has multiple roots. Biometrika, 53, 151165.
Copas, J. B. (1975) On the unimodality of the likelihood for the Cauchy distribution. Biometrika,
62, 701704.
Efron, B. and Hinkley, D. V. (1978) Assessing the accuracy of the maximum likelihood estimator:
Observed versus expected Fisher information. Biometrika, 65, 457481.

See Also

Cauchy, cauchit, studentt, simulate.vlm.

cdf.lmscreg 151

Examples
# Both location and scale parameters unknown
set.seed(123)
cdata <- data.frame(x2 = runif(nn <- 1000))
cdata <- transform(cdata, loc = exp(1 + 0.5 * x2), scale = exp(1))
cdata <- transform(cdata, y2 = rcauchy(nn, loc, scale))
fit2 <- vglm(y2 ~ x2, cauchy(lloc = "loge"), data = cdata, trace = TRUE)
coef(fit2, matrix = TRUE)
head(fitted(fit2)) # Location estimates
summary(fit2)

# Location parameter unknown

cdata <- transform(cdata, scale1 = 0.4)
cdata <- transform(cdata, y1 = rcauchy(nn, loc, scale1))
fit1 <- vglm(y1 ~ x2, cauchy1(scale = 0.4), data = cdata, trace = TRUE)
coef(fit1, matrix = TRUE)

cdf.lmscreg Cumulative Distribution Function for LMS Quantile Regression

Description
Computes the cumulative distribution function (CDF) for observations, based on a LMS quantile
regression.

Usage
cdf.lmscreg(object, newdata = NULL, ...)

Arguments
object A VGAM quantile regression model, i.e., an object produced by modelling func-
tions such as vglm and vgam with a family function beginning with "lms.".
newdata Data frame where the predictions are to be made. If missing, the original data is
used.
... Parameters which are passed into functions such as cdf.lms.yjn.

Details
The CDFs returned here are values lying in [0,1] giving the relative probabilities associated with
the quantiles newdata. For example, a value near 0.75 means it is close to the upper quartile of the
distribution.

Value
A vector of CDF values lying in [0,1].
152 cens.gumbel

Note

The data are treated like quantiles, and the percentiles are returned. The opposite is performed by
qtplot.lmscreg.
The CDF values of the model have been placed in @post$cdf when the model was fitted.

Author(s)

Thomas W. Yee

References

Yee, T. W. (2004) Quantile regression via vector generalized additive models. Statistics in Medicine,
23, 22952315.

See Also

deplot.lmscreg, qtplot.lmscreg, lms.bcn, lms.bcg, lms.yjn.

Examples

fit <- vgam(BMI ~ s(age, df=c(4, 2)), lms.bcn(zero = 1), data = bmi.nz)
head(fit@post$cdf)
head(cdf(fit)) # Same
head(depvar(fit))
head(fitted(fit))

cdf(fit, data.frame(age = c(31.5, 39), BMI = c(28.4, 24)))

cens.gumbel Censored Gumbel Distribution

Description

Maximum likelihood estimation of the 2-parameter Gumbel distribution when there are censored
observations. A matrix response is not allowed.

Usage

cens.gumbel(llocation = "identitylink", lscale = "loge", iscale = NULL,

mean = TRUE, percentiles = NULL, zero = "scale")
cens.gumbel 153

Arguments
llocation, lscale
Character. Parameter link functions for the location and (positive) scale param-
eters. See Links for more choices.
iscale Numeric and positive. Initial value for scale. Recycled to the appropriate length.
In general, a larger value is better than a smaller value. The default is to choose
the value internally.
mean Logical. Return the mean? If TRUE then the mean is returned, otherwise per-
centiles given by the percentiles argument.
percentiles Numeric with values between 0 and 100. If mean=FALSE then the fitted values
are percentiles which must be specified by this argument.
zero An integer-valued vector specifying which linear/additive predictors are mod-
elled as intercepts only. The value (possibly values) must be from the set {1,2}
corresponding respectively to location and scale. If zero=NULL then all lin-
ear/additive predictors are modelled as a linear combination of the explanatory
variables. The default is to fit the shape parameter as an intercept only.

Details
This VGAM family function is like gumbel but handles observations that are left-censored (so
that the true value would be less than the observed value) else right-censored (so that the true
value would be greater than the observed value). To indicate which type of censoring, input
extra = list(leftcensored = vec1, rightcensored = vec2) where vec1 and vec2 are
logical vectors the same length as the response. If the two components of this list are missing then
the logical values are taken to be FALSE. The fitted object has these two components stored in the
extra slot.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Warning
Numerical problems may occur if the amount of censoring is excessive.

Note
See gumbel for details about the Gumbel distribution. The initial values are based on assuming all
uncensored observations, therefore could be improved upon.

Author(s)
T. W. Yee

References
Coles, S. (2001) An Introduction to Statistical Modeling of Extreme Values. London: Springer-
Verlag.
154 cens.normal

See Also
gumbel, gumbelff, rgumbel, guplot, gev, venice.

Examples
# Example 1
ystar <- venice[["r1"]] # Use the first order statistic as the response
nn <- length(ystar)
L <- runif(nn, 100, 104) # Lower censoring points
U <- runif(nn, 130, 135) # Upper censoring points
y <- pmax(L, ystar) # Left censored
y <- pmin(U, y) # Right censored
extra <- list(leftcensored = ystar < L, rightcensored = ystar > U)
fit <- vglm(y ~ scale(year), data = venice, trace = TRUE, extra = extra,
cens.gumbel(mean = FALSE, perc = c(5, 25, 50, 75, 95)))
coef(fit, matrix = TRUE)
head(fitted(fit))
fit@extra

# Example 2: simulated data

nn <- 1000
ystar <- rgumbel(nn, loc = 1, scale = exp(0.5)) # The uncensored data
L <- runif(nn, -1, 1) # Lower censoring points
U <- runif(nn, 2, 5) # Upper censoring points
y <- pmax(L, ystar) # Left censored
y <- pmin(U, y) # Right censored
## Not run: par(mfrow = c(1, 2)); hist(ystar); hist(y);
extra <- list(leftcensored = ystar < L, rightcensored = ystar > U)
fit <- vglm(y ~ 1, trace = TRUE, extra = extra, cens.gumbel)
coef(fit, matrix = TRUE)

cens.normal Censored Normal Distribution

Description
Maximum likelihood estimation for the normal distribution with left and right censoring.

Usage
cens.normal(lmu = "identitylink", lsd = "loge", imethod = 1, zero = "sd")

Arguments
lmu, lsd Parameter link functions applied to the mean and standard deviation parame-
ters. See Links for more choices. The standard deviation is a positive quantity,
therefore a log link is the default.
imethod Initialization method. Either 1 or 2, this specifies two methods for obtaining
initial values for the parameters.
cens.normal 155

zero A vector, e.g., containing the value 1 or 2; if so, the mean or standard deviation
respectively are modelled as an intercept only. Setting zero = NULL means both
linear/additive predictors are modelled as functions of the explanatory variables.
See CommonVGAMffArguments for more information.

Details
This function is like uninormal but handles observations that are left-censored (so that the true
value would be less than the observed value) else right-censored (so that the true value would be
greater than the observed value). To indicate which type of censoring, input extra = list(leftcensored = vec1, rightce
where vec1 and vec2 are logical vectors the same length as the response. If the two components
of this list are missing then the logical values are taken to be FALSE. The fitted object has these two
components stored in the extra slot.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Note
This function, which is an alternative to tobit, cannot handle a matrix response and uses different
working weights. If there are no censored observations then uninormal is recommended instead.

Author(s)
T. W. Yee

See Also
tobit, uninormal, double.cens.normal.

Examples
## Not run:
cdata <- data.frame(x2 = runif(nn <- 1000)) # ystar are true values
cdata <- transform(cdata, ystar = rnorm(nn, m = 100 + 15 * x2, sd = exp(3)))
with(cdata, hist(ystar))
cdata <- transform(cdata, L = runif(nn, 80, 90), # Lower censoring points
U = runif(nn, 130, 140)) # Upper censoring points
cdata <- transform(cdata, y = pmax(L, ystar)) # Left censored
cdata <- transform(cdata, y = pmin(U, y)) # Right censored
with(cdata, hist(y))
Extra <- list(leftcensored = with(cdata, ystar < L),
rightcensored = with(cdata, ystar > U))
fit1 <- vglm(y ~ x2, cens.normal, data = cdata, crit = "c", extra = Extra)
fit2 <- vglm(y ~ x2, tobit(Lower = with(cdata, L), Upper = with(cdata, U)),
data = cdata, crit = "c", trace = TRUE)
coef(fit1, matrix = TRUE)
max(abs(coef(fit1, matrix = TRUE) -
coef(fit2, matrix = TRUE))) # Should be 0
156 cens.poisson

names(fit1@extra)

## End(Not run)

cens.poisson Censored Poisson Family Function

Description
Family function for a censored Poisson response.

Usage
cens.poisson(link = "loge", imu = NULL)

Arguments
link Link function applied to the mean; see Links for more choices.
imu Optional initial value; see CommonVGAMffArguments for more information.

Details
Often a table of Poisson counts has an entry J+ meaning J. This family function is similar to
poissonff but handles such censored data. The input requires SurvS4. Only a univariate response
is allowed. The Newton-Raphson algorithm is used.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Warning
As the response is discrete, care is required with Surv, especially with "interval" censored data
because of the (start, end] format. See the examples below. The examples have y < L as left
censored and y >= U (formatted as U+) as right censored observations, therefore L <= y < U is
for uncensored and/or interval censored observations. Consequently the input must be tweaked to
conform to the (start, end] format.

Note
The function poissonff should be used when there are no censored observations. Also, NAs are not
permitted with SurvS4, nor is type = "counting".

Author(s)
Thomas W. Yee
cens.poisson 157

References
See survival for background.

# Example 2: left censored data

L <- 15
cdata <- transform(cdata,
cY = pmax(L, y),
lcensored = y < L) # Note y < L, not cY == L or y <= L
cdata <- transform(cdata, status = ifelse(lcensored, 0, 1))
with(cdata, table(cY))
with(cdata, table(lcensored))
with(cdata, table(ii <- print(SurvS4(cY, status, type = "left")))) # Check
fit <- vglm(SurvS4(cY, status, type = "left") ~ 1, cens.poisson,
data = cdata, trace = TRUE)
coef(fit, matrix = TRUE)

# Example 3: interval censored data

cdata <- transform(cdata, Lvec = rep(L, len = N),
Uvec = rep(U, len = N))
cdata <-
transform(cdata,
icensored = Lvec <= y & y < Uvec) # Not lcensored or rcensored
with(cdata, table(icensored))
cdata <- transform(cdata, status = rep(3, N)) # 3 means interval censored
cdata <-
transform(cdata,
status = ifelse(rcensored, 0, status)) # 0 means right censored
cdata <-
transform(cdata,
status = ifelse(lcensored, 2, status)) # 2 means left censored
# Have to adjust Lvec and Uvec because of the (start, end] format:
158 cfibrosis

cdata$Lvec[with(cdata, icensored)] <- cdata$Lvec[with(cdata, icensored)] - 1

cdata$Uvec[with(cdata, icensored)] <- cdata$Uvec[with(cdata, icensored)] - 1
# Unchanged:
cdata$Lvec[with(cdata, lcensored)] <- cdata$Lvec[with(cdata, lcensored)]
cdata$Lvec[with(cdata, rcensored)] <- cdata$Uvec[with(cdata, rcensored)]
with(cdata,
table(ii <- print(SurvS4(Lvec, Uvec, status, type = "interval")))) # Check

fit <- vglm(SurvS4(Lvec, Uvec, status, type = "interval") ~ 1,

cens.poisson, data = cdata, trace = TRUE)
coef(fit, matrix = TRUE)
table(print(depvar(fit))) # Another check

# Example 4: Add in some uncensored observations

index <- (1:N)[with(cdata, icensored)]
index <- head(index, 4)
cdata$status[index] <- 1 # actual or uncensored value
cdata$Lvec[index] <- cdata$y[index]
with(cdata, table(ii <- print(SurvS4(Lvec, Uvec, status,
type = "interval")))) # Check

fit <- vglm(SurvS4(Lvec, Uvec, status, type = "interval") ~ 1,

cens.poisson, data = cdata, trace = TRUE, crit = "c")
coef(fit, matrix = TRUE)
table(print(depvar(fit))) # Another check

cfibrosis Cystic Fibrosis Data

Description
This data frame concerns families data and cystic fibrosis.

Usage
data(cfibrosis)

Format
A data frame with 24 rows on the following 4 variables.

siblings, affected, ascertained, families Over ascertained families, the kth ascertained family has
sk siblings of whom rk are affected and ak are ascertained.

Details
The data set allows a classical segregation analysis to be peformed. In particular, to test Mendelian
segregation ratios in nuclear family data. The likelihood has similarities with seq2binomial.
cgo 159

Source
The data is originally from Crow (1965) and appears as Table 2.3 of Lange (2002).
Crow, J. F. (1965) Problems of ascertainment in the analysis of family data. Epidemiology and
Genetics of Chronic Disease. Public Health Service Publication 1163, Neel J. V., Shaw M. W.,
Schull W. J., editors, Department of Health, Education, and Welfare, Washington, DC, USA.
Lange, K. (2002) Mathematical and Statistical Methods for Genetic Analysis. Second Edition.
Springer-Verlag: New York, USA.

Examples
cfibrosis
summary(cfibrosis)

cgo Redirects the user to cqo

Description
Redirects the user to the function cqo.

Usage
cgo(...)

Arguments
... Ignored.

Details
The former function cgo has been renamed cqo because CGO (for canonical Gaussian ordination)
is a confusing and inaccurate name. CQO (for constrained quadratic ordination) is better. This new
nomenclature described in Yee (2006).

Value
Nothing is returned; an error message is issued.

Warning
The code, therefore, in Yee (2004) will not run without changing the "g" to a "q".

Author(s)
Thomas W. Yee
160 chest.nz

chest.nz Chest Pain in NZ Adults Data

Description
Presence/absence of chest pain in 10186 New Zealand adults.

Usage
data(chest.nz)

Format
A data frame with 73 rows and the following 5 variables.

age a numeric vector; age (years).

nolnor a numeric vector of counts; no pain on LHS or RHS.
nolr a numeric vector of counts; no pain on LHS but pain on RHS.
lnor a numeric vector of counts; no pain on RHS but pain on LHS.
lr a numeric vector of counts; pain on LHS and RHS of chest.

Details
Each adult was asked their age and whether they experienced any pain or discomfort in their chest
over the last six months. If yes, they indicated whether it was on their LHS and/or RHS of their
chest.
chinese.nz 161

Source
MacMahon, S., Norton, R., Jackson, R., Mackie, M. J., Cheng, A., Vander Hoorn, S., Milne, A.,
McCulloch, A. (1995) Fletcher Challenge-University of Auckland Heart & Health Study: design
and baseline findings. New Zealand Medical Journal, 108, 499502.

Examples
## Not run:
fit <- vgam(cbind(nolnor, nolr, lnor, lr) ~ s(age, c(4, 3)),
binom2.or(exchan = TRUE, zero = NULL), data = chest.nz)
coef(fit, matrix = TRUE)

## End(Not run)
## Not run: plot(fit, which.cf = 2, se = TRUE)

chinese.nz Chinese Population in New Zealand 18672001 Data

Description
The Chinese population in New Zealand from 1867 to 2001, along with the whole of the New
Zealand population.

Usage
data(chinese.nz)

Format
A data frame with 27 observations on the following 4 variables.
year Year.
male Number of Chinese males.
female Number of Chinese females.
nz Total number in the New Zealand population.

Details
Historically, there was a large exodus of Chinese from the Guangdong region starting in the mid-
1800s to the gold fields of South Island of New Zealand, California, and southern Australia, etc.
Discrimination then meant that only men were allowed entry, to hinder permanent settlement. In
the case of New Zealand, the government relaxed its immigration laws after WWII to allow wives
of Chinese already in NZ to join them because China had been among the Allied powers. Gradual
relaxation in the immigration and an influx during the 1980s meant the Chinese population became
increasingly demographically normal over time.
The NZ total for the years 1867 and 1871 exclude the Maori population. Three modifications have
been made to the female column to make the data internally consistent with the original table.
162 chisq

References
Page 6 of Aliens At My Table: Asians as New Zealanders See Them by M. Ip and N. Murphy, (2005).
Penguin Books. Auckland, New Zealand.

Examples
## Not run: par(mfrow = c(1, 2))
plot(female / (male + female) ~ year, chinese.nz, type = "b",
ylab = "Proportion", col = "blue", las = 1,
cex = 0.015 * sqrt(male + female),
# cex = 0.10 * sqrt((male + female)^1.5 / sqrt(female) / sqrt(male)),
main = "Proportion of NZ Chinese that are female")
abline(h = 0.5, lty = "dashed", col = "gray")

fit1.cnz <- vglm(cbind(female, male) ~ year, binomialff,

data = chinese.nz)
fit2.cnz <- vglm(cbind(female, male) ~ sm.poly(year, 2), binomialff,
data = chinese.nz)
fit4.cnz <- vglm(cbind(female, male) ~ sm.bs(year, 5), binomialff,
data = chinese.nz)

lines(fitted(fit1.cnz) ~ year, chinese.nz, col = "purple", lty = 1)

lines(fitted(fit2.cnz) ~ year, chinese.nz, col = "green", lty = 2)
lines(fitted(fit4.cnz) ~ year, chinese.nz, col = "orange", lwd = 2, lty = 1)
legend("bottomright", col = c("purple", "green", "orange"),
lty = c(1, 2, 1), leg = c("linear", "quadratic", "B-spline"))

plot(100*(male+female)/nz ~ year, chinese.nz, type = "b", ylab = "Percent",

ylim = c(0, max(100*(male+female)/nz)), col = "blue", las = 1,
main = "Percent of NZers that are Chinese")
abline(h = 0, lty = "dashed", col = "gray")
## End(Not run)

chisq Chi-squared Distribution

Description
Maximum likelihood estimation of the degrees of freedom for a chi-squared distribution.

Usage
chisq(link = "loge", zero = NULL)

Arguments
link, zero See CommonVGAMffArguments for information.
clo 163

Details
The degrees of freedom is treated as a parameter to be estimated, and as real (not integer). Being
positive, a log link is used by default. Fisher scoring is used.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Note
Multiple responses are permitted. There may be convergence problems if the degrees of freedom is
very large or close to zero.

Author(s)
T. W. Yee

References
Forbes, C., Evans, M., Hastings, N. and Peacock, B. (2011) Statistical Distributions, Hoboken, NJ,
USA: John Wiley and Sons, Fourth edition.

clo Redirects the User to rrvglm()

Description
Redirects the user to the function rrvglm.

Usage
clo(...)

Arguments
... Ignored.
164 cloglog

Details
CLO stands for constrained linear ordination, and is fitted with a statistical class of models called
reduced-rank vector generalized linear models (RR-VGLMs). It allows for generalized reduced-
rank regression in that response types such as Poisson counts and presence/absence data can be
handled.
Currently in the VGAM package, rrvglm is used to fit RR-VGLMs. However, the Authors opinion
is that linear responses to a latent variable (composite environmental gradient) is not as common as
unimodal responses, therefore cqo is often more appropriate.
The new CLO/CQO/CAO nomenclature described in Yee (2006).

Value
Nothing is returned; an error message is issued.

Author(s)
Thomas W. Yee

References
Yee, T. W. (2006) Constrained additive ordination. Ecology, 87, 203213.
Yee, T. W. and Hastie, T. J. (2003) Reduced-rank vector generalized linear models. Statistical
Modelling, 3, 1541.

cloglog Complementary Log-log Link Function

Description
Computes the complementary log-log transformation, including its inverse and the first two deriva-
tives.
cloglog 165

Usage
cloglog(theta, bvalue = NULL, inverse = FALSE, deriv = 0,
short = TRUE, tag = FALSE)

Arguments
theta Numeric or character. See below for further details.
bvalue See Links for general information about links.
inverse, deriv, short, tag
Details at Links.

Details
The complementary log-log link function is commonly used for parameters that lie in the unit
interval. Numerical values of theta close to 0 or 1 or out of range result in Inf, -Inf, NA or NaN.

Value
For deriv = 0, the complimentary log-log of theta, i.e., log(-log(1 - theta)) when inverse = FALSE,
and if inverse = TRUE then 1-exp(-exp(theta)).
For deriv = 1, then the function returns d eta / d theta as a function of theta if inverse = FALSE,
else if inverse = TRUE then it returns the reciprocal.
Here, all logarithms are natural logarithms, i.e., to base e.

Note
Numerical instability may occur when theta is close to 1 or 0. One way of overcoming this is to
use bvalue.
Changing 1s to 0s and 0s to 1s in the response means that effectively a loglog link is fitted. That is,
tranform y by 1 y. Thats why only one of cloglog and loglog is written.
With constrained ordination (e.g., cqo and cao) used with binomialff, a complementary log-log
link function is preferred over the default logit link, for a good reason. See the example below.
In terms of the threshold approach with cumulative probabilities for an ordinal response this link
function corresponds to the extreme value distribution.

Author(s)
Thomas W. Yee

References
McCullagh, P. and Nelder, J. A. (1989) Generalized Linear Models, 2nd ed. London: Chapman &
Hall.

See Also
Links, logitoffsetlink, logit, probit, cauchit.
166 coalminers

Examples
p <- seq(0.01, 0.99, by = 0.01)
cloglog(p)
max(abs(cloglog(cloglog(p), inverse = TRUE) - p)) # Should be 0

p <- c(seq(-0.02, 0.02, by = 0.01), seq(0.97, 1.02, by = 0.01))

cloglog(p) # Has NAs
cloglog(p, bvalue = .Machine$double.eps) # Has no NAs

## Not run:
p <- seq(0.01, 0.99, by = 0.01)
plot(p, logit(p), type = "l", col = "limegreen", lwd = 2, las = 1,
main = "Some probability link functions", ylab = "transformation")
lines(p, probit(p), col = "purple", lwd = 2)
lines(p, cloglog(p), col = "chocolate", lwd = 2)
lines(p, cauchit(p), col = "tan", lwd = 2)
abline(v = 0.5, h = 0, lty = "dashed")
legend(0.1, 4, c("logit", "probit", "cloglog", "cauchit"),
col = c("limegreen", "purple", "chocolate", "tan"), lwd = 2)

## End(Not run)

## Not run:
# This example shows that a cloglog link is preferred over the logit
n <- 500; p <- 5; S <- 3; Rank <- 1 # Species packing model:
mydata <- rcqo(n, p, S, eq.tol = TRUE, es.opt = TRUE, eq.max = TRUE,
family = "binomial", hi.abundance = 5, seed = 123,
Rank = Rank)
fitc <- cqo(attr(mydata, "formula"), I.tol = TRUE, data = mydata,
fam = binomialff(multiple.responses = TRUE, link = "cloglog"),
Rank = Rank)
fitl <- cqo(attr(mydata, "formula"), I.tol = TRUE, data = mydata,
fam = binomialff(multiple.responses = TRUE, link = "logit"),
Rank = Rank)

# Compare the fitted models (cols 1 and 3) with the truth (col 2)
cbind(concoef(fitc), attr(mydata, "concoefficients"), concoef(fitl))

## End(Not run)

coalminers Breathlessness and Wheeze Amongst Coalminers Data

Description
Coalminers who are smokers without radiological pneumoconiosis, classified by age, breathlessness
and wheeze.

Usage
data(coalminers)
Coef 167

Format
A data frame with 9 age groups with the following 5 columns.

BW Counts with breathlessness and wheeze.

BnW Counts with breathlessness but no wheeze.
nBW Counts with no breathlessness but wheeze.
nBnW Counts with neither breathlessness or wheeze.
age Age of the coal miners (actually, the midpoints of the 5-year category ranges).

Details
The data were published in Ashford and Sowden (1970). A more recent analysis is McCullagh and
Nelder (1989, Section 6.6).

Source
Ashford, J. R. and Sowden, R. R. (1970) Multi-variate probit analysis. Biometrics, 26, 535546.

References
McCullagh, P. and Nelder, J. A. (1989) Generalized Linear Models. 2nd ed. London: Chapman &
Hall.

Examples
str(coalminers)

Coef Computes Model Coefficients and Quantities

Description
Coef is a generic function which computes model coefficients from objects returned by modelling
functions. It is an auxiliary function to coef that enables extra capabilities for some specific models.

Usage
Coef(object, ...)

Arguments
object An object for which the computation of other types of model coefficients or
quantities is meaningful.
... Other arguments fed into the specific methods function of the model.
168 Coef.qrrvglm

Details
This function can often be useful for vglm objects with just an intercept term in the RHS of the
formula, e.g., y ~ 1. Then often this function will apply the inverse link functions to the parameters.
See the example below.
For reduced-rank VGLMs, this function can return the A, C matrices, etc.
For quadratic and additive ordination models, this function can return ecological meaningful quan-
tities such as tolerances, optimums, maximums.

Value
The value returned depends specifically on the methods function invoked.

Warning
This function may not work for all VGAM family functions. You should check your results on
some artificial data before applying it to models fitted to real data.

Author(s)
Thomas W. Yee

References
Yee, T. W. and Hastie, T. J. (2003) Reduced-rank vector generalized linear models. Statistical
Modelling, 3, 1541.

See Also
coef, Coef.vlm, Coef.rrvglm, Coef.qrrvglm, depvar.

Examples
nn <- 1000
bdata <- data.frame(y = rbeta(nn, shape1 = 1, shape2 = 3)) # Original scale
fit <- vglm(y ~ 1, betaR, data = bdata, trace = TRUE) # Intercept-only model
coef(fit, matrix = TRUE) # Both on a log scale
Coef(fit) # On the original scale

Coef.qrrvglm Returns Important Matrices etc. of a QO Object

Description
This methods function returns important matrices etc. of a QO object.

Usage
Coef.qrrvglm(object, varI.latvar = FALSE, refResponse = NULL, ...)
Coef.qrrvglm 169

Arguments
object A CQO object. The former has class "qrrvglm".
varI.latvar Logical indicating whether to scale the site scores (latent variables) to have
variance-covariance matrix equal to the rank-R identity matrix. All models have
uncorrelated site scores (latent variables), and this option stretches or shrinks the
ordination axes if TRUE. See below for further details.
refResponse Integer or character. Specifies the reference response or reference species. By
default, the reference species is found by searching sequentially starting from
the first species until a positive-definite tolerance matrix is found. Then this
tolerance matrix is transformed to the identity matrix. Then the sites scores
(latent variables) are made uncorrelated. See below for further details.
... Currently unused.

Details
If I.tolerances=TRUE or eq.tolerances=TRUE (and its estimated tolerance matrix is positive-
definite) then all species tolerances are unity by transformation or by definition, and the spread of
the site scores can be compared to them. Vice versa, if one wishes to compare the tolerances with
the sites score variability then setting varI.latvar=TRUE is more appropriate.
For rank-2 QRR-VGLMs, one of the species can be chosen so that the angle of its major axis and
minor axis is zero, i.e., parallel to the ordination axes. This means the effect on the latent vars is
independent on that species, and that its tolerance matrix is diagonal. The argument refResponse
allows one to choose which is the reference species, which must have a positive-definite tolerance
matrix, i.e., is bell-shaped. If refResponse is not specified, then the code will try to choose some
reference species starting from the first species. Although the refResponse argument could possi-
bly be offered as an option when fitting the model, it is currently available after fitting the model,
e.g., in the functions Coef.qrrvglm and lvplot.qrrvglm.

Value
The A, B1, C, T, D matrices/arrays are returned, along with other slots. The returned object has
class "Coef.qrrvglm" (see Coef.qrrvglm-class).

Note
Consider an equal-tolerances Poisson/binomial CQO model with noRRR = ~ 1. For R = 1 it has
about 2S + p2 parameters. For R = 2 it has about 3S + 2p2 parameters. Here, S is the number of
species, and p2 = p 1 is the number of environmental variables making up the latent variable. For
an unequal-tolerances Poisson/binomial CQO model with noRRR = ~ 1, it has about 3S 1 + p2
parameters for R = 1, and about 6S 3 + 2p2 parameters for R = 2. Since the total number of
data points is nS, where n is the number of sites, it pays to divide the number of data points by the
number of parameters to get some idea about how much information the parameters contain.

Author(s)
Thomas W. Yee
170 Coef.qrrvglm-class

References

Yee, T. W. (2004) A new technique for maximum-likelihood canonical Gaussian ordination. Eco-
logical Monographs, 74, 685701.
Yee, T. W. (2006) Constrained additive ordination. Ecology, 87, 203213.

See Also

cqo, Coef.qrrvglm-class, print.Coef.qrrvglm, lvplot.qrrvglm.

Examples

set.seed(123)
x2 <- rnorm(n <- 100)
x3 <- rnorm(n)
x4 <- rnorm(n)
latvar1 <- 0 + x3 - 2*x4
lambda1 <- exp(3 - 0.5 * ( latvar1-0)^2)
lambda2 <- exp(2 - 0.5 * ( latvar1-1)^2)
lambda3 <- exp(2 - 0.5 * ((latvar1+4)/2)^2) # Unequal tolerances
y1 <- rpois(n, lambda1)
y2 <- rpois(n, lambda2)
y3 <- rpois(n, lambda3)
set.seed(111)
# vvv p1 <- cqo(cbind(y1, y2, y3) ~ x2 + x3 + x4, poissonff, trace = FALSE)
## Not run: lvplot(p1, y = TRUE, lcol = 1:3, pch = 1:3, pcol = 1:3)

# vvv Coef(p1)
# vvv print(Coef(p1), digits=3)

Coef.qrrvglm-class Class Coef.qrrvglm

Description

The most pertinent matrices and other quantities pertaining to a QRR-VGLM (CQO model).

Objects from the Class

Objects can be created by calls of the form Coef(object,...) where object is an object of class
"qrrvglm" (created by cqo).
In this document, R is the rank, M is the number of linear predictors and n is the number of
observations.
Coef.qrrvglm-class 171

Slots

A: Of class "matrix", A, which are the linear coefficients of the matrix of latent variables. It is
M by R.
B1: Of class "matrix", B1. These correspond to terms of the argument noRRR.
C: Of class "matrix", C, the canonical coefficients. It has R columns.
Constrained: Logical. Whether the model is a constrained ordination model.
D: Of class "array", D[,,j] is an order-Rank matrix, for j = 1,. . . ,M . Ideally, these are negative-
definite in order to make the response curves/surfaces bell-shaped.
Rank: The rank (dimension, number of latent variables) of the RR-VGLM. Called R.
latvar: n by R matrix of latent variable values.
latvar.order: Of class "matrix", the permutation returned when the function order is applied
to each column of latvar. This enables each column of latvar to be easily sorted.
Maximum: Of class "numeric", the M maximum fitted values. That is, the fitted values at the
optimums for noRRR = ~ 1 models. If noRRR is not ~ 1 then these will be NAs.
NOS: Number of species.
Optimum: Of class "matrix", the values of the latent variables where the optimums are. If the
curves are not bell-shaped, then the value will be NA or NaN.
Optimum.order: Of class "matrix", the permutation returned when the function order is applied
to each column of Optimum. This enables each row of Optimum to be easily sorted.
bellshaped: Vector of logicals: is each response curve/surface bell-shaped?
dispersion: Dispersion parameter(s).
Dzero: Vector of logicals, is each of the response curves linear in the latent variable(s)? It will be
if and only if D[,,j] equals O, for j = 1,. . . ,M .
Tolerance: Object of class "array", Tolerance[,,j] is an order-Rank matrix, for j = 1,. . . ,M ,
being the matrix of tolerances (squared if on the diagonal). These are denoted by T in Yee
(2004). Ideally, these are positive-definite in order to make the response curves/surfaces bell-
shaped. The tolerance matrices satisfy Ts = 21 Ds1 .

Author(s)

Thomas W. Yee

References

Yee, T. W. (2004) A new technique for maximum-likelihood canonical Gaussian ordination. Eco-
logical Monographs, 74, 685701.

See Also

Coef.qrrvglm, cqo, print.Coef.qrrvglm.

172 Coef.rrvglm

Examples
x2 <- rnorm(n <- 100)
x3 <- rnorm(n)
x4 <- rnorm(n)
latvar1 <- 0 + x3 - 2*x4
lambda1 <- exp(3 - 0.5 * ( latvar1-0)^2)
lambda2 <- exp(2 - 0.5 * ( latvar1-1)^2)
lambda3 <- exp(2 - 0.5 * ((latvar1+4)/2)^2)
y1 <- rpois(n, lambda1)
y2 <- rpois(n, lambda2)
y3 <- rpois(n, lambda3)
yy <- cbind(y1, y2, y3)
# vvv p1 <- cqo(yy ~ x2 + x3 + x4, fam = poissonff, trace = FALSE)
## Not run:
lvplot(p1, y = TRUE, lcol = 1:3, pch = 1:3, pcol = 1:3)

## End(Not run)
# vvv print(Coef(p1), digits = 3)

Coef.rrvglm Returns Important Matrices etc. of a RR-VGLM Object

Description
This methods function returns important matrices etc. of a RR-VGLM object.

Usage
Coef.rrvglm(object, ...)

Arguments
object An object of class "rrvglm".
... Currently unused.

Details
The A, B1, C matrices are returned, along with other slots. See rrvglm for details about RR-
VGLMs.

Value
An object of class "Coef.rrvglm" (see Coef.rrvglm-class).

Note
This function is an alternative to coef.rrvglm.
Coef.rrvglm-class 173

Author(s)
Thomas W. Yee

References
Yee, T. W. and Hastie, T. J. (2003) Reduced-rank vector generalized linear models. Statistical
Modelling, 3, 1541.

See Also
Coef.rrvglm-class, print.Coef.rrvglm, rrvglm.

Examples
# Rank-1 stereotype model of Anderson (1984)
pneumo <- transform(pneumo, let = log(exposure.time), x3 = runif(nrow(pneumo)))
fit <- rrvglm(cbind(normal, mild, severe) ~ let + x3, multinomial, data = pneumo)
coef(fit, matrix = TRUE)
Coef(fit)

Coef.rrvglm-class Class Coef.rrvglm

Description
The most pertinent matrices and other quantities pertaining to a RR-VGLM.

Objects from the Class

Objects can be created by calls of the form Coef(object, ...) where object is an object of class
rrvglm (see rrvglm-class).
In this document, M is the number of linear predictors and n is the number of observations.

Slots
A: Of class "matrix", A.
B1: Of class "matrix", B1.
C: Of class "matrix", C.
Rank: The rank of the RR-VGLM.
colx1.index: Index of the columns of the "vlm"-type model matrix corresponding to the variables
in x1. These correspond to B1.
colx2.index: Index of the columns of the "vlm"-type model matrix corresponding to the variables
in x2. These correspond to the reduced-rank regression.
Atilde: Object of class "matrix", the A matrix with the corner rows removed. Thus each of the
elements have been estimated. This matrix is returned only if corner constraints were used.
174 Coef.vlm

Author(s)
Thomas W. Yee

References
Yee, T. W. and Hastie, T. J. (2003) Reduced-rank vector generalized linear models. Statistical
Modelling, 3, 1541.

See Also
Coef.rrvglm, rrvglm, rrvglm-class, print.Coef.rrvglm.

Coef.vlm Extract Model Coefficients for VLM Objects

Description
Amongst other things, this function applies inverse link functions to the parameters of intercept-only
VGLMs.

Usage
Coef.vlm(object, ...)

Arguments
object A fitted model.
... Arguments which may be passed into coef.

Details
Most VGAM family functions apply a link function to the parameters, e.g., positive parameter are
often have a log link, parameters between 0 and 1 have a logit link. This function can back-transform
the parameter estimate to the original scale.

Value
For intercept-only models (e.g., formula is y ~ 1) the back-transformed parameter estimates can
be returned.
coefvgam 175

Warning
This function may not work for all VGAM family functions. You should check your results on
some artificial data before applying it to models fitted to real data.

Author(s)
Thomas W. Yee

References
Yee, T. W. and Hastie, T. J. (2003) Reduced-rank vector generalized linear models. Statistical
Modelling, 3, 1541.

coefvgam Extract Model Coefficients of a vgam() Object

Description
Extracts the estimated coefficients from vgam() objects.

Usage
coefvgam(object, type = c("linear", "nonlinear"), ...)

Arguments
object A vgam object.
type Character. The default is the first choice.
... Optional arguments fed into coefvlm.

Details
For VGAMs, because modified backfitting is performed, each fitted function is decomposed into a
linear and nonlinear (smooth) part. The argument type is used to return which one is wanted.
176 coefvlm

Value
A vector if type = "linear". A list if type = "nonlinear", and each component of this list corre-
sponds to an s term; the component contains an S4 object with slot names such as "Bcoefficients",
"knots", "xmin", "xmax".

Author(s)
Thomas W. Yee

coefvlm Extract Model Coefficients

Description
Extracts the estimated coefficients from VLM objects such as VGLMs.

Usage
coefvlm(object, matrix.out = FALSE, label = TRUE, colon = FALSE)

Arguments
object An object for which the extraction of coefficients is meaningful. This will usu-
ally be a vglm object.
matrix.out Logical. If TRUE then a matrix is returned. The explanatory variables are the
rows. The linear/additive predictors are the columns. The constraint matrices
are used to compute this matrix.
label Logical. If FALSE then the names of the vector of coefficients are set to NULL.
colon Logical. Explanatory variables which appear in more than one linear/additive
predictor are labelled with a colon, e.g., age:1, age:2. However, if it only
appears in one linear/additive predictor then the :1 is omitted by default. Then
setting colon = TRUE will add the :1.
CommonVGAMffArguments 177

Details

This function works in a similar way to applying coef() to a lm or glm object. However, for
VGLMs, there are more options available.

Value

A vector usually. A matrix if matrix.out = TRUE.

Author(s)

Thomas W. Yee

References

Yee, T. W. and Hastie, T. J. (2003) Reduced-rank vector generalized linear models. Statistical
Modelling, 3, 1541.

See Also

vglm, coefvgam, coef.

Examples

zdata <- data.frame(x2 = runif(nn <- 200))

zdata <- transform(zdata, pstr0 = logit(-0.5 + 1*x2, inverse = TRUE),
lambda = loge( 0.5 + 2*x2, inverse = TRUE))
zdata <- transform(zdata, y2 = rzipois(nn, lambda, pstr0 = pstr0))

fit2 <- vglm(y2 ~ x2, zipoisson(zero = 1), data = zdata, trace = TRUE)
coef(fit2, matrix = TRUE) # Always a good idea
coef(fit2)
coef(fit2, colon = TRUE)

CommonVGAMffArguments Common VGAM Family Function Arguments

Description

Here is a description of some common and typical arguments found in many VGAM family func-
tions, e.g., lsigma, isigma, gsigma, nsimEI, parallel and zero.
178 CommonVGAMffArguments

Usage
TypicalVGAMfamilyFunction(lsigma = "loge",
isigma = NULL,
link.list = list("(Default)" = "identitylink",
x2 = "loge",
x3 = "logoff",
x4 = "multilogit",
x5 = "multilogit"),
earg.list = list("(Default)" = list(),
x2 = list(),
x3 = list(offset = -1),
x4 = list(),
x5 = list()),
gsigma = exp(-5:5),
parallel = TRUE,
ishrinkage = 0.95,
nointercept = NULL, imethod = 1,
type.fitted = c("mean", "quantiles",
"pobs0", "pstr0", "onempstr0"),
percentiles = c(25, 50, 75),
probs.x = c(0.15, 0.85),
probs.y = c(0.25, 0.50, 0.75),
multiple.responses = FALSE, earg.link = FALSE,
whitespace = FALSE, bred = FALSE, lss = TRUE,
oim = FALSE, nsimEIM = 100, byrow.arg = FALSE,
zero = NULL)

Arguments
lsigma Character. Link function applied to a parameter and not necessarily a mean.
See Links for a selection of choices. If there is only one parameter then this
argument is often called link.
link.list, earg.list
Some VGAM family functions (such as normal.vcm) implement models with
potentially lots of parameter link functions. These two arguments allow many
such links and extra arguments to be inputted more easily. One has something
like link.list = list("(Default)" = "identitylink", x2 = "loge", x3 = "logoff")
and earg.list = list("(Default)" = list(), x2 = list(), x3 = "list(offset = -1)").
Then any unnamed terms will have the default link with its corresponding extra
argument. Note: the multilogit link is also possible, and if so, at least two
instances of it are necessary. Then the last term is the baseline/reference group.
isigma Optional initial values can often be inputted using an argument beginning with
"i". For example, "isigma" and "ilocation", or just "init" if there is one
parameter. A value of NULL means a value is computed internally, i.e., a self-
starting VGAM family function. If a failure to converge occurs make use of
these types of arguments.
gsigma Grid-search initial values can be inputted using an argument beginning with "g",
CommonVGAMffArguments 179

e.g., "gsigma", "gshape" and "gscale". If argument isigma is inputted then

that has precedence over gsigma, etc.
If the grid search is 2-dimensional then it is advisable not to make the vectors too
long as a nested for loop may be used. Ditto for 3-dimensions etc. Sometimes a
".mux" is added as a suffix, e.g., gshape.mux; this means that the grid is created
relatively and not absolutely, e.g., its values are multipled by some single initial
estimate of the parameter in order to create the grid on an absolute scale.
Some family functions have an argument called gprobs.y. This is fed into the
probs argument of quantile in order to obtain some values of central tendency
of the response, i.e., some spread of values in the middle. when imethod = 1
to obtain an initial value for the mean Some family functions have an argument
called iprobs.y, and if so, then these values can overwrite gprobs.y.
parallel A logical, or a simple formula specifying which terms have equal/unequal co-
efficients. The formula must be simple, i.e., additive with simple main effects
terms. Interactions and nesting etc. are not handled. To handle complex formu-
las use the constraints argument (of vglm etc.); however, there is a lot more
setting up involved and things will not be as convenient.
Here are some examples. 1. parallel = TRUE ~ x2 + x5 means the paral-
lelism assumption is only applied to X2 , X5 and the intercept. 2. parallel = TRUE ~ -1
and parallel = TRUE ~ 0 mean the parallelism assumption is applied to no
variables at all. Similarly, parallel = FALSE ~ -1 and parallel = FALSE ~ 0
mean the parallelism assumption is applied to all the variables including the in-
tercept. 3. parallel = FALSE ~ x2 - 1 and parallel = FALSE ~ x2 + 0
applies the parallelism constraint to all terms (including the intercept) except for
X2 .
This argument is common in VGAM family functions for categorical responses,
e.g., cumulative, acat, cratio, sratio. For the proportional odds model
(cumulative) having parallel constraints applied to each explanatory variable
(except for the intercepts) means the fitted probabilities do not become negative
or greater than 1. However this parallelism or proportional-odds assumption
ought to be checked.
nsimEIM Some VGAM family functions use simulation to obtain an approximate ex-
pected information matrix (EIM). For those that do, the nsimEIM argument spec-
ifies the number of random variates used per observation; the mean of nsimEIM
random variates is taken. Thus nsimEIM controls the accuracy and a larger value
may be necessary if the EIMs are not positive-definite. For intercept-only mod-
els (y ~ 1) the value of nsimEIM can be smaller (since the common value used
is also then taken as the mean over the observations), especially if the number
of observations is large.
Some VGAM family functions provide two algorithms for estimating the EIM.
If applicable, set nsimEIM = NULL to choose the other algorithm.
imethod An integer with value 1 or 2 or 3 or ... which specifies the initialization method
for some parameters or a specific parameter. If failure to converge occurs try
the next higher value, and continue until success. For example, imethod = 1
might be the method of moments, and imethod = 2 might be another method.
If no value of imethod works then it will be necessary to use arguments such as
isigma. For many VGAM family functions it is advisable to try this argument
180 CommonVGAMffArguments

with all possible values to safeguard against problems such as converging to a

local solution. VGAM family functions with this argument usually correspond
to a model or distribution that is relatively hard to fit successfully, therefore care
is needed to ensure the global solution is obtained. So using all possible values
that this argument supplies is a good idea.
type.fitted Character. Type of fitted value returned by the fitted() methods function. The
first choice is always the default. The available choices depends on what kind
of family function it is. Using the first few letters of the chosen choice is okay.
See fittedvlm for more details.
percentiles Numeric vector, with values between 0 and 100 (although it is not recommended
that exactly 0 or 100 be inputted). Used only if type.fitted = "quantiles"
or type.fitted = "percentiles", then this argument specifies the values of
these quantiles. The argument name tries to reinforce that the values lie between
0 and 100. See fittedvlm for more details.
probs.x, probs.y
Numeric, with values in (0, 1). The probabilites that define quantiles with re-
spect to some vector, usually an x or y of some sort. This is used to create
two subsets of data corresponding to low and high values of x or y. Each
value is separately fed into the probs argument of quantile. If the data set size
is small then it may be necessary to increase/decrease slightly the first/second
values respectively.
lss Logical. This stands for the ordering: location, scale and shape. Should the
ordering of the parameters be in this order? Almost all VGAM family functions
have this order by default, but in order to match the arguments of existing R
functions, one might need to set lss = FALSE. For example, the arguments
of weibullR are scale and shape, whereas rweibull are shape and scale. As
a temporary measure (from VGAM 0.9-7 onwards but prior to version 1.0-0),
some family functions such as sinmad have an lss argument without a default.
For these, setting lss = FALSE will work. Later, lss = TRUE will be the default.
Be careful for the dpqr-type functions, e.g., rsinmad.
whitespace Logical. Should white spaces (" ") be used in the labelling of the linear/additive
predictors? Setting TRUE usually results in more readability but it occupies more
columns of the output.
oim Logical. Should the observed information matrices (OIMs) be used for the
working weights? In general, setting oim = TRUE means the Newton-Raphson
algorithm, and oim = FALSE means Fisher-scoring. The latter uses the EIM,
and is usually recommended. If oim = TRUE then nsimEIM is ignored.
zero Either an integer vector, or a vector of character strings.
If an integer, then it specifies which linear/additive predictor is modelled as
intercept-only. That is, the regression coefficients are set to zero for all co-
variates except for the intercept. If zero is specified then it may be a vector with
values from the set {1, 2, . . . , M }. The value zero = NULL means model all
linear/additive predictors as functions of the explanatory variables. Here, M is
the number of linear/additive predictors. Technically, if zero contains the value
j then the jth row of every constraint matrix (except for the intercept) consists
of all 0 values.
CommonVGAMffArguments 181

Some VGAM family functions allow the zero argument to accept negative val-
ues; if so then its absolute value is recycled over each (usual) response. For ex-
ample, zero = -2 for the two-parameter negative binomial distribution would
mean, for each response, the second linear/additive predictor is modelled as
intercepts-only. That is, for all the k parameters in negbinomial (this VGAM
family function can handle a matrix of responses).
Suppose zero = zerovec where zerovec is a vector of negative values. If G
is the usual M value for a univariate response then the actual values for argument
zero are all values in c(abs(zerovec), G + abs(zerovec), 2*G + abs(zerovec), ... )
lying in the integer range 1 to M . For example, setting zero = -c(2, 3) for
a matrix response of 4 columns with zinegbinomial (which usually has G =
M = 3 for a univariate response) would be equivalent to zero = c(2, 3, 5, 6, 8, 9, 11, 12).
This example has M = 12. Note that if zerovec contains negative values then
their absolute values should be elements from the set 1:G.
Note: zero may have positive and negative values, for example, setting zero = c(-2, 3)
in the above example would be equivalent to zero = c(2, 3, 5, 8, 11).
The argument zero also accepts a character vector (for VGAM 1.0-1 onwards).
Each value is fed into grep with fixed = TRUE, meaning that wildcards "*"
are not useful. See the example belowall the variants work; those with LOCAT
issue a warning that that value is unmatched. Importantly, the parameter names
are c("location1", "scale1", "location2", "scale2") because there
are 2 responses. Yee (2015) described zero for only numerical input. Allowing
character input is particularly important when the number of parameters cannot
be determined without having the actual data first. For example, with time series
data, an ARMA(p,q) process might have parameters 1 , . . . , p which should be
intercept-only by default. Then specifying a numerical default value for zero
would be too difficult (there are the drift and scale parameters too). However, it
is possible with the character representation: zero = "theta" would achieve
this. In the future, most VGAM family functions might be converted to the
character representationthe advantage being that it is more readable. When
programming a VGAM family function that allows character input, the variable
predictors.names must be assigned correctly.
ishrinkage Shrinkage factor s used for obtaining initial values. Numeric, between 0 and
1. In general, the formula used is something like s + (1 s)y where is a
measure of central tendency such as a weighted mean or median, and y is the
response vector. For example, the initial values are slight perturbations of the
mean towards the actual data. For many types of models this method seems to
work well and is often reasonably robust to outliers in the response. Often this
argument is only used if the argument imethod is assigned a certain value.
nointercept An integer-valued vector specifying which linear/additive predictors have no
intercepts. Any values must be from the set {1,2,. . . ,M }. A value of NULL
means no such constraints.
multiple.responses
Logical. Some VGAM family functions allow a multivariate or vector response.
If so, then usually the response is a matrix with columns corresponding to the
individual response variables. They are all fitted simultaneously. Arguments
such as parallel may then be useful to allow for relationships between the
regressions of each response variable. If multiple.responses = TRUE then
182 CommonVGAMffArguments

sometimes the response is interpreted differently, e.g., posbinomial chooses the

first column of a matrix response as success and combines the other columns
as failure, but when multiple.responses = TRUE then each column of the
response matrix is the number of successes and the weights argument is of the
same dimension as the response and contains the number of trials.
earg.link Sometimes the link argument can receive earg-type input, such as quasibinomial
calling binomial. This argument should be generally ignored.
byrow.arg Logical. Some VGAM family functions that handle multiple responses have ar-
guments that allow input to be fed in which affect all the responses, e.g., imu for
initalizing a mu parameter. In such cases it is sometime more convenient to input
one value per response by setting byrow.arg = TRUE; then values are recycled
in order to form a matrix of the appropriate dimension. This argument matches
byrow in matrix; in fact it is fed into such using matrix(..., byrow = byrow.arg).
This argument has no effect when there is one response.
bred Logical. Some VGAM family functions will allow bias-reduction based on
the work by Kosmidis and Firth. Sometimes half-stepping is a good idea; set
stepsize = 0.5 and monitor convergence by setting trace = TRUE.

Details
Full details will be given in documentation yet to be written, at a later date!

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Warning
The zero argument is supplied for convenience but conflicts can arise with other arguments, e.g.,
the constraints argument of vglm and vgam. See Example 5 below for an example. If not sure,
use, e.g., constraints(fit) and coef(fit, matrix = TRUE) to check the result of a fit fit.
The arguments zero and nointercept can be inputted with values that fail. For example, multinomial(zero = 2, nointer
means the second linear/additive predictor is identically zero, which will cause a failure.
Be careful about the use of other potentially contradictory constraints, e.g., multinomial(zero = 2, parallel = TRUE ~ x3
If in doubt, apply constraints() to the fitted object to check.
VGAM family functions with the nsimEIM may have inaccurate working weight matrices. If
so, then the standard errors of the regression coefficients may be inaccurate. Thus output from
summary(fit), vcov(fit), etc. may be misleading.
Changes relating to the codelss argument have very important consequences and users must beware.
Good programming style is to rely on the argument names and not on the order.

Note
See Links regarding a major change in link functions, for version 0.9-0 and higher (released during
the 2nd half of 2012).
CommonVGAMffArguments 183

Author(s)
T. W. Yee

References
Yee, T. W. (2015) Vector Generalized Linear and Additive Models: With an Implementation in R.
New York, USA: Springer.
Kosmidis, I. and Firth, D. (2009) Bias reduction in exponential family nonlinear models. Biometrika,
96(4), 793804.

See Also
Links, vglmff-class, UtilitiesVGAM, normal.vcm, multilogit.

Examples
# Example 1
cumulative()
cumulative(link = "probit", reverse = TRUE, parallel = TRUE)

# Example 2
wdata <- data.frame(x2 = runif(nn <- 1000))
wdata <- transform(wdata,
y = rweibull(nn, shape = 2 + exp(1 + x2), scale = exp(-0.5)))
fit <- vglm(y ~ x2, weibullR(lshape = logoff(offset = -2), zero = 2), data = wdata)
coef(fit, mat = TRUE)

# Example 3; multivariate (multiple) response

## Not run:
ndata <- data.frame(x = runif(nn <- 500))
ndata <- transform(ndata,
y1 = rnbinom(nn, mu = exp(3+x), size = exp(1)), # k is size
y2 = rnbinom(nn, mu = exp(2-x), size = exp(0)))
fit <- vglm(cbind(y1, y2) ~ x, negbinomial(zero = -2), data = ndata)
coef(fit, matrix = TRUE)

## End(Not run)
# Example 4
## Not run:
# fit1 and fit2 are equivalent
fit1 <- vglm(ymatrix ~ x2 + x3 + x4 + x5,
cumulative(parallel = FALSE ~ 1 + x3 + x5), data = cdata)
fit2 <- vglm(ymatrix ~ x2 + x3 + x4 + x5,
cumulative(parallel = TRUE ~ x2 + x4), data = cdata)

## End(Not run)

# Example 5
udata <- data.frame(x2 = rnorm(nn <- 200))
udata <- transform(udata,
y1 = rnorm(nn, mean = 1 - 3*x2, sd = exp(1 + 0.2*x2)),
184 concoef

y2 = rnorm(nn, mean = 1 - 3*x2, sd = exp(1)))

args(uninormal)
fit1 <- vglm(y1 ~ x2, uninormal, data = udata) # This is okay
fit2 <- vglm(y2 ~ x2, uninormal(zero = 2), data = udata) # This is okay

# This creates potential conflict

clist <- list("(Intercept)" = diag(2), "x2" = diag(2))
fit3 <- vglm(y2 ~ x2, uninormal(zero = 2), data = udata,
constraints = clist) # Conflict!
coef(fit3, matrix = TRUE) # Shows that clist[["x2"]] was overwritten,
constraints(fit3) # i.e., 'zero' seems to override the 'constraints' arg

# Example 6 ('whitespace' argument)

pneumo <- transform(pneumo, let = log(exposure.time))
fit1 <- vglm(cbind(normal, mild, severe) ~ let,
sratio(whitespace = FALSE, parallel = TRUE), data = pneumo)
fit2 <- vglm(cbind(normal, mild, severe) ~ let,
sratio(whitespace = TRUE, parallel = TRUE), data = pneumo)
head(predict(fit1), 2) # No white spaces
head(predict(fit2), 2) # Uses white spaces

# Example 7 ('zero' argument with character input)

set.seed(123); n <- 1000
ldata <- data.frame(x2 = runif(n))
ldata <- transform(ldata, y1 = rlogis(n, loc = 0+5*x2, scale = exp(2)))
ldata <- transform(ldata, y2 = rlogis(n, loc = 0+5*x2, scale = exp(0+1*x2)))
ldata <- transform(ldata, w1 = runif(n))
ldata <- transform(ldata, w2 = runif(n))
fit7 <- vglm(cbind(y1, y2) ~ x2,
# logistic(zero = "location1"), # location1 is intercept-only
# logistic(zero = "location2"),
# logistic(zero = "location*"), # Not okay... all is unmatched
# logistic(zero = "scale1"),
# logistic(zero = "scale2"),
# logistic(zero = "scale"), # Both scale parameters are matched
logistic(zero = c("location", "scale2")), # All but scale1
# logistic(zero = c("LOCAT", "scale2")), # Only scale2 is matched
# logistic(zero = c("LOCAT")), # Nothing is matched
# trace = TRUE,
# weights = cbind(w1, w2),
weights = w1,
data = ldata)
coef(fit7, matrix = TRUE)

concoef Extract Model Constrained/Canonical Coefficients

Description
concoef is a generic function which extracts the constrained (canonical) coefficients from objects
returned by certain modelling functions.
concoef 185

Usage
concoef(object, ...)

Arguments
object An object for which the extraction of canonical coefficients is meaningful.
... Other arguments fed into the specific methods function of the model.

Details
For constrained quadratic and ordination models, canonical coefficients are the elements of the C
matrix used to form the latent variables. They are highly interpretable in ecology, and are looked at
as weights or loadings.
They are also applicable for reduced-rank VGLMs.

Value
The value returned depends specifically on the methods function invoked.

Warning
concoef replaces ccoef; the latter is deprecated.
For QO models, there is a direct inverse relationship between the scaling of the latent variables (site
scores) and the tolerances. One normalization is for the latent variables to have unit variance. An-
other normalization is for all the species tolerances to be unit (provided eq.tolerances is TRUE).
These two normalizations cannot simultaneously hold in general. For rank R models with R > 1
it becomes more complicated because the latent variables are also uncorrelated. An important ar-
gument when fitting quadratic ordination models is whether eq.tolerances is TRUE or FALSE. See
Yee (2004) for details.

Author(s)
Thomas W. Yee

See Also
concoef-method, concoef.qrrvglm, concoef.cao, coef.
186 confintvglm

Examples
## Not run: set.seed(111) # This leads to the global solution
hspider[,1:6] <- scale(hspider[,1:6]) # Standardized environmental vars
p1 <- cqo(cbind(Alopacce, Alopcune, Alopfabr, Arctlute, Arctperi,
Auloalbi, Pardlugu, Pardmont, Pardnigr, Pardpull,
Trocterr, Zoraspin) ~
WaterCon + BareSand + FallTwig + CoveMoss + CoveHerb + ReflLux,
fam = quasipoissonff, data = hspider, Crow1positive = FALSE)
concoef(p1)

## End(Not run)

concoef-methods Constrained (Canonical) Coefficients

Description
concoef is a generic function used to return the constrained (canonical) coefficients of a constrained
ordination model. The function invokes particular methods which depend on the class of the first
argument.

Methods
object The object from which the constrained coefficients are extracted.

confintvglm Confidence Intervals for Parameters of VGLMs

Description
Computes confidence intervals (CIs) for one or more parameters in a fitted model. Currently the
object must be a "vglm" object.

Usage
confintvglm(object, parm, level = 0.95, method = c("wald", "profile"),
trace = NULL, ...)

Arguments
object A fitted model object.
parm, level, ...
Same as confint.
confintvglm 187

method Character. The default is the first method. Abbreviations are allowed. Currently
"profile" is basically working; and it is likely to be more accurate especially
for small samples, as it is based on a profile log likelihood, however it is com-
putationally intensive.
trace Logical. If TRUE then one can monitor the computation as it progresses (be-
cause it is expensive). The default is the orginal models trace value (see
vglm.control). Setting FALSE suppresses all intermediate output.

Details
The default for this methods function is based on confint.default and assumes asymptotic nor-
mality. In particular, the coef and vcov methods functions are used for vglm-class objects.
When method = "profile" the function profilevglm() is called to do the profiling. The code is
very heavily based on profile.glm which was originally written by D. M. Bates and W. N. Ven-
ables (For S in 1996) and subsequently corrected by B. D. Ripley. Sometimes the profiling method
can give problems, for example, cumulative requires the M linear predictors not to intersect in
the data cloud. Such numerical problems are less common when method = "wald", however,
it is well-known that inference based on profile likelihoods is generally more accurate than Wald,
especially when the sample size is small. The deviance (deviance(object)) is used if possible,
else the difference 2 * (logLik(object) - ell) is computed, where ell are the values of the
loglikelihood on a grid.
For Wald CIs and rrvglm-class objects, currently an error message is produced because I havent
gotten around to write the methods function; its not too hard, but am too busy! An interim measure
is to coerce the object into a "vglm" object, but then the confidence intervals will tend to be too
narrow because the estimated constraint matrices are treated as known.
For Wald CIs and vgam-class objects, currently an error message is produced because the theory
is undeveloped.

Value
Same as confint.

Note
The order of the values of argument method may change in the future without notice. The functions
plot.profile.glm and pairs.profile.glm from MASS appear to work with output from this
function.

Author(s)
Thomas Yee adapted confint.lm to handle "vglm" objects, for Wald-type confidence intervals.
Also, profile.glm was originally written by D. M. Bates and W. N. Venables (For S in 1996) and
subsequently corrected by B. D. Ripley. This function effectively calls confint.profile.glm()
in MASS.

See Also
vcovvlm, summaryvglm, confint, profile.glm, plot.profile.glm, pairs.profile.glm.
188 constraints

Examples
# Example 1: this is based on a glm example
counts <- c(18,17,15,20,10,20,25,13,12)
outcome <- gl(3, 1, 9); treatment <- gl(3, 3)
glm.D93 <- glm(counts ~ outcome + treatment, family = poisson())
vglm.D93 <- vglm(counts ~ outcome + treatment, family = poissonff)
confint(glm.D93) # needs MASS to be present on the system
confint.default(glm.D93) # based on asymptotic normality
confint(vglm.D93)
confint(vglm.D93) - confint(glm.D93) # Should be all 0s
confint(vglm.D93) - confint.default(glm.D93) # based on asympt. normality

# Example 2: simulated negative binomial data with multiple responses

ndata <- data.frame(x2 = runif(nn <- 100))
ndata <- transform(ndata, y1 = rnbinom(nn, mu = exp(3+x2), size = exp(1)),
y2 = rnbinom(nn, mu = exp(2-x2), size = exp(0)))
fit1 <- vglm(cbind(y1, y2) ~ x2, negbinomial, data = ndata, trace = TRUE)
coef(fit1)
coef(fit1, matrix = TRUE)
confint(fit1)
confint(fit1, "x2:1") # This might be improved to "x2" some day...
## Not run:
confint(fit1, method = "profile") # Computationally expensive
confint(fit1, "x2:1", method = "profile", trace = FALSE)

## End(Not run)

fit2 <- rrvglm(y1 ~ x2, negbinomial(zero = NULL), data = ndata)

confint(as(fit2, "vglm")) # Too narrow (SEs are biased downwards)

constraints Constraint Matrices

Description
Extractor function for the constraint matrices of objects in the VGAM package.

Usage
constraints(object, ...)
constraints.vlm(object, type = c("lm", "term"), all = TRUE, which,
matrix.out = FALSE, colnames.arg = TRUE, ...)

Arguments
object Some VGAM object, for example, having class vglmff-class.
type Character. Whether LM- or term-type constraints are to be returned. The num-
ber of such matrices returned is equal to nvar(object, type = "lm") and the
number of terms, respectively.
constraints 189

all, which If all = FALSE then which gives the integer index or a vector of logicals spec-
ifying the selection.
matrix.out Logical. If TRUE then the constraint matrices are cbind()ed together. The result
is usually more compact because the default is a list of constraint matrices.
colnames.arg Logical. If TRUE then column names are assigned corresponding to the variables.
... Other possible arguments such as type.

Details
Constraint matrices describe the relationship of coefficients/component functions of a particular
explanatory variable between the linear/additive predictors in VGLM/VGAM models. For example,
they may be all different (constraint matrix is the identity matrix) or all the same (constraint matrix
has one column and has unit values).
VGLMs and VGAMs have constraint matrices which are known. The class of RR-VGLMs have
constraint matrices which are unknown and are to be estimated.

Value
The extractor function constraints() returns a list comprising of constraint matricesusually one
for each column of the VLM model matrix, and in that order. The list is labelled with the variable
names. Each constraint matrix has M rows, where M is the number of linear/additive predictors,
and whose rank is equal to the number of columns. A model with no constraints at all has an order
M identity matrix as each variables constraint matrix.
For vglm and vgam objects, feeding in type = "term" constraint matrices back into the same model
should work and give an identical model. The default are the "lm"-type constraint matrices; this is
a list with one constraint matrix per column of the LM matrix. See the constraints argument of
vglm, and the example below.

Note
In all VGAM family functions zero = NULL means none of the linear/additive predictors are
modelled as intercepts-only. Other arguments found in certain VGAM family functions which
affect constraint matrices include parallel and exchangeable.
The constraints argument in vglm and vgam allows constraint matrices to be inputted. If so, then
constraints(fit, type = "lm") can be fed into the constraints argument of the same object
to get the same model.
The xij argument does not affect constraint matrices; rather, it allows each row of the constraint
matrix to be multiplied by a specified vector.

Author(s)
T. W. Yee

References
Yee, T. W. and Wild, C. J. (1996) Vector generalized additive models. Journal of the Royal Statisti-
cal Society, Series B, Methodological, 58, 481493.
190 corbet

Yee, T. W. and Hastie, T. J. (2003) Reduced-rank vector generalized linear models. Statistical
Modelling, 3, 1541.

See Also
is.parallel, is.zero. VGLMs are described in vglm-class; RR-VGLMs are described in
rrvglm-class.
Arguments such as zero and parallel found in many VGAM family functions are a way of creat-
ing/modifying constraint matrices conveniently, e.g., see zero. See CommonVGAMffArguments for
more information.

Examples
# Fit the proportional odds model:
pneumo <- transform(pneumo, let = log(exposure.time))
(fit1 <- vglm(cbind(normal, mild, severe) ~ sm.bs(let, 3),
cumulative(parallel = TRUE, reverse = TRUE), data = pneumo))
coef(fit1, matrix = TRUE)
constraints(fit1) # Parallel assumption results in this
constraints(fit1, type = "term") # Same as the default ("vlm"-type)
is.parallel(fit1)

# An equivalent model to fit1 (needs the type "term" constraints):

clist.term <- constraints(fit1, type = "term") # "term"-type constraints
(fit2 <- vglm(cbind(normal, mild, severe) ~ sm.bs(let, 3), data = pneumo,
cumulative(reverse = TRUE), constraints = clist.term))
abs(max(coef(fit1, matrix = TRUE) -
coef(fit2, matrix = TRUE))) # Should be zero

# Fit a rank-1 stereotype (RR-multinomial logit) model:

fit <- rrvglm(Country ~ Width + Height + HP, multinomial, data = car.all)
constraints(fit) # All except the first are the estimated A matrix

corbet Corbets Butterfly Data

Description
About 3300 individual butterflies were caught in Malaya by naturalist Corbet trapping butterflies.
They were classified to about 500 species.

Usage
data(corbet)

Format
A data frame with 24 observations on the following 2 variables.
species Number of species.
ofreq Observed frequency of individual butterflies of that species.
cqo 191

Details
In the early 1940s Corbet spent two years trapping butterflies in Malaya. Of interest was the total
number of species. Some species were so rare (e.g., 118 species had only one specimen) that it was
thought likely that there were many unknown species.

References
Fisher, R. A., Corbet, A. S. and Williams, C. B. (1943) The Relation Between the Number of
Species and the Number of Individuals in a Random Sample of an Animal Population. Journal of
Animal Ecology, 12, 4258.

Examples
summary(corbet)

cqo Fitting Constrained Quadratic Ordination (CQO)

Description
A constrained quadratic ordination (CQO; formerly called canonical Gaussian ordination or CGO)
model is fitted using the quadratic reduced-rank vector generalized linear model (QRR-VGLM)
framework.

Usage
cqo(formula, family, data = list(), weights = NULL, subset = NULL,
na.action = na.fail, etastart = NULL, mustart = NULL,
coefstart = NULL, control = qrrvglm.control(...), offset = NULL,
method = "cqo.fit", model = FALSE, x.arg = TRUE, y.arg = TRUE,
contrasts = NULL, constraints = NULL, extra = NULL,
smart = TRUE, ...)

data an optional data frame containing the variables in the model. By default the vari-
ables are taken from environment(formula), typically the environment from
which cqo is called.
weights an optional vector or matrix of (prior) weights to be used in the fitting process.
Currently, this argument should not be used.
subset an optional logical vector specifying a subset of observations to be used in the
fitting process.
na.action a function which indicates what should happen when the data contain NAs. The
default is set by the na.action setting of options, and is na.fail if that is
unset. The factory-fresh default is na.omit.
etastart starting values for the linear predictors. It is a M -column matrix. If M = 1 then
it may be a vector. Currently, this argument probably should not be used.
mustart starting values for the fitted values. It can be a vector or a matrix. Some family
functions do not make use of this argument. Currently, this argument probably
should not be used.
coefstart starting values for the coefficient vector. Currently, this argument probably
should not be used.
control a list of parameters for controlling the fitting process. See qrrvglm.control
for details.
offset This argument must not be used.
method the method to be used in fitting the model. The default (and presently only)
method cqo.fit uses iteratively reweighted least squares (IRLS).
model a logical value indicating whether the model frame should be assigned in the
model slot.
x.arg, y.arg logical values indicating whether the model matrix and response matrix used in
the fitting process should be assigned in the x and y slots. Note the model matrix
is the LM model matrix.
contrasts an optional list. See the contrasts.arg of model.matrix.default.
constraints an optional list of constraint matrices. The components of the list must be named
with the term it corresponds to (and it must match in character format). Each
constraint matrix must have M rows, and be of full-column rank. By default,
constraint matrices are the M by M identity matrix unless arguments in the
family function itself override these values. If constraints is used it must
contain all the terms; an incomplete list is not accepted. Constraint matrices for
x2 variables are taken as the identity matrix.
extra an optional list with any extra information that might be needed by the family
function.
smart logical value indicating whether smart prediction (smartpred) will be used.
... further arguments passed into qrrvglm.control.

Details
QRR-VGLMs or constrained quadratic ordination (CQO) models are estimated here by maximum
likelihood estimation. Optimal linear combinations of the environmental variables are computed,
cqo 193

called latent variables (these appear as latvar for R = 1 else latvar1, latvar2, etc. in the
output). Here, R is the rank or the number of ordination axes. Each species response is then a
regression of these latent variables using quadratic polynomials on a transformed scale (e.g., log for
Poisson counts, logit for presence/absence responses). The solution is obtained iteratively in order
to maximize the log-likelihood function, or equivalently, minimize the deviance.
The central formula (for Poisson and binomial species data) is given by
M
X
= B1T x1 + A + ( T Dm )em
m=1

where x1 is a vector (usually just a 1 for an intercept), x2 is a vector of environmental variables,

= C T x2 is a R-vector of latent variables, em is a vector of 0s but with a 1 in the mth position.
The are a vector of linear/additive predictors, e.g., the mth element is m = log(E[Ym ]) for
the mth species. The matrices B1 , A, C and Dm are estimated from the data, i.e., contain the
regression coefficients. The tolerance matrices satisfy Ts = 21 Ds1 . Many important CQO details
are directly related to arguments in qrrvglm.control, e.g., the argument noRRR specifies which
variables comprise x1 .
Theoretically, the four most popular VGAM family functions to be used with cqo correspond to the
Poisson, binomial, normal, and negative binomial distributions. The latter is a 2-parameter model.
All of these are implemented, as well as the 2-parameter gamma. The Poisson is or should be catered
for by quasipoissonff and poissonff, and the binomial by quasibinomialff and binomialff.
Those beginning with "quasi" have dispersion parameters that are estimated for each species.
For initial values, the function .Init.Poisson.QO should work reasonably well if the data is Pois-
son with species having equal tolerances. It can be quite good on binary data too. Otherwise the
Cinit argument in qrrvglm.control can be used.
It is possible to relax the quadratic form to an additive model. The result is a data-driven approach
rather than a model-driven approach, so that CQO is extended to constrained additive ordination
(CAO) when R = 1. See cao for more details.
In this documentation, M is the number of linear predictors, S is the number of responses (species).
Then M = S for Poisson and binomial species data, and M = 2S for negative binomial and gamma
distributed species data.
Incidentally, Unconstrained quadratic ordination (UQO) may be performed by, e.g., fitting a Good-
mans RC association model; see uqo and the Yee and Hadi (2014) referenced there. For UQO, the
response is the usual site-by-species matrix and there are no environmental variables; the site scores
are free parameters. UQO can be performed under the assumption that all species have the same
tolerance matrices.

Value
An object of class "qrrvglm".

Warning
Local solutions are not uncommon when fitting CQO models. To increase the chances of obtaining
the global solution, increase the value of the argument Bestof in qrrvglm.control. For repro-
ducibility of the results, it pays to set a different random number seed before calling cqo (the func-
tion set.seed does this). The function cqo chooses initial values for C using .Init.Poisson.QO()
if Use.Init.Poisson.QO = TRUE, else random numbers.
194 cqo

Unless I.tolerances = TRUE or eq.tolerances = FALSE, CQO is computationally expensive

with memory and time. It pays to keep the rank down to 1 or 2. If eq.tolerances = TRUE and
I.tolerances = FALSE then the cost grows quickly with the number of species and sites (in terms
of memory requirements and time). The data needs to conform quite closely to the statistical model,
and the environmental range of the data should be wide in order for the quadratics to fit the data well
(bell-shaped response surfaces). If not, RR-VGLMs will be more appropriate because the response
is linear on the transformed scale (e.g., log or logit) and the ordination is called constrained linear
ordination or CLO.
Like many regression models, CQO is sensitive to outliers (in the environmental and species data),
sparse data, high leverage points, multicollinearity etc. For these reasons, it is necessary to examine
the data carefully for these features and take corrective action (e.g., omitting certain species, sites,
environmental variables from the analysis, transforming certain environmental variables, etc.). Any
optimum lying outside the convex hull of the site scores should not be trusted. Fitting a CAO is
recommended first, then upon transformations etc., possibly a CQO can be fitted.
For binary data, it is necessary to have enough data. In general, the number of sites n ought to be
much larger than the number of species S, e.g., at least 100 sites for two species. Compared to count
(Poisson) data, numerical problems occur more frequently with presence/absence (binary) data. For
example, if Rank = 1 and if the response data for each species is a string of all absences, then all
presences, then all absences (when enumerated along the latent variable) then infinite parameter
estimates will occur. In general, setting I.tolerances = TRUE may help.
This function was formerly called cgo. It has been renamed to reinforce a new nomenclature de-
scribed in Yee (2006).

Note
The input requires care, preparation and thoughta lot more than other ordination methods. Here
is a partial checklist.

(1) The number of species should be kept reasonably low, e.g., 12 max. Feeding in 100+ species
wholesale is a recipe for failure. Choose a few species carefully. Using 10 well-chosen species
is better than 100+ species thrown in willy-nilly.
(2) Each species should be screened individually first, e.g., for presence/absence is the species to-
tally absent or totally present at all sites? For presence/absence data sort(colMeans(data))
can help avoid such species.
(3) The number of explanatory variables should be kept low, e.g., 7 max.
(4) Each explanatory variable should be screened individually first, e.g., is it heavily skewed or are
there outliers? They should be plotted and then transformed where needed. They should not
be too highly correlated with each other.
(5) Each explanatory variable should be scaled, e.g., to mean 0 and unit variance. This is especially
needed for I.tolerance = TRUE.
(6) Keep the rank low. Only if the data is very good should a rank-2 model be attempted. Usually
a rank-1 model is all that is practically possible even after a lot of work. The rank-1 model
should always be attempted first. Then might be clever and try use this for initial values for a
rank-2 model.
(7) If the number of sites is large then choose a random sample of them. For example, choose a
maximum of 500 sites. This will reduce the memory and time expense of the computations.
cqo 195

(8) Try I.tolerance = TRUE or eq.tolerance = FALSE if the inputted data set is large, so as
to reduce the computational expense. Thats because the default, I.tolerance = FALSE and
eq.tolerance = TRUE, is very memory hungry.

By default, a rank-1 equal-tolerances QRR-VGLM model is fitted (see qrrvglm.control for the
default control parameters). If Rank > 1 then the latent variables are always transformed so that
they are uncorrelated. By default, the argument trace is TRUE meaning a running log is printed
out while the computations are taking place. This is because the algorithm is computationally
expensive, therefore users might think that their computers have frozen if trace = FALSE!
The argument Bestof in qrrvglm.control controls the number of models fitted (each uses differ-
ent starting values) to the data. This argument is important because convergence may be to a local
solution rather than the global solution. Using more starting values increases the chances of finding
the global solution. Always plot an ordination diagram (use the generic function lvplot) and see
if it looks sensible. Local solutions arise because the optimization problem is highly nonlinear, and
this is particularly true for CAO.
Many of the arguments applicable to cqo are common to vglm and rrvglm.control. The most
important arguments are Rank, noRRR, Bestof, I.tolerances, eq.tolerances, isd.latvar, and
MUXfactor.
When fitting a 2-parameter model such as the negative binomial or gamma, it pays to have eq.tolerances = TRUE
and I.tolerances = FALSE. This is because numerical problems can occur when fitting the model
far away from the global solution when I.tolerances = TRUE. Setting the two arguments as
described will slow down the computation considerably, however it is numerically more stable.
In Example 1 below, an unequal-tolerances rank-1 QRR-VGLM is fitted to the hunting spiders
dataset, and Example 2 is the equal-tolerances version. The latter is less likely to have convergence
problems compared to the unequal-tolerances model. In Example 3 below, an equal-tolerances
rank-2 QRR-VGLM is fitted to the hunting spiders dataset. The numerical difficulties encountered
in fitting the rank-2 model suggests a rank-1 model is probably preferable. In Example 4 below,
constrained binary quadratic ordination (in old nomenclature, constrained Gaussian logit ordina-
tion) is fitted to some simulated data coming from a species packing model. With multivariate
binary responses, one must use multiple.responses = TRUE to indicate that the response (ma-
trix) is multivariate. Otherwise, it is interpreted as a single binary response variable. In Example 5
below, the deviance residuals are plotted for each species. This is useful as a diagnostic plot. This
is done by (re)regressing each species separately against the latent variable.
Sometime in the future, this function might handle input of the form cqo(x, y), where x and y are
matrices containing the environmental and species data respectively.

Author(s)
Thomas W. Yee. Thanks to Alvin Sou for converting a lot of the original FORTRAN code into C.

References
Yee, T. W. (2004) A new technique for maximum-likelihood canonical Gaussian ordination. Eco-
logical Monographs, 74, 685701.
ter Braak, C. J. F. and Prentice, I. C. (1988) A theory of gradient analysis. Advances in Ecological
Research, 18, 271317.
Yee, T. W. (2006) Constrained additive ordination. Ecology, 87, 203213.
196 cqo

See Also
qrrvglm.control, Coef.qrrvglm, predictqrrvglm, rcqo, cao, uqo, rrvglm, poissonff, binomialff,
negbinomial, gamma2, lvplot.qrrvglm, perspqrrvglm, trplot.qrrvglm, vglm, set.seed, hspider,
trapO.

Examples
## Not run:
# Example 1; Fit an unequal tolerances model to the hunting spiders data
hspider[,1:6] <- scale(hspider[,1:6]) # Standardized environmental variables
set.seed(1234) # For reproducibility of the results
p1ut <- cqo(cbind(Alopacce, Alopcune, Alopfabr, Arctlute, Arctperi,
Auloalbi, Pardlugu, Pardmont, Pardnigr, Pardpull,
Trocterr, Zoraspin) ~
WaterCon + BareSand + FallTwig + CoveMoss + CoveHerb + ReflLux,
fam = poissonff, data = hspider, Crow1positive = FALSE,
eq.tol = FALSE)
sort(deviance(p1ut, history = TRUE)) # A history of all the iterations
if (deviance(p1ut) > 1177) warning("suboptimal fit obtained")

S <- ncol(depvar(p1ut)) # Number of species

clr <- (1:(S+1))[-7] # Omits yellow
lvplot(p1ut, y = TRUE, lcol = clr, pch = 1:S, pcol = clr,
las = 1) # Ordination diagram
legend("topright", leg = colnames(depvar(p1ut)), col = clr,
pch = 1:S, merge = TRUE, bty = "n", lty = 1:S, lwd = 2)
(cp <- Coef(p1ut))

(a <- latvar(cp)[[email protected]]) # Ordered site scores along the gradient

# Names of the ordered sites along the gradient:
rownames(latvar(cp))[[email protected]]
(aa <- Opt(cp)[, [email protected]]) # Ordered optimums along the gradient
aa <- aa[!is.na(aa)] # Delete the species that is not unimodal
names(aa) # Names of the ordered optimums along the gradient

trplot(p1ut, which.species = 1:3, log = "xy", type = "b", lty = 1, lwd = 2,

col = c("blue","red","green"), label = TRUE) -> ii # Trajectory plot
legend(0.00005, 0.3, paste(ii$species[, 1], ii$species[, 2], sep = " and "),
lwd = 2, lty = 1, col = c("blue", "red", "green"))
abline(a = 0, b = 1, lty = "dashed")

S <- ncol(depvar(p1ut)) # Number of species

clr <- (1:(S+1))[-7] # Omits yellow
persp(p1ut, col = clr, label = TRUE, las = 1) # Perspective plot

# Example 2; Fit an equal tolerances model. Less numerically fraught.

poissonff, data = hspider, Crow1positive = FALSE)

sort(deviance(p1et, history = TRUE)) # A history of all the iterations
if (deviance(p1et) > 1586) warning("suboptimal fit obtained")
S <- ncol(depvar(p1et)) # Number of species
clr <- (1:(S+1))[-7] # Omits yellow
persp(p1et, col = clr, label = TRUE, las = 1)

# Example 3: A rank-2 equal tolerances CQO model with Poisson data

# This example is numerically fraught... need I.toler = TRUE too.
set.seed(555)
p2 <- cqo(cbind(Alopacce, Alopcune, Alopfabr, Arctlute, Arctperi,
Auloalbi, Pardlugu, Pardmont, Pardnigr, Pardpull,
Trocterr, Zoraspin) ~
WaterCon + BareSand + FallTwig + CoveMoss + CoveHerb + ReflLux,
poissonff, data = hspider, Crow1positive = FALSE,
I.toler = TRUE, Rank = 2, Bestof = 3, isd.latvar = c(2.1, 0.9))
sort(deviance(p2, history = TRUE)) # A history of all the iterations
if (deviance(p2) > 1127) warning("suboptimal fit obtained")
lvplot(p2, ellips = FALSE, label = TRUE, xlim = c(-3,4),
C = TRUE, Ccol = "brown", sites = TRUE, scol = "grey",
pcol = "blue", pch = "+", chull = TRUE, ccol = "grey")

# Example 4: species packing model with presence/absence data

set.seed(2345)
n <- 200; p <- 5; S <- 5
mydata <- rcqo(n, p, S, fam = "binomial", hi.abundance = 4,
eq.tol = TRUE, es.opt = TRUE, eq.max = TRUE)
myform <- attr(mydata, "formula")
set.seed(1234)
b1et <- cqo(myform, binomialff(multiple.responses = TRUE, link = "cloglog"),
data = mydata)
sort(deviance(b1et, history = TRUE)) # A history of all the iterations
lvplot(b1et, y = TRUE, lcol = 1:S, pch = 1:S, pcol = 1:S, las = 1)
Coef(b1et)

# Compare the fitted model with the 'truth'

cbind(truth = attr(mydata, "concoefficients"), fitted = concoef(b1et))

# Example 5: Plot the deviance residuals for diagnostic purposes

set.seed(1234)
p1et <- cqo(cbind(Alopacce, Alopcune, Alopfabr, Arctlute, Arctperi,
Auloalbi, Pardlugu, Pardmont, Pardnigr, Pardpull,
Trocterr, Zoraspin) ~
WaterCon + BareSand + FallTwig + CoveMoss + CoveHerb + ReflLux,
poissonff, data = hspider, eq.tol = TRUE, trace = FALSE)
sort(deviance(p1et, history = TRUE)) # A history of all the iterations
if (deviance(p1et) > 1586) warning("suboptimal fit obtained")
S <- ncol(depvar(p1et))
par(mfrow = c(3, 4))
for (ii in 1:S) {
198 crashes

tempdata <- data.frame(latvar1 = c(latvar(p1et)),

sppCounts = depvar(p1et)[, ii])
tempdata <- transform(tempdata, myOffset = -0.5 * latvar1^2)

# For species ii, refit the model to get the deviance residuals
fit1 <- vglm(sppCounts ~ offset(myOffset) + latvar1, poissonff,
data = tempdata, trace = FALSE)

# For checking: this should be 0

# print("max(abs(c(Coef(p1et)@B1[1,ii],Coef(p1et)@A[ii,1])-coef(fit1)))")
# print( max(abs(c(Coef(p1et)@B1[1,ii],Coef(p1et)@A[ii,1])-coef(fit1))) )

# Plot the deviance residuals

devresid <- resid(fit1, type = "deviance")
predvalues <- predict(fit1) + fit1@offset
ooo <- with(tempdata, order(latvar1))
plot(predvalues + devresid ~ latvar1, data = tempdata, col = "red",
xlab = "latvar1", ylab = "", main = colnames(depvar(p1et))[ii])
with(tempdata, lines(latvar1[ooo], predvalues[ooo], col = "blue"))
}

## End(Not run)

crashes Crashes on New Zealand Roads in 2009

Description
A variety of reported crash data cross-classified by time (hour of the day) and day of the week,
accumulated over 2009. These include fatalities and injuries (by car), trucks, motor cycles, bicycles
and pedestrians. There are some alcohol-related data too.

Usage
data(crashi)
data(crashf)
data(crashtr)
data(crashmc)
data(crashbc)
data(crashp)
data(alcoff)
data(alclevels)

Format
Data frames with hourly times as rows and days of the week as columns. The alclevels dataset
has hourly times and alcohol levels.

Mon, Tue, Wed, Thu, Fri, Sat, Sun Day of the week.
crashes 199

0-30, 31-50, 51-80, 81-100, 101-120, 121-150, 151-200, 201-250, 251-300, 301-350, 350+ Blood al-
cohol level (milligrams alcohol per 100 millilitres of blood).

Details
Each cell is the aggregate number of crashes reported at each hour-day combination, over the 2009
calendar year. The rownames of each data frame is the start time (hourly from midnight onwards)
on a 24 hour clock, e.g., 21 means 9.00pm to 9.59pm.
For crashes, chrashi are the number of injuries by car, crashf are the number of fatalities by car
(not included in chrashi), crashtr are the number of crashes involving trucks, crashmc are the
number of crashes involving motorcyclists, crashbc are the number of crashes involving bicycles,
and crashp are the number of crashes involving pedestrians. For alcohol-related offences, alcoff
are the number of alcohol offenders from breath screening drivers, and alclevels are the blood
alcohol levels of fatally injured drivers.

Source
http://www.transport.govt.nz/research/Pages/Motor-Vehicle-Crashes-in-New-Zealand-2009.
aspx. Thanks to Warwick Goold and Alfian F. Hadi for assistance.

References
Motor Vehicles Crashes in New Zealand 2009; Statistical Statement Calendar Year 2009. Ministry
of Transport, NZ Government; Yearly Report 2010. ISSN: 1176-3949

Clabels = as.character(1:23), xlim = c(-1.3, 2.3),

ylim = c(-1.2, 1))
## End(Not run)

cratio Ordinal Regression with Continuation Ratios

Description
Fits a continuation ratio logit/probit/cloglog/cauchit/... regression model to an ordered (preferably)
factor response.

Usage
cratio(link = "logit", parallel = FALSE, reverse = FALSE, zero = NULL,
whitespace = FALSE)

Arguments
link Link function applied to the M continuation ratio probabilities. See Links for
more choices.
parallel A logical, or formula specifying which terms have equal/unequal coefficients.
reverse Logical. By default, the continuation ratios used are j = logit(P [Y > j|Y
j]) for j = 1, . . . , M . If reverse is TRUE, then j = logit(P [Y < j + 1|Y
j + 1]) will be used.
zero An integer-valued vector specifying which linear/additive predictors are mod-
elled as intercepts only. The values must be from the set {1,2,. . . ,M }. The
default value means none are modelled as intercept-only terms.
whitespace See CommonVGAMffArguments for information.

Details
In this help file the response Y is assumed to be a factor with ordered values 1, 2, . . . , M + 1, so
that M is the number of linear/additive predictors j .
There are a number of definitions for the continuation ratio in the literature. To make life easier, in
the VGAM package, we use continuation ratios and stopping ratios (see sratio). Stopping ratios
deal with quantities such as logit(P[Y=j|Y>=j]).

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.

Warning
No check is made to verify that the response is ordinal if the response is a matrix; see ordered.
cumulative 201

Note
The response should be either a matrix of counts (with row sums that are all positive), or a factor.
In both cases, the y slot returned by vglm/vgam/rrvglm is the matrix of counts.
For a nominal (unordered) factor response, the multinomial logit model (multinomial) is more
appropriate.
Here is an example of the usage of the parallel argument. If there are covariates x1, x2 and x3,
then parallel = TRUE ~ x1 + x2 -1 and parallel = FALSE ~ x3 are equivalent. This would
constrain the regression coefficients for x1 and x2 to be equal; those of the intercepts and x3 would
be different.

Author(s)
Thomas W. Yee

References
Agresti, A. (2013) Categorical Data Analysis, 3rd ed. Hoboken, NJ, USA: Wiley.
McCullagh, P. and Nelder, J. A. (1989) Generalized Linear Models, 2nd ed. London: Chapman &
Hall.
Yee, T. W. (2010) The VGAM package for categorical data analysis. Journal of Statistical Software,
32, 134. http://www.jstatsoft.org/v32/i10/.

See Also
sratio, acat, cumulative, multinomial, margeff, pneumo, logit, probit, cloglog, cauchit.

Examples
pneumo <- transform(pneumo, let = log(exposure.time))
(fit <- vglm(cbind(normal, mild, severe) ~ let,
cratio(parallel = TRUE), data = pneumo))
coef(fit, matrix = TRUE)
constraints(fit)
predict(fit)
predict(fit, untransform = TRUE)
margeff(fit)

cumulative Ordinal Regression with Cumulative Probabilities

Description
Fits a cumulative link regression model to a (preferably ordered) factor response.
202 cumulative

Usage
cumulative(link = "logit", parallel = FALSE, reverse = FALSE,
multiple.responses = FALSE, whitespace = FALSE)

Arguments
link Link function applied to the J cumulative probabilities. See Links for more
choices, e.g., for the cumulative probit/cloglog/cauchit/. . . models.
parallel A logical or formula specifying which terms have equal/unequal coefficients.
See below for more information about the parallelism assumption. The default
results in what some people call the generalized ordered logit model to be fitted.
If parallel = TRUE then it does not apply to the intercept.
reverse Logical. By default, the cumulative probabilities used are P (Y 1), P (Y
2), . . . , P (Y J). If reverse is TRUE then P (Y 2), P (Y 3), . . . ,
P (Y J + 1) are used.
This should be set to TRUE for link= golf, polf, nbolf. For these links the
cutpoints must be an increasing sequence; if reverse = FALSE for then the
cutpoints must be an decreasing sequence.
multiple.responses
Logical. Multiple responses? If TRUE then the input should be a matrix with
values 1, 2, . . . , L, where L = J + 1 is the number of levels. Each column
of the matrix is a response, i.e., multiple responses. A suitable matrix can be
obtained from Cut.
whitespace See CommonVGAMffArguments for information.

Details
In this help file the response Y is assumed to be a factor with ordered values 1, 2, . . . , J + 1. Hence
M is the number of linear/additive predictors j ; for cumulative() one has M = J.
This VGAM family function fits the class of cumulative link models to (hopefully) an ordinal re-
sponse. By default, the non-parallel cumulative logit model is fitted, i.e.,
j = logit(P [Y j])
where j = 1, 2, . . . , M and the j are not constrained to be parallel. This is also known as the non-
proportional odds model. If the logit link is replaced by a complementary log-log link (cloglog)
then this is known as the proportional-hazards model.
In almost all the literature, the constraint matrices associated with this family of models are known.
For example, setting parallel = TRUE will make all constraint matrices (except for the intercept)
equal to a vector of M 1s. If the constraint matrices are equal, unknown and to be estimated, then
this can be achieved by fitting the model as a reduced-rank vector generalized linear model (RR-
VGLM; see rrvglm). Currently, reduced-rank vector generalized additive models (RR-VGAMs)
have not been implemented here.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.
cumulative 203

Warning
No check is made to verify that the response is ordinal if the response is a matrix; see ordered.

Note
The response should be either a matrix of counts (with row sums that are all positive), or a fac-
tor. In both cases, the y slot returned by vglm/vgam/rrvglm is the matrix of counts. The formula
must contain an intercept term. Other VGAM family functions for an ordinal response include
acat, cratio, sratio. For a nominal (unordered) factor response, the multinomial logit model
(multinomial) is more appropriate.
With the logit link, setting parallel = TRUE will fit a proportional odds model. Note that the
TRUE here does not apply to the intercept term. In practice, the validity of the proportional odds
assumption needs to be checked, e.g., by a likelihood ratio test (LRT). If acceptable on the data,
then numerical problems are less likely to occur during the fitting, and there are less parameters.
Numerical problems occur when the linear/additive predictors cross, which results in probabilities
outside of (0, 1); setting parallel = TRUE will help avoid this problem.
Here is an example of the usage of the parallel argument. If there are covariates x2, x3 and x4,
then parallel = TRUE ~ x2 + x3 -1 and parallel = FALSE ~ x4 are equivalent. This would
constrain the regression coefficients for x2 and x3 to be equal; those of the intercepts and x4 would
be different.
If the data is inputted in long format (not wide format, as in pneumo below) and the self-starting
initial values are not good enough then try using mustart, coefstart and/or etatstart. See the
example below.
To fit the proportional odds model one can use the VGAM family function propodds. Note that
propodds(reverse) is equivalent to cumulative(parallel = TRUE, reverse = reverse)
(which is equivalent to cumulative(parallel = TRUE, reverse = reverse, link = "logit")).
It is for convenience only. A call to cumulative() is preferred since it reminds the user that a par-
allelism assumption is made, as well as being a lot more flexible.

Author(s)
Thomas W. Yee

References
Agresti, A. (2013) Categorical Data Analysis, 3rd ed. Hoboken, NJ, USA: Wiley.
Agresti, A. (2010) Analysis of Ordinal Categorical Data, 2nd ed. Hoboken, NJ, USA: Wiley.
Dobson, A. J. and Barnett, A. (2008) An Introduction to Generalized Linear Models, 3rd ed. Boca
Raton: Chapman & Hall/CRC Press.
McCullagh, P. and Nelder, J. A. (1989) Generalized Linear Models, 2nd ed. London: Chapman &
Hall.
Simonoff, J. S. (2003) Analyzing Categorical Data, New York: Springer-Verlag.
Yee, T. W. (2010) The VGAM package for categorical data analysis. Journal of Statistical Software,
32, 134. http://www.jstatsoft.org/v32/i10/.
Yee, T. W. and Wild, C. J. (1996) Vector generalized additive models. Journal of the Royal Statisti-
cal Society, Series B, Methodological, 58, 481493.
204 cumulative

See Also
propodds, prplot, margeff, acat, cratio, sratio, multinomial, pneumo, Links, logit, probit,
cloglog, cauchit, golf, polf, nbolf, logistic1.

Examples
# Fit the proportional odds model, p.179, in McCullagh and Nelder (1989)
pneumo <- transform(pneumo, let = log(exposure.time))
(fit <- vglm(cbind(normal, mild, severe) ~ let,
cumulative(parallel = TRUE, reverse = TRUE), data = pneumo))
depvar(fit) # Sample proportions (good technique)
fit@y # Sample proportions (bad technique)
weights(fit, type = "prior") # Number of observations
coef(fit, matrix = TRUE)
constraints(fit) # Constraint matrices
apply(fitted(fit), 1, which.max) # Classification
apply(predict(fit, newdata = pneumo, type = "response"),
1, which.max) # Classification

# Check that the model is linear in let ----------------------

fit2 <- vgam(cbind(normal, mild, severe) ~ s(let, df = 2),
cumulative(reverse = TRUE), data = pneumo)
## Not run: plot(fit2, se = TRUE, overlay = TRUE, lcol = 1:2, scol = 1:2)

# Check the proportional odds assumption with a LRT ----------

(fit3 <- vglm(cbind(normal, mild, severe) ~ let,
cumulative(parallel = FALSE, reverse = TRUE), data = pneumo))
pchisq(2 * (logLik(fit3) - logLik(fit)),
df = length(coef(fit3)) - length(coef(fit)), lower.tail = FALSE)
lrtest(fit3, fit) # More elegant

# A factor() version of fit ----------------------------------

# This is in long format (cf. wide format above)
Nobs <- round(depvar(fit) * c(weights(fit, type = "prior")))
sumNobs <- colSums(Nobs) # apply(Nobs, 2, sum)

pneumo.long <-
data.frame(symptoms = ordered(rep(rep(colnames(Nobs), nrow(Nobs)),
times = c(t(Nobs))),
levels = colnames(Nobs)),
let = rep(rep(with(pneumo, let), each = ncol(Nobs)),
times = c(t(Nobs))))
with(pneumo.long, table(let, symptoms)) # Should be same as pneumo

(fit.long1 <- vglm(symptoms ~ let, data = pneumo.long, trace = TRUE,

cumulative(parallel = TRUE, reverse = TRUE)))
coef(fit.long1, matrix = TRUE) # Should be as coef(fit, matrix = TRUE)
# Could try using mustart if fit.long1 failed to converge.
mymustart <- matrix(sumNobs / sum(sumNobs),
nrow(pneumo.long), ncol(Nobs), byrow = TRUE)
fit.long2 <- vglm(symptoms ~ let, mustart = mymustart,
Dagum 205

cumulative(parallel = TRUE, reverse = TRUE),

data = pneumo.long, trace = TRUE)
coef(fit.long2, matrix = TRUE) # Should be as coef(fit, matrix = TRUE)

Dagum The Dagum Distribution

Description
Density, distribution function, quantile function and random generation for the Dagum distribution
with shape parameters a and p, and scale parameter scale.

Usage
ddagum(x, scale = 1, shape1.a, shape2.p, log = FALSE)
pdagum(q, scale = 1, shape1.a, shape2.p, lower.tail = TRUE, log.p = FALSE)
qdagum(p, scale = 1, shape1.a, shape2.p, lower.tail = TRUE, log.p = FALSE)
rdagum(n, scale = 1, shape1.a, shape2.p)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. If length(n) > 1, the length is taken to be the number
required.
shape1.a, shape2.p
shape parameters.
scale scale parameter.
log Logical. If log = TRUE then the logarithm of the density is returned.
lower.tail, log.p
Same meaning as in pnorm or qnorm.

Details
See dagum, which is the VGAM family function for estimating the parameters by maximum likeli-
hood estimation.

Value
ddagum gives the density, pdagum gives the distribution function, qdagum gives the quantile function,
and rdagum generates random deviates.

Note
The Dagum distribution is a special case of the 4-parameter generalized beta II distribution.
206 dagum

Author(s)

T. W. Yee and Kai Huang

References

Kleiber, C. and Kotz, S. (2003) Statistical Size Distributions in Economics and Actuarial Sciences,
Hoboken, NJ, USA: Wiley-Interscience.

See Also

dagum, genbetaII.

Examples
probs <- seq(0.1, 0.9, by = 0.1)
shape1.a <- 1; shape2.p <- 2
# Should be 0:
max(abs(pdagum(qdagum(p = probs, shape1.a = shape1.a, shape2.p = shape2.p),
shape1.a = shape1.a, shape2.p = shape2.p) - probs))

## Not run: par(mfrow = c(1, 2))

x <- seq(-0.01, 5, len = 401)
plot(x, dexp(x), type = "l", col = "black", ylab = "", las = 1, ylim = c(0, 1),
main = "Black is standard exponential, others are ddagum(x, ...)")
lines(x, ddagum(x, shape1.a = shape1.a, shape2.p = 1), col = "orange")
lines(x, ddagum(x, shape1.a = shape1.a, shape2.p = 2), col = "blue")
lines(x, ddagum(x, shape1.a = shape1.a, shape2.p = 5), col = "green")
legend("topright", col = c("orange","blue","green"), lty = rep(1, len = 3),
legend = paste("shape1.a =", shape1.a, ", shape2.p =", c(1, 2, 5)))

plot(x, pexp(x), type = "l", col = "black", ylab = "", las = 1,

main = "Black is standard exponential, others are pdagum(x, ...)")
lines(x, pdagum(x, shape1.a = shape1.a, shape2.p = 1), col = "orange")
lines(x, pdagum(x, shape1.a = shape1.a, shape2.p = 2), col = "blue")
lines(x, pdagum(x, shape1.a = shape1.a, shape2.p = 5), col = "green")
legend("bottomright", col = c("orange","blue","green"), lty = rep(1, len = 3),
legend = paste("shape1.a =", shape1.a, ", shape2.p =", c(1, 2, 5)))

## End(Not run)

dagum Dagum Distribution Family Function

Description

Maximum likelihood estimation of the 3-parameter Dagum distribution.

dagum 207

Usage
dagum(lscale = "loge", lshape1.a = "loge", lshape2.p = "loge",
iscale = NULL, ishape1.a = NULL, ishape2.p = NULL, imethod = 1,
lss = TRUE, gscale = exp(-5:5), gshape1.a = seq(0.75, 4, by = 0.25),
gshape2.p = exp(-5:5), probs.y = c(0.25, 0.5, 0.75), zero = "shape")

Arguments
lss See CommonVGAMffArguments for important information.
lshape1.a, lscale, lshape2.p
Parameter link functions applied to the (positive) parameters a, scale, and p.
See Links for more choices.
iscale, ishape1.a, ishape2.p, imethod, zero
See CommonVGAMffArguments for information. For imethod = 2 a good initial
value for ishape2.p is needed to obtain a good estimate for the other parameter.
gscale, gshape1.a, gshape2.p
See CommonVGAMffArguments for information.
probs.y See CommonVGAMffArguments for information.

Details
The 3-parameter Dagum distribution is the 4-parameter generalized beta II distribution with shape
parameter q = 1. It is known under various other names, such as the Burr III, inverse Burr, beta-
K, and 3-parameter kappa distribution. It can be considered a generalized log-logistic distribution.
Some distributions which are special cases of the 3-parameter Dagum are the inverse Lomax (a =
1), Fisk (p = 1), and the inverse paralogistic (a = p). More details can be found in Kleiber and
Kotz (2003).
The Dagum distribution has a cumulative distribution function

F (y) = [1 + (y/b)a ]p

which leads to a probability density function

f (y) = apy ap1 /[bap {1 + (y/b)a }p+1 ]

for a > 0, b > 0, p > 0, y 0. Here, b is the scale parameter scale, and the others are shape
parameters. The mean is

E(Y ) = b (p + 1/a) (1 1/a)/(p)

provided ap < 1 < a; these are returned as the fitted values. This family function handles
multiple responses.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.
208 dAR1

Note

See the notes in genbetaII.

From Kleiber and Kotz (2003), the MLE is rather sensitive to isolated observations located suf-
ficiently far from the majority of the data. Reliable estimation of the scale parameter require
n > 7000, while estimates for a and p can be considered unbiased for n > 2000 or 3000.

Author(s)

T. W. Yee

References

Kleiber, C. and Kotz, S. (2003) Statistical Size Distributions in Economics and Actuarial Sciences,
Hoboken, NJ, USA: Wiley-Interscience.

See Also

Dagum, genbetaII, betaII, sinmad, fisk, inv.lomax, lomax, paralogistic, inv.paralogistic,

simulate.vlm.

Examples
ddata <- data.frame(y = rdagum(n = 3000, scale = exp(2),
shape1 = exp(1), shape2 = exp(1)))
fit <- vglm(y ~ 1, dagum(lss = FALSE), data = ddata, trace = TRUE)
fit <- vglm(y ~ 1, dagum(lss = FALSE, ishape1.a = exp(1)),
data = ddata, trace = TRUE)
coef(fit, matrix = TRUE)
Coef(fit)
summary(fit)

dAR1 The AR-1 Autoregressive Process

Description

Density for the AR-1 model.

Usage

dAR1(x, drift = 0, var.error = 1, ARcoef1 = 0.0,

type.likelihood = c("exact", "conditional"), log = FALSE)
dAR1 209

Arguments
x, vector of quantiles.
drift the scaled mean (also known as the drift parameter), . Note that the mean is
/(1 ). The default corresponds to observations that have mean 0.
log Logical. If TRUE then the logarithm of the density is returned.
type.likelihood, var.error, ARcoef1
See AR1. The argument ARcoef1 is . The argument var.error is the variance
of the i.i.d. random noise, i.e., 2 . If type.likelihood = "conditional" then
the first element or row of the result is currently assigned NAthis is because
the density of the first observation is effectively ignored.

Details
Most of the background to this function is given in AR1. All the arguments are converted into
matrices, and then all their dimensions are obtained. They are then coerced into the same size: the
number of rows is the maximum of all the single rows, and ditto for the number of columns.

Value
dAR1 gives the density.

Author(s)
T. W. Yee and Victor Miranda

fit2 <- vglm(TS1 ~ 1, AR1(type.likelihood = "cond"), data = tdata, trace = TRUE)

(Cfit2 <- Coef(fit2)) # Okay for intercept-only models
logLik(fit2)
head(keep <- dAR1(depvar(fit2), drift = Cfit2[1], var.error = (Cfit2[2])^2,
ARcoef1 = Cfit2[3], type.likelihood = "cond", log = TRUE))
sum(keep[-1])
210 deermice

deermice Captures of Peromyscus maniculatus, also known as deer mice.

Description
Captures of Peromyscus maniculatus collected at East Stuart Gulch, Colorado, USA.

Usage
data(deermice)

Format
The format is a data frame.

Details
Peromyscus maniculatus is a rodent native to North America. The deer mouse is small in size, only
about 8 to 10 cm long, not counting the length of the tail.
Originally, the columns of this data frame represent the sex (m or f), the ages (y: young, sa: semi-
adult, a: adult), the weights in grams, and the capture histories of 38 individuals over 6 trapping
occasions (1: captured, 0: not captured).
The data set was collected by V. Reid and distributed with the CAPTURE program of Otis et al.
(1978).
deermice has 38 deermice whereas Perom had 36 deermice (Perom has been withdrawn.) In
deermice the two semi-adults have been classified as adults. The sex variable has 1 for female,
and 0 for male.

References
Huggins, R. M. (1991) Some practical aspects of a conditional likelihood approach to capture ex-
periments. Biometrics, 47, 725732.
Otis, D. L. et al. (1978) Statistical inference from capture data on closed animal populations,
Wildlife Monographs, 62, 3135.

See Also
posbernoulli.b, posbernoulli.t, fill1.

Examples
head(deermice)
## Not run:
fit1 <- vglm(cbind(y1, y2, y3, y4, y5, y6) ~ sex + age,
posbernoulli.t(parallel.t = TRUE), data = deermice, trace = TRUE)
coef(fit1)
coef(fit1, matrix = TRUE)
deplot.lmscreg 211

## End(Not run)

deplot.lmscreg Density Plot for LMS Quantile Regression

Description
Plots a probability density function associated with a LMS quantile regression.

Usage
deplot.lmscreg(object, newdata = NULL, x0, y.arg, show.plot = TRUE, ...)

Arguments
object A VGAM quantile regression model, i.e., an object produced by modelling func-
tions such as vglm and vgam with a family function beginning with "lms.", e.g.,
lms.yjn.
newdata Optional data frame containing secondary variables such as sex. It should have
a maximum of one row. The default is to use the original data.
x0 Numeric. The value of the primary variable at which to make the slice.
y.arg Numerical vector. The values of the response variable at which to evaluate the
density. This should be a grid that is fine enough to ensure the plotted curves are
smooth.
show.plot Logical. Plot it? If FALSE no plot will be done.
... Graphical parameter that are passed into plotdeplot.lmscreg.

Details
This function calls, e.g., deplot.lms.yjn in order to compute the density function.

Value
The original object but with a list placed in the slot post, called @post$deplot. The list has
components

newdata The argument newdata above, or a one-row data frame constructed out of the
x0 argument.
y The argument y.arg above.
density Vector of the density function values evaluated at y.arg.

Note
plotdeplot.lmscreg actually does the plotting.
212 depvar

Author(s)
Thomas W. Yee

References
Yee, T. W. (2004) Quantile regression via vector generalized additive models. Statistics in Medicine,
23, 22952315.

See Also
plotdeplot.lmscreg, qtplot.lmscreg, lms.bcn, lms.bcg, lms.yjn.

Examples
## Not run:
fit <- vgam(BMI ~ s(age, df = c(4, 2)), fam = lms.bcn(zero = 1), data = bmi.nz)
ygrid <- seq(15, 43, by = 0.25)
deplot(fit, x0 = 20, y = ygrid, xlab = "BMI", col = "green", llwd = 2,
main = "BMI distribution at ages 20 (green), 40 (blue), 60 (red)")
deplot(fit, x0 = 40, y = ygrid, add = TRUE, col = "blue", llwd = 2)
deplot(fit, x0 = 60, y = ygrid, add = TRUE, col = "red", llwd = 2) -> a

names(a@post$deplot)
a@post$deplot$newdata
head(a@post$deplot$y)
head(a@post$deplot$density)

## End(Not run)

depvar Response Variable Extracted

Description
A generic function that extracts the response/dependent variable from objects.

Usage
depvar(object, ...)

Arguments
object An object that has some response/dependent variable.
... Other arguments fed into the specific methods function of the model. In partic-
ular, sometimes type = c("lm", "lm2") is available, in which case the first
one is chosen if the user does not input a value. The latter value corresponds to
argument form2, and sometimes a response for that is optional.
df.residual 213

Details

By default this function is preferred to calling fit@y, say.

Value

The response/dependent variable, usually as a matrix or vector.

Author(s)

Thomas W. Yee

See Also

model.matrix, vglm.

Examples
pneumo <- transform(pneumo, let = log(exposure.time))
(fit <- vglm(cbind(normal, mild, severe) ~ let, propodds, data = pneumo))
fit@y # Sample proportions (not recommended)
depvar(fit) # Better than using fit@y; dependent variable (response)
weights(fit, type = "prior") # Number of observations

df.residual Residual Degrees-of-Freedom

Description

Returns the residual degrees-of-freedom extracted from a fitted VGLM object.

Usage

df.residual_vlm(object, type = c("vlm", "lm"), ...)

Arguments

object an object for which the degrees-of-freedom are desired, e.g., a vglm object.
type the type of residual degrees-of-freedom wanted. In some applications the usual
LM-type value may be more appropriate. The default is the first choice.
... additional optional arguments.
214 dgenpois

Details

When a VGLM is fitted, a large (VLM) generalized least squares (GLS) fit is done at each IRLS
iteration. To do this, an ordinary least squares (OLS) fit is performed by transforming the GLS using
Cholesky factors. The number of rows is M times the ordinary number of rows of the LM-type
model: nM . Here, M is the number of linear/additive predictors. So the formula for the VLM-type
residual degrees-of-freedom is nM p where p is the number of columns of the big VLM
matrix. The formula for the LM-type residual degrees-of-freedom is n pj where pj is the number
of columns of the ordinary LM matrix corresponding to the jth linear/additive predictor.

Value

The value of the residual degrees-of-freedom extracted from the object. When type = "vlm" this
is a single integer, and when type = "lm" this is a M -vector of integers.

See Also

vglm, deviance, lm.

Examples
pneumo <- transform(pneumo, let = log(exposure.time))
(fit <- vglm(cbind(normal, mild, severe) ~ let, propodds, data = pneumo))
head(model.matrix(fit, type = "vlm"))
head(model.matrix(fit, type = "lm"))

df.residual(fit, type = "vlm") # n * M - p_VLM

nobs(fit, type = "vlm") # n * M
nvar(fit, type = "vlm") # p_VLM

df.residual(fit, type = "lm") # n - p_LM(j); Useful in some situations

nobs(fit, type = "lm") # n
nvar(fit, type = "lm") # p_LM
nvar_vlm(fit, type = "lm") # p_LM(j) (<= p_LM elementwise)

dgenpois The Generalized Poisson Distribution

Description

Density for the Generalized Poisson Distribution.

Usage

dgenpois(x, lambda = 0, theta, log = FALSE)

dgenpois 215

Arguments

x, vector of quantiles.
lambda, theta See genpoisson. The default value of lambda corresponds to an ordinary Pois-
son distribution.
log Logical. If TRUE then the logarithm of the density is returned.

Details

Most of the background to this function is given in genpoisson. Some warnings relevant to this
distribution are given there, especially relating to the complicated range of the parameter lambda
about or near 1.
Note that numerical round off errors etc. can occur; see below for an example.

Value

dgenpois gives the density. The value NaN is returned for elements not satisfying the parameter
restrictions, e.g., if > 1.

Author(s)

T. W. Yee

See Also

genpoisson, dpois.

Examples

sum(dgenpois(0:1000, lambda = -0.5, theta = 2)) # Not perfect...

## Not run:
lambda <- -0.2; theta <- 2; y <- 0:10
proby <- dgenpois(y, lambda = lambda, theta = theta, log = FALSE)
plot(y, proby, type = "h", col = "blue", lwd = 2, ylab = "P[Y=y]",
main = paste("Y ~ Generalized Poisson(lambda=", lambda,
", theta=", theta, ")", sep = ""), las = 1,
sub = "Orange is the Poisson probability function")
sum(proby)
lines(y + 0.1, dpois(y, theta), type = "h", lwd = 2, col = "orange")

## End(Not run)
216 dhuber

dhuber Hubers Least Favourable Distribution

Description
Density, distribution function, quantile function and random generation for Hubers least favourable
distribution, see Huber and Ronchetti (2009).

Usage
dhuber(x, k = 0.862, mu = 0, sigma = 1, log = FALSE)
edhuber(x, k = 0.862, mu = 0, sigma = 1, log = FALSE)
rhuber(n, k = 0.862, mu = 0, sigma = 1)
qhuber(p, k = 0.862, mu = 0, sigma = 1, lower.tail = TRUE, log.p = FALSE)
phuber(q, k = 0.862, mu = 0, sigma = 1, lower.tail = TRUE, log.p = FALSE)

Arguments
x, q numeric vector, vector of quantiles.
p vector of probabilities.
n number of random values to be generated. If length(n) > 1 then the length is
taken to be the number required.
k numeric. Borderline value of central Gaussian part of the distribution. This is
known as the tuning constant, and should be positive. For example, k = 0.862
refers to a 20% contamination neighborhood of the Gaussian distribution. If
k = 1.40 then this is 5% contamination.
mu numeric. distribution mean.
sigma numeric. Distribution scale (sigma = 1 defines the distribution in standard
form, with standard Gaussian centre).
log Logical. If log = TRUE then the logarithm of the result is returned.
lower.tail, log.p
Same meaning as in pnorm or qnorm.

Details
Details are given in huber2, the VGAM family function for estimating the parameters mu and
sigma.

Value
dhuber gives out a vector of density values.
edhuber gives out a list with components val (density values) and eps (contamination proportion).
rhuber gives out a vector of random numbers generated by Hubers least favourable distribution.
phuber gives the distribution function, qhuber gives the quantile function.
Diffzeta 217

Author(s)
Christian Hennig wrote [d,ed,r]huber() (from smoothmest) and slight modifications were made
by T. W. Yee to replace looping by vectorization and addition of the log argument. Arash Ardalan
wrote [pq]huber(), and two arguments for these were implemented by Kai Huang. This helpfile
was adapted from smoothmest.

Diffzeta Differenced Zeta Distribution

Description
Density, distribution function, quantile function, and random generation for the differenced zeta
distribution.

Usage
ddiffzeta(x, shape, start = 1, log = FALSE)
pdiffzeta(q, shape, start = 1, lower.tail = TRUE)
qdiffzeta(p, shape, start = 1)
rdiffzeta(n, shape, start = 1)
218 Diffzeta

Arguments

x, q, p, n Same as in runif.
shape, start Details at diffzeta.
log, lower.tail
Same as in runif.

Details

This distribution appears to work well on the distribution of English words in such texts. Some
more details are given in diffzeta.

Value

ddiffzeta gives the density, pdiffzeta gives the distribution function, qdiffzeta gives the quan-
tile function, and rdiffzeta generates random deviates.

Note

Given some response data, the VGAM family function diffzeta estimates the parameter shape.
Function pdiffzeta() suffers from the problems that plog sometimes has, i.e., when p is very
close to 1.

Author(s)

T. W. Yee

See Also

diffzeta, zetaff, zipf, Oizeta.

Examples

ddiffzeta(1:20, 0.5, start = 2)

rdiffzeta(20, 0.5)

## Not run: shape <- 0.8; x <- 1:10

plot(x, ddiffzeta(x, shape = shape), type = "h", ylim = 0:1,
sub = "shape=0.8", las = 1, col = "blue", ylab = "Probability",
main = "Differenced zeta distribution: blue=PMF; orange=CDF")
lines(x + 0.1, pdiffzeta(x, shape = shape), col = "orange", lty = 3, type = "h")
## End(Not run)
diffzeta 219

diffzeta Differenced Zeta Distribution Family Function

Description

Estimates the parameter of the differenced zeta distribution.

Usage

diffzeta(start = 1, lshape = "loge", ishape = NULL)

Arguments

lshape, ishape Same as zetaff.

start Smallest value of the support of the distribution. Must be a positive integer.

Details

The PMF is

P (Y = y) = (a/y)s (a/(1 + y))s , s > 0, y = a, a + 1, . . . ,

where s is the positive shape parameter, and a is start. According to Moreno-Sanchez et al. (2016),
this model fits quite well to about 40 percent of all the English books in the Project Gutenberg data
base (about 30,000 texts). Multiple responses are handled.

Value

An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Author(s)

T. W. Yee

References

Moreno-Sanchez, I. and Font-Clos, F. and Corral, A. Large-Scale Analysis of Zipfs Law in English
Texts, 2016. PLoS ONE, 11(1), 119.

See Also

Diffzeta, zetaff, zeta, zipf, zipf.

220 dirichlet

Examples
odata <- data.frame(x2 = runif(nn <- 1000)) # Artificial data
odata <- transform(odata, shape = loge(-0.25 + x2, inverse = TRUE))
odata <- transform(odata, y1 = rdiffzeta(nn, shape))
with(odata, table(y1))
ofit <- vglm(y1 ~ x2, diffzeta, data = odata, trace = TRUE, crit = "coef")
coef(ofit, matrix = TRUE)

dirichlet Fitting a Dirichlet Distribution

Description
Fits a Dirichlet distribution to a matrix of compositions.

Usage
dirichlet(link = "loge", parallel = FALSE, zero = NULL, imethod = 1)

Arguments
link Link function applied to each of the M (positive) shape parameters j . See
Links for more choices. The default gives j = log(j ).
parallel, zero, imethod
See CommonVGAMffArguments for more information.

Details
In this help file the response is assumed to be a M -column matrix with positive values and whose
rows each sum to unity. Such data can be thought of as compositional data. There are M lin-
ear/additive predictors j .
The Dirichlet distribution is commonly used to model compositional data, including applications in
genetics. Suppose (Y1 , . . . , YM )T is the response. Then it has a Dirichlet distribution if (Y1 , . . . , YM 1 )T
has density
M
(+ ) Y j 1
QM yj
j=1 (j ) j=1

where + = 1 + + M , j > 0, and the density is defined on the unit simplex

M
X
M = (y1 , . . . , yM )T : y1 > 0, . . . , yM > 0, yj = 1 .

j=1

One has E(Yj ) = j /+ , which are returned as the fitted values. For this distribution Fisher
scoring corresponds to Newton-Raphson.
The Dirichlet distribution can be motivated by considering the random variables (G1 , . . . , GM )T
which are each independent and identically distributed as a gamma distribution with density f (gj ) =
1
gj j egj /(j ). Then the Dirichlet distribution arises when Yj = Gj /(G1 + + GM ).
dirmul.old 221

Note
The response should be a matrix of positive values whose rows each sum to unity. Similar to this is
count data, where probably a multinomial logit model (multinomial) may be appropriate. Another
similar distribution to the Dirichlet is the Dirichlet-multinomial (see dirmultinomial).

Author(s)
Thomas W. Yee

See Also
rdiric, dirmultinomial, multinomial, simplex.

Examples
ddata <- data.frame(rdiric(n = 1000,
shape = exp(c(y1 = -1, y2 = 1, y3 = 0))))
fit <- vglm(cbind(y1, y2, y3) ~ 1, dirichlet,
data = ddata, trace = TRUE, crit = "coef")
Coef(fit)
coef(fit, matrix = TRUE)
head(fitted(fit))

dirmul.old Fitting a Dirichlet-Multinomial Distribution

Description
Fits a Dirichlet-multinomial distribution to a matrix of non-negative integers.

Usage
dirmul.old(link = "loge", ialpha = 0.01, parallel = FALSE, zero = NULL)
222 dirmul.old

Arguments
link Link function applied to each of the M (positive) shape parameters j for j =
1, . . . , M . See Links for more choices. Here, M is the number of columns of
the response matrix.
ialpha Numeric vector. Initial values for the alpha vector. Must be positive. Recycled
to length M .
parallel A logical, or formula specifying which terms have equal/unequal coefficients.
zero An integer-valued vector specifying which linear/additive predictors are mod-
elled as intercepts only. The values must be from the set {1,2,. . . ,M }.

Details
The Dirichlet-multinomial distribution, which is somewhat similar to a Dirichlet distribution, has
probability function
M
2y (+ ) Y (yj + j )
P (Y1 = y1 , . . . , YM = yM ) =
y1 , . . . , yM (2y + + ) j=1 (j )

a

for j > 0, + = 1 + + M , and 2y = y1 + + yM . Here, b means a choose b and
refers to combinations (see choose). The (posterior) mean is

E(Yj ) = (yj + j )/(2y + + )

for j = 1, . . . , M , and these are returned as the fitted values as a M -column matrix.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.

Note
The response should be a matrix of non-negative values. Convergence seems to slow down if there
are zero values. Currently, initial values can be improved upon.
This function is almost defunct and may be withdrawn soon. Use dirmultinomial instead.

Author(s)
Thomas W. Yee

References
Lange, K. (2002) Mathematical and Statistical Methods for Genetic Analysis, 2nd ed. New York:
Springer-Verlag.
Forbes, C., Evans, M., Hastings, N. and Peacock, B. (2011) Statistical Distributions, Hoboken, NJ,
USA: John Wiley and Sons, Fourth edition.
Paul, S. R., Balasooriya, U. and Banerjee, T. (2005) Fisher information matrix of the Dirichlet-
multinomial distribution. Biometrical Journal, 47, 230236.
dirmultinomial 223

Tvedebrink, T. (2010) Overdispersion in allelic counts and -correction in forensic genetics. Theo-
retical Population Biology, 78, 200210.

See Also
dirmultinomial, dirichlet, betabinomialff, multinomial.

Examples
# Data from p.50 of Lange (2002)
alleleCounts <- c(2, 84, 59, 41, 53, 131, 2, 0,
0, 50, 137, 78, 54, 51, 0, 0,
0, 80, 128, 26, 55, 95, 0, 0,
0, 16, 40, 8, 68, 14, 7, 1)
dim(alleleCounts) <- c(8, 4)
alleleCounts <- data.frame(t(alleleCounts))
dimnames(alleleCounts) <- list(c("White","Black","Chicano","Asian"),
paste("Allele", 5:12, sep = ""))

set.seed(123) # @initialize uses random numbers

fit <- vglm(cbind(Allele5,Allele6,Allele7,Allele8,Allele9,
Allele10,Allele11,Allele12) ~ 1, dirmul.old,
trace = TRUE, crit = "c", data = alleleCounts)

(sfit <- summary(fit))

vcov(sfit)
round(eta2theta(coef(fit), fit@misc$link, fit@misc$earg), digits = 2) # not preferred
round(Coef(fit), digits = 2) # preferred
round(t(fitted(fit)), digits = 4) # 2nd row of Table 3.5 of Lange (2002)
coef(fit, matrix = TRUE)

pfit <- vglm(cbind(Allele5,Allele6,Allele7,Allele8,Allele9,

Allele10,Allele11,Allele12) ~ 1,
dirmul.old(parallel = TRUE), trace = TRUE,
data = alleleCounts)
round(eta2theta(coef(pfit, matrix = TRUE), pfit@misc$link,
pfit@misc$earg), digits = 2) # 'Right' answer
round(Coef(pfit), digits = 2) # 'Wrong' answer due to parallelism constraint

dirmultinomial Fitting a Dirichlet-Multinomial Distribution

Description
Fits a Dirichlet-multinomial distribution to a matrix response.

Usage
dirmultinomial(lphi = "logit", iphi = 0.10, parallel = FALSE, zero = "M")
224 dirmultinomial

Arguments
lphi Link function applied to the parameter, which lies in the open unit interval
(0, 1). See Links for more choices.
iphi Numeric. Initial value for . Must be in the open unit interval (0, 1). If a failure
to converge occurs then try assigning this argument a different value.
parallel A logical (formula not allowed here) indicating whether the probabilities 1 , . . . , M 1
are to be equal via equal coefficients. Note M will generally be different from
the other probabilities. Setting parallel = TRUE will only work if you also set
zero = NULL because of interference between these arguments (with respect to
the intercept term).
zero An integer-valued vector specifying which linear/additive predictors are mod-
elled as intercepts only. The values must be from the set {1, 2, . . . , M }. If the
character "M" then this means the numerical value M , which corresponds to lin-
ear/additive predictor associated with . Setting zero = NULL means none of
the values from the set {1, 2, . . . , M }.

Details
The Dirichlet-multinomial distribution arises from a multinomial distribution where the probability
parameters are not constant but are generated from a multivariate distribution called the Dirichlet
distribution. The Dirichlet-multinomial distribution has probability function
QM Qyj
r=1 (j (1 ) + (r 1))

N j=1
P (Y1 = y1 , . . . , YM = yM ) = QN
y1 , . . . , y M r=1 (1 + (r 1))

where is the over-dispersion parameter and N = y1 + + yM . Here, ab means a choose

b and refers to combinations (see choose). The above formula applies to each row of the matrix
response. In this VGAM family function the first M 1 linear/additive predictors correspond to
the first M 1 probabilities via

j = log(P [Y = j]/P [Y = M ]) = log(j /M )

where j is the jth linear/additive predictor (M = 0 by definition for P [Y = M ] but not for )
and j = 1, . . . , M 1. The M th linear/additive predictor corresponds to lphi applied to .
Note that E(Yj ) = N j but the probabilities (returned as the fitted values) j are bundled together
as a M -column matrix. The quantities N are returned as the prior weights.
The beta-binomial distribution is a special case of the Dirichlet-multinomial distribution when M =
2; see betabinomial. It is easy to show that the first shape parameter of the beta distribution is
shape1 = (1/ 1) and the second shape parameter is shape2 = (1 )(1/ 1). Also,
= 1/(1 + shape1 + shape2), which is known as the intra-cluster correlation coefficient.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.
If the model is an intercept-only model then @misc (which is a list) has a component called shape
which is a vector with the M values j (1/ 1).
dirmultinomial 225

Warning
This VGAM family function is prone to numerical problems, especially when there are covariates.

Note
The response can be a matrix of non-negative integers, or else a matrix of sample proportions and
the total number of counts in each row specified using the weights argument. This dual input option
is similar to multinomial.
To fit a parallel model with the parameter being an intercept-only you will need to use the
constraints argument.
Currently, Fisher scoring is implemented. To compute the expected information matrix a for loop
is used; this may be very slow when the counts are large. Additionally, convergence may be slower
than usual due to round-off error when computing the expected information matrices.

Author(s)
Thomas W. Yee

References
Paul, S. R., Balasooriya, U. and Banerjee, T. (2005) Fisher information matrix of the Dirichlet-
multinomial distribution. Biometrical Journal, 47, 230236.
Tvedebrink, T. (2010) Overdispersion in allelic counts and -correction in forensic genetics. Theo-
retical Population Biology, 78, 200210.
Yu, P. and Shaw, C. A. (2014). An Efficient Algorithm for Accurate Computation of the Dirichlet-
Multinomial Log-Likelihood Function. Bioinformatics, 30, 154754.

See Also
dirmul.old, betabinomial, betabinomialff, dirichlet, multinomial.

Examples
nn <- 5; M <- 4; set.seed(1)
ydata <- data.frame(round(matrix(runif(nn * M, max = 100), nn, M))) # Integer counts
colnames(ydata) <- paste("y", 1:M, sep = "")

fit <- vglm(cbind(y1, y2, y3, y4) ~ 1, dirmultinomial,

data = ydata, trace = TRUE)
head(fitted(fit))
depvar(fit) # Sample proportions
weights(fit, type = "prior", matrix = FALSE) # Total counts per row

## Not run:
ydata <- transform(ydata, x2 = runif(nn))
fit <- vglm(cbind(y1, y2, y3, y4) ~ x2, dirmultinomial,
data = ydata, trace = TRUE)
Coef(fit)
coef(fit, matrix = TRUE)
226 dlogF

(sfit <- summary(fit))

vcov(sfit)

## End(Not run)

dlogF log F Distribution

Description
Density for the log F distribution.

Usage
dlogF(x, shape1, shape2, log = FALSE)

Arguments
x Vector of quantiles.
shape1, shape2 Positive shape parameters.
log if TRUE then the log density is returned, else the density.

Details
The details are given in logF.

Value
dlogF gives the density.

Author(s)
T. W. Yee

double.cens.normal Univariate Normal Distribution with Double Censoring

Description
Maximum likelihood estimation of the two parameters of a univariate normal distribution when
there is double censoring.

Usage
double.cens.normal(r1 = 0, r2 = 0, lmu = "identitylink", lsd = "loge",
imu = NULL, isd = NULL, zero = "sd")

Arguments
r1, r2 Integers. Number of smallest and largest values censored, respectively.
lmu, lsd Parameter link functions applied to the mean and standard deviation. See Links
for more choices.
imu, isd, zero See CommonVGAMffArguments for more information.

Details
This family function uses the Fisher information matrix given in Harter and Moore (1966). The
matrix is not diagonal if either r1 or r2 are positive.
By default, the mean is the first linear/additive predictor and the log of the standard deviation is the
second linear/additive predictor.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Note
This family function only handles a vector or one-column matrix response. The weights argument,
if used, are interpreted as frequencies, therefore it must be a vector with positive integer values.
With no censoring at all (the default), it is better (and equivalent) to use uninormal.

Author(s)
T. W. Yee

References
Harter, H. L. and Moore, A. H. (1966) Iterative maximum-likelihood estimation of the parameters
of normal populations from singly and doubly censored samples. Biometrika, 53, 205213.
228 double.expbinomial

double.expbinomial Double Exponential Binomial Distribution Family Function

Description
Fits a double exponential binomial distribution by maximum likelihood estimation. The two pa-
rameters here are the mean and dispersion parameter.

Usage
double.expbinomial(lmean = "logit", ldispersion = "logit",
idispersion = 0.25, zero = "dispersion")

Arguments
lmean, ldispersion
Link functions applied to the two parameters, called and respectively below.
See Links for more choices. The defaults cause the parameters to be restricted
to (0, 1).
idispersion Initial value for the dispersion parameter. If given, it must be in range, and is
recyled to the necessary length. Use this argument if convergence failure occurs.
double.expbinomial 229

zero A vector specifying which linear/additive predictor is to be modelled as intercept-

only. If assigned, the single value can be either 1 or 2. The default is to have a
single dispersion parameter value. To model both parameters as functions of the
covariates assign zero = NULL. See CommonVGAMffArguments for more details.

Details
This distribution provides a way for handling overdispersion in a binary response. The double
exponential binomial distribution belongs the family of double exponential distributions proposed
by Efron (1986). Below, equation numbers refer to that original article. Briefly, the idea is that
an ordinary one-parameter exponential family allows the addition of a second parameter which
varies the dispersion of the family without changing the mean. The extended family behaves like
the original family with sample size changed from n to n. The extended family is an exponential
family in when n and are fixed, and an exponential family in when n and are fixed. Having
0 < < 1 corresponds to overdispersion with respect to the binomial distribution. See Efron
(1986) for full details.
This VGAM family function implements an approximation (2.10) to the exact density (2.4). It
replaces the normalizing constant by unity since the true value nearly equals 1. The default model
fitted is 1 = logit() and 2 = logit(). This restricts both parameters to lie between 0 and 1,
although the dispersion parameter can be modelled over a larger parameter space by assigning the
arguments ldispersion and edispersion.
Approximately, the mean (of Y ) is . The effective sample size is the dispersion parameter mul-
tiplied by the original sample size, i.e., n. This family function uses Fisher scoring, and the two
estimates are asymptotically independent because the expected information matrix is diagonal.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm.

Warning
Numerical difficulties can occur; if so, try using idispersion.

Note
This function processes the input in the same way as binomialff, however multiple responses are
not allowed (binomialff(multiple.responses = FALSE)).

Author(s)
T. W. Yee

References
Efron, B. (1986) Double exponential families and their use in generalized linear regression. Journal
of the American Statistical Association, 81, 709721.
230 double.expbinomial

See Also
binomialff, toxop, CommonVGAMffArguments.

Examples
# This example mimics the example in Efron (1986).
# The results here differ slightly.

# Scale the variables

toxop <- transform(toxop,
phat = positive / ssize,
srainfall = scale(rainfall), # (6.1)
sN = scale(ssize)) # (6.2)

# A fit similar (should be identical) to Section 6 of Efron (1986).

# But does not use poly(), and M = 1.25 here, as in (5.3)
cmlist <- list("(Intercept)" = diag(2),
"I(srainfall)" = rbind(1, 0),
"I(srainfall^2)" = rbind(1, 0),
"I(srainfall^3)" = rbind(1, 0),
"I(sN)" = rbind(0, 1),
"I(sN^2)" = rbind(0, 1))
fit <- vglm(cbind(phat, 1 - phat) * ssize ~
I(srainfall) + I(srainfall^2) + I(srainfall^3) +
I(sN) + I(sN^2),
double.expbinomial(ldisp = extlogit(min = 0, max = 1.25),
idisp = 0.2, zero = NULL),
toxop, trace = TRUE, constraints = cmlist)

# Now look at the results

coef(fit, matrix = TRUE)
head(fitted(fit))
summary(fit)
vcov(fit)
sqrt(diag(vcov(fit))) # Standard errors

# Effective sample size (not quite the last column of Table 1)

head(predict(fit))
Dispersion <- extlogit(predict(fit)[,2], min = 0, max = 1.25, inverse = TRUE)
c(round(weights(fit, type = "prior") * Dispersion, digits = 1))

# Ordinary logistic regression (gives same results as (6.5))

ofit <- vglm(cbind(phat, 1 - phat) * ssize ~
I(srainfall) + I(srainfall^2) + I(srainfall^3),
binomialff, toxop, trace = TRUE)

# Same as fit but it uses poly(), and can be plotted (cf. Figure 1)
cmlist2 <- list("(Intercept)" = diag(2),
"poly(srainfall, degree = 3)" = rbind(1, 0),
"poly(sN, degree = 2)" = rbind(0, 1))
ducklings 231

fit2 <- vglm(cbind(phat, 1 - phat) * ssize ~

poly(srainfall, degree = 3) + poly(sN, degree = 2),
double.expbinomial(ldisp = extlogit(min = 0, max = 1.25),
idisp = 0.2, zero = NULL),
toxop, trace = TRUE, constraints = cmlist2)
## Not run: par(mfrow = c(1, 2))
plot(as(fit2, "vgam"), se = TRUE, lcol = "blue", scol = "orange") # Cf. Figure 1

# Cf. Figure 1(a)

par(mfrow = c(1,2))
ooo <- with(toxop, sort.list(rainfall))
with(toxop, plot(rainfall[ooo], fitted(fit2)[ooo], type = "l",
col = "blue", las = 1, ylim = c(0.3, 0.65)))
with(toxop, points(rainfall[ooo], fitted(ofit)[ooo], col = "orange",
type = "b", pch = 19))

# Cf. Figure 1(b)

ooo <- with(toxop, sort.list(ssize))
with(toxop, plot(ssize[ooo], Dispersion[ooo], type = "l", col = "blue",
las = 1, xlim = c(0, 100)))
## End(Not run)

ducklings Relative Frequencies of Serum Proteins in White Pekin Ducklings

Description
Relative frequencies of serum proteins in white Pekin ducklings as determined by electrophoresis.

Usage
data(ducklings)

Format
The format is: chr "ducklings"

Details
Columns p1, p2, p3 stand for pre-albumin, albumin, globulins respectively. These were collected
from 3-week old white Pekin ducklings. Let Y1 be proportional to the total milligrams of pre-
albumin in the blood serum of a duckling. Similarly, let Y2 and Y3 be directly proportional to the
same factor as Y1 to the total milligrams respectively of albumin and globulins in its blood serum.
The proportion of pre-albumin is given by Y1 /(Y1 + Y2 + Y3 ), and similarly for the others.

Source
Mosimann, J. E. (1962) On the compound multinomial distribution, the multivariate -distribution,
and correlations among proportions, Biometrika, 49, 6582.
232 enzyme

See Also

dirichlet.

Examples
print(ducklings)

enzyme Enzyme Data

Description

Enzyme velocity and substrate concentration.

Usage

data(enzyme)

Format

A data frame with 12 observations on the following 2 variables.

conc a numeric explanatory vector; substrate concentration

velocity a numeric response vector; enzyme velocity

Details

Sorry, more details need to be included later.

Source

Sorry, more details need to be included later.

References

Watts, D. G. (1981) An introduction to nonlinear least squares. In: L. Endrenyi (Ed.), Kinetic Data
Analysis: Design and Analysis of Enzyme and Pharmacokinetic Experiments, pp.124. New York:
Plenum Press.

See Also

micmen.
erf 233

Examples
## Not run:
fit <- vglm(velocity ~ 1, micmen, data = enzyme, trace = TRUE,
form2 = ~ conc - 1, crit = "crit")
summary(fit)

## End(Not run)

erf Error Function, and variants

Description
Computes the error function, or its inverse, based on the normal distribution. Also computes the
complement of the error function, or its inverse,

Usage
erf(x, inverse = FALSE)
erfc(x, inverse = FALSE)

Arguments
x Numeric.
inverse Logical. Of length 1.

Details
Erf (x) is defined as Z x
2
Erf (x) = exp(t2 )dt
0

so that it is closely related to pnorm. The inverse function is defined for x in (1, 1).

Value
Returns the value of the function evaluated at x.

Note

Some authors omit the term 2/ from the definition of Erf (x). Although defined for complex
arguments, this function only works for real arguments.
The complementary error function erf c(x) is defined as 1 erf (x), and is implemented by erfc.
Its inverse function is defined for x in (0, 2).

Author(s)
T. W. Yee
234 erlang

References
Abramowitz, M. and Stegun, I. A. (1972) Handbook of Mathematical Functions with Formulas,
Graphs, and Mathematical Tables, New York: Dover Publications Inc.

erlang Erlang Distribution

Description
Estimates the scale parameter of the Erlang distribution by maximum likelihood estimation.

Usage
erlang(shape.arg, lscale = "loge", imethod = 1, zero = NULL)

Arguments
shape.arg The shape parameters. The user must specify a positive integer, or integers for
multiple responses. They are recycled by.row = TRUE according to matrix.
lscale Link function applied to the (positive) scale parameter. See Links for more
choices.
imethod, zero See CommonVGAMffArguments for more details.

Details
The Erlang distribution is a special case of the gamma distribution with shape that is a positive
integer. If shape.arg = 1 then it simplifies to the exponential distribution. As illustrated in the
example below, the Erlang distribution is the distribution of the sum of shape.arg independent and
identically distributed exponential random variates.
The probability density function of the Erlang distribution is given by
f (y) = exp(y/scale)y shape1 scaleshape /(shape)
for known positive integer shape, unknown scale > 0 and y > 0. Here, (shape) is the gamma
function, as in gamma. The mean of Y is = shape scale and its variance is shape scale2 .
The linear/additive predictor, by default, is = log(scale).
Expectiles-Exponential 235

Value

An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Note

Multiple responses are permitted. The rate parameter found in gammaR is 1/scale heresee also
rgamma.

Author(s)

T. W. Yee

References

Most standard texts on statistical distributions describe this distribution, e.g.,

Forbes, C., Evans, M., Hastings, N. and Peacock, B. (2011) Statistical Distributions, Hoboken, NJ,
USA: John Wiley and Sons, Fourth edition.

See Also

gammaR, exponential, simulate.vlm.

Examples
rate <- exp(2); myshape <- 3
edata <- data.frame(y = rep(0, nn <- 1000))
for (ii in 1:myshape)
edata <- transform(edata, y = y + rexp(nn, rate = rate))
fit <- vglm(y ~ 1, erlang(shape = myshape), data = edata, trace = TRUE)
coef(fit, matrix = TRUE)
Coef(fit) # Answer = 1/rate
1/rate
summary(fit)

Expectiles-Exponential
Expectiles of the Exponential Distribution

Description

Density function, distribution function, and expectile function and random generation for the distri-
bution associated with the expectiles of an exponential distribution.
236 Expectiles-Exponential

Usage
deexp(x, rate = 1, log = FALSE)
peexp(q, rate = 1, lower.tail = TRUE, log.p = FALSE)
qeexp(p, rate = 1, Maxit.nr = 10, Tol.nr = 1.0e-6,
lower.tail = TRUE, log.p = FALSE)
reexp(n, rate = 1)

Arguments
x, p, q See deunif.
n, rate, log See rexp.
lower.tail, log.p
Same meaning as in pexp or qexp.
Maxit.nr, Tol.nr
See deunif.

Details
General details are given in deunif including a note regarding the terminology used. Here, exp
corresponds to the distribution of interest, F , and eexp corresponds to G. The addition of e is for
the other distribution associated with the parent distribution. Thus deexp is for g, peexp is for G,
qeexp is for the inverse of G, reexp generates random variates from g.
For qeexp the Newton-Raphson algorithm is used to solve for y satisfying p = G(y). Numerical
problems may occur when values of p are very close to 0 or 1.

Value
deexp(x) gives the density function g(x). peexp(q) gives the distribution function G(q). qeexp(p)
gives the expectile function: the value y such that G(y) = p. reexp(n) gives n random variates
from G.

Author(s)
T. W. Yee and Kai Huang

## Not run: par(mfrow = c(2,1))

yy <- seq(-0, 4, len = nn)
plot(yy, deexp(yy), col = "blue", ylim = 0:1, xlab = "y", ylab = "g(y)",
type = "l", main = "g(y) for Exp(1); dotted green is f(y) = dexp(y)")
Expectiles-Normal 237

lines(yy, dexp(yy), col = "darkgreen", lty = "dotted", lwd = 2) # 'original'

plot(yy, peexp(yy), type = "l", col = "blue", ylim = 0:1,

xlab = "y", ylab = "G(y)", main = "G(y) for Exp(1)")
abline(v = 1, h = 0.5, col = "red", lty = "dashed")
lines(yy, pexp(yy), col = "darkgreen", lty = "dotted", lwd = 2)
## End(Not run)

Expectiles-Normal Expectiles of the Normal Distribution

Description
Density function, distribution function, and expectile function and random generation for the distri-
bution associated with the expectiles of a normal distribution.

Usage
denorm(x, mean = 0, sd = 1, log = FALSE)
penorm(q, mean = 0, sd = 1, lower.tail = TRUE, log.p = FALSE)
qenorm(p, mean = 0, sd = 1, Maxit.nr = 10, Tol.nr = 1.0e-6,
lower.tail = TRUE, log.p = FALSE)
renorm(n, mean = 0, sd = 1)

Arguments
x, p, q See deunif.
n, mean, sd, log
See rnorm.
lower.tail, log.p
Same meaning as in pnorm or qnorm.
Maxit.nr, Tol.nr
See deunif.

Details
General details are given in deunif including a note regarding the terminology used. Here, norm
corresponds to the distribution of interest, F , and enorm corresponds to G. The addition of e is
for the other distribution associated with the parent distribution. Thus denorm is for g, penorm is
for G, qenorm is for the inverse of G, renorm generates random variates from g.
For qenorm the Newton-Raphson algorithm is used to solve for y satisfying p = G(y). Numerical
problems may occur when values of p are very close to 0 or 1.

Value
denorm(x) gives the density function g(x). penorm(q) gives the distribution function G(q). qenorm(p)
gives the expectile function: the value y such that G(y) = p. renorm(n) gives n random variates
from G.
238 Expectiles-sc.t2

Author(s)

T. W. Yee and Kai Huang

See Also

deunif, deexp, dnorm, amlnormal, lms.bcn.

Examples
my.p <- 0.25; y <- rnorm(nn <- 1000)
(myexp <- qenorm(my.p))
sum(myexp - y[y <= myexp]) / sum(abs(myexp - y)) # Should be my.p

# Non-standard normal
mymean <- 1; mysd <- 2
yy <- rnorm(nn, mymean, mysd)
(myexp <- qenorm(my.p, mymean, mysd))
sum(myexp - yy[yy <= myexp]) / sum(abs(myexp - yy)) # Should be my.p
penorm(-Inf, mymean, mysd) # Should be 0
penorm( Inf, mymean, mysd) # Should be 1
penorm(mean(yy), mymean, mysd) # Should be 0.5
abs(qenorm(0.5, mymean, mysd) - mean(yy)) # Should be 0
abs(penorm(myexp, mymean, mysd) - my.p) # Should be 0
integrate(f = denorm, lower = -Inf, upper = Inf,
mymean, mysd) # Should be 1

## Not run:
par(mfrow = c(2, 1))
yy <- seq(-3, 3, len = nn)
plot(yy, denorm(yy), type = "l", col="blue", xlab = "y", ylab = "g(y)",
main = "g(y) for N(0,1); dotted green is f(y) = dnorm(y)")
lines(yy, dnorm(yy), col = "darkgreen", lty = "dotted", lwd = 2) # 'original'

plot(yy, penorm(yy), type = "l", col = "blue", ylim = 0:1,

xlab = "y", ylab = "G(y)", main = "G(y) for N(0,1)")
abline(v = 0, h = 0.5, col = "red", lty = "dashed")
lines(yy, pnorm(yy), col = "darkgreen", lty = "dotted", lwd = 2)
## End(Not run)

Expectiles-sc.t2 Expectiles/Quantiles of the Scaled Student t Distribution with 2 Df

Description

Density function, distribution function, and quantile/expectile function and random generation for
the scaled Student t distribution with 2 degrees of freedom.
Expectiles-sc.t2 239

Usage
dsc.t2(x, location = 0, scale = 1, log = FALSE)
psc.t2(q, location = 0, scale = 1, lower.tail = TRUE, log.p = FALSE)
qsc.t2(p, location = 0, scale = 1, lower.tail = TRUE, log.p = FALSE)
rsc.t2(n, location = 0, scale = 1)

Arguments
x, q Vector of expectiles/quantiles. See the terminology note below.
p Vector of probabilities. These should lie in (0, 1).
n, log See runif.
location, scale
Location and scale parameters. The latter should have positive values. Values
of these vectors are recyled.
lower.tail, log.p
Same meaning as in pt or qt.

Details
A Student-t distribution with 2 degrees of freedom and a scale parameter of sqrt(2) is equivalent
to the standard form of this distribution (called Koenkers distribution below). Further details about
this distribution are given in sc.studentt2.

Value
dsc.t2(x) gives the density function. psc.t2(q) gives the distribution function. qsc.t2(p) gives
the expectile and quantile function. rsc.t2(n) gives n random variates.

Author(s)
T. W. Yee and Kai Huang

# Non-standard Koenker distribution

myloc <- 1; myscale <- 2
yy <- rsc.t2(nn, myloc, myscale)
(myexp <- qsc.t2(my.p, myloc, myscale))
sum(myexp - yy[yy <= myexp]) / sum(abs(myexp - yy)) # Should be my.p
psc.t2(mean(yy), myloc, myscale) # Should be 0.5
abs(qsc.t2(0.5, myloc, myscale) - mean(yy)) # Should be 0
abs(psc.t2(myexp, myloc, myscale) - my.p) # Should be 0
integrate(f = dsc.t2, lower = -Inf, upper = Inf,
locat = myloc, scale = myscale) # Should be 1

y <- seq(-7, 7, len = 201)

max(abs(dsc.t2(y) - dt(y / sqrt(2), df = 2) / sqrt(2))) # Should be 0
## Not run: plot(y, dsc.t2(y), type = "l", col = "blue", las = 1,
ylim = c(0, 0.4), main = "Blue = Koenker; orange = N(0, 1)")
lines(y, dnorm(y), type = "l", col = "orange")
abline(h = 0, v = 0, lty = 2)
## End(Not run)

Expectiles-Uniform Expectiles of the Uniform Distribution

Description
Density function, distribution function, and expectile function and random generation for the distri-
bution associated with the expectiles of a uniform distribution.

Usage
deunif(x, min = 0, max = 1, log = FALSE)
peunif(q, min = 0, max = 1, lower.tail = TRUE, log.p = FALSE)
qeunif(p, min = 0, max = 1, Maxit.nr = 10, Tol.nr = 1.0e-6,
lower.tail = TRUE, log.p = FALSE)
reunif(n, min = 0, max = 1)

Arguments
x, q Vector of expectiles. See the terminology note below.
p Vector of probabilities. These should lie in (0, 1).
n, min, max, log
See runif.
lower.tail, log.p
Same meaning as in punif or qunif.
Maxit.nr Numeric. Maximum number of Newton-Raphson iterations allowed. A warning
is issued if convergence is not obtained for all p values.
Tol.nr Numeric. Small positive value specifying the tolerance or precision to which the
expectiles are computed.
Expectiles-Uniform 241

Details
Jones (1994) elucidated on the property that the expectiles of a random variable X with distribution
function F (x) correspond to the quantiles of a distribution G(x) where G is related by an explicit
formula to F . In particular, let y be the p-expectile of F . Then y is the p-quantile of G where

p = G(y) = (P (y) yF (y))/(2[P (y) yF (y)] + y ),

and is the mean of X. The derivative of G is

g(y) = (F (y) P (y))/(2[P (y) yF (y)] + y )2 .

Ry
Here, P (y) is the partial moment xf (x) dx and 0 < p < 1. The 0.5-expectile is the mean
and the 0.5-quantile is the median.
A note about the terminology used here. Recall in the S language there are the dpqr-type functions
associated with a distribution, e.g., dunif, punif, qunif, runif, for the uniform distribution. Here,
unif corresponds to F and eunif corresponds to G. The addition of e (for expectile) is for the
other distribution associated with the parent distribution. Thus deunif is for g, peunif is for G,
qeunif is for the inverse of G, reunif generates random variates from g.
For qeunif the Newton-Raphson algorithm is used to solve for y satisfying p = G(y). Numerical
problems may occur when values of p are very close to 0 or 1.

Value
deunif(x) gives the density function g(x). peunif(q) gives the distribution function G(q). qeunif(p)
gives the expectile function: the expectile y such that G(y) = p. reunif(n) gives n random vari-
ates from G.

Author(s)
T. W. Yee and Kai Huang

References
Jones, M. C. (1994) Expectiles and M-quantiles are quantiles. Statistics and Probability Letters,
20, 149153.
Yee, T. W. (2012) Vector generalized linear and additive quantile and expectile regression. In prepa-
ration.

I2 <- mean(y > myexp) * mean(-myexp + y[y > myexp])

I1 / (I1 + I2) # Should be my.p
# Or:
I1 <- sum( myexp - y[y <= myexp])
I2 <- sum(-myexp + y[y > myexp])

# Non-standard uniform
mymin <- 1; mymax <- 8
yy <- runif(nn, mymin, mymax)
(myexp <- qeunif(my.p, mymin, mymax))
sum(myexp - yy[yy <= myexp]) / sum(abs(myexp - yy)) # Should be my.p
peunif(mymin, mymin, mymax) # Should be 0
peunif(mymax, mymin, mymax) # Should be 1
peunif(mean(yy), mymin, mymax) # Should be 0.5
abs(qeunif(0.5, mymin, mymax) - mean(yy)) # Should be 0
abs(qeunif(0.5, mymin, mymax) - (mymin+mymax)/2) # Should be 0
abs(peunif(myexp, mymin, mymax) - my.p) # Should be 0
integrate(f = deunif, lower = mymin - 3, upper = mymax + 3,
min = mymin, max = mymax) # Should be 1

## Not run:
par(mfrow = c(2,1))
yy <- seq(0.0, 1.0, len = nn)
plot(yy, deunif(yy), type = "l", col = "blue", ylim = c(0, 2),
xlab = "y", ylab = "g(y)", main = "g(y) for Uniform(0,1)")
lines(yy, dunif(yy), col = "darkgreen", lty = "dotted", lwd = 2) # 'original'

plot(yy, peunif(yy), type = "l", col = "blue", ylim = 0:1,

xlab = "y", ylab = "G(y)", main = "G(y) for Uniform(0,1)")
abline(a = 0.0, b = 1.0, col = "darkgreen", lty = "dotted", lwd = 2)
abline(v = 0.5, h = 0.5, col = "red", lty = "dashed")
## End(Not run)

expexpff Exponentiated Exponential Distribution

Description
Estimates the two parameters of the exponentiated exponential distribution by maximum likelihood
estimation.

Usage
expexpff(lrate = "loge", lshape = "loge",
irate = NULL, ishape = 1.1, tolerance = 1.0e-6, zero = NULL)

Arguments
lshape, lrate Parameter link functions for the and parameters. See Links for more
choices. The defaults ensure both parameters are positive.
expexpff 243

ishape Initial value for the parameter. If convergence fails try setting a different value
for this argument.
irate Initial value for the parameter. By default, an initial value is chosen internally
using ishape.
tolerance Numeric. Small positive value for testing whether values are close enough to 1
and 2.
zero An integer-valued vector specifying which linear/additive predictors are mod-
elled as intercepts only. The default is none of them. If used, choose one value
from the set {1,2}.

Details
The exponentiated exponential distribution is an alternative to the Weibull and the gamma distribu-
tions. The formula for the density is

f (y; , ) = (1 exp(y))1 exp(y)

where y > 0, > 0 and > 0. The mean of Y is (( + 1) (1))/ (returned as the fitted
values) where is the digamma function. The variance of Y is ( 0 (1) 0 ( + 1))/2 where 0
is the trigamma function.
This distribution has been called the two-parameter generalized exponential distribution by Gupta
and Kundu (2006). A special case of the exponentiated exponential distribution: = 1 is the
exponential distribution.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Warning
Practical experience shows that reasonably good initial values really helps. In particular, try setting
different values for the ishape argument if numerical problems are encountered or failure to con-
vergence occurs. Even if convergence occurs try perturbing the initial value to make sure the global
solution is obtained and not a local solution. The algorithm may fail if the estimate of the shape
parameter is too close to unity.

Note
Fisher scoring is used, however, convergence is usually very slow. This is a good sign that there is a
bug, but I have yet to check that the expected information is correct. Also, I have yet to implement
Type-I right censored data using the results of Gupta and Kundu (2006).
Another algorithm for fitting this model is implemented in expexpff1.

Author(s)
T. W. Yee
244 expexpff1

References
Gupta, R. D. and Kundu, D. (2001) Exponentiated exponential family: an alternative to gamma and
Weibull distributions, Biometrical Journal, 43, 117130.
Gupta, R. D. and Kundu, D. (2006) On the comparison of Fisher information of the Weibull and
GE distributions, Journal of Statistical Planning and Inference, 136, 31303144.

See Also
expexpff1, gammaR, weibullR, CommonVGAMffArguments.

Examples
# A special case: exponential data
edata <- data.frame(y = rexp(n <- 1000))
fit <- vglm(y ~ 1, fam = expexpff, data = edata, trace = TRUE, maxit = 99)
coef(fit, matrix = TRUE)
Coef(fit)

# Ball bearings data (number of million revolutions before failure)

edata <- data.frame(bbearings = c(17.88, 28.92, 33.00, 41.52, 42.12, 45.60,
48.80, 51.84, 51.96, 54.12, 55.56, 67.80, 68.64, 68.64,
68.88, 84.12, 93.12, 98.64, 105.12, 105.84, 127.92,
128.04, 173.40))
fit <- vglm(bbearings ~ 1, fam = expexpff(irate = 0.05, ish = 5),
trace = TRUE, maxit = 300, data = edata)
coef(fit, matrix = TRUE)
Coef(fit) # Authors get c(rate=0.0314, shape=5.2589)
logLik(fit) # Authors get -112.9763

# Failure times of the airconditioning system of an airplane

eedata <- data.frame(acplane = c(23, 261, 87, 7, 120, 14, 62, 47,
225, 71, 246, 21, 42, 20, 5, 12, 120, 11, 3, 14,
71, 11, 14, 11, 16, 90, 1, 16, 52, 95))
fit <- vglm(acplane ~ 1, fam = expexpff(ishape = 0.8, irate = 0.15),
trace = TRUE, maxit = 99, data = eedata)
coef(fit, matrix = TRUE)
Coef(fit) # Authors get c(rate=0.0145, shape=0.8130)
logLik(fit) # Authors get log-lik -152.264

expexpff1 Exponentiated Exponential Distribution

Description
Estimates the two parameters of the exponentiated exponential distribution by maximizing a profile
(concentrated) likelihood.
expexpff1 245

Usage
expexpff1(lrate = "loge", irate = NULL, ishape = 1)

Arguments
lrate Parameter link function for the (positive) parameter. See Links for more
choices.
irate Initial value for the parameter. By default, an initial value is chosen internally
using ishape.
ishape Initial value for the parameter. If convergence fails try setting a different value
for this argument.

Details
See expexpff for details about the exponentiated exponential distribution. This family function
uses a different algorithm for fitting the model. Given , the MLE of can easily be solved in
terms of . This family function maximizes a profile (concentrated) likelihood with respect to .
Newton-Raphson is used, which compares with Fisher scoring with expexpff.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Warning
The standard errors produced by a summary of the model may be wrong.

Note
This family function works only for intercept-only models, i.e., y ~ 1 where y is the response.
The estimate of is attached to the misc slot of the object, which is a list and contains the compo-
nent shape.
As Newton-Raphson is used, the working weights are sometimes negative, and some adjustment is
made to these to make them positive.
Like expexpff, good initial values are needed. Convergence may be slow.

Author(s)
T. W. Yee

References
Gupta, R. D. and Kundu, D. (2001) Exponentiated exponential family: an alternative to gamma and
Weibull distributions, Biometrical Journal, 43, 117130.
246 expgeom

See Also
expexpff, CommonVGAMffArguments.

Examples
# Ball bearings data (number of million revolutions before failure)
edata <- data.frame(bbearings = c(17.88, 28.92, 33.00, 41.52, 42.12, 45.60,
48.80, 51.84, 51.96, 54.12, 55.56, 67.80, 68.64, 68.64,
68.88, 84.12, 93.12, 98.64, 105.12, 105.84, 127.92,
128.04, 173.40))
fit <- vglm(bbearings ~ 1, expexpff1(ishape = 4), trace = TRUE,
maxit = 250, checkwz = FALSE, data = edata)
coef(fit, matrix = TRUE)
Coef(fit) # Authors get c(0.0314, 5.2589) with log-lik -112.9763
logLik(fit)
fit@misc$shape # Estimate of shape

# Failure times of the airconditioning system of an airplane

eedata <- data.frame(acplane = c(23, 261, 87, 7, 120, 14, 62, 47,
225, 71, 246, 21, 42, 20, 5, 12, 120, 11, 3, 14,
71, 11, 14, 11, 16, 90, 1, 16, 52, 95))
fit <- vglm(acplane ~ 1, expexpff1(ishape = 0.8), trace = TRUE,
maxit = 50, checkwz = FALSE, data = eedata)
coef(fit, matrix = TRUE)
Coef(fit) # Authors get c(0.0145, 0.8130) with log-lik -152.264
logLik(fit)
fit@misc$shape # Estimate of shape

expgeom The Exponential Geometric Distribution

Description
Density, distribution function, quantile function and random generation for the exponential geomet-
ric distribution.

Usage
dexpgeom(x, scale = 1, shape, log = FALSE)
pexpgeom(q, scale = 1, shape)
qexpgeom(p, scale = 1, shape)
rexpgeom(n, scale = 1, shape)

Arguments
x, q vector of quantiles.
p vector of probabilities.
expgeom 247

n number of observations. If length(n) > 1 then the length is taken to be the

number required.
scale, shape positive scale and shape parameters.
log Logical. If log = TRUE then the logarithm of the density is returned.

Details

See expgeometric, the VGAM family function for estimating the parameters, for the formula of
the probability density function and other details.

Value

dexpgeom gives the density, pexpgeom gives the distribution function, qexpgeom gives the quantile
function, and rexpgeom generates random deviates.

Note

We define scale as the reciprocal of the scale parameter used by Adamidis and Loukas (1998).

Author(s)

J. G. Lauder and T. W. Yee

See Also

expgeometric, exponential, geometric.

Examples
## Not run:
shape <- 0.5; scale <- 1; nn <- 501
x <- seq(-0.10, 3.0, len = nn)
plot(x, dexpgeom(x, scale, shape), type = "l", las = 1, ylim = c(0, 2),
ylab = paste("[dp]expgeom(shape = ", shape, ", scale = ", scale, ")"),
col = "blue", cex.main = 0.8,
main = "Blue is density, red is cumulative distribution function",
sub = "Purple lines are the 10,20,...,90 percentiles")
lines(x, pexpgeom(x, scale, shape), col = "red")
probs <- seq(0.1, 0.9, by = 0.1)
Q <- qexpgeom(probs, scale, shape)
lines(Q, dexpgeom(Q, scale, shape), col = "purple", lty = 3, type = "h")
lines(Q, pexpgeom(Q, scale, shape), col = "purple", lty = 3, type = "h")
abline(h = probs, col = "purple", lty = 3)
max(abs(pexpgeom(Q, scale, shape) - probs)) # Should be 0

## End(Not run)
248 expgeometric

expgeometric Exponential Geometric Distribution Family Function

Description
Estimates the two parameters of the exponential geometric distribution by maximum likelihood
estimation.

Usage
expgeometric(lscale = "loge", lshape = "logit",
iscale = NULL, ishape = NULL,
tol12 = 1e-05, zero = 1, nsimEIM = 400)

Arguments
lscale, lshape Link function for the two parameters. See Links for more choices.
iscale, ishape Numeric. Optional initial values for the scale and shape parameters.
tol12 Numeric. Tolerance for testing whether a parameter has value 1 or 2.
zero, nsimEIM See CommonVGAMffArguments.

Details
The exponential geometric distribution has density function

f (y; c = scale, s = shape) = (1/c)(1 s)ey/c (1 sey/c )2

where y > 0, c > 0 and s (0, 1). The mean, (c(s 1)/s) log(1 s) is returned as the fitted
values. Note the median is c log(2 s). Simulated Fisher scoring is implemented.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Note
We define scale as the reciprocal of the scale parameter used by Adamidis and Loukas (1998).

Author(s)
J. G. Lauder and T. W. Yee

References
Adamidis, K., Loukas, S. (1998). A lifetime distribution with decreasing failure rate. Statistics and
Probability Letters, 39, 3542.
expint 249

See Also
dexpgeom, exponential, geometric.

Examples
## Not run:
Scale <- exp(2); shape = logit(-1, inverse = TRUE);
edata <- data.frame(y = rexpgeom(n = 2000, scale = Scale, shape = shape))
fit <- vglm(y ~ 1, expgeometric, edata, trace = TRUE)
c(with(edata, mean(y)), head(fitted(fit), 1))
coef(fit, matrix = TRUE)
Coef(fit)
summary(fit)

## End(Not run)

expint The Exponential Integral and Variants

Description
Computes the exponential integral Ei(x) for real values, as well as exp(x) Ei(x) and E1 (x)
and their derivatives (up to the 3rd derivative).

Usage
expint(x, deriv = 0)
expexpint(x, deriv = 0)
expint.E1(x, deriv = 0)

Arguments
x Numeric. Ideally a vector of positive reals.
deriv Integer. Either 0, 1, 2 or 3.

Details
The exponential integral Ei(x) function is the integral of exp(t)/t from 0 to x, for positive real x.
The function E1 (x) is the integral of exp(t)/t from x to infinity, for positive real x.

Value
Function expint(x, deriv = n) returns the nth derivative of Ei(x) (up to the 3rd), function
expexpint(x, deriv = n) returns the nth derivative of exp(x)Ei(x) (up to the 3rd), function
expint.E1(x, deriv = n) returns the nth derivative of E1 (x) (up to the 3rd).
250 explink

Warning
These functions have not been tested thoroughly.

Author(s)
T. W. Yee has simply written a small wrapper function to call the NETLIB FORTRAN code.
Xiangjie Xue modified the functions to calculate derivatives. Higher derivatives can actually be
calculatedplease let me know if you need it.

References
http://www.netlib.org/specfun/ei.

curve(expexpint, 0.01, 2, xlim = c(0, 2), ylim = c(-3, 2),

las = 1, col = "orange")
abline(v = (-3):2, h = (-4):5, lwd = 2, lty = "dotted", col = "gray")
abline(h = 0, v = 0, lty = "dashed", col = "blue")

curve(expint.E1, 0.01, 2, xlim = c(0, 2), ylim = c(0, 5),

las = 1, col = "orange")
abline(v = (-3):2, h = (-4):5, lwd = 2, lty = "dotted", col = "gray")
abline(h = 0, v = 0, lty = "dashed", col = "blue")

## End(Not run)

explink Exponential Link Function

Description
Computes the exponential transformation, including its inverse and the first two derivatives.

Usage
explink(theta, bvalue = NULL, inverse = FALSE, deriv = 0, short = TRUE, tag = FALSE)
explink 251

Arguments

theta Numeric or character. See below for further details.

bvalue See cloglog.
inverse, deriv, short, tag
Details at Links.

Details

The exponential link function is potentially suitable for parameters that are positive. Numerical
values of theta close to negative or positive infinity may result in 0, Inf, -Inf, NA or NaN.

Value

For explink with deriv = 0, the exponential of theta, i.e., exp(theta) when inverse = FALSE.
And if inverse = TRUE then log(theta); if theta is not positive then it will return NaN.
For deriv = 1, then the function returns d eta / d theta as a function of theta if inverse = FALSE,
else if inverse = TRUE then it returns the reciprocal.
Here, all logarithms are natural logarithms, i.e., to base e.

Note

This function has particular use for computing quasi-variances when used with rcim and uninormal.
Numerical instability may occur when theta is close to negative or positive infinity. One way of
overcoming this (one day) is to use bvalue.

Author(s)

Thomas W. Yee

See Also

Links, loge, rcim, Qvar, uninormal.

Examples

theta <- rnorm(30)

explink(theta)
max(abs(explink(explink(theta), inverse = TRUE) - theta)) # Should be 0
252 explog

explog The Exponential Logarithmic Distribution

Description
Density, distribution function, quantile function and random generation for the exponential loga-
rithmic distribution.

Usage
dexplog(x, scale = 1, shape, log = FALSE)
pexplog(q, scale = 1, shape)
qexplog(p, scale = 1, shape)
rexplog(n, scale = 1, shape)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. If length(n) > 1 then the length is taken to be the
number required.
scale, shape positive scale and shape parameters.
log Logical. If log = TRUE then the logarithm of the density is returned.

Details
See explogff, the VGAM family function for estimating the parameters, for the formula of the
probability density function and other details.

Value
dexplog gives the density, pexplog gives the distribution function, qexplog gives the quantile
function, and rexplog generates random deviates.

Note
We define scale as the reciprocal of the scale parameter used by Tahmasabi and Rezaei (2008).

Author(s)
J. G. Lauder and T. W. Yee

See Also
explogff, exponential.
explogff 253

Examples
## Not run:
shape <- 0.5; scale <- 2; nn <- 501
x <- seq(-0.50, 6.0, len = nn)
plot(x, dexplog(x, scale, shape), type = "l", las = 1, ylim = c(0, 1.1),
ylab = paste("[dp]explog(shape = ", shape, ", scale = ", scale, ")"),
col = "blue", cex.main = 0.8,
main = "Blue is density, orange is cumulative distribution function",
sub = "Purple lines are the 10,20,...,90 percentiles")
lines(x, pexplog(x, scale, shape), col = "orange")
probs <- seq(0.1, 0.9, by = 0.1)
Q <- qexplog(probs, scale, shape = shape)
lines(Q, dexplog(Q, scale, shape = shape), col = "purple", lty = 3, type = "h")
lines(Q, pexplog(Q, scale, shape = shape), col = "purple", lty = 3, type = "h")
abline(h = probs, col = "purple", lty = 3)
max(abs(pexplog(Q, scale, shape = shape) - probs)) # Should be 0

## End(Not run)

explogff Exponential Logarithmic Distribution Family Function

Description
Estimates the two parameters of the exponential logarithmic distribution by maximum likelihood
estimation.

Usage
explogff(lscale = "loge", lshape = "logit",
iscale = NULL, ishape = NULL,
tol12 = 1e-05, zero = 1, nsimEIM = 400)

Arguments
lscale, lshape See CommonVGAMffArguments for information.
tol12 Numeric. Tolerance for testing whether a parameter has value 1 or 2.
iscale, ishape, zero, nsimEIM
See CommonVGAMffArguments.

Details
The exponential logarithmic distribution has density function

f (y; c, s) = (1/( log p))(((1/c)(1 s)ey/c )/(1 (1 s)ey/c ))

where y > 0, scale parameter c > 0, and shape parameter s (0, 1). The mean, (polylog(2, 1
p)c)/ log(s) is not returned as the fitted values. Note the median is c log(1 + s) and it is currently
returned as the fitted values. Simulated Fisher scoring is implemented.
254 exponential

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Note
We define scale as the reciprocal of the rate parameter used by Tahmasabi and Sadegh (2008).
Yet to do: find a polylog() function.

Author(s)
J. G. Lauder and T. W .Yee

References
Tahmasabi, R., Sadegh, R. (2008). A two-parameter lifetime distribution with decreasing failure
rate. Computational Statistics and Data Analysis, 52, 38893901.

exponential Exponential Distribution

Description
Maximum likelihood estimation for the exponential distribution.

Usage
exponential(link = "loge", location = 0, expected = TRUE,
ishrinkage = 0.95, parallel = FALSE, zero = NULL)
exponential 255

Arguments
link Parameter link function applied to the positive parameter rate. See Links for
more choices.
location Numeric of length 1, the known location parameter, A, say.
expected Logical. If TRUE Fisher scoring is used, otherwise Newton-Raphson. The latter
is usually faster.
ishrinkage, parallel, zero
See CommonVGAMffArguments for information.

Details
The family function assumes the response Y has density

f (y) = exp((y A))

for y > A, where A is the known location parameter. By default, A = 0. Then E(Y ) = A + 1/
and V ar(Y ) = 1/2 .

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Note
Suppose A = 0. For a fixed time interval, the number of events is Poisson with mean if the time
between events has a geometric distribution with mean 1 . The argument rate in exponential
is the same as rexp etc. The argument lambda in rpois is somewhat the same as rate here.

Author(s)
T. W. Yee

References
Forbes, C., Evans, M., Hastings, N. and Peacock, B. (2011) Statistical Distributions, Hoboken, NJ,
USA: John Wiley and Sons, Fourth edition.

See Also
amlexponential, gpd, laplace, expgeometric, explogff, poissonff, mix2exp, freund61, simulate.vlm,
Exponential.

Examples
edata <- data.frame(x2 = runif(nn <- 100) - 0.5)
edata <- transform(edata, x3 = runif(nn) - 0.5)
edata <- transform(edata, eta = 0.2 - 0.7 * x2 + 1.9 * x3)
edata <- transform(edata, rate = exp(eta))
256 exppois

edata <- transform(edata, y = rexp(nn, rate = rate))

with(edata, stem(y))

fit.slow <- vglm(y ~ x2 + x3, exponential, data = edata, trace = TRUE)

fit.fast <- vglm(y ~ x2 + x3, exponential(exp = FALSE), data = edata,
trace = TRUE, crit = "coef")
coef(fit.slow, mat = TRUE)
summary(fit.slow)

# Compare results with a GPD. Has a threshold.

threshold <- 0.5
gdata <- data.frame(y1 = threshold + rexp(n = 3000, rate = exp(1.5)))

fit.exp <- vglm(y1 ~ 1, exponential(location = threshold), data = gdata)

coef(fit.exp, matrix = TRUE)
Coef(fit.exp)
logLik(fit.exp)

fit.gpd <- vglm(y1 ~ 1, gpd(threshold = threshold), data = gdata)

coef(fit.gpd, matrix = TRUE)
Coef(fit.gpd)
logLik(fit.gpd)

exppois The Exponential Poisson Distribution

Description
Density, distribution function, quantile function and random generation for the exponential poisson
distribution.

Usage
dexppois(x, rate = 1, shape, log = FALSE)
pexppois(q, rate = 1, shape, lower.tail = TRUE, log.p = FALSE)
qexppois(p, rate = 1, shape, lower.tail = TRUE, log.p = FALSE)
rexppois(n, rate = 1, shape)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. If length(n) > 1 then the length is taken to be the
number required.
shape, rate positive parameters.
log Logical. If log = TRUE then the logarithm of the density is returned.
lower.tail, log.p
Same meaning as in pnorm or qnorm.
exppoisson 257

Details
See exppoisson, the VGAM family function for estimating the parameters, for the formula of the
probability density function and other details.

Value
dexppois gives the density, pexppois gives the distribution function, qexppois gives the quantile
function, and rexppois generates random deviates.

Author(s)
Kai Huang and J. G. Lauder

exppoisson Exponential Poisson Distribution Family Function

Description
Estimates the two parameters of the exponential Poisson distribution by maximum likelihood esti-
mation.

Usage
exppoisson(lrate = "loge", lshape = "loge",
irate = 2, ishape = 1.1, zero = NULL)
258 exppoisson

Arguments
lshape, lrate Link function for the two positive parameters. See Links for more choices.
ishape, irate Numeric. Initial values for the shape and rate parameters. Currently this func-
tion is not intelligent enough to obtain better initial values.
zero See CommonVGAMffArguments.

Details
The exponential Poisson distribution has density function

f (y; = rate, = shape) = ey+ exp (y)
1 e
where y > 0, and the parameters shape, , and rate, , are positive. The distribution implies a
population facing discrete hazard rates which are multiples of a base hazard. This VGAM fam-
ily function requires the hypergeo package (to use their genhypergeo function). The median is
returned as the fitted value.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Warning
This VGAM family function does not work properly!

Author(s)
J. G. Lauder, [email protected]

References
Kus, C., (2007). A new lifetime distribution. Computational Statistics and Data Analysis, 51,
44974509.

See Also
dexppois, exponential, poisson.

Examples
## Not run:
shape <- exp(1); rate <- exp(2)
rdata <- data.frame(y = rexppois(n = 1000, rate = rate, shape = shape))
library("hypergeo") # Required!
fit <- vglm(y ~ 1, exppoisson, data = rdata, trace = FALSE, maxit = 1200)
c(with(rdata, median(y)), head(fitted(fit), 1))
coef(fit, matrix = TRUE)
Coef(fit)
Felix 259

summary(fit)

## End(Not run)

Felix The Felix Distribution

Description

Density for the Felix distribution.

Usage

dfelix(x, rate = 0.25, log = FALSE)

Arguments

x vector of quantiles.
rate See felix.
log Logical. If log = TRUE then the logarithm of the density is returned.

Details

See felix, the VGAM family function for estimating the parameter, for the formula of the proba-
bility density function and other details.

Value

dfelix gives the density.

Warning

The default value of rate is subjective.

Author(s)

T. W. Yee

See Also

felix.
260 felix

Examples
## Not run:
rate <- 0.25; x <- 1:15
plot(x, dfelix(x, rate), type = "h", las = 1, col = "blue",
ylab = paste("dfelix(rate=", rate, ")"),
main = "Felix density function")

## End(Not run)

felix Felix Distribution Family Function

Description
Estimates the parameter of a Felix distribution by maximum likelihood estimation.

Usage
felix(lrate = extlogit(min = 0, max = 0.5), imethod = 1)

Arguments
lrate Link function for the parameter, called a below; see Links for more choices and
for general information.
imethod See CommonVGAMffArguments. Valid values are 1, 2, 3 or 4.

Details
The Felix distribution is an important basic Lagrangian distribution. The density function is
1
f (y; a) = y (y3)/2 a(y1)/2 exp(ay)
((y 1)/2)!

where y = 1, 3, 5, . . . and 0 < a < 0.5. The mean is 1/(1 2a) (returned as the fitted values).
Fisher scoring is implemented.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Author(s)
T. W. Yee

References
Consul, P. C. and Famoye, F. (2006) Lagrangian Probability Distributions, Boston, USA: Birkhauser.
fff 261

fff F Distribution Family Function

Description
Maximum likelihood estimation of the (2-parameter) F distribution.

Usage
fff(link = "loge", idf1 = NULL, idf2 = NULL, nsimEIM = 100,
imethod = 1, zero = NULL)

Arguments
link Parameter link function for both parameters. See Links for more choices. The
default keeps the parameters positive.
idf1, idf2 Numeric and positive. Initial value for the parameters. The default is to choose
each value internally.
nsimEIM, zero See CommonVGAMffArguments for more information.
imethod Initialization method. Either the value 1 or 2. If both fail try setting values for
idf1 and idf2.

Details
The F distribution is named after Fisher and has a density function that has two parameters, called
df1 and df2 here. This function treats these degrees of freedom as positive reals rather than integers.
The mean of the distribution is df 2/(df 2 2) provided df 2 > 2, and its variance is 2df 22 (df 1 +
df 2 2)/(df 1(df 2 2)2 (df 2 4)) provided df 2 > 4. The estimated mean is returned as the fitted
values. Although the F distribution can be defined to accommodate a non-centrality parameter ncp,
it is assumed zero here. Actually it shouldnt be too difficult to handle any known ncp; something
to do in the short future.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.
262 fill

Warning
Numerical problems will occur when the estimates of the parameters are too low or too high.

Author(s)
T. W. Yee

References
Forbes, C., Evans, M., Hastings, N. and Peacock, B. (2011) Statistical Distributions, Hoboken, NJ,
USA: John Wiley and Sons, Fourth edition.

fill Creates a Matrix of Appropriate Dimension

Description
A support function for the argument xij, it generates a matrix of an appropriate dimension.

Usage
fill(x, values = 0, ncolx = ncol(x))

Arguments
x A vector or matrix which is used to determine the dimension of the answer, in
particular, the number of rows. After converting x to a matrix if necessary, the
answer is a matrix of values values, of dimension nrow(x) by ncolx.
values Numeric. The answer contains these values, which are recycled columnwise if
necessary, i.e., as matrix(values, ..., byrow=TRUE).
ncolx The number of columns of the returned matrix. The default is the number of
columns of x.
fill 263

Details

The xij argument for vglm allows the user to input variables specific to each linear/additive predic-
tor. For example, consider the bivariate logit model where the first/second linear/additive predictor
is the logistic regression of the first/second binary response respectively. The third linear/additive
predictor is log(OR) = eta3, where OR is the odds ratio. If one has ocular pressure as a covariate
in this model then xij is required to handle the ocular pressure for each eye, since these will be
different in general. [This contrasts with a variable such as age, the age of the person, which has a
common value for both eyes.] In order to input these data into vglm one often finds that functions
fill, fill1, etc. are useful.
All terms in the xij and formula arguments in vglm must appear in the form2 argument too.

Value

matrix(values, nrow=nrow(x), ncol=ncolx), i.e., a matrix consisting of values values, with

the number of rows matching x, and the default number of columns is the number of columns of x.

Note

The effect of the xij argument is after other arguments such as exchangeable and zero. Hence
xij does not affect constraint matrices.
Additionally, there are currently 3 other identical fill functions, called fill1, fill2 and fill3;
if you need more then assign fill4 = fill5 = fill1 etc. The reason for this is that if
more than one fill function is needed then they must be unique. For example, if M = 4 then
xij = op ~ lop + rop + fill(mop) + fill(mop) would reduce to xij = op ~ lop + rop + fill(mop),
whereas xij = op ~ lop + rop + fill1(mop) + fill2(mop) would retain all M terms, which
is needed.
In Examples 1 to 3 below, the xij argument illustrates covariates that are specific to a linear pre-
dictor. Here, lop/rop are the ocular pressures of the left/right eye in an artificial dataset, and mop is
their mean. Variables leye and reye might be the presence/absence of a particular disease on the
LHS/RHS eye respectively.
In Example 3, the xij argument illustrates fitting the (exchangeable) model where there is a com-
mon smooth function of the ocular pressure. One should use regression splines since s in vgam does
not handle the xij argument. However, regression splines such as bs and ns need to have the same
basis functions here for both functions, and Example 3 illustrates a trick involving a function BS to
obtain this, e.g., same knots. Although regression splines create more than a single column per term
in the model matrix, fill(BS(lop,rop)) creates the required (same) number of columns.

Author(s)

T. W. Yee

See Also

vglm.control, vglm, multinomial, Select.

264 fill

Examples

fill(runif(5))
fill(runif(5), ncol = 3)
fill(runif(5), val = 1, ncol = 3)

# Generate eyes data for the examples below. Eyes are independent (OR=1).
nn <- 1000 # Number of people
eyesdata <- data.frame(lop = round(runif(nn), 2),
rop = round(runif(nn), 2),
age = round(rnorm(nn, 40, 10)))
eyesdata <- transform(eyesdata,
mop = (lop + rop) / 2, # Mean ocular pressure
op = (lop + rop) / 2, # Value unimportant unless plotting
# op = lop, # Choose this if plotting
eta1 = 0 - 2*lop + 0.04*age, # Linear predictor for left eye
eta2 = 0 - 2*rop + 0.04*age) # Linear predictor for right eye
eyesdata <- transform(eyesdata,
leye = rbinom(nn, size = 1, prob = logit(eta1, inverse = TRUE)),
reye = rbinom(nn, size = 1, prob = logit(eta2, inverse = TRUE)))

# Example 1
# All effects are linear
fit1 <- vglm(cbind(leye,reye) ~ op + age,
family = binom2.or(exchangeable = TRUE, zero = 3),
data = eyesdata, trace = TRUE,
xij = list(op ~ lop + rop + fill(lop)),
form2 = ~ op + lop + rop + fill(lop) + age)
head(model.matrix(fit1, type = "lm")) # LM model matrix
head(model.matrix(fit1, type = "vlm")) # Big VLM model matrix
coef(fit1)
coef(fit1, matrix = TRUE) # Unchanged with 'xij'
constraints(fit1)
max(abs(predict(fit1)-predict(fit1, new = eyesdata))) # Predicts correctly
summary(fit1)
## Not run:
plotvgam(fit1, se = TRUE) # Wrong, e.g., because it plots against op, not lop.
# So set op = lop in the above for a correct plot.

## End(Not run)

# Example 2
# Model OR as a linear function of mop
fit2 <- vglm(cbind(leye,reye) ~ op + age, data = eyesdata, trace = TRUE,
binom2.or(exchangeable = TRUE, zero = NULL),
xij = list(op ~ lop + rop + mop),
form2 = ~ op + lop + rop + mop + age)
head(model.matrix(fit2, type = "lm")) # LM model matrix
head(model.matrix(fit2, type = "vlm")) # Big VLM model matrix
coef(fit2)
coef(fit2, matrix = TRUE) # Unchanged with 'xij'
fill 265

max(abs(predict(fit2) - predict(fit2, new = eyesdata))) # Predicts correctly

summary(fit2)
## Not run:
plotvgam(fit2, se = TRUE) # Wrong because it plots against op, not lop.

## End(Not run)

# Example 3. This model uses regression splines on ocular pressure.

# It uses a trick to ensure common basis functions.
BS <- function(x, ...)
sm.bs(c(x,...), df = 3)[1:length(x), , drop = FALSE] # trick

fit3 <- vglm(cbind(leye,reye) ~ BS(lop,rop) + age,

family = binom2.or(exchangeable = TRUE, zero = 3),
data = eyesdata, trace = TRUE,
xij = list(BS(lop,rop) ~ BS(lop,rop) +
BS(rop,lop) +
fill(BS(lop,rop))),
form2 = ~ BS(lop,rop) + BS(rop,lop) + fill(BS(lop,rop)) +
lop + rop + age)
head(model.matrix(fit3, type = "lm")) # LM model matrix
head(model.matrix(fit3, type = "vlm")) # Big VLM model matrix
coef(fit3)
coef(fit3, matrix = TRUE)
summary(fit3)
[email protected]
max(abs(predict(fit3) - predict(fit3, new = eyesdata))) # Predicts correctly
predict(fit3, new = head(eyesdata)) # Note the 'scalar' OR, i.e., zero=3
max(abs(head(predict(fit3)) -
predict(fit3, new = head(eyesdata)))) # Should be 0
## Not run:
plotvgam(fit3, se = TRUE, xlab = "lop") # Correct

## End(Not run)

# Example 4. Capture-recapture model with ephemeral and enduring

# memory effects. Similar to Yang and Chao (2005), Biometrics.
deermice <- transform(deermice, Lag1 = y1)
M.tbh.lag1 <-
vglm(cbind(y1, y2, y3, y4, y5, y6) ~ sex + weight + Lag1,
posbernoulli.tb(parallel.t = FALSE ~ 0,
parallel.b = FALSE ~ 0,
drop.b = FALSE ~ 1),
xij = list(Lag1 ~ fill(y1) + fill(y2) + fill(y3) + fill(y4) +
fill(y5) + fill(y6) +
y1 + y2 + y3 + y4 + y5),
form2 = ~ sex + weight + Lag1 +
fill(y1) + fill(y2) + fill(y3) + fill(y4) +
fill(y5) + fill(y6) +
y1 + y2 + y3 + y4 + y5 + y6,
data = deermice, trace = TRUE)
266 finney44

coef(M.tbh.lag1)

finney44 Toxicity trial for insects

Description

A data frame of a toxicity trial.

Usage

data(finney44)

Format

A data frame with 6 observations on the following 3 variables.

pconc a numeric vector, percent concentration of pyrethrins.

hatched number of eggs that hatched.
unhatched number of eggs that did not hatch.

Details

Finney (1944) describes a toxicity trial of five different concentrations of pyrethrins (percent) plus
a control that were administered to eggs of Ephestia kuhniella. The natural mortality rate is large,
and a common adjustment is to use Abbotts formula.

References

Finney, D. J., 1944. The application of the probit method to toxicity test data adjusted for mortality
in the controls. Annals of Applied Biology, 31, 6874.
Abbott, W. S. (1925). A method of computing the effectiveness of an insecticide. Journal of
Economic Entomology, 18, 2657.

Examples
data(finney44)
transform(finney44, mortality = unhatched / (hatched + unhatched))
fisherz 267

fisherz Fishers Z Link Function

Description
Computes the Fisher Z transformation, including its inverse and the first two derivatives.

Usage
fisherz(theta, bminvalue = NULL, bmaxvalue = NULL,
inverse = FALSE, deriv = 0, short = TRUE, tag = FALSE)

Arguments
theta Numeric or character. See below for further details.
bminvalue, bmaxvalue
Optional boundary values. Values of theta which are less than or equal to 1
can be replaced by bminvalue before computing the link function value. Values
of theta which are greater than or equal to 1 can be replaced by bmaxvalue
before computing the link function value. See Links.
inverse, deriv, short, tag
Details at Links.

Details
The fisherz link function is commonly used for parameters that lie between 1 and 1. Numerical
values of theta close to 1 or 1 or out of range result in Inf, -Inf, NA or NaN.

Value
For deriv = 0, 0.5 * log((1+theta)/(1-theta)) (same as atanh(theta)) when inverse = FALSE,
and if inverse = TRUE then (exp(2*theta)-1)/(exp(2*theta)+1) (same as tanh(theta)).
For deriv = 1, then the function returns d eta / d theta as a function of theta if inverse = FALSE,
else if inverse = TRUE then it returns the reciprocal.
Here, all logarithms are natural logarithms, i.e., to base e.

Note
Numerical instability may occur when theta is close to 1 or 1. One way of overcoming this is to
use, e.g., bminvalue.
The link function rhobit is very similar to fisherz, e.g., just twice the value of fisherz. This
link function may be renamed to atanhlink in the near future.

Author(s)
Thomas W. Yee
268 Fisk

References
McCullagh, P. and Nelder, J. A. (1989) Generalized Linear Models, 2nd ed. London: Chapman &
Hall.

x <- c(seq(-1.02, -0.98, by = 0.01), seq(0.97, 1.02, by = 0.01))

fisherz(x) # Has NAs
fisherz(x, bminvalue = -1 + .Machine$double.eps,
bmaxvalue = 1 - .Machine$double.eps) # Has no NAs

Fisk The Fisk Distribution

Description
Density, distribution function, quantile function and random generation for the Fisk distribution
with shape parameter a and scale parameter scale.

Usage
dfisk(x, scale = 1, shape1.a, log = FALSE)
pfisk(q, scale = 1, shape1.a, lower.tail = TRUE, log.p = FALSE)
qfisk(p, scale = 1, shape1.a, lower.tail = TRUE, log.p = FALSE)
rfisk(n, scale = 1, shape1.a)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. If length(n) > 1 then the length is taken to be the
number required.
shape1.a shape parameter.
scale scale parameter.
log Logical. If log = TRUE then the logarithm of the density is returned.
lower.tail, log.p
Same meaning as in pnorm or qnorm.
fisk 269

Details
See fisk, which is the VGAM family function for estimating the parameters by maximum likeli-
hood estimation.

Value
dfisk gives the density, pfisk gives the distribution function, qfisk gives the quantile function,
and rfisk generates random deviates.

Note
The Fisk distribution is a special case of the 4-parameter generalized beta II distribution.

Author(s)
T. W. Yee and Kai Huang

References
Kleiber, C. and Kotz, S. (2003) Statistical Size Distributions in Economics and Actuarial Sciences,
Hoboken, NJ, USA: Wiley-Interscience.

fisk Fisk Distribution family function

Description
Maximum likelihood estimation of the 2-parameter Fisk distribution.

Usage
fisk(lscale = "loge", lshape1.a = "loge", iscale = NULL,
ishape1.a = NULL, imethod = 1, lss = TRUE, gscale = exp(-5:5),
gshape1.a = seq(0.75, 4, by = 0.25), probs.y = c(0.25, 0.5, 0.75),
zero = "shape")
270 fisk

Arguments
lss See CommonVGAMffArguments for important information.
lshape1.a, lscale
Parameter link functions applied to the (positive) parameters a and scale. See
Links for more choices.
iscale, ishape1.a, imethod, zero
See CommonVGAMffArguments for information. For imethod = 2 a good initial
value for iscale is needed to obtain a good estimate for the other parameter.
gscale, gshape1.a
See CommonVGAMffArguments for information.
probs.y See CommonVGAMffArguments for information.

Details
The 2-parameter Fisk (aka log-logistic) distribution is the 4-parameter generalized beta II distribu-
tion with shape parameter q = p = 1. It is also the 3-parameter Singh-Maddala distribution with
shape parameter q = 1, as well as the Dagum distribution with p = 1. More details can be found in
Kleiber and Kotz (2003).
The Fisk distribution has density

f (y) = ay a1 /[ba {1 + (y/b)a }2 ]

for a > 0, b > 0, y 0. Here, b is the scale parameter scale, and a is a shape parameter. The
cumulative distribution function is

F (y) = 1 [1 + (y/b)a ]1 = [1 + (y/b)a ]1 .

The mean is
E(Y ) = b (1 + 1/a) (1 1/a)
provided a > 1; these are returned as the fitted values. This family function handles multiple
responses.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Note
See the notes in genbetaII.

Author(s)
T. W. Yee

References
Kleiber, C. and Kotz, S. (2003) Statistical Size Distributions in Economics and Actuarial Sciences,
Hoboken, NJ, USA: Wiley-Interscience.
fittedvlm 271

See Also
Fisk, genbetaII, betaII, dagum, sinmad, inv.lomax, lomax, paralogistic, inv.paralogistic,
simulate.vlm.

Examples
fdata <- data.frame(y = rfisk(n = 200, shape = exp(1), scale = exp(2)))
fit <- vglm(y ~ 1, fisk(lss = FALSE), data = fdata, trace = TRUE)
fit <- vglm(y ~ 1, fisk(ishape1.a = exp(2)), data = fdata, trace = TRUE)
coef(fit, matrix = TRUE)
Coef(fit)
summary(fit)

fittedvlm Fitted Values of a VLM object

Description
Extractor function for the fitted values of a model object that inherits from a vector linear model
(VLM), e.g., a model of class "vglm".

Usage
fittedvlm(object, drop = FALSE, type.fitted = NULL,
percentiles = NULL, ...)

Arguments
object a model object that inherits from a VLM.
drop Logical. If FALSE then the answer is a matrix. If TRUE then the answer is a
vector.
type.fitted Character. Some VGAM family functions have a type.fitted argument. If
so then a different type of fitted value can be returned. It is recomputed from
the model after convergence. Note: this is an experimental feature and not all
VGAM family functions have this implemented yet. See CommonVGAMffArguments
for more details.
percentiles See CommonVGAMffArguments for details.
... Currently unused.

Details
The fitted values usually corresponds to the mean response, however, because the VGAM pack-
age fits so many models, this sometimes refers to quantities such as quantiles. The mean may even
not exist, e.g., for a Cauchy distribution.
Note that the fitted value is output from the @linkinv slot of the VGAM family function, where
the eta argument is the n M matrix of linear predictors.
272 fittedvlm

Value
The fitted values evaluated at the final IRLS iteration.

Note
This function is one of several extractor functions for the VGAM package. Others include coef,
deviance, weights and constraints etc. This function is equivalent to the methods function for
the generic function fitted.values.
If fit is a VLM or VGLM then fitted(fit) and predict(fit, type = "response") should be
equivalent (see predictvglm). The latter has the advantage in that it handles a newdata argument
so that the fitted values can be computed for a different data set.

Author(s)
Thomas W. Yee

References
Chambers, J. M. and T. J. Hastie (eds) (1992) Statistical Models in S. Wadsworth & Brooks/Cole.

See Also
fitted, predictvglm, vglmff-class.

Examples
# Categorical regression example 1
pneumo <- transform(pneumo, let = log(exposure.time))
(fit1 <- vglm(cbind(normal, mild, severe) ~ let, propodds, data = pneumo))
fitted(fit1)

# LMS quantile regression example 2

fit2 <- vgam(BMI ~ s(age, df = c(4, 2)),
lms.bcn(zero = 1), data = bmi.nz, trace = TRUE)
head(predict(fit2, type = "response")) # Equal to the the following two:
head(fitted(fit2))
predict(fit2, type = "response", newdata = head(bmi.nz))

# Zero-inflated example 3
zdata <- data.frame(x2 = runif(nn <- 1000))
zdata <- transform(zdata, pstr0.3 = logit(-0.5 , inverse = TRUE),
lambda.3 = loge(-0.5 + 2*x2, inverse = TRUE))
zdata <- transform(zdata, y1 = rzipois(nn, lambda = lambda.3, pstr0 = pstr0.3))
fit3 <- vglm(y1 ~ x2, zipoisson(zero = NULL), data = zdata, trace = TRUE)
head(fitted(fit3, type.fitted = "mean" )) # E(Y), which is the default
head(fitted(fit3, type.fitted = "pobs0")) # P(Y = 0)
head(fitted(fit3, type.fitted = "pstr0")) # Prob of a structural 0
head(fitted(fit3, type.fitted = "onempstr0")) # 1 - prob of a structural 0
flourbeetle 273

flourbeetle Mortality of Flour Beetles from Carbon Disulphide

Description
The flourbeetle data frame has 8 rows and 4 columns. Two columns are explanatory, the other
two are responses.

Usage
data(flourbeetle)

Format
This data frame contains the following columns:

logdose log10 applied to CS2mgL.

CS2mgL a numeric vector, the concentration of gaseous carbon disulphide in mg per litre.
exposed a numeric vector, counts; the number of beetles exposed to the poison.
killed a numeric vector, counts; the numbers killed.

Details
These data were originally given in Table IV of Bliss (1935) and are the combination of two series of
toxicological experiments involving Tribolium confusum, also known as the flour beetle. Groups of
such adult beetles were exposed for 5 hours of gaseous carbon disulphide at different concentrations,
and their mortality measured.

Source
Bliss, C.I., 1935. The calculation of the dosage-mortality curve. Annals of Applied Biology, 22,
134167.

Foldnorm The Folded-Normal Distribution

Description
Density, distribution function, quantile function and random generation for the (generalized) folded-
normal distribution.

Usage
dfoldnorm(x, mean = 0, sd = 1, a1 = 1, a2 = 1, log = FALSE)
pfoldnorm(q, mean = 0, sd = 1, a1 = 1, a2 = 1,
lower.tail = TRUE, log.p = FALSE)
qfoldnorm(p, mean = 0, sd = 1, a1 = 1, a2 = 1,
lower.tail = TRUE, log.p = FALSE, ...)
rfoldnorm(n, mean = 0, sd = 1, a1 = 1, a2 = 1)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. Same as rnorm.
mean, sd see rnorm.
a1, a2 see foldnormal.
log Logical. If TRUE then the log density is returned.
lower.tail, log.p
Same meaning as in pnorm or qnorm.
... Arguments that can be passed into uniroot.

Details
See foldnormal, the VGAM family function for estimating the parameters, for the formula of the
probability density function and other details.

Value
dfoldnorm gives the density, pfoldnorm gives the distribution function, qfoldnorm gives the quan-
tile function, and rfoldnorm generates random deviates.

Note
qfoldnorm() runs very slowly because it calls uniroot for each value of the argument p. The solu-
tion is consequently not exact; the ... can be used to obtain a more accurate solution if necessary.
foldnormal 275

Author(s)
T. W. Yee and Kai Huang

foldnormal Folded Normal Distribution Family Function

Description
Fits a (generalized) folded (univariate) normal distribution.

Usage
foldnormal(lmean = "identitylink", lsd = "loge", imean = NULL, isd = NULL,
a1 = 1, a2 = 1, nsimEIM = 500, imethod = 1, zero = NULL)

Arguments
lmean, lsd Link functions for the mean and standard deviation parameters of the usual uni-
variate normal distribution. They are and respectively. See Links for more
choices.
imean, isd Optional initial values for and . A NULL means a value is computed internally.
See CommonVGAMffArguments.
a1, a2 Positive weights, called a1 and a2 below. Each must be of length 1.
nsimEIM, imethod, zero
See CommonVGAMffArguments.
276 foldnormal

Details
If a random variable has an ordinary univariate normal distribution then the absolute value of that
random variable has an ordinary folded normal distribution. That is, the sign has not been recorded;
only the magnitude has been measured.
More generally, suppose X is normal with mean mean and standard deviation sd. Let Y =
max(a1 X, a2 X) where a1 and a2 are positive weights. This means that Y = a1 X for X > 0,
and Y = a2 X for X < 0. Then Y is said to have a generalized folded normal distribution. The
ordinary folded normal distribution corresponds to the special case a1 = a2 = 1.
The probability density function of the ordinary folded normal distribution can be written dnorm(y, mean, sd) + dnorm(y,
for y 0. By default, mean and log(sd) are the linear/additive predictors. Having mean=0 and
sd=1 results in the half-normal distribution. The mean of an ordinary folded normal distribution is
p
E(Y ) = 2/ exp(2 /(2 2 )) + [1 2(/)]

and these are returned as the fitted values. Here, () is the cumulative distribution function of a
standard normal (pnorm).

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Warning
Under- or over-flow may occur if the data is ill-conditioned. It is recommended that several different
initial values be used to help avoid local solutions.

Note
The response variable for this family function is the same as uninormal except positive values are
required. Reasonably good initial values are needed. Fisher scoring using simulation is imple-
mented.
See CommonVGAMffArguments for general information about many of these arguments.
Yet to do: implement the results of Johnson (1962) which gives expressions for the EIM, albeit,
under a different parameterization. Also, one element of the EIM appears to require numerical
integration.

Author(s)
Thomas W. Yee

References
Lin, P. C. (2005) Application of the generalized folded-normal distribution to the process capability
measures. International Journal of Advanced Manufacturing Technology, 26, 825830.
Johnson, N. L. (1962) The folded normal distribution: accuracy of estimation by maximum likeli-
hood. Technometrics, 4, 249256.
foldsqrt 277

See Also
rfoldnorm, uninormal, dnorm, skewnormal.

Examples
## Not run: m <- 2; SD <- exp(1)
fdata <- data.frame(y = rfoldnorm(n <- 1000, m = m, sd = SD))
hist(with(fdata, y), prob = TRUE, main = paste("foldnormal(m = ", m,
", sd = ", round(SD, 2), ")"))
fit <- vglm(y ~ 1, foldnormal, data = fdata, trace = TRUE)
coef(fit, matrix = TRUE)
(Cfit <- Coef(fit))
# Add the fit to the histogram:
mygrid <- with(fdata, seq(min(y), max(y), len = 200))
lines(mygrid, dfoldnorm(mygrid, Cfit[1], Cfit[2]), col = "orange")

## End(Not run)

foldsqrt Folded Square Root Link Function

Description
Computes the folded square root transformation, including its inverse and the first two derivatives.

Usage
foldsqrt(theta, min = 0, max = 1, mux = sqrt(2),
inverse = FALSE, deriv = 0, short = TRUE, tag = FALSE)

Arguments
theta Numeric or character. See below for further details.
min, max, mux These are called L, U and K below.
inverse, deriv, short, tag
Details at Links.

Details
The folded square root link function can be applied to parameters that lie between L and U inclusive.
Numerical values of theta out of range result in NA or NaN.

Value

For foldsqrt with deriv = 0: K( L U ) or mux * (sqrt(theta-min) - sqrt(max-theta))
when inverse = FALSE, and if inverse = TRUE then some more complicated function that returns
a NA unless theta is between -mux*sqrt(max-min) and mux*sqrt(max-min).
For deriv = 1, then the function returns d eta / d theta as a function of theta if inverse = FALSE,
else if inverse = TRUE then it returns the reciprocal.
278 foldsqrt

Note
The default has, if theta is 0 or 1, the link function value is -sqrt(2) and +sqrt(2) respec-
tively. These are finite values, therefore one cannot use this link function for general modelling of
probabilities because of numerical problem, e.g., with binomialff, cumulative. See the example
below.

Author(s)
Thomas W. Yee

p <- c(seq(-0.02, 0.02, by = 0.01), seq(0.97, 1.02, by = 0.01))

foldsqrt(p) # Has NAs

## Not run:
p <- seq(0.01, 0.99, by = 0.01)
par(mfrow = c(2, 2), lwd = (mylwd <- 2))
y <- seq(-4, 4, length = 100)
for (d in 0:1) {
matplot(p, cbind(logit(p, deriv = d), foldsqrt(p, deriv = d)),
type = "n", col = "purple", ylab = "transformation", las = 1,
main = if (d == 0) "Some probability link functions"
else "First derivative")
lines(p, logit(p, deriv = d), col = "limegreen")
lines(p, probit(p, deriv = d), col = "purple")
lines(p, cloglog(p, deriv = d), col = "chocolate")
lines(p, foldsqrt(p, deriv = d), col = "tan")
if (d == 0) {
abline(v = 0.5, h = 0, lty = "dashed")
legend(0, 4.5, c("logit", "probit", "cloglog", "foldsqrt"), lwd = 2,
col = c("limegreen","purple","chocolate", "tan"))
} else
abline(v = 0.5, lty = "dashed")
}

for (d in 0) {
matplot(y, cbind(logit(y, deriv = d, inverse = TRUE),
foldsqrt(y, deriv = d, inverse = TRUE)),
type = "n", col = "purple", xlab = "transformation", ylab = "p",
lwd = 2, las = 1,
main = if (d == 0) "Some inverse probability link functions"
else "First derivative")
lines(y, logit(y, deriv = d, inverse = TRUE), col = "limegreen")
formulavlm 279

lines(y, probit(y, deriv = d, inverse = TRUE), col = "purple")

lines(y, cloglog(y, deriv = d, inverse = TRUE), col = "chocolate")
lines(y, foldsqrt(y, deriv = d, inverse = TRUE), col = "tan")
if (d == 0) {
abline(h = 0.5, v = 0, lty = "dashed")
legend(-4, 1, c("logit", "probit", "cloglog", "foldsqrt"), lwd = 2,
col = c("limegreen","purple","chocolate", "tan"))
}
}
par(lwd = 1)

## End(Not run)

# This is lucky to converge

fit.h <- vglm(agaaus ~ sm.bs(altitude), binomialff(link = foldsqrt(mux = 5)),
data = hunua, trace = TRUE)
## Not run:
plotvgam(fit.h, se = TRUE, lcol = "orange", scol = "orange",
main = "Orange is Hunua, Blue is Waitakere")
## End(Not run)
head(predict(fit.h, hunua, type = "response"))

## Not run:
# The following fails.
pneumo <- transform(pneumo, let = log(exposure.time))
fit <- vglm(cbind(normal, mild, severe) ~ let,
cumulative(link = foldsqrt(mux = 10), par = TRUE, rev = TRUE),
data = pneumo, trace = TRUE, maxit = 200)
## End(Not run)

formulavlm Model Formulae and Term Names for VGLMs

Description

The methods function for formula to extract the formula from a fitted object, as well as a methods
function to return the names of the terms in the formula.

Usage

## S3 method for class 'vlm'

formula(x, ...)
formulavlm(x, form.number = 1, ...)
term.names(model, ...)
term.namesvlm(model, form.number = 1, ...)
280 Frank

Arguments
x, model A fitted model object.
form.number Formula number, is 1 or 2. which correspond to the arguments formula and
form2 respectively.
... Same as formula.

Details
The formula methods function is based on formula.

Value
The formula methods function should return something similar to formula. The term.names
methods function should return a character string with the terms in the formula; this includes any
intercept (which is denoted by "(Intercept)" as the first element.)

Author(s)
Thomas W. Yee

Frank Franks Bivariate Distribution

Description
Density, distribution function, and random generation for the (one parameter) bivariate Frank dis-
tribution.
Frank 281

Usage
dbifrankcop(x1, x2, apar, log = FALSE)
pbifrankcop(q1, q2, apar)
rbifrankcop(n, apar)

Arguments
x1, x2, q1, q2 vector of quantiles.
n number of observations. Same as in runif.
apar the positive association parameter.
log Logical. If log = TRUE then the logarithm of the density is returned.

Details
See bifrankcop, the VGAM family functions for estimating the association parameter by maxi-
mum likelihood estimation, for the formula of the cumulative distribution function and other details.

Value
dbifrankcop gives the density, pbifrankcop gives the distribution function, and rbifrankcop
generates random deviates (a two-column matrix).

Author(s)
T. W. Yee

References
Genest, C. (1987) Franks family of bivariate distributions. Biometrika, 74, 549555.

plot(rr <- rbifrankcop(n = 3000, apar = exp(4)))

par(mfrow = c(1, 2))
hist(rr[, 1]); hist(rr[, 2]) # Should be uniform

## End(Not run)
282 Frechet

Frechet The Frechet Distribution

Description
Density, distribution function, quantile function and random generation for the three parameter
Frechet distribution.

Usage
dfrechet(x, location = 0, scale = 1, shape, log = FALSE)
pfrechet(q, location = 0, scale = 1, shape,
lower.tail = TRUE, log.p = FALSE)
qfrechet(p, location = 0, scale = 1, shape,
lower.tail = TRUE, log.p = FALSE)
rfrechet(n, location = 0, scale = 1, shape)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. Passed into runif.
location, scale, shape
the location parameter a, scale parameter b, and shape parameter s.
log Logical. If log = TRUE then the logarithm of the density is returned.
lower.tail, log.p
Same meaning as in punif or qunif.

Details
See frechet, the VGAM family function for estimating the 2 parameters (without location param-
eter) by maximum likelihood estimation, for the formula of the probability density function and
range restrictions on the parameters.

Value
dfrechet gives the density, pfrechet gives the distribution function, qfrechet gives the quantile
function, and rfrechet generates random deviates.

Author(s)
T. W. Yee and Kai Huang

frechet Frechet Distribution Family Function

Description
Maximum likelihood estimation of the 2-parameter Frechet distribution.

Usage
frechet(location = 0, lscale = "loge", lshape = logoff(offset = -2),
iscale = NULL, ishape = NULL, nsimEIM = 250, zero = NULL)

Arguments
location Numeric. Location parameter. It is called a below.
lscale, lshape Link functions for the parameters; see Links for more choices.
iscale, ishape, zero, nsimEIM
See CommonVGAMffArguments for information.

Details
The (3-parameter) Frechet distribution has a density function that can be written
sb
f (y) = [b/(y a)]s1 exp[(b/(y a))s ]
(y a)2
for y > a and scale parameter b > 0. The positive shape parameter is s. The cumulative distribution
function is
F (y) = exp[(b/(y a))s ].
The mean of Y is a + b(1 1/s) for s > 1 (these are returned as the fitted values). The variance
of Y is b2 [(1 2/s) 2 (1 1/s)] for s > 2.
Family frechet has a known, and log(b) and log(s 2) are the default linear/additive predictors.
The working weights are estimated by simulated Fisher scoring.
284 freund61

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Warning
Family function frechet may fail for low values of the shape parameter, e.g., near 2 or lower.

Author(s)
T. W. Yee

References
Castillo, E., Hadi, A. S., Balakrishnan, N. Sarabia, J. S. (2005) Extreme Value and Related Models
with Applications in Engineering and Science, Hoboken, NJ, USA: Wiley-Interscience.

freund61 Freunds (1961) Bivariate Extension of the Exponential Distribution

Description
Estimate the four parameters of the Freund (1961) bivariate extension of the exponential distribution
by maximum likelihood estimation.

Usage
freund61(la = "loge", lap = "loge", lb = "loge", lbp = "loge",
ia = NULL, iap = NULL, ib = NULL, ibp = NULL,
independent = FALSE, zero = NULL)
freund61 285

Arguments
la, lap, lb, lbp
Link functions applied to the (positive) parameters , 0 , and 0 , respectively
(the p stands for prime). See Links for more choices.
ia, iap, ib, ibp
Initial value for the four parameters respectively. The default is to estimate them
all internally.
independent Logical. If TRUE then the parameters are constrained to satisfy = 0 and
= 0 , which implies that y1 and y2 are independent and each have an ordinary
exponential distribution.
zero A vector specifying which linear/additive predictors are modelled as intercepts
only. The values can be from the set {1,2,3,4}. The default is none of them. See
CommonVGAMffArguments for more information.

Details
This model represents one type of bivariate extension of the exponential distribution that is appli-
cable to certain problems, in particular, to two-component systems which can function if one of the
components has failed. For example, engine failures in two-engine planes, paired organs such as
peoples eyes, ears and kidneys. Suppose y1 and y2 are random variables representing the lifetimes
of two components A and B in a two component system. The dependence between y1 and y2 is
essentially such that the failure of the B component changes the parameter of the exponential life
distribution of the A component from to 0 , while the failure of the A component changes the
parameter of the exponential life distribution of the B component from to 0 .
The joint probability density function is given by

f (y1 , y2 ) = 0 exp( 0 y2 ( + 0 )y1 )

for 0 < y1 < y2 , and

f (y1 , y2 ) = 0 exp(0 y1 ( + 0 )y2 )

for 0 < y2 < y1 . Here, all four parameters are positive, as well as the responses y1 and y2 . Under
this model, the probability that component A is the first to fail is /( + ). The time to the first
failure is distributed as an exponential distribution with rate + . Furthermore, the distribution
of the time from first failure to failure of the other component is a mixture of Exponential(0 ) and
Exponential( 0 ) with proportions /( + ) and /( + ) respectively.
The marginal distributions are, in general, not exponential. By default, the linear/additive predictors
are 1 = log(), 2 = log(0 ), 3 = log(), 4 = log( 0 ).
A special case is when = 0 and = 0 , which means that y1 and y2 are independent, and both
have an ordinary exponential distribution with means 1/ and 1/ respectively.
Fisher scoring is used, and the initial values correspond to the MLEs of an intercept model. Conse-
quently, convergence may take only one iteration.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.
286 freund61

Note
To estimate all four parameters, it is necessary to have some data where y1 < y2 and y2 < y1 .
The response must be a two-column matrix, with columns y1 and y2 . Currently, the fitted value is
a matrix with two columns; the first column has values (0 + )/(0 ( + )) for the mean of y1 ,
while the second column has values ( 0 + )/( 0 ( + )) for the mean of y2 . The variance of y1 is

(0 )2 + 2 + 2
,
(0 )2 ( + )2

the variance of y2 is
( 0 )2 + 2 + 2
,
( 0 )2 ( + )2
the covariance of y1 and y2 is
0 0
.
0 0 ( + )2

Author(s)
T. W. Yee

References
Freund, J. E. (1961) A bivariate extension of the exponential distribution. Journal of the American
Statistical Association, 56, 971977.

# y1 and y2 are independent, so fit an independence model

fit2 <- vglm(cbind(y1, y2) ~ 1, freund61(indep = TRUE),
data = fdata, trace = TRUE)
coef(fit2, matrix = TRUE)
constraints(fit2)
pchisq(2 * (logLik(fit1) - logLik(fit2)), # p-value
df = df.residual(fit2) - df.residual(fit1), lower.tail = FALSE)
lrtest(fit1, fit2) # Better alternative
gamma1 287

gamma1 1-parameter Gamma Distribution

Description
Estimates the 1-parameter gamma distribution by maximum likelihood estimation.

Usage
gamma1(link = "loge", zero = NULL)

Arguments
link Link function applied to the (positive) shape parameter. See Links for more
choices and general information.
zero Details at CommonVGAMffArguments.

Details
The density function is given by

f (y) = exp(y) y shape1 /(shape)

for shape > 0 and y > 0. Here, (shape) is the gamma function, as in gamma. The mean of Y
(returned as the fitted values) is = shape, and the variance is 2 = shape.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Note
This VGAM family function can handle a multiple responses, which is inputted as a matrix.
The parameter shape matches with shape in rgamma. The argument rate in rgamma is assumed 1
for this family function.
If rate is unknown use the family function gammaR to estimate it too.

Author(s)
T. W. Yee

References
Most standard texts on statistical distributions describe the 1-parameter gamma distribution, e.g.,
Forbes, C., Evans, M., Hastings, N. and Peacock, B. (2011) Statistical Distributions, Hoboken, NJ,
USA: John Wiley and Sons, Fourth edition.
288 gamma2

See Also

gammaR for the 2-parameter gamma distribution, lgamma1, lindley, simulate.vlm.

Examples
gdata <- data.frame(y = rgamma(n = 100, shape = exp(3)))
fit <- vglm(y ~ 1, gamma1, data = gdata, trace = TRUE, crit = "coef")
coef(fit, matrix = TRUE)
Coef(fit)
summary(fit)

gamma2 2-parameter Gamma Distribution

Description

Estimates the 2-parameter gamma distribution by maximum likelihood estimation.

Usage

gamma2(lmu = "loge", lshape = "loge",

imethod = 1, ishape = NULL,
parallel = FALSE, deviance.arg = FALSE, zero = "shape")

Arguments

lmu, lshape Link functions applied to the (positive) mu and shape parameters (called and
a respectively). See Links for more choices.
ishape Optional initial value for shape. A NULL means a value is computed internally.
If a failure to converge occurs, try using this argument. This argument is ignored
if used within cqo; see the iShape argument of qrrvglm.control instead.
imethod An integer with value 1 or 2 which specifies the initialization method for the
parameter. If failure to converge occurs try another value (and/or specify a value
for ishape).
deviance.arg Logical. If TRUE, the deviance function is attached to the object. Under ordi-
nary circumstances, it should be left alone because it really assumes the shape
parameter is at the maximum likelihood estimate. Consequently, one cannot use
that criterion to minimize within the IRLS algorithm. It should be set TRUE only
when used with cqo under the fast algorithm.
zero See CommonVGAMffArguments for information.
parallel Details at CommonVGAMffArguments. If parallel = TRUE then the constraint
is not applied to the intercept.
gamma2 289

Details
This distribution can model continuous skewed responses. The density function is given by

exp(ay/) (ay/)a1 a
f (y; , a) =
(a)

for > 0, a > 0 and y > 0. Here, () is the gamma function, as in gamma. The mean of Y is
= (returned as the fitted values) with variance 2 = 2 /a. If 0 < a < 1 then the density has a
pole at the origin and decreases monotonically as y increases. If a = 1 then this corresponds to the
exponential distribution. If a > 1 then the density is zero at the origin and is unimodal with mode
at y = /a; this can be achieved with lshape="loglog".
By default, the two linear/additive predictors are 1 = log() and 2 = log(a). This family function
implements Fisher scoring and the working weight matrices are diagonal.
This VGAM family function handles multivariate responses, so that a matrix can be used as the
response. The number of columns is the number of species, say, and zero=-2 means that all species
have a shape parameter equalling a (different) intercept only.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Note
The response must be strictly positive. A moment estimator for the shape parameter may be imple-
mented in the future.
If mu and shape are vectors, then rgamma(n = n, shape = shape, scale = mu/shape) will
generate random gamma variates of this parameterization, etc.; see GammaDist.
For cqo and cao, taking the logarithm of the response means (approximately) a gaussianff family
may be used on the transformed data.

Author(s)
T. W. Yee

References
The parameterization of this VGAM family function is the 2-parameter gamma distribution de-
scribed in the monograph
McCullagh, P. and Nelder, J. A. (1989) Generalized Linear Models, 2nd ed. London: Chapman &
Hall.

See Also
gamma1 for the 1-parameter gamma distribution, gammaR for another parameterization of the 2-
parameter gamma distribution that is directly matched with rgamma, bigamma.mckay for a bivari-
ate gamma distribution, expexpff, GammaDist, golf, CommonVGAMffArguments, simulate.vlm,
negloge.
290 gammahyperbola

Examples
# Essentially a 1-parameter gamma
gdata <- data.frame(y = rgamma(n = 100, shape = exp(1)))
fit1 <- vglm(y ~ 1, gamma1, data = gdata)
fit2 <- vglm(y ~ 1, gamma2, data = gdata, trace = TRUE, crit = "coef")
coef(fit2, matrix = TRUE)
c(Coef(fit2), colMeans(gdata))

# Essentially a 2-parameter gamma

gdata <- data.frame(y = rgamma(n = 500, rate = exp(-1), shape = exp(2)))
fit2 <- vglm(y ~ 1, gamma2, data = gdata, trace = TRUE, crit = "coef")
coef(fit2, matrix = TRUE)
c(Coef(fit2), colMeans(gdata))
summary(fit2)

gammahyperbola Gamma Hyperbola Bivariate Distribution

Description
Estimate the parameter of a gamma hyperbola bivariate distribution by maximum likelihood esti-
mation.

Usage
gammahyperbola(ltheta = "loge", itheta = NULL, expected = FALSE)

Arguments
ltheta Link function applied to the (positive) parameter . See Links for more choices.
itheta Initial value for the parameter. The default is to estimate it internally.
expected Logical. FALSE means the Newton-Raphson (using the observed information
matrix) algorithm, otherwise the expected information matrix is used (Fisher
scoring algorithm).

Details
The joint probability density function is given by

f (y1 , y2 ) = exp(e y1 / y2 )

for > 0, y1 > 0, y2 > 1. The random variables Y1 and Y2 are independent. The marginal
distribution of Y1 is an exponential distribution with rate parameter exp()/. The marginal
distribution of Y2 is an exponential distribution that has been shifted to the right by 1 and with rate
parameter . The fitted values are stored in a two-column matrix with the marginal means, which
are exp() and 1 + 1/.
The default algorithm is Newton-Raphson because Fisher scoring tends to be much slower for this
distribution.
gammaR 291

Value

An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Note

The response must be a two-column matrix.

Author(s)

T. W. Yee

References

Reid, N. (2003) Asymptotics and the theory of inference. Annals of Statistics, 31, 16951731.

See Also

exponential.

Examples
gdata <- data.frame(x2 = runif(nn <- 1000))
gdata <- transform(gdata, theta = exp(-2 + x2))
gdata <- transform(gdata, y1 = rexp(nn, rate = exp(-theta)/theta),
y2 = rexp(nn, rate = theta) + 1)
fit <- vglm(cbind(y1, y2) ~ x2, gammahyperbola(expected = TRUE), data = gdata)
coef(fit, matrix = TRUE)
Coef(fit)
head(fitted(fit))
summary(fit)

gammaR 2-parameter Gamma Distribution

Description

Estimates the 2-parameter gamma distribution by maximum likelihood estimation.

Usage

gammaR(lrate = "loge", lshape = "loge", irate = NULL,

ishape = NULL, lss = TRUE, zero = "shape")
292 gammaR

Arguments

lrate, lshape Link functions applied to the (positive) rate and shape parameters. See Links
for more choices.
irate, ishape Optional initial values for rate and shape. A NULL means a value is computed
internally. If a failure to converge occurs, try using these arguments.
zero, lss Details at CommonVGAMffArguments.

Details

The density function is given by

f (y) = exp(rate y) y shape1 rateshape /(shape)

for shape > 0, rate > 0 and y > 0. Here, (shape) is the gamma function, as in gamma. The
mean of Y is = shape/rate (returned as the fitted values) with variance 2 = 2 /shape =
shape/rate2 . By default, the two linear/additive predictors are 1 = log(shape) and 2 =
log(rate).

Value

An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Note

The parameters rate and shape match with the arguments rate and shape of rgamma. The order
of the arguments agree too. Here, scale = 1/rate is used, so one can use negloge. Multiple
responses are handled.
If rate = 1 use the family function gamma1 to estimate shape.

Author(s)

T. W. Yee

References

Most standard texts on statistical distributions describe the 2-parameter gamma distribution, e.g.,
Forbes, C., Evans, M., Hastings, N. and Peacock, B. (2011) Statistical Distributions, Hoboken, NJ,
USA: John Wiley and Sons, Fourth edition.

See Also

gamma1 for the 1-parameter gamma distribution, gamma2 for another parameterization of the 2-
parameter gamma distribution, bigamma.mckay for a bivariate gamma distribution, expexpff, simulate.vlm,
rgamma, negloge.
garma 293

Examples
# Essentially a 1-parameter gamma
gdata <- data.frame(y1 = rgamma(n <- 100, shape = exp(1)))
fit1 <- vglm(y1 ~ 1, gamma1, data = gdata, trace = TRUE)
fit2 <- vglm(y1 ~ 1, gammaR, data = gdata, trace = TRUE, crit = "coef")
coef(fit2, matrix = TRUE)
Coef(fit2)

# Essentially a 2-parameter gamma

gdata <- data.frame(y2 = rgamma(n = 500, rate = exp(1), shape = exp(2)))
fit2 <- vglm(y2 ~ 1, gammaR, data = gdata, trace = TRUE, crit = "coef")
coef(fit2, matrix = TRUE)
Coef(fit2)
summary(fit2)

garma GARMA (Generalized Autoregressive Moving-Average) Models

Description
Fits GARMA models to time series data.

Usage
garma(link = "identitylink", p.ar.lag = 1, q.ma.lag = 0,
coefstart = NULL, step = 1)

Arguments
link Link function applied to the mean response. The default is suitable for continu-
ous responses. The link loge should be chosen if the data are counts. The link
reciprocal can be chosen if the data are counts and the variance assumed for
this is 2 . The links logit, probit, cloglog, and cauchit are supported and
suitable for binary responses.
Note that when the log or logit link is chosen: for log and logit, zero values can
be replaced by bvalue. See loge and logit etc. for specific information about
each link function.
p.ar.lag A positive integer, the lag for the autoregressive component. Called p below.
q.ma.lag A non-negative integer, the lag for the moving-average component. Called q
below.
coefstart Starting values for the coefficients. Assigning this argument is highly recom-
mended. For technical reasons, the argument coefstart in vglm cannot be
used.
step Numeric. Step length, e.g., 0.5 means half-stepsizing.
294 garma

Details

This function draws heavily on Benjamin et al. (1998). See also Benjamin et al. (2003). GARMA
models extend the ARMA time series model to generalized responses in the exponential family,
e.g., Poisson counts, binary responses. Currently, this function is rudimentary and can handle only
certain continuous, count and binary responses only. The user must choose an appropriate link for
the link argument.
The GARMA(p, q) model is defined by firstly having a response belonging to the exponential family

yt t b(t )
f (yt |Dt ) = exp + c(yt , /At )
/At

where t and are the canonical and scale parameters respectively, and At are known prior weights.
The mean t = E(Yt |Dt ) = b0 (t ) is related to the linear predictor t by the link function g.
Here, Dt = {xt , . . . , x1 , yt1 , . . . , y1 , t1 , . . . , 1 } is the previous information set. Secondly, the
GARMA(p, q) model is defined by
p
X q
X
g(t ) = t = xTt + k (g(ytk ) xTtk ) + k (g(ytk ) tk ).
k=1 k=1

Parameter vectors , and are estimated by maximum likelihood.

Value

An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm.

Warning

This VGAM family function is non-standard in that the model does need some coercing to get it
into the VGLM framework. Special code is required to get it running. A consequence is that some
methods functions may give wrong results when applied to the fitted object.

Note

This function is unpolished and is requires lots of improvements. In particular, initialization is very
poor. Results appear very sensitive to quality of initial values. A limited amount of experience has
shown that half-stepsizing is often needed for convergence, therefore choosing crit = "coef" is
not recommended.
Overdispersion is not handled. For binomial responses it is currently best to input a vector of 1s and
0s rather than the cbind(successes, failures) because the initialize slot is rudimentary.

Author(s)

T. W. Yee
gaussianff 295

References
Benjamin, M. A., Rigby, R. A. and Stasinopoulos, M. D. (1998) Fitting Non-Gaussian Time Series
Models. Pages 191196 in: Proceedings in Computational Statistics COMPSTAT 1998 by Payne,
R. and P. J. Green. Physica-Verlag.
Benjamin, M. A., Rigby, R. A. and Stasinopoulos, M. D. (2003) Generalized Autoregressive Mov-
ing Average Models. Journal of the American Statistical Association, 98: 214223.
Zeger, S. L. and Qaqish, B. (1988) Markov regression models for time series: a quasi-likelihood
approach. Biometrics, 44: 10191031.

Examples
gdata <- data.frame(interspike = c(68, 41, 82, 66, 101, 66, 57, 41, 27, 78,
59, 73, 6, 44, 72, 66, 59, 60, 39, 52,
50, 29, 30, 56, 76, 55, 73, 104, 104, 52,
25, 33, 20, 60, 47, 6, 47, 22, 35, 30,
29, 58, 24, 34, 36, 34, 6, 19, 28, 16,
36, 33, 12, 26, 36, 39, 24, 14, 28, 13,
2, 30, 18, 17, 28, 9, 28, 20, 17, 12,
19, 18, 14, 23, 18, 22, 18, 19, 26, 27,
23, 24, 35, 22, 29, 28, 17, 30, 34, 17,
20, 49, 29, 35, 49, 25, 55, 42, 29, 16)) # See Zeger and Qaqish (1988)
gdata <- transform(gdata, spikenum = seq(interspike))
bvalue <- 0.1 # .Machine$double.xmin # Boundary value
fit <- vglm(interspike ~ 1, trace = TRUE, data = gdata,
garma(loge(bvalue = bvalue),
p = 2, coefstart = c(4, 0.3, 0.4)))
summary(fit)
coef(fit, matrix = TRUE)
Coef(fit) # A bug here
## Not run: with(gdata, plot(interspike, ylim = c(0, 120), las = 1,
xlab = "Spike Number", ylab = "Inter-Spike Time (ms)", col = "blue"))
with(gdata, lines(spikenum[-(1:fit@misc$plag)], fitted(fit), col = "orange"))
abline(h = mean(with(gdata, interspike)), lty = "dashed", col = "gray")
## End(Not run)

gaussianff Gaussian (Normal) Family Function

Description
Fits a generalized linear model to a response with Gaussian (normal) errors. The dispersion param-
eter may be known or unknown.

Usage
gaussianff(dispersion = 0, parallel = FALSE, zero = NULL)
296 gaussianff

Arguments
parallel A logical or formula. If a formula, the response of the formula should be a
logical and the terms of the formula indicates whether or not those terms are
parallel.
dispersion Dispersion parameter. If 0 then it is estimated and the moment estimate is put in
object@misc$dispersion; it is assigned the value
n
X
(yi i )T Wi (yi i )/(nM p)
i=1

where p is the total number of parameters estimated (for RR-VGLMs the value
used is the number of columns in the large X model matrix; this may not be
correct). If the argument is assigned a positive quantity then it is assumed to be
known with that value.
zero An integer-valued vector specifying which linear/additive predictors are mod-
elled as intercepts only. The values must be from the set {1,2,. . . ,M } where M
is the number of columns of the matrix response.

Details
This function is usually used in conjunction with vglm, else vlm is recommended instead. The
notation M is used to denote the number of linear/additive predictors. This function can handle
any finite M , and the default is to use ordinary least squares. A vector linear/additive model can be
fitted by minimizing
Xn
(yi i )T Wi (yi i )
i=1

where yi is a M -vector, i is the vector of linear/additive predictors. The Wi is any positive-

definite matrix, and the default is the order-M identity matrix. The Wi can be inputted using
the weights argument of vlm/vglm/vgam etc., and the format is the matrix-band format whereby
it is a n A matrix with the diagonals are passed first, followed by next the upper band, all
the way to the (1, M ) element. Here, A has maximum value of M (M + 1)/2 and a minimum
value of M . Usually the weights argument of vlm/vglm/vgam/rrvglm is just a vector, in which
case each element is multiplied by a order-M identity matrix. If in doubt, type something like
weights(object, type="working") after the model has been fitted.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.

Note
This VGAM family function is supposed to be similar to gaussian but is is not compatible with
glm. The "ff" in the name is added to avoid any masking problems.

Author(s)
Thomas W. Yee
GenbetaII 297

References

McCullagh, P. and Nelder, J. A. (1989) Generalized Linear Models, 2nd ed. London: Chapman &
Hall.
Yee, T. W. and Wild, C. J. (1996) Vector generalized additive models. Journal of the Royal Statisti-
cal Society, Series B, Methodological, 58, 481493.

See Also

uninormal, huber2, lqnorm, binormal, SURff. vlm, vglm, vgam, rrvglm.

Examples
gdata <- data.frame(x2 = sort(runif(n <- 40)))
gdata <- transform(gdata, y1 = 1 + 2*x2 + rnorm(n, sd = 0.1),
y2 = 3 + 4*x2 + rnorm(n, sd = 0.1),
y3 = 7 + 4*x2 + rnorm(n, sd = 0.1))
fit <- vglm(cbind(y1,y2) ~ x2, gaussianff, data = gdata)
coef(fit, matrix = TRUE)

# For comparison:
coef( lmfit <- lm(y1 ~ x2, data = gdata))
coef(glmfit <- glm(y2 ~ x2, data = gdata, gaussian))
vcov(fit)
vcov(lmfit)

t(weights(fit, type = "prior")) # Unweighted observations

head(weights(fit, type = "working")) # Identity matrices

# Reduced-rank VLM (rank-1)

fit2 <- rrvglm(cbind(y1, y2, y3) ~ x2, gaussianff, data = gdata)
Coef(fit2)

GenbetaII The Generalized Beta II Distribution

Description

Density for the generalized beta II distribution with shape parameters a and p and q, and scale
parameter scale.

Usage

dgenbetaII(x, scale = 1, shape1.a, shape2.p, shape3.q, log = FALSE)

298 genbetaII

Arguments
x vector of quantiles.
shape1.a, shape2.p, shape3.q
positive shape parameters.
scale positive scale parameter.
log Logical. If log = TRUE then the logarithm of the density is returned.

Details
See genbetaII, which is the VGAM family function for estimating the parameters by maximum
likelihood estimation. Several distributions, such as the Singh-Maddala, are special case of this
flexible 4-parameter distribution. The product of shape1.a and shape2.p determines the behaviour
of the density at the origin.

Value
dgenbetaII gives the density.

Author(s)
T. W. Yee

References
Kleiber, C. and Kotz, S. (2003) Statistical Size Distributions in Economics and Actuarial Sciences,
Hoboken, NJ, USA: Wiley-Interscience.

genbetaII Generalized Beta Distribution of the Second Kind

Description
Maximum likelihood estimation of the 4-parameter generalized beta II distribution.
genbetaII 299

Usage
genbetaII(lscale = "loge", lshape1.a = "loge", lshape2.p = "loge",
lshape3.q = "loge", iscale = NULL, ishape1.a = NULL,
ishape2.p = NULL, ishape3.q = NULL, lss = TRUE,
gscale = exp(-5:5), gshape1.a = exp(-5:5),
gshape2.p = exp(-5:5), gshape3.q = exp(-5:5),
zero = "shape")

Arguments
lss See CommonVGAMffArguments for important information.
lshape1.a, lscale, lshape2.p, lshape3.q
Parameter link functions applied to the shape parameter a, scale parameter scale,
shape parameter p, and shape parameter q. All four parameters are positive. See
Links for more choices.
iscale, ishape1.a, ishape2.p, ishape3.q
Optional initial values for the parameters. A NULL means a value is computed
internally using the arguments gscale, gshape1.a, etc.
gscale, gshape1.a, gshape2.p, gshape3.q
See CommonVGAMffArguments for information. Replaced by iscale, ishape1.a
etc. if given.
zero The default is to set all the shape parameters to be intercept-only. See CommonVGAMffArguments
for information.

Details
This distribution is most useful for unifying a substantial number of size distributions. For example,
the Singh-Maddala, Dagum, Fisk (log-logistic), Lomax (Pareto type II), inverse Lomax, beta distri-
bution of the second kind distributions are all special cases. Full details can be found in Kleiber and
Kotz (2003), and Brazauskas (2002). The argument names given here are used by other families
that are special cases of this family. Fisher scoring is used here and for the special cases too.
The 4-parameter generalized beta II distribution has density

f (y) = ay ap1 /[bap B(p, q){1 + (y/b)a }p+q ]

for a > 0, b > 0, p > 0, q > 0, y 0. Here B is the beta function, and b is the scale parameter
scale, while the others are shape parameters. The mean is

E(Y ) = b (p + 1/a) (q 1/a)/((p) (q))

provided ap < 1 < aq; these are returned as the fitted values.
This family function handles multiple responses.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.
300 genbetaII

Warning
This distribution is very flexible and it is not generally recommended to use this family function
when the sample size is smallnumerical problems easily occur with small samples. Probably
several hundred observations at least are needed in order to estimate the parameters with any level
of confidence. Neither is the inclusion of covariates recommended at allnot unless there are
several thousand observations. The mean is finite only when ap < 1 < aq, and this can be easily
violated by the parameter estimates for small sample sizes. Try fitting some of the special cases of
this distribution (e.g., sinmad, fisk, etc.) first, and then possibly use those models for initial values
for this distribution.

Note
The default is to use a grid search with respect to all four parameters; this is quite costly and is time
consuming. If the self-starting initial values fail, try experimenting with the initial value arguments.
Also, the constraint ap < 1 < aq may be violated as the iterations progress so it pays to monitor
convergence, e.g., set trace = TRUE. Successful convergence depends on having very good initial
values. This is rather difficult for this distribution so that a grid search is conducted by default. One
suggestion for increasing the estimation reliability is to set stepsize = 0.5 and maxit = 100; see
vglm.control.

Author(s)
T. W. Yee, with help from Victor Miranda.

References
Kleiber, C. and Kotz, S. (2003) Statistical Size Distributions in Economics and Actuarial Sciences,
Hoboken, NJ, USA: Wiley-Interscience.
Brazauskas, V. (2002) Fisher information matrix for the Feller-Pareto distribution. Statistics &
Probability Letters, 59, 159167.

See Also
dgenbetaII, betaff, betaII, dagum, sinmad, fisk, lomax, inv.lomax, paralogistic, inv.paralogistic,
lino, CommonVGAMffArguments, vglm.control.

Examples
## Not run:
gdata <- data.frame(y = rsinmad(3000, shape1 = exp(1), scale = exp(2),
shape3 = exp(1))) # A special case!
fit <- vglm(y ~ 1, genbetaII(lss = FALSE), data = gdata, trace = TRUE)
fit <- vglm(y ~ 1, data = gdata, trace = TRUE,
genbetaII(ishape1.a = 3, iscale = 7, ishape3.q = 2.3))
coef(fit, matrix = TRUE)
Coef(fit)
summary(fit)

## End(Not run)
gengamma.stacy 301

gengamma.stacy Generalized Gamma distribution family function

Description

Estimation of the 3-parameter generalized gamma distribution proposed by Stacy (1962).

Usage

gengamma.stacy(lscale = "loge", ld = "loge", lk = "loge",

iscale = NULL, id = NULL, ik = NULL,
gscale = exp(-5:5), gshape1.d = exp(-5:5),
gshape2.k = exp(-5:5), zero = NULL)

Arguments

lscale, ld, lk Parameter link function applied to each of the positive parameters b, d and k,
respectively. See Links for more choices.
iscale, id, ik Initial value for b, d and k, respectively. The defaults mean an initial value is
determined internally for each.
gscale, gshape1.d, gshape2.k
See CommonVGAMffArguments for information. Replaced by iscale, id etc. if
given.
zero See CommonVGAMffArguments for information.

Details

The probability density function can be written

f (y; b, d, k) = dbdk y dk1 exp[(y/b)d ]/(k)

for scale parameter b > 0, and d > 0, k > 0, and y > 0. The mean of Y is b (k + 1/d)/(k)
(returned as the fitted values), which equals bk if d = 1.
There are many special cases, as given in Table 1 of Stacey and Mihram (1965). In the fol-
lowing, the parameters are in the order b, d, k. The special cases are: Exponential f (y; b, 1, 1),
Gamma f (y; b, 1, k), Weibull f (y; b, d, 1), Chi Squared f (y; 2, 1, a/2)
with a degrees of freedom,
Chi f(y; 2, 2, a/2) with a degrees of freedom, Half-normal f (y; 2, 2, 1/2), Circular normal
f (y; 2, 2, 1), Spherical normal f (y; 2, 2, 3/2), Rayleigh f (y; c 2, 2, 1) where c > 0.

Value

An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.
302 gengamma.stacy

Warning

Several authors have considered maximum likelihood estimation for the generalized gamma distri-
bution and have found that the Newton-Raphson algorithm does not work very well and that the
existence of solutions to the log-likelihood equations is sometimes in doubt. Although Fisher scor-
ing is used here, it is likely that the same problems will be encountered. It appears that large samples
are required, for example, the estimator of k became asymptotically normal only with 400 or more
observations. It is not uncommon for maximum likelihood estimates to fail to converge even with
two or three hundred observations. With covariates, even more observations are needed to increase
the chances of convergence. Using covariates is not advised unless the sample size is at least a few
thousand.

Note

The notation used here differs from Stacy (1962) and Prentice (1974). Poor initial values may result
in failure to converge so if there are covariates and there are convergence problems, try using the
zero argument (e.g., zero = 2:3) or the ik argument.

Author(s)

T. W. Yee

References

Stacy, E. W. (1962) A generalization of the gamma distribution. Annals of Mathematical Statistics,

33(3), 11871192.
Stacy, E. W. and Mihram, G. A. (1965) Parameter estimation for a generalized gamma distribution.
Technometrics, 7, 349358.
Prentice, R. L. (1974) A log gamma model and its maximum likelihood estimation. Biometrika, 61,
539544.

See Also

rgengamma.stacy, gamma1, gamma2, prentice74, simulate.vlm.

Examples

k <- exp(-1); Scale = exp(1)

gdata <- data.frame(y = rgamma(1000, shape = k, scale = Scale))
fit <- vglm(y ~ 1, gengamma.stacy, data = gdata, trace = TRUE)
coef(fit, matrix = TRUE)
gengammaUC 303

gengammaUC The Generalized Gamma Distribution

Description
Density, distribution function, quantile function and random generation for the generalized gamma
distribution with scale parameter scale, and parameters d and k.

Usage
dgengamma.stacy(x, scale = 1, d = 1, k = 1, log = FALSE)
pgengamma.stacy(q, scale = 1, d = 1, k = 1,
lower.tail = TRUE, log.p = FALSE)
qgengamma.stacy(p, scale = 1, d = 1, k = 1,
lower.tail = TRUE, log.p = FALSE)
rgengamma.stacy(n, scale = 1, d = 1, k = 1)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. Same as in runif.
scale the (positive) scale parameter b.
d, k the (positive) parameters d and k.
log Logical. If log = TRUE then the logarithm of the density is returned.
lower.tail, log.p
Same meaning as in pnorm or qnorm.

Details
See gengamma.stacy, the VGAM family function for estimating the generalized gamma distribu-
tion by maximum likelihood estimation, for formulae and other details. Apart from n, all the above
arguments may be vectors and are recyled to the appropriate length if necessary.

Value
dgengamma.stacy gives the density, pgengamma.stacy gives the distribution function, qgengamma.stacy
gives the quantile function, and rgengamma.stacy generates random deviates.

Author(s)
T. W. Yee and Kai Huang

References
Stacy, E. W. and Mihram, G. A. (1965) Parameter estimation for a generalized gamma distribution.
Technometrics, 7, 349358.
304 genpoisson

genpoisson Generalized Poisson distribution

Description
Estimation of the two-parameter generalized Poisson distribution.

Usage
genpoisson(llambda = "rhobit", ltheta = "loge",
ilambda = NULL, itheta = NULL,
use.approx = TRUE, imethod = 1, ishrinkage = 0.95,
zero = "lambda")

Arguments
llambda, ltheta
Parameter link functions for and . See Links for more choices. The pa-
rameter lies at least within the interval [1, 1]; see below for more details, and
an alternative link is rhobit. The parameter is positive, therefore the default
is the log link.
ilambda, itheta
Optional initial values for and . The default is to choose values internally.
use.approx Logical. If TRUE then an approximation to the expected information matrix is
used, otherwise Newton-Raphson is used.
imethod An integer with value 1 or 2 or 3 which specifies the initialization method for the
parameters. If failure to converge occurs try another value and/or else specify a
value for ilambda and/or itheta.
ishrinkage, zero
See CommonVGAMffArguments for information.
genpoisson 305

Details
The generalized Poisson distribution has density

f (y) = ( + y)y1 exp( y)/y!

for > 0 and y = 0, 1, 2, . . .. Now max(1, /m) 1 where m( 4) is the greatest

positive integer satisfying + m > 0 when < 0 [and then P (Y = y) = 0 for y > m]. Note
the complicated support for this distribution means, for some data sets, the default link for llambda
will not always work, and some tinkering may be required to get it running.
As Consul and Famoye (2006) state on p.165, the lower limits on and m 4 are imposed to
ensure that there are at least 5 classes with nonzero probability when is negative.
An ordinary Poisson distribution corresponds to = 0. The mean (returned as the fitted values) is
E(Y ) = /(1 ) and the variance is /(1 )3 .
For more information see Consul and Famoye (2006) for a summary and Consul (1989) for full
details.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Warning
Monitor convergence! This family function is fragile. Dont get confused because theta (and not
lambda) here really matches more closely with lambda of dpois.

Note
This family function handles multiple responses. This distribution is potentially useful for disper-
sion modelling. Convergence problems may occur when lambda is very close to 0 or 1. If a failure
occurs then you might want to try something like llambda = extlogit(min = -0.9, max = 1) to
handle the LHS complicated constraint, and if that doesnt work, try llambda = extlogit(min = -0.8, max = 1),
etc.

Author(s)
T. W. Yee

References
Consul, P. C. and Famoye, F. (2006) Lagrangian Probability Distributions, Boston, USA: Birkhauser.
Jorgensen, B. (1997) The Theory of Dispersion Models. London: Chapman & Hall
Consul, P. C. (1989) Generalized Poisson Distributions: Properties and Applications. New York,
USA: Marcel Dekker.

See Also
poissonff, dpois. dgenpois, rhobit, extlogit.
306 genray

Examples
gdata <- data.frame(x2 = runif(nn <- 200))
gdata <- transform(gdata, y1 = rpois(nn, exp(2 - x2))) # Poisson data
fit <- vglm(y1 ~ x2, genpoisson, data = gdata, trace = TRUE)
coef(fit, matrix = TRUE)
summary(fit)

genray The Generalized Rayleigh Distribution

Description
Density, distribution function, quantile function and random generation for the generalized Rayleigh
distribution.

Usage
dgenray(x, scale = 1, shape, log = FALSE)
pgenray(q, scale = 1, shape, lower.tail = TRUE, log.p = FALSE)
qgenray(p, scale = 1, shape, lower.tail = TRUE, log.p = FALSE)
rgenray(n, scale = 1, shape)

Details
See genrayleigh, the VGAM family function for estimating the parameters, for the formula of the
probability density function and other details.

Value
dgenray gives the density, pgenray gives the distribution function, qgenray gives the quantile
function, and rgenray generates random deviates.

Note
We define scale as the reciprocal of the scale parameter used by Kundu and Raqab (2005).
genrayleigh 307

Author(s)
Kai Huang and J. G. Lauder and T. W. Yee

genrayleigh Generalized Rayleigh Distribution Family Function

Description
Estimates the two parameters of the generalized Rayleigh distribution by maximum likelihood esti-
mation.

Usage
genrayleigh(lscale = "loge", lshape = "loge",
iscale = NULL, ishape = NULL,
tol12 = 1e-05, nsimEIM = 300, zero = 2)

Arguments
lscale, lshape Link function for the two positive parameters, scale and shape. See Links for
more choices.
iscale, ishape Numeric. Optional initial values for the scale and shape parameters.
nsimEIM, zero See CommonVGAMffArguments.
tol12 Numeric and positive. Tolerance for testing whether the second shape parameter
is either 1 or 2. If so then the working weights need to handle these singularities.
308 genrayleigh

Details

The generalized Rayleigh distribution has density function

2 2
f (y; b = scale, s = shape) = (2sy/b2 )e(y/b) (1 e(y/b) )s1

where y > 0 and the two parameters, b and s, are positive. The mean cannot be expressed nicely
so the median is returned as the fitted values. Applications of the generalized Rayleigh distribution
include modeling strength data and general lifetime data. Simulated Fisher scoring is implemented.

Value

An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Note

We define scale as the reciprocal of the scale parameter used by Kundu and Raqab (2005).

Author(s)

J. G. Lauder and T. W. Yee

References

Kundu, D., Raqab, M. C. (2005). Generalized Rayleigh distribution: different methods of estima-
tions. Computational Statistics and Data Analysis, 49, 187200.

See Also

dgenray, rayleigh.

Examples

Scale <- exp(1); shape <- exp(1)

rdata <- data.frame(y = rgenray(n = 1000, scale = Scale, shape = shape))
fit <- vglm(y ~ 1, genrayleigh, data = rdata, trace = TRUE)
c(with(rdata, mean(y)), head(fitted(fit), 1))
coef(fit, matrix = TRUE)
Coef(fit)
summary(fit)
geometric 309

geometric Geometric (Truncated and Untruncated) Distributions

Description
Maximum likelihood estimation for the geometric and truncated geometric distributions.

Usage
geometric(link = "logit", expected = TRUE, imethod = 1,
iprob = NULL, zero = NULL)
truncgeometric(upper.limit = Inf,
link = "logit", expected = TRUE, imethod = 1,
iprob = NULL, zero = NULL)

Arguments
link Parameter link function applied to the probability parameter p, which lies in the
unit interval. See Links for more choices.
expected Logical. Fisher scoring is used if expected = TRUE, else Newton-Raphson.
iprob, imethod, zero
See CommonVGAMffArguments for details.
upper.limit Numeric. Upper values. As a vector, it is recycled across responses first. The
default value means both family functions should give the same result.

Details
A random variable Y has a 1-parameter geometric distribution if P (Y = y) = p(1 p)y for
y = 0, 1, 2, . . .. Here, p is the probability of success, and Y is the number of (independent) trials
that are fails until a success occurs. Thus the response Y should be a non-negative integer. The mean
of Y is E(Y ) = (1 p)/p and its variance is V ar(Y ) = (1 p)/p2 . The geometric distribution is
a special case of the negative binomial distribution (see negbinomial). The geometric distribution
is also a special case of the Borel distribution, which is a Lagrangian distribution. If Y has a
geometric distribution with parameter p then Y + 1 has a positive-geometric distribution with the
same parameter. Multiple responses are permitted.
For truncgeometric(), the (upper) truncated geometric distribution can have response integer val-
ues from 0 to upper.limit. It has density prob * (1 - prob)^y / [1-(1-prob)^(1+upper.limit)].
For a generalized truncated geometric distribution with integer values L to U , say, subtract L from
the response and feed in U L as the upper limit.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.
310 get.smart

Author(s)
T. W. Yee. Help from Viet Hoang Quoc is gratefully acknowledged.

References
Forbes, C., Evans, M., Hastings, N. and Peacock, B. (2011) Statistical Distributions, Hoboken, NJ,
USA: John Wiley and Sons, Fourth edition.

See Also
negbinomial, Geometric, betageometric, expgeometric, zageometric, zigeometric, rbetageom,
simulate.vlm.

Examples
gdata <- data.frame(x2 = runif(nn <- 1000) - 0.5)
gdata <- transform(gdata, x3 = runif(nn) - 0.5,
x4 = runif(nn) - 0.5)
gdata <- transform(gdata, eta = -1.0 - 1.0 * x2 + 2.0 * x3)
gdata <- transform(gdata, prob = logit(eta, inverse = TRUE))
gdata <- transform(gdata, y1 = rgeom(nn, prob))
with(gdata, table(y1))
fit1 <- vglm(y1 ~ x2 + x3 + x4, geometric, data = gdata, trace = TRUE)
coef(fit1, matrix = TRUE)
summary(fit1)

# Truncated geometric (between 0 and upper.limit)

upper.limit <- 5
tdata <- subset(gdata, y1 <= upper.limit)
nrow(tdata) # Less than nn
fit2 <- vglm(y1 ~ x2 + x3 + x4, truncgeometric(upper.limit),
data = tdata, trace = TRUE)
coef(fit2, matrix = TRUE)

# Generalized truncated geometric (between lower.limit and upper.limit)

lower.limit <- 1
upper.limit <- 8
gtdata <- subset(gdata, lower.limit <= y1 & y1 <= upper.limit)
with(gtdata, table(y1))
nrow(gtdata) # Less than nn
fit3 <- vglm(y1 - lower.limit ~ x2 + x3 + x4,
truncgeometric(upper.limit - lower.limit),
data = gtdata, trace = TRUE)
coef(fit3, matrix = TRUE)

get.smart Retrieve One Component of .smart.prediction

get.smart.prediction 311

Description
Retrieve one component of the list .smart.prediction from smartpredenv.

Usage
get.smart()

Details
get.smart is used in "read" mode within a smart function: it retrieves parameters saved at the
time of fitting, and is used for prediction. get.smart is only used in smart functions such as
sm.poly; get.smart.prediction is only used in modelling functions such as lm and glm. The
function get.smart gets only a part of .smart.prediction whereas get.smart.prediction gets
the entire .smart.prediction.

Value
Returns with one list component of .smart.prediction from smartpredenv, in fact, .smart.prediction[[.smart.predi
The whole procedure mimics a first-in first-out stack (better known as a queue).

Side Effects
The variable .smart.prediction.counter in smartpredenv is incremented beforehand, and then
written back to smartpredenv.

get.smart.prediction Retrieves .smart.prediction

Description
Retrieves .smart.prediction from smartpredenv.

Usage
get.smart.prediction()

Details
A smart modelling function such as lm allows smart functions such as sm.bs to write to a data struc-
ture called .smart.prediction in smartpredenv. At the end of fitting, get.smart.prediction
retrieves this data structure. It is then attached to the object, and used for prediction later.
312 gev

Value
Returns with the list .smart.prediction from smartpredenv.

gev Generalized Extreme Value Distribution Family Function

Description
Maximum likelihood estimation of the 3-parameter generalized extreme value (GEV) distribution.

Usage
gev(llocation = "identitylink", lscale = "loge",
lshape = logoff(offset = 0.5), percentiles = c(95, 99),
ilocation = NULL, iscale = NULL, ishape = NULL, imethod = 1,
gprobs.y = (1:9)/10, gscale.mux = exp((-5:5)/6),
gshape = (-5:5) / 11 + 0.01,
iprobs.y = NULL, tolshape0 = 0.001,
type.fitted = c("percentiles", "mean"),
zero = c("scale", "shape"))
gevff(llocation = "identitylink", lscale = "loge",
lshape = logoff(offset = 0.5), percentiles = c(95, 99),
ilocation = NULL, iscale = NULL, ishape = NULL, imethod = 1,
gprobs.y = (1:9)/10, gscale.mux = exp((-5:5)/6),
gshape = (-5:5) / 11 + 0.01,
iprobs.y = NULL, tolshape0 = 0.001,
type.fitted = c("percentiles", "mean"), zero = c("scale", "shape"))

Arguments
llocation, lscale, lshape
Parameter link functions for , and respectively. See Links for more choices.
For the shape parameter, the default logoff link has an offset called A below;
and then the linear/additive predictor is log( + A) which means that > A.
For technical reasons (see Details) it is a good idea for A = 0.5.
gev 313

percentiles Numeric vector of percentiles used for the fitted values. Values should be be-
tween 0 and 100. This argument is ignored if type.fitted = "mean".
type.fitted See CommonVGAMffArguments for information. The default is to use the percentiles
argument. If "mean" is chosen, then the mean + ((1 ) 1)/ is returned
as the fitted values, and these are only defined for < 1.
ilocation, iscale, ishape
Numeric. Initial value for the location parameter, and . A NULL means a
value is computed internally. The argument ishape is more important than the
other two. If a failure to converge occurs, or even to obtain initial values occurs,
try assigning ishape some value (positive or negative; the sign can be very
important). Also, in general, a larger value of iscale tends to be better than a
smaller value.
imethod Initialization method. Either the value 1 or 2. If both methods fail then try using
ishape. See CommonVGAMffArguments for information.
gshape Numeric vector. The values are used for a grid search for an initial value for .
See CommonVGAMffArguments for information.
gprobs.y, gscale.mux, iprobs.y
Numeric vectors, used for the initial values. See CommonVGAMffArguments for
information.
tolshape0 Passed into dgev when computing the log-likelihood.
zero A specifying which linear/additive predictors are modelled as intercepts only.
The values can be from the set {1,2,3} corresponding respectively to , , . If
zero = NULL then all linear/additive predictors are modelled as a linear com-
bination of the explanatory variables. For many data sets having zero = 3 is a
good idea. See CommonVGAMffArguments for information.

Details
The GEV distribution function can be written
1/
G(y) = exp([(y )/]+ )
where > 0, < < , and 1 + (y )/ > 0. Here, x+ = max(x, 0). The , ,
are known as the location, scale and shape parameters respectively. The cases > 0, < 0,
= 0 correspond to the Frechet, Weibull, and Gumbel types respectively. It can be noted that
the Gumbel (or Type I) distribution accommodates many commonly-used distributions such as the
normal, lognormal, logistic, gamma, exponential and Weibull.
For the GEV distribution, the kth moment about the mean exists if < 1/k. Provided they exist,
the mean and variance are given by + {(1 ) 1}/ and 2 {(1 2) 2 (1 )}/ 2
respectively, where is the gamma function.
Smith (1985) established that when > 0.5, the maximum likelihood estimators are completely
regular. To have some control over the estimated try using lshape = logoff(offset = 0.5),
say, or lshape = extlogit(min = -0.5, max = 0.5), say.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.
314 gev

Warning

Currently, if an estimate of is too close to 0 then an error may occur for gev() with multivariate
responses. In general, gevff() is more reliable than gev().
Fitting the GEV by maximum likelihood estimation can be numerically fraught. If 1+(y)/
0 then some crude evasive action is taken but the estimation process can still fail. This is particularly
the case if vgam with s is used; then smoothing is best done with vglm with regression splines (bs
or ns) because vglm implements half-stepsizing whereas vgam doesnt (half-stepsizing helps handle
the problem of straying outside the parameter space.)

Note

The VGAM family function gev can handle a multivariate (matrix) response, cf. multiple responses.
If so, each row of the matrix is sorted into descending order and NAs are put last. With a vector or
one-column matrix response using gevff will give the same result but be faster and it handles the
= 0 case. The function gev implements Tawn (1988) while gevff implements Prescott and
Walden (1980).
Function egev() has been replaced by the new family function gevff(). It now conforms to the
usual VGAM philosophy of having M1 linear predictors per (independent) response. This is the
usual way multiple responses are handled. Hence vglm(cbind(y1, y2)..., gevff, ...) will
have 6 linear predictors and it is possible to constrain the linear predictors so that the answer is
similar to gev(). Missing values in the response of gevff() will be deleted; this behaviour is the
same as with almost every other VGAM family function.
The shape parameter is difficult to estimate accurately unless there is a lot of data. Convergence
is slow when is near 0.5. Given many explanatory variables, it is often a good idea to make
sure zero = 3. The range restrictions of the parameter are not enforced; thus it is possible for a
violation to occur.
Successful convergence often depends on having a reasonably good initial value for . If failure
occurs try various values for the argument ishape, and if there are covariates, having zero = 3 is
advised.

Author(s)

T. W. Yee

References

Yee, T. W. and Stephenson, A. G. (2007) Vector generalized linear and additive extreme value
models. Extremes, 10, 119.
Tawn, J. A. (1988) An extreme-value theory model for dependent observations. Journal of Hydrol-
ogy, 101, 227250.
Prescott, P. and Walden, A. T. (1980) Maximum likelihood estimation of the parameters of the
generalized extreme-value distribution. Biometrika, 67, 723724.
Smith, R. L. (1985) Maximum likelihood estimation in a class of nonregular cases. Biometrika, 72,
6790.
gevUC 315

See Also
rgev, gumbel, gumbelff, guplot, rlplot.gevff, gpd, weibullR, frechet, extlogit, oxtemp,
venice, CommonVGAMffArguments.

Examples
## Not run:
# Multivariate example
fit1 <- vgam(cbind(r1, r2) ~ s(year, df = 3), gev(zero = 2:3),
data = venice, trace = TRUE)
coef(fit1, matrix = TRUE)
head(fitted(fit1))
par(mfrow = c(1, 2), las = 1)
plot(fit1, se = TRUE, lcol = "blue", scol = "forestgreen",
main = "Fitted mu(year) function (centered)", cex.main = 0.8)
with(venice, matplot(year, depvar(fit1)[, 1:2], ylab = "Sea level (cm)",
col = 1:2, main = "Highest 2 annual sea levels", cex.main = 0.8))
with(venice, lines(year, fitted(fit1)[,1], lty = "dashed", col = "blue"))
legend("topleft", lty = "dashed", col = "blue", "Fitted 95 percentile")

# Univariate example
(fit <- vglm(maxtemp ~ 1, gevff, data = oxtemp, trace = TRUE))
head(fitted(fit))
coef(fit, matrix = TRUE)
Coef(fit)
vcov(fit)
vcov(fit, untransform = TRUE)
sqrt(diag(vcov(fit))) # Approximate standard errors
rlplot(fit)

## End(Not run)

gevUC The Generalized Extreme Value Distribution

Description
Density, distribution function, quantile function and random generation for the generalized extreme
value distribution (GEV) with location parameter location, scale parameter scale and shape pa-
rameter shape.

Usage
dgev(x, location = 0, scale = 1, shape = 0, log = FALSE,
tolshape0 = sqrt(.Machine$double.eps))
pgev(q, location = 0, scale = 1, shape = 0, lower.tail = TRUE, log.p = FALSE)
qgev(p, location = 0, scale = 1, shape = 0, lower.tail = TRUE, log.p = FALSE)
rgev(n, location = 0, scale = 1, shape = 0)
316 gevUC

Arguments

x, q vector of quantiles.
p vector of probabilities.
n number of observations. If length(n) > 1 then the length is taken to be the
number required.
location the location parameter .
scale the (positive) scale parameter . Must consist of positive values.
shape the shape parameter .
log Logical. If log = TRUE then the logarithm of the density is returned.
lower.tail, log.p
Same meaning as in punif or qunif.
tolshape0 Positive numeric. Threshold/tolerance value for resting whether is zero. If the
absolute value of the estimate of is less than this value then it will be assumed
zero and a Gumbel distribution will be used.

Details

See gev, the VGAM family function for estimating the 3 parameters by maximum likelihood esti-
mation, for formulae and other details. Apart from n, all the above arguments may be vectors and
are recyled to the appropriate length if necessary.

Value

dgev gives the density, pgev gives the distribution function, qgev gives the quantile function, and
rgev generates random deviates.

Note

The default value of = 0 means the default distribution is the Gumbel.

Currently, these functions have different argument names compared with those in the evd package.

Author(s)

T. W. Yee

References

Coles, S. (2001) An Introduction to Statistical Modeling of Extreme Values. London: Springer-

Verlag.

See Also

gev, gevff, vglm.control.

gew 317

Examples
loc <- 2; sigma <- 1; xi <- -0.4
pgev(qgev(seq(0.05, 0.95, by = 0.05), loc, sigma, xi), loc, sigma, xi)
## Not run: x <- seq(loc - 3, loc + 3, by = 0.01)
plot(x, dgev(x, loc, sigma, xi), type = "l", col = "blue", ylim = c(0,1),
main = "Blue is density, orange is cumulative distribution function",
sub = "Purple are 5,10,...,95 percentiles", ylab = "", las = 1)
abline(h = 0, col = "blue", lty = 2)
lines(qgev(seq(0.05, 0.95, by = 0.05), loc, sigma, xi),
dgev(qgev(seq(0.05, 0.95, by = 0.05), loc, sigma, xi), loc, sigma, xi),
col = "purple", lty = 3, type = "h")
lines(x, pgev(x, loc, sigma, xi), type = "l", col = "orange")
abline(h = 0, lty = 2, col = "gray50")

## End(Not run)

gew General Electric and Westinghouse Data

Description

General Electric and Westinghouse capital data.

Usage

data(gew)

Format

A data frame with 20 observations on the following 7 variables. All variables are numeric vectors.
Variables ending in .g correspond to General Electric and those ending in .w are Westinghouse.

year The observations are the years from 1934 to 1953

invest.g, invest.w investment figures. These are I = Gross investment = additions to plant and
equipment plus maintenance and repairs in millions of dollars deflated by P1 .
capital.g, capital.w capital stocks. These are C = The stock of plant and equipment = accumu-
lated sum of net additions to plant and equipment deflated by P1 minus depreciation allowance
deflated by P3 .
value.g, value.w market values. These are F = Value of the firm = price of common and preferred
shares at December 31 (or average price of December 31 and January 31 of the following
year) times number of common and preferred shares outstanding plus total book value of debt
at December 31 in millions of dollars deflated by P2 .
318 golf

Details
These data are a subset of a table in Boot and de Wit (1960), also known as the Grunfeld data. It is
used a lot in econometrics, e.g., for seemingly unrelated regressions (see SURff).
Here, P1 = Implicit price deflator of producers durable equipment (base 1947), P2 = Implicit price
deflator of G.N.P. (base 1947), P3 = Depreciation expense deflator = ten years moving average of
wholesale price index of metals and metal products (base 1947).

Source
Table 10 of: Boot, J. C. G. and de Wit, G. M. (1960) Investment Demand: An Empirical Contribu-
tion to the Aggregation Problem. International Economic Review, 1, 330.
Grunfeld, Y. (1958) The Determinants of Corporate Investment. Unpublished PhD Thesis (Chicago).

References
Zellner, A. (1962) An efficient method of estimating seemingly unrelated regressions and tests for
aggregation bias. Journal of the American Statistical Association, 57, 348368.

See Also
SURff, http://statmath.wu.ac.at/~zeileis/grunfeld.

Examples
str(gew)

golf Gamma-Ordinal Link Function

Description
Computes the gamma-ordinal transformation, including its inverse and the first two derivatives.

Usage
golf(theta, lambda = 1, cutpoint = NULL,
inverse = FALSE, deriv = 0, short = TRUE, tag = FALSE)

Arguments
theta Numeric or character. See below for further details.
lambda, cutpoint
The former is the shape parameter in gamma2. cutpoint is optional; if NULL then
cutpoint is ignored from the GOLF definition. If given, the cutpoints should
be non-negative integers. If golf() is used as the link function in cumulative
then, if the cutpoints are known, then one should choose reverse = TRUE, parallel = FALSE ~ -1.
If the cutpoints are unknown, then choose reverse = TRUE, parallel = TRUE.
inverse, deriv, short, tag
Details at Links.
golf 319

Details
The gamma-ordinal link function (GOLF) can be applied to a parameter lying in the unit interval.
Its purpose is to link cumulative probabilities associated with an ordinal response coming from an
underlying 2-parameter gamma distribution.
See Links for general information about VGAM link functions.

Value
See Yee (2012) for details.

Warning
Prediction may not work on vglm or vgam etc. objects if this link function is used.

Note
Numerical values of theta too close to 0 or 1 or out of range result in large positive or negative
values, or maybe 0 depending on the arguments. Although measures have been taken to handle
cases where theta is too close to 1 or 0, numerical instabilities may still arise.
In terms of the threshold approach with cumulative probabilities for an ordinal response this link
function corresponds to the gamma distribution (see gamma2) that has been recorded as an ordinal
response using known cutpoints.

Author(s)
Thomas W. Yee

References
Yee, T. W. (2012) Ordinal ordination with normalizing link functions for count data, (in prepara-
tion).

See Also
Links, gamma2, polf, nbolf, cumulative.

Examples
## Not run:
golf("p", lambda = 1, short = FALSE)
golf("p", lambda = 1, tag = TRUE)

p <- seq(0.02, 0.98, len = 201)

y <- golf(p, lambda = 1)
y. <- golf(p, lambda = 1, deriv = 1, inverse = TRUE)
max(abs(golf(y, lambda = 1, inverse = TRUE) - p)) # Should be 0

#\ dontrun{par(mfrow = c(2, 1), las = 1)

#plot(p, y, type = "l", col = "blue", main = "golf()")
#abline(h = 0, v = 0.5, col = "orange", lty = "dashed")
320 Gompertz

#plot(p, y., type = "l", col = "blue",

# main = "(Reciprocal of) first GOLF derivative")
#}

# Another example
gdata <- data.frame(x2 = sort(runif(nn <- 1000)))
gdata <- transform(gdata, x3 = runif(nn))
gdata <- transform(gdata, mymu = exp( 3 + 1 * x2 - 2 * x3))
lambda <- 4
gdata <- transform(gdata,
y1 = rgamma(nn, shape = lambda, scale = mymu / lambda))
cutpoints <- c(-Inf, 10, 20, Inf)
gdata <- transform(gdata, cuty = Cut(y1, breaks = cutpoints))

#\ dontrun{ par(mfrow = c(1, 1), las = 1)

#with(gdata, plot(x2, x3, col = cuty, pch = as.character(cuty))) }
with(gdata, table(cuty) / sum(table(cuty)))
fit <- vglm(cuty ~ x2 + x3, cumulative(multiple.responses = TRUE,
reverse = TRUE, parallel = FALSE ~ -1,
link = golf(cutpoint = cutpoints[2:3], lambda = lambda)),
data = gdata, trace = TRUE)
head(depvar(fit))
head(fitted(fit))
head(predict(fit))
coef(fit)
coef(fit, matrix = TRUE)
constraints(fit)
fit@misc

## End(Not run)

Gompertz The Gompertz Distribution

Description
Density, cumulative distribution function, quantile function and random generation for the Gom-
pertz distribution.

Usage
dgompertz(x, scale = 1, shape, log = FALSE)
pgompertz(q, scale = 1, shape, lower.tail = TRUE, log.p = FALSE)
qgompertz(p, scale = 1, shape, lower.tail = TRUE, log.p = FALSE)
rgompertz(n, scale = 1, shape)

Arguments
x, q vector of quantiles.
Gompertz 321

p vector of probabilities.
n number of observations. Same as in runif.
log Logical. If log = TRUE then the logarithm of the density is returned.
lower.tail, log.p
Same meaning as in pnorm or qnorm.
scale, shape positive scale and shape parameters.

Details

See gompertz for details.

Value

dgompertz gives the density, pgompertz gives the cumulative distribution function, qgompertz
gives the quantile function, and rgompertz generates random deviates.

Author(s)

T. W. Yee and Kai Huang

See Also

gompertz, dgumbel, dmakeham.

Examples
probs <- seq(0.01, 0.99, by = 0.01)
Shape <- exp(1); Scale <- exp(1)
max(abs(pgompertz(qgompertz(p = probs, Scale, shape = Shape),
Scale, shape = Shape) - probs)) # Should be 0

## Not run: x <- seq(-0.1, 1.0, by = 0.001)

plot(x, dgompertz(x, Scale,shape = Shape), type = "l", col = "blue", las = 1,
main = "Blue is density, orange is cumulative distribution function",
sub = "Purple lines are the 10,20,...,90 percentiles",
ylab = "")
abline(h = 0, col = "blue", lty = 2)
lines(x, pgompertz(x, Scale, shape = Shape), col = "orange")
probs <- seq(0.1, 0.9, by = 0.1)
Q <- qgompertz(probs, Scale, shape = Shape)
lines(Q, dgompertz(Q, Scale, shape = Shape), col = "purple",
lty = 3, type = "h")
pgompertz(Q, Scale, shape = Shape) - probs # Should be all zero
abline(h = probs, col = "purple", lty = 3)
## End(Not run)
322 gompertz

gompertz Gompertz Distribution Family Function

Description
Maximum likelihood estimation of the 2-parameter Gompertz distribution.

Usage
gompertz(lscale = "loge", lshape = "loge",
iscale = NULL, ishape = NULL,
nsimEIM = 500, zero = NULL, nowarning = FALSE)

Arguments
nowarning Logical. Suppress a warning? Ignored for VGAM 0.9-7 and higher.
lshape, lscale Parameter link functions applied to the shape parameter a, scale parameter scale.
All parameters are positive. See Links for more choices.
ishape, iscale Optional initial values. A NULL means a value is computed internally.
nsimEIM, zero See CommonVGAMffArguments.

Details
The Gompertz distribution has a cumulative distribution function

F (x; , ) = 1 exp[(/) (exp(x) 1)]

which leads to a probability density function

f (x; , ) = exp(x) exp[(/) (exp(x) 1)]

for > 0, > 0, x > 0. Here, is called the scale parameter scale, and is called the shape
parameter (one could refer to as a location parameter and as a shape parametersee Lenart
(2012)). The mean is involves an exponential integral function. Simulated Fisher scoring is used
and multiple responses are handled.
The Makeham distibution has an additional parameter compared to the Gompertz distribution. If
X is defined to be the result of sampling from a Gumbel distribution until a negative value Z is
produced, then X = Z has a Gompertz distribution.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Warning
The same warnings in makeham apply here too.
gpd 323

Author(s)
T. W. Yee

References
Lenart, A. (2012) The moments of the Gompertz distribution and maximum likelihood estimation
of its parameters. Scandinavian Actuarial Journal, in press.

See Also
dgompertz, makeham, simulate.vlm.

Examples
## Not run:
gdata <- data.frame(x2 = runif(nn <- 1000))
gdata <- transform(gdata, eta1 = -1,
eta2 = -1 + 0.2 * x2,
ceta1 = 1,
ceta2 = -1 + 0.2 * x2)
gdata <- transform(gdata, shape1 = exp(eta1),
shape2 = exp(eta2),
scale1 = exp(ceta1),
scale2 = exp(ceta2))
gdata <- transform(gdata, y1 = rgompertz(nn, scale = scale1, shape = shape1),
y2 = rgompertz(nn, scale = scale2, shape = shape2))

fit1 <- vglm(y1 ~ 1, gompertz, data = gdata, trace = TRUE)

fit2 <- vglm(y2 ~ x2, gompertz, data = gdata, trace = TRUE)
coef(fit1, matrix = TRUE)
Coef(fit1)
summary(fit1)
coef(fit2, matrix = TRUE)
summary(fit2)

## End(Not run)

gpd Generalized Pareto Distribution Family Function

Description
Maximum likelihood estimation of the 2-parameter generalized Pareto distribution (GPD).

Usage
gpd(threshold = 0, lscale = "loge", lshape = logoff(offset = 0.5),
percentiles = c(90, 95), iscale = NULL, ishape = NULL,
tolshape0 = 0.001, type.fitted = c("percentiles", "mean"),
imethod = 1, zero = "shape")
324 gpd

Arguments
threshold Numeric, values are recycled if necessary. The threshold value(s), called be-
low.
lscale Parameter link function for the scale parameter . See Links for more choices.
lshape Parameter link function for the shape parameter . See Links for more choices.
The default constrains the parameter to be greater than 0.5 because if
0.5 then Fisher scoring does not work. See the Details section below for more
information.
For the shape parameter, the default logoff link has an offset called A below;
and then the second linear/additive predictor is log( + A) which means that
> A. The working weight matrices are positive definite if A = 0.5.
percentiles Numeric vector of percentiles used for the fitted values. Values should be be-
tween 0 and 100. See the example below for illustration. This argument is
ignored if type.fitted = "mean".
type.fitted See CommonVGAMffArguments for information. The default is to use the percentiles
argument. If "mean" is chosen, then the mean + /(1 ) is returned as the
fitted values, and these are only defined for < 1.
iscale, ishape Numeric. Optional initial values for and . The default is to use imethod
and compute a value internally for each parameter. Values of ishape should be
between 0.5 and 1. Values of iscale should be positive.
tolshape0 Passed into dgpd when computing the log-likelihood.
imethod Method of initialization, either 1 or 2. The first is the method of moments, and
the second is a variant of this. If neither work, try assigning values to arguments
ishape and/or iscale.
zero Can be an integer-valued vector specifying which linear/additive predictors are
modelled as intercepts only. For one response, the value should be from the set
{1,2} corresponding respectively to and . It is often a good idea for the
parameter only to be modelled through a linear combination of the explanatory
variables because the shape parameter is probably best left as an intercept only:
zero = 2. Setting zero = NULL means both parameters are modelled with
explanatory variables. See CommonVGAMffArguments for more details.

Details
The distribution function of the GPD can be written
1/
G(y) = 1 [1 + (y )/]+

where is the location parameter (known, with value threshold), > 0 is the scale parameter,
is the shape parameter, and h+ = max(h, 0). The function 1 G is known as the survivor function.
The limit 0 gives the shifted exponential as a special case:

G(y) = 1 exp[(y )/].

The support is y > for > 0, and < y < / for < 0.
gpd 325

Smith (1985) showed that if <= 0.5 then this is known as the nonregular case and prob-
lems/difficulties can arise both theoretically and numerically. For the (regular) case > 0.5 the
classical asymptotic theory of maximum likelihood estimators is applicable; this is the default.
Although for < 0.5 the usual asymptotic properties do not apply, the maximum likelihood
estimator generally exists and is superefficient for 1 < < 0.5, so it is better than normal.
When < 1 the maximum likelihood estimator generally does not exist as it effectively becomes
a two parameter problem.
The mean of Y does not exist unless < 1, and the variance does not exist unless < 0.5. So if
you want to fit a model with finite variance use lshape = "extlogit".

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam. However, for this VGAM family function, vglm is probably preferred over vgam
when there is smoothing.

Warning
Fitting the GPD by maximum likelihood estimation can be numerically fraught. If 1+(y )/
0 then some crude evasive action is taken but the estimation process can still fail. This is particularly
the case if vgam with s is used. Then smoothing is best done with vglm with regression splines (bs
or ns) because vglm implements half-stepsizing whereas vgam doesnt. Half-stepsizing helps handle
the problem of straying outside the parameter space.

Note
The response in the formula of vglm and vgam is y. Internally, y is computed. This VGAM
family function can handle a multiple responses, which is inputted as a matrix. The response stored
on the object is the original uncentred data.
With functions rgpd, dgpd, etc., the argument location matches with the argument threshold
here.

Author(s)
T. W. Yee

References
Yee, T. W. and Stephenson, A. G. (2007) Vector generalized linear and additive extreme value
models. Extremes, 10, 119.
Coles, S. (2001) An Introduction to Statistical Modeling of Extreme Values. London: Springer-
Verlag.
Smith, R. L. (1985) Maximum likelihood estimation in a class of nonregular cases. Biometrika, 72,
6790.

See Also
rgpd, meplot, gev, paretoff, vglm, vgam, s.
326 gpdUC

Examples

# Simulated data from an exponential distribution (xi = 0)

Threshold <- 0.5
gdata <- data.frame(y1 = Threshold + rexp(n = 3000, rate = 2))
fit <- vglm(y1 ~ 1, gpd(threshold = Threshold), data = gdata, trace = TRUE)
head(fitted(fit))
summary(depvar(fit)) # The original uncentred data
coef(fit, matrix = TRUE) # xi should be close to 0
Coef(fit)
summary(fit)

head(fit@extra$threshold) # Note the threshold is stored here

# Check the 90 percentile

ii <- depvar(fit) < fitted(fit)[1, "90%"]
100 * table(ii) / sum(table(ii)) # Should be 90%

# Check the 95 percentile

ii <- depvar(fit) < fitted(fit)[1, "95%"]
100 * table(ii) / sum(table(ii)) # Should be 95%

## Not run: plot(depvar(fit), col = "blue", las = 1,

main = "Fitted 90% and 95% quantiles")
matlines(1:length(depvar(fit)), fitted(fit), lty = 2:3, lwd = 2)
## End(Not run)

# Another example
gdata <- data.frame(x2 = runif(nn <- 2000))
Threshold <- 0; xi <- exp(-0.8) - 0.5
gdata <- transform(gdata, y2 = rgpd(nn, scale = exp(1 + 0.1*x2), shape = xi))
fit <- vglm(y2 ~ x2, gpd(Threshold), data = gdata, trace = TRUE)
coef(fit, matrix = TRUE)

## Not run: # Nonparametric fits

# Not so recommended:
fit1 <- vgam(y2 ~ s(x2), gpd(Threshold), data = gdata, trace = TRUE)
par(mfrow = c(2, 1))
plot(fit1, se = TRUE, scol = "blue")
# More recommended:
fit2 <- vglm(y2 ~ sm.bs(x2), gpd(Threshold), data = gdata, trace = TRUE)
plot(as(fit2, "vgam"), se = TRUE, scol = "blue")
## End(Not run)

gpdUC The Generalized Pareto Distribution

gpdUC 327

Description
Density, distribution function, quantile function and random generation for the generalized Pareto
distribution (GPD) with location parameter location, scale parameter scale and shape parameter
shape.

Usage
dgpd(x, location = 0, scale = 1, shape = 0, log = FALSE,
tolshape0 = sqrt(.Machine$double.eps))
pgpd(q, location = 0, scale = 1, shape = 0,
lower.tail = TRUE, log.p = FALSE)
qgpd(p, location = 0, scale = 1, shape = 0,
lower.tail = TRUE, log.p = FALSE)
rgpd(n, location = 0, scale = 1, shape = 0)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. If length(n) > 1 then the length is taken to be the
number required.
location the location parameter .
scale the (positive) scale parameter .
shape the shape parameter .
log Logical. If log = TRUE then the logarithm of the density is returned.
lower.tail, log.p
Same meaning as in punif or qunif.
tolshape0 Positive numeric. Threshold/tolerance value for resting whether is zero. If the
absolute value of the estimate of is less than this value then it will be assumed
zero and an exponential distribution will be used.

Details
See gpd, the VGAM family function for estimating the two parameters by maximum likelihood
estimation, for formulae and other details. Apart from n, all the above arguments may be vectors
and are recyled to the appropriate length if necessary.

Value
dgpd gives the density, pgpd gives the distribution function, qgpd gives the quantile function, and
rgpd generates random deviates.

Note
The default values of all three parameters, especially = 0, means the default distribution is the
exponential.
Currently, these functions have different argument names compared with those in the evd package.
328 grain.us

Author(s)
T. W. Yee and Kai Huang

References
Coles, S. (2001) An Introduction to Statistical Modeling of Extreme Values. London: Springer-
Verlag.

pgpd(qgpd(seq(0.05, 0.95, by = 0.05), loc, sigma, xi), loc, sigma, xi)

## End(Not run)

grain.us Grain Prices Data in USA

Description
A 4-column matrix.

Usage
data(grain.us)

Format
The columns are:
wheat.flour numeric
corn numeric
wheat numeric
rye numeric
grc 329

Details
Monthly averages of grain prices in the United States for wheat flour, corn, wheat, and rye for the
period January 1961 through October 1972. The units are US dollars per 100 pound sack for wheat
flour, and per bushel for corn, wheat and rye.

Source
Ahn and Reinsel (1988).

References
Ahn, S. K and Reinsel, G. C. (1988) Nested reduced-rank autoregressive models for multiple time
series. Journal of the American Statistical Association, 83, 849856.

Examples
## Not run:
cgrain <- scale(grain.us, scale = FALSE) # Center the time series only
fit <- vglm(cgrain ~ 1, rrar(Rank = c(4, 1)),
epsilon = 1e-3, stepsize = 0.5, trace = TRUE, maxit = 50)
summary(fit)

## End(Not run)

grc Row-Column Interaction Models including Goodmans RC Associa-

tion Model and Unconstrained Quadratic Ordination

Description
Fits a Goodmans RC association model (GRC) to a matrix of counts, and more generally, row-
column interaction models (RCIMs). RCIMs allow for unconstrained quadratic ordination (UQO).

Usage
grc(y, Rank = 1, Index.corner = 2:(1 + Rank),
str0 = 1, summary.arg = FALSE, h.step = 1e-04, ...)
rcim(y, family = poissonff, Rank = 0, M1 = NULL,
weights = NULL, which.linpred = 1,
Index.corner = ifelse(is.null(str0), 0, max(str0)) + 1:Rank,
rprefix = "Row.", cprefix = "Col.", iprefix = "X2.",
offset = 0, str0 = if (Rank) 1 else NULL,
summary.arg = FALSE, h.step = 0.0001,
rbaseline = 1, cbaseline = 1,
has.intercept = TRUE, M = NULL,
rindex = 2:nrow(y), cindex = 2:ncol(y), iindex = 2:nrow(y), ...)
330 grc

Arguments

y For grc(): a matrix of counts. For rcim(): a general matrix response depending
on family. Output from table() is acceptable; it is converted into a matrix.
Note that y should be at least 3 by 3 in dimension.
family A VGAM family function. By default, the first linear/additive predictor is fitted
using main effects plus an optional rank-Rank interaction term. Not all family
functions are suitable or make sense. All other linear/additive predictors are fit-
ted using an intercept-only, so it has a common value over all rows and columns.
For example, zipoissonff may be suitable for counts but not zipoisson be-
cause of the ordering of the linear/additive predictors. If the VGAM family
function does not have an infos slot then M1 needs to be inputted (the num-
ber of linear predictors for an ordinary (usually univariate) response, aka M ).
The VGAM family function also needs to be able to handle multiple responses
(currently not all of them can do this).
Rank An integer from the set {0,. . . ,min(nrow(y), ncol(y))}. This is the dimension
of the fit in terms of the interaction. For grc() this argument must be positive. A
value of 0 means no interactions (i.e., main effects only); each row and column
is represented by an indicator variable.
weights Prior weights. Fed into rrvglm or vglm.
which.linpred Single integer. Specifies which linear predictor is modelled as the sum of an
intercept, row effect, column effect plus an optional interaction term. It should
be one value from the set 1:M1.
Index.corner A vector of Rank integers. These are used to store the Rank by Rank identity
matrix in the A matrix; corner constraints are used.
rprefix, cprefix, iprefix
Character, for rows and columns and interactions respectively. For labelling the
indicator variables.
offset Numeric. Either a matrix of the right dimension, else a single numeric expanded
into such a matrix.
str0 Ignored if Rank = 0, else an integer from the set {1,. . . ,min(nrow(y), ncol(y))},
specifying the row that is used as the structural zero. Passed into rrvglm.control
if Rank > 0. Set str0 = NULL for none.
summary.arg Logical. If TRUE then a summary is returned. If TRUE then y may be the output
(fitted object) of grc().
h.step A small positive value that is passed into summary.rrvglm(). Only used when
summary.arg = TRUE.
... Arguments that are passed into rrvglm.control().
M1 The number of linear predictors of the VGAM family function for an ordi-
nary (univariate) response. Then the number of linear predictors of the rcim()
fit is usually the number of columns of y multiplied by M1. The default is to
evaluate the infos slot of the VGAM family function to try to evaluate it; see
vglmff-class. If this information is not yet supplied by the family function
then the value needs to be inputted manually using this argument.
grc 331

rbaseline, cbaseline
Baseline reference levels for the rows and columns. Currently stored on the
object but not used.
has.intercept Logical. Include an intercept?
M, cindex M is the usual VGAM M , viz. the number of linear/additive predictors in total.
Also, cindex means column index, and these point to the columns of y which
are part of the vector of linear/additive predictor main effects.
For family = multinomial it is necessary to input these arguments as M = ncol(y)-1
and cindex = 2:(ncol(y)-1).
rindex, iindex rindex means row index, and these are similar to cindex. iindex means inter-
action index, and these are similar to cindex.

Details
Goodmans RC association model fits a reduced-rank approximation to a table of counts. A Poisson
model is assumed. The log of each cell mean is decomposed as an intercept plus a row effect plus
a column effect plus a reduced-rank component. The latter can be collectively written A %*% t(C),
the product of two thin matrices. Indeed, A and C have Rank columns. By default, the first column
and row of the interaction matrix A %*% t(C) is chosen to be structural zeros, because str0 = 1.
This means the first row of A are all zeros.
This function uses options()$contrasts to set up the row and column indicator variables. In
particular, Equation (4.5) of Yee and Hastie (2003) is used. These are called Row. and Col. (by
default) followed by the row or column number.
The function rcim() is more general than grc(). Its default is a no-interaction model of grc(),
i.e., rank-0 and a Poisson distribution. This means that each row and column has a dummy variable
associated with it. The first row and first column are baseline. The power of rcim() is that many
VGAM family functions can be assigned to its family argument. For example, uninormal fits
something in between a 2-way ANOVA with and without interactions, alaplace2 with Rank = 0
is something like medpolish. Others include zipoissonff and negbinomial. Hopefully one day
all VGAM family functions will work when assigned to the family argument, although the result
may not have meaning.
Unconstrained quadratic ordination (UQO) can be performed using rcim() and grc(). This has
been called unconstrained Gaussian ordination in the literature, however the word Gaussian has
two meanings which is confusing; it is better to use quadratic because the bell-shape response
surface is meant. UQO is similar to CQO (cqo) except there are no environmental/explanatory
variables. Here, a GLM is fitted to each column (species) that is a quadratic function of hypothet-
ical latent variables or gradients. Thus each row of the response has an associated site score, and
each column of the response has an associated optimum and tolerance matrix. UQO can be per-
formed here under the assumption that all species have the same tolerance matrices. See Yee and
Hadi (2014) for details. It is not recommended that presence/absence data be inputted because the
information content is so low for each site-species cell. The example below uses Poisson counts.

Value
An object of class "grc", which currently is the same as an "rrvglm" object. Currently, a rank-0
rcim() object is of class rcim0-class, else of class "rcim" (this may change in the future).
332 grc

Warning
The function rcim() is experimental at this stage and may have bugs. Quite a lot of expertise is
needed when fitting and in its interpretion thereof. For example, the constraint matrices applies
the reduced-rank regression to the first (see which.linpred) linear predictor and the other linear
predictors are intercept-only and have a common value throughout the entire data set. This means
that, by default, family = zipoissonff is appropriate but not family = zipoisson. Else set
family = zipoisson and which.linpred = 2. To understand what is going on, do examine the
constraint matrices of the fitted object, and reconcile this with Equations (4.3) to (4.5) of Yee and
Hastie (2003).
The functions temporarily create a permanent data frame called .grc.df or .rcim.df, which used
to be needed by summary.rrvglm(). Then these data frames are deleted before exiting the function.
If an error occurs then the data frames may be present in the workspace.

Note
These functions set up the indicator variables etc. before calling rrvglm or vglm. The ... is passed
into rrvglm.control or vglm.control, This means, e.g., Rank = 1 is default for grc().
The data should be labelled with rownames and colnames. Setting trace = TRUE is recommended
to monitor convergence. Using criterion = "coefficients" can result in slow convergence.
If summary = TRUE then y can be a "grc" object, in which case a summary can be returned. That
is, grc(y, summary = TRUE) is equivalent to summary(grc(y)). It is not possible to plot a
grc(y, summary = TRUE) or rcim(y, summary = TRUE) object.

Author(s)
Thomas W. Yee, with assistance from Alfian F. Hadi.

References
Yee, T. W. and Hastie, T. J. (2003) Reduced-rank vector generalized linear models. Statistical
Modelling, 3, 1541.
Yee, T. W. and Hadi, A. F. (2014) Row-column interaction models, with an R implementation.
Computational Statistics, 29, 14271445.
Goodman, L. A. (1981) Association models and canonical correlation in the analysis of cross-
classifications having ordered categories. Journal of the American Statistical Association, 76, 320
334.

See Also
rrvglm, rrvglm.control, rrvglm-class, summary.grc, moffset, Rcim, Select, Qvar, plotrcim0,
cqo, multinomial, alcoff, crashi, auuc, olym08, olym12, poissonff, medpolish.

Examples
# Example 1: Undergraduate enrolments at Auckland University in 1990
fitted(grc1 <- grc(auuc))
summary(grc1)
grc 333

grc2 <- grc(auuc, Rank = 2, Index.corner = c(2, 5))

fitted(grc2)
summary(grc2)

model3 <- rcim(auuc, Rank = 1, fam = multinomial,

M = ncol(auuc)-1, cindex = 2:(ncol(auuc)-1), trace = TRUE)
fitted(model3)
summary(model3)

# Median polish but not 100 percent reliable. Maybe call alaplace2()...
## Not run:
rcim0 <- rcim(auuc, fam = alaplace1(tau = 0.5), trace = FALSE, maxit = 500)
round(fitted(rcim0), digits = 0)
round(100 * (fitted(rcim0) - auuc) / auuc, digits = 0) # Discrepancy
depvar(rcim0)
round(coef(rcim0, matrix = TRUE), digits = 2)
Coef(rcim0, matrix = TRUE)
# constraints(rcim0)
names(constraints(rcim0))

# Compare with medpolish():

(med.a <- medpolish(auuc))
fv <- med.a$overall + outer(med.a$row, med.a$col, "+")
round(100 * (fitted(rcim0) - fv) / fv) # Hopefully should be all 0s

## End(Not run)

# Example 2: 2012 Summer Olympic Games in London

## Not run: top10 <- head(olym12, 10)
grc1.oly12 <- with(top10, grc(cbind(gold, silver, bronze)))
round(fitted(grc1.oly12))
round(resid(grc1.oly12, type = "response"), digits = 1) # Response residuals
summary(grc1.oly12)
Coef(grc1.oly12)

## End(Not run)

# Example 3: Unconstrained quadratic ordination (UQO); see Yee and Hadi (2014)
## Not run:
n <- 100; p <- 5; S <- 10
pdata <- rcqo(n, p, S, es.opt = FALSE, eq.max = FALSE,
eq.tol = TRUE, sd.latvar = 0.75) # Poisson counts
true.nu <- attr(pdata, "latvar") # The 'truth'; site scores
attr(pdata, "tolerances") # The 'truth'; tolerances

Y <- Select(pdata, "y", sort = FALSE) # Y matrix (n x S); the "y" vars
uqo.rcim1 <- rcim(Y, Rank = 1,
str0 = NULL, # Delta covers entire n x M matrix
iindex = 1:nrow(Y), # RRR covers the entire Y
has.intercept = FALSE) # Suppress the intercept
334 gumbel

# Plot 1
par(mfrow = c(2, 2))
plot(attr(pdata, "optimums"), Coef(uqo.rcim1)@A,
col = "blue", type = "p", main = "(a) UQO optimums",
xlab = "True optimums", ylab = "Estimated (UQO) optimums")
mylm <- lm(Coef(uqo.rcim1)@A ~ attr(pdata, "optimums"))
abline(coef = coef(mylm), col = "orange", lty = "dashed")

# Plot 2
fill.val <- NULL # Choose this for the new parameterization
plot(attr(pdata, "latvar"), c(fill.val, concoef(uqo.rcim1)),
las = 1, col = "blue", type = "p", main = "(b) UQO site scores",
xlab = "True site scores", ylab = "Estimated (UQO) site scores" )
mylm <- lm(c(fill.val, concoef(uqo.rcim1)) ~ attr(pdata, "latvar"))
abline(coef = coef(mylm), col = "orange", lty = "dashed")

# Plots 3 and 4
myform <- attr(pdata, "formula")
p1ut <- cqo(myform, family = poissonff,
eq.tol = FALSE, trace = FALSE, data = pdata)
c1ut <- cqo(Select(pdata, "y", sort = FALSE) ~ scale(latvar(uqo.rcim1)),
family = poissonff, eq.tol = FALSE, trace = FALSE, data = pdata)
lvplot(p1ut, lcol = 1:S, y = TRUE, pcol = 1:S, pch = 1:S, pcex = 0.5,
main = "(c) CQO fitted to the original data",
xlab = "Estimated (CQO) site scores")
lvplot(c1ut, lcol = 1:S, y = TRUE, pcol = 1:S, pch = 1:S, pcex = 0.5,
main = "(d) CQO fitted to the scaled UQO site scores",
xlab = "Estimated (UQO) site scores")

## End(Not run)

gumbel Gumbel Distribution Family Function

Description

Maximum likelihood estimation of the 2-parameter Gumbel distribution.

Usage

gumbel(llocation = "identitylink", lscale = "loge",

iscale = NULL, R = NA, percentiles = c(95, 99),
mpv = FALSE, zero = NULL)
gumbelff(llocation = "identitylink", lscale = "loge",
iscale = NULL, R = NA, percentiles = c(95, 99),
zero = "scale", mpv = FALSE)
gumbel 335

Arguments
llocation, lscale
Parameter link functions for and . See Links for more choices.
iscale Numeric and positive. Optional initial value for . Recycled to the appropriate
length. In general, a larger value is better than a smaller value. A NULL means
an initial value is computed internally.
R Numeric. Maximum number of values possible. See Details for more details.
percentiles Numeric vector of percentiles used for the fitted values. Values should be be-
tween 0 and 100. This argument uses the argument R if assigned. If percentiles = NULL
then the mean will be returned as the fitted values.
mpv Logical. If mpv = TRUE then the median predicted value (MPV) is computed
and returned as the (last) column of the fitted values. This argument is ignored
if percentiles = NULL. See Details for more details.
zero A vector specifying which linear/additive predictors are modelled as intercepts
only. The value (possibly values) can be from the set {1, 2} corresponding re-
spectively to and . By default all linear/additive predictors are modelled as a
linear combination of the explanatory variables. See CommonVGAMffArguments
for more details.

Details
The Gumbel distribution is a generalized extreme value (GEV) distribution with shape parameter
= 0. Consequently it is more easily estimated than the GEV. See gev for more details.
The quantity R is the maximum number of observations possible, for example, in the Venice data
below, the top 10 daily values are recorded for each year, therefore R = 365 because there are about
365 days per year. The MPV is the value of the response such that the probability of obtaining a
value greater than the MPV is 0.5 out of R observations. For the Venice data, the MPV is the sea
level such that there is an even chance that the highest level for a particular year exceeds the MPV.
If mpv = TRUE then the column labelled "MPV" contains the MPVs when fitted() is applied to the
fitted object.
The formula for the mean of a response Y is + Euler where Euler is a constant that
has value approximately equal to 0.5772. The formula for the percentiles are (if R is not given)
log[ log(P/100)] where P is the percentile argument value(s). If R is given then the
percentiles are log[R(1 P/100)].

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Warning
When R is not given (the default) the fitted percentiles are that of the data, and not of the overall
population. For example, in the example below, the 50 percentile is approximately the running
median through the data, however, the data are the highest sea level measurements recorded each
year (it therefore equates to the median predicted value or MPV).
336 gumbel

Note
Like many other usual VGAM family functions, gumbelff() handles (independent) multiple re-
sponses.
gumbel() can handle more of a multivariate response, i.e., a matrix with more than one column.
Each row of the matrix is sorted into descending order. Missing values in the response are allowed
but require na.action = na.pass. The response matrix needs to be padded with any missing
values. With a multivariate response one has a matrix y, say, where y[, 2] contains the second
order statistics, etc.

Author(s)
T. W. Yee

References
Yee, T. W. and Stephenson, A. G. (2007) Vector generalized linear and additive extreme value
models. Extremes, 10, 119.
Smith, R. L. (1986) Extreme value theory based on the r largest annual events. Journal of Hydrol-
ogy, 86, 2743.
Rosen, O. and Cohen, A. (1996) Extreme percentile regression. In: Haerdle, W. and Schimek, M. G.
(eds.), Statistical Theory and Computational Aspects of Smoothing: Proceedings of the COMPSTAT
94 Satellite Meeting held in Semmering, Austria, 2728 August 1994, pp.200214, Heidelberg:
Physica-Verlag.
Coles, S. (2001) An Introduction to Statistical Modeling of Extreme Values. London: Springer-
Verlag.

See Also
rgumbel, dgumbelII, cens.gumbel, guplot, gev, gevff, venice.

Examples
# Example 1: Simulated data
gdata <- data.frame(y1 = rgumbel(n = 1000, loc = 100, scale = exp(1)))
fit1 <- vglm(y1 ~ 1, gumbelff(perc = NULL), data = gdata, trace = TRUE)
coef(fit1, matrix = TRUE)
Coef(fit1)
head(fitted(fit1))
with(gdata, mean(y1))

# Example 2: Venice data

(fit2 <- vglm(cbind(r1, r2, r3, r4, r5) ~ year, data = venice,
gumbel(R = 365, mpv = TRUE), trace = TRUE))
head(fitted(fit2))
coef(fit2, matrix = TRUE)
sqrt(diag(vcov(summary(fit2)))) # Standard errors

# Example 3: Try a nonparametric fit ---------------------

# Use the entire data set, including missing values
Gumbel-II 337

# Same as as.matrix(venice[, paste0("r", 1:10)]):

Y <- Select(venice, "r", sort = FALSE)
fit3 <- vgam(Y ~ s(year, df = 3), gumbel(R = 365, mpv = TRUE),
data = venice, trace = TRUE, na.action = na.pass)
depvar(fit3)[4:5, ] # NAs used to pad the matrix

## Not run: # Plot the component functions

par(mfrow = c(2, 3), mar = c(6, 4, 1, 2) + 0.3, xpd = TRUE)
plot(fit3, se = TRUE, lcol = "blue", scol = "limegreen", lty = 1,
lwd = 2, slwd = 2, slty = "dashed")

# Quantile plot --- plots all the fitted values

qtplot(fit3, mpv = TRUE, lcol = c(1, 2, 5), tcol = c(1, 2, 5), lwd = 2,
pcol = "blue", tadj = 0.1, ylab = "Sea level (cm)")

# Plot the 99 percentile only

year <- venice[["year"]]
matplot(year, Y, ylab = "Sea level (cm)", type = "n")
matpoints(year, Y, pch = "*", col = "blue")
lines(year, fitted(fit3)[, "99%"], lwd = 2, col = "orange")

# Check the 99 percentiles with a smoothing spline.

# Nb. (1-0.99) * 365 = 3.65 is approx. 4, meaning the 4th order
# statistic is approximately the 99 percentile.
plot(year, Y[, 4], ylab = "Sea level (cm)", type = "n",
main = "Orange is 99 percentile, Green is a smoothing spline")
points(year, Y[, 4], pch = "4", col = "blue")
lines(year, fitted(fit3)[, "99%"], lty = 1, col = "orange")
lines(smooth.spline(year, Y[, 4], df = 4), col = "limegreen", lty = 2)

## End(Not run)

Gumbel-II The Gumbel-II Distribution

Description

Density, cumulative distribution function, quantile function and random generation for the Gumbel-
II distribution.

Usage

dgumbelII(x, scale = 1, shape, log = FALSE)

pgumbelII(q, scale = 1, shape, lower.tail = TRUE, log.p = FALSE)
qgumbelII(p, scale = 1, shape, lower.tail = TRUE, log.p = FALSE)
rgumbelII(n, scale = 1, shape)
338 Gumbel-II

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. Same as in runif.
log Logical. If log = TRUE then the logarithm of the density is returned.
lower.tail, log.p
Same meaning as in pnorm or qnorm.
shape, scale positive shape and scale parameters.

Details
See gumbelII for details.

Value
dgumbelII gives the density, pgumbelII gives the cumulative distribution function, qgumbelII
gives the quantile function, and rgumbelII generates random deviates.

Author(s)
T. W. Yee and Kai Huang

## Not run: x <- seq(-0.1, 10, by = 0.01);

plot(x, dgumbelII(x, shape = Shape, Scale), type = "l", col = "blue", las = 1,
main = "Blue is density, orange is cumulative distribution function",
sub = "Purple lines are the 10,20,...,90 percentiles",
ylab = "", ylim = 0:1)
abline(h = 0, col = "blue", lty = 2)
lines(x, pgumbelII(x, shape = Shape, Scale), col = "orange")
probs <- seq(0.1, 0.9, by = 0.1)
Q <- qgumbelII(probs, shape = Shape, Scale)
lines(Q, dgumbelII(Q, Scale, Shape), col = "purple", lty = 3, type = "h")
pgumbelII(Q, shape = Shape, Scale) - probs # Should be all zero
abline(h = probs, col = "purple", lty = 3)
## End(Not run)
gumbelII 339

gumbelII Gumbel-II Distribution Family Function

Description
Maximum likelihood estimation of the 2-parameter Gumbel-II distribution.

Usage
gumbelII(lscale = "loge", lshape = "loge", iscale = NULL, ishape = NULL,
probs.y = c(0.2, 0.5, 0.8), perc.out = NULL, imethod = 1,
zero = "shape", nowarning = FALSE)

Arguments
nowarning Logical. Suppress a warning?
lshape, lscale Parameter link functions applied to the (positive) shape parameter (called s
below) and (positive) scale parameter (called b below). See Links for more
choices.
Parameter link functions applied to the
ishape, iscale Optional initial values for the shape and scale parameters.
imethod See weibullR.
zero, probs.y Details at CommonVGAMffArguments.
perc.out If the fitted values are to be quantiles then set this argument to be the percentiles
of these, e.g., 50 for median.

Details
The Gumbel-II density for a response Y is

f (y; b, s) = sy s1 exp[(y/b)s ]/(bs )

for b > 0, s > 0, y > 0. The cumulative distribution function is

F (y; b, s) = exp[(y/b)s ].

The mean of Y is b (1 1/s) (returned as the fitted values) when s > 1, and the variance is
b2 (1 2/s) when s > 2. This distribution looks similar to weibullR, and is due to Gumbel
(1954).
This VGAM family function currently does not handle censored data. Fisher scoring is used to esti-
mate the two parameters. Probably similar regularity conditions hold for this distribution compared
to the Weibull distribution.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.
340 gumbelUC

Note

See weibullR. This VGAM family function handles multiple responses.

Author(s)

T. W. Yee

References

Gumbel, E. J. (1954). Statistical theory of extreme values and some practical applications. Applied
Mathematics Series, volume 33, U.S. Department of Commerce, National Bureau of Standards,
USA.

See Also

dgumbelII, gumbel, gev.

Examples
gdata <- data.frame(x2 = runif(nn <- 1000))
gdata <- transform(gdata, heta1 = +1,
heta2 = -1 + 0.1 * x2,
ceta1 = 0,
ceta2 = 1)
gdata <- transform(gdata, shape1 = exp(heta1),
shape2 = exp(heta2),
scale1 = exp(ceta1),
scale2 = exp(ceta2))
gdata <- transform(gdata,
y1 = rgumbelII(nn, scale = scale1, shape = shape1),
y2 = rgumbelII(nn, scale = scale2, shape = shape2))

fit <- vglm(cbind(y1, y2) ~ x2,

gumbelII(zero = c(1, 2, 3)), data = gdata, trace = TRUE)
coef(fit, matrix = TRUE)
vcov(fit)
summary(fit)

gumbelUC The Gumbel Distribution

Description

Density, distribution function, quantile function and random generation for the Gumbel distribution
with location parameter location and scale parameter scale.
gumbelUC 341

Usage
dgumbel(x, location = 0, scale = 1, log = FALSE)
pgumbel(q, location = 0, scale = 1, lower.tail = TRUE, log.p = FALSE)
qgumbel(p, location = 0, scale = 1, lower.tail = TRUE, log.p = FALSE)
rgumbel(n, location = 0, scale = 1)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. If length(n) > 1 then the length is taken to be the
number required.
location the location parameter . This is not the mean of the Gumbel distribution (see
Details below).
scale the scale parameter . This is not the standard deviation of the Gumbel distri-
bution (see Details below).
log Logical. If log = TRUE then the logarithm of the density is returned.
lower.tail, log.p
Same meaning as in punif or qunif.

Details
The Gumbel distribution is a special case of the generalized extreme value (GEV) distribution where
the shape parameter = 0. The latter has 3 parameters, so the Gumbel distribution has two. The
Gumbel distribution function is

y
G(y) = exp exp

where < y < , < < and > 0. Its mean is

and its variance is
2 2 /6
where is Eulers constant (which can be obtained as -digamma(1)).
See gumbel, the VGAM family function for estimating the two parameters by maximum likelihood
estimation, for formulae and other details. Apart from n, all the above arguments may be vectors
and are recyled to the appropriate length if necessary.

Value
dgumbel gives the density, pgumbel gives the distribution function, qgumbel gives the quantile
function, and rgumbel generates random deviates.

Note
The VGAM family function gumbel can estimate the parameters of a Gumbel distribution using
maximum likelihood estimation.
342 guplot

Author(s)
T. W. Yee

References
Coles, S. (2001) An Introduction to Statistical Modeling of Extreme Values. London: Springer-
Verlag.

See Also
gumbel, gumbelff, gev, dgompertz.

Examples
mu <- 1; sigma <- 2;
y <- rgumbel(n = 100, loc = mu, scale = sigma)
c(mean(y), mu - sigma * digamma(1)) # Sample and population means
c(var(y), sigma^2 * pi^2 / 6) # Sample and population variances

## Not run: x <- seq(-2.5, 3.5, by = 0.01)

loc <- 0; sigma <- 1
plot(x, dgumbel(x, loc, sigma), type = "l", col = "blue", ylim = c(0, 1),
main = "Blue is density, red is cumulative distribution function",
sub = "Purple are 5,10,...,95 percentiles", ylab = "", las = 1)
abline(h = 0, col = "blue", lty = 2)
lines(qgumbel(seq(0.05, 0.95, by = 0.05), loc, sigma),
dgumbel(qgumbel(seq(0.05, 0.95, by = 0.05), loc, sigma), loc, sigma),
col = "purple", lty = 3, type = "h")
lines(x, pgumbel(x, loc, sigma), type = "l", col = "red")
abline(h = 0, lty = 2)
## End(Not run)

guplot Gumbel Plot

Description
Produces a Gumbel plot, a diagnostic plot for checking whether the data appears to be from a
Gumbel distribution.

Usage
guplot(object, ...)
guplot.default(y, main = "Gumbel Plot",
xlab = "Reduced data", ylab = "Observed data", type = "p", ...)
guplot.vlm(object, ...)
guplot 343

Arguments
y A numerical vector. NAs etc. are not allowed.
main Character. Overall title for the plot.
xlab Character. Title for the x axis.
ylab Character. Title for the y axis.
type Type of plot. The default means points are plotted.
object An object that inherits class "vlm", usually of class vglm-class or vgam-class.
... Graphical argument passed into plot. See par for an exhaustive list. The argu-
ments xlim and ylim are particularly useful.

Details
If Y has a Gumbel distribution then plotting the sorted values yi versus the reduced values ri should
appear linear. The reduced values are given by

ri = log( log(pi ))

where pi is the ith plotting position, taken here to be (i 0.5)/n. Here, n is the number of obser-
vations. Curvature upwards/downwards may indicate a Frechet/Weibull distribution, respectively.
Outliers may also be detected using this plot.
The function guplot is generic, and guplot.default and guplot.vlm are some methods functions
for Gumbel plots.

Value
A list is returned invisibly with the following components.
x The reduced data.
y The sorted y data.

Note
The Gumbel distribution is a special case of the GEV distribution with shape parameter equal to
zero.

Author(s)
T. W. Yee

References
Coles, S. (2001) An Introduction to Statistical Modeling of Extreme Values. London: Springer-
Verlag.
Gumbel, E. J. (1958) Statistics of Extremes. New York, USA: Columbia University Press.

See Also
gumbel, gumbelff, gev, venice.
344 has.interceptvlm

Examples
## Not run: guplot(rnorm(500), las = 1) -> ii
names(ii)

guplot(with(venice, r1), col = "blue") # Venice sea levels data

## End(Not run)

has.interceptvlm Has a Fitted VGLM Got an Intercept Term?

Description
Looks at the formula to see if it has an intercept term.

Usage
has.intercept(object, ...)
has.interceptvlm(object, form.number = 1, ...)

Arguments
object A fitted model object.
form.number Formula number, is 1 or 2. which correspond to the arguments formula and
form2 respectively.
... Arguments that are might be passed from one function to another.

Details
This methods function is a simple way to determine whether a fitted vglm object etc. has an intercept
term or not. It is not entirely foolproof because one might suppress the intercept from the formula
and then add in a variable in the formula that has a constant value.

Value
Returns a single logical.

Author(s)
Thomas W. Yee

See Also
formulavlm, termsvlm.
hatvalues 345

Examples
# Example: this is based on a glm example
counts <- c(18,17,15,20,10,20,25,13,12)
outcome <- gl(3, 1, 9); treatment <- gl(3, 3)
pdata <- data.frame(counts, outcome, treatment) # Better style
vglm.D93 <- vglm(counts ~ outcome + treatment, poissonff, data = pdata)
formula(vglm.D93)
term.names(vglm.D93)
responseName(vglm.D93)
has.intercept(vglm.D93)

hatvalues Hat Values and Regression Deletion Diagnostics

Description
When complete, a suite of functions that can be used to compute some of the regression (leave-one-
out deletion) diagnostics, for the VGLM class.

Usage
hatvalues(model, ...)
hatvaluesvlm(model, type = c("diagonal", "matrix", "centralBlocks"), ...)
hatplot(model, ...)
hatplot.vlm(model, multiplier = c(2, 3), lty = "dashed",
xlab = "Observation", ylab = "Hat values", ylim = NULL, ...)
dfbetavlm(model, maxit.new = 1,
trace.new = FALSE,
smallno = 1.0e-8, ...)

Arguments
model an R object, typically returned by vglm.
type Character. The default is the first choice, which is a nM nM matrix. If
type = "matrix" then the entire hat matrix is returned. If type = "centralBlocks"
then n central M M block matrices, in matrix-band format.
multiplier Numeric, the multiplier. The usual rule-of-thumb is that values greater than
two or three times the average leverage (at least for the linear model) should be
checked.
lty, xlab, ylab, ylim
Graphical parameters, see par etc. The default of ylim is c(0, max(hatvalues(model)))
which means that if the horizontal dashed lines cannot be seen then there are no
particularly influential observations.
maxit.new, trace.new, smallno
Having maxit.new = 1 will give a one IRLS step approximation from the
ordinary solution (and no warnings!). Else having maxit.new = 10, say, should
346 hatvalues

usually mean convergence will occur for all observations when they are removed
one-at-a-time. Else having maxit.new = 2, say, should usually mean some lack
of convergence will occur when observations are removed one-at-a-time. Setting
trace.new = TRUE will produce some running output at each IRLS iteration and
for each individual row of the model matrix. The argument smallno multiplies
each value of the original prior weight (often unity); setting it identically to zero
will result in an error, but setting a very small value effectively removes that
observation.
... further arguments, for example, graphical parameters for hatplot.vlm().

Details

The invocation hatvalues(vglmObject) should return a n M matrix of the diagonal elements

of the hat (projection) matrix of a vglm object. To do this, the QR decomposition of the object is
retrieved or reconstructed, and then straightforward calculations are performed.
The invocation hatplot(vglmObject) should plot the diagonal of the hat matrix for each of the M
linear/additive predictors. By default, two horizontal dashed lines are added; hat values higher than
these ought to be checked.

Note

It is hoped, soon, that the full suite of functions described at influence.measures will be written
for VGLMs. This will enable general regression deletion diagnostics to be available for the entire
VGLM class.

Author(s)

T. W. Yee.

See Also

vglm, cumulative, influence.measures.

Examples

# Proportional odds model, p.179, in McCullagh and Nelder (1989)

pneumo <- transform(pneumo, let = log(exposure.time))
fit <- vglm(cbind(normal, mild, severe) ~ let, cumulative, data = pneumo)
hatvalues(fit) # n x M matrix, with positive values
all.equal(sum(hatvalues(fit)), fit@rank) # Should be TRUE
## Not run: par(mfrow = c(1, 2))
hatplot(fit, ylim = c(0, 1), las = 1, col = "blue")
## End(Not run)
hormone 347

hormone Hormone Assay Data

Description

A hormone assay data set from Carroll and Ruppert (1988).

Usage

data(hormone)

Format

A data frame with 85 observations on the following 2 variables.

X a numeric vector, suitable as the x-axis in a scatter plot. The reference method.
Y a numeric vector, suitable as the y-axis in a scatter plot. The test method.

Details

The data is given in Table 2.4 of Carroll and Ruppert (1988), and was downloaded from http:
//www.stat.tamu.edu/~carroll. The book describes the data as follows. The data are the re-
sults of two assay methods for hormone data; the scale of the data as presented is not particularly
meaningful, and the original source of the data refused permission to divulge further information.
As in a similar example of Leurgans (1980), the old or reference method is being used to predict
the new or test method. The overall goal is to see whether we can reproduce the test-method mea-
surements with the reference-method measurements. Thus calibration might be of interest for the
data.

References

Carroll, R. J. and Ruppert, D. (1988) Transformation and Weighting in Regression. New York,
USA: Chapman & Hall.
Leurgans, S. (1980) Evaluating laboratory measurement techniques. Biostatistics Casebook. Eds.:
Miller, R. G. Jr., and Efron, B. and Brown, B. W. Jr., and Moses, L. New York, USA: Wiley.
Yee, T. W. (2014) Reduced-rank vector generalized linear models with two linear predictors. Com-
putational Statistics and Data Analysis, 71, 889902.

See Also

uninormal, rrvglm.
348 hormone

Examples

## Not run:
data(hormone)
summary(hormone)

modelI <-rrvglm(Y ~ 1 + X, data = hormone, trace = TRUE,

uninormal(zero = NULL, lsd = "identitylink", imethod = 2))

# Alternative way to fit modelI

modelI.other <- vglm(Y ~ 1 + X, data = hormone, trace = TRUE,
uninormal(zero = NULL, lsd = "identitylink"))

# Inferior to modelI
modelII <- vglm(Y ~ 1 + X, data = hormone, trace = TRUE,
family = uninormal(zero = NULL))

logLik(modelI)
logLik(modelII) # Less than logLik(modelI)

# Reproduce the top 3 equations on p.65 of Carroll and Ruppert (1988).

# They are called Equations (1)--(3) here.

# Equation (1)
hormone <- transform(hormone, rX = 1 / X)
clist <- list("(Intercept)" = diag(2), X = diag(2), rX = rbind(0, 1))
fit1 <- vglm(Y ~ 1 + X + rX, family = uninormal(zero = NULL),
constraints = clist, data = hormone, trace = TRUE)
coef(fit1, matrix = TRUE)
summary(fit1) # Actually, the intercepts do not seem significant
plot(Y ~ X, hormone, col = "blue")
lines(fitted(fit1) ~ X, hormone, col = "orange")

# Equation (2)
fit2 <- rrvglm(Y ~ 1 + X, uninormal(zero = NULL), hormone, trace = TRUE)
coef(fit2, matrix = TRUE)
plot(Y ~ X, hormone, col = "blue")
lines(fitted(fit2) ~ X, hormone, col = "red")
# Add +- 2 SEs
lines(fitted(fit2) + 2 * exp(predict(fit2)[, "loge(sd)"]) ~ X,
hormone, col = "orange")
lines(fitted(fit2) - 2 * exp(predict(fit2)[, "loge(sd)"]) ~ X,
hormone, col = "orange")

# Equation (3)
# Does not fit well because the loge link for the mean is not good.
fit3 <- rrvglm(Y ~ 1 + X, maxit = 300, data = hormone, trace = TRUE,
uninormal(lmean = "loge", zero = NULL))
coef(fit3, matrix = TRUE)
plot(Y ~ X, hormone, col = "blue") # Does not look okay.
lines(exp(predict(fit3)[, 1]) ~ X, hormone, col = "red")
# Add +- 2 SEs
hspider 349

lines(fitted(fit3) + 2 * exp(predict(fit3)[, "loge(sd)"]) ~ X,

hormone, col = "orange")
lines(fitted(fit3) - 2 * exp(predict(fit3)[, "loge(sd)"]) ~ X,
hormone, col = "orange")

## End(Not run)

hspider Hunting Spider Data

Description
Abundance of hunting spiders in a Dutch dune area.

Usage
data(hspider)

Format
A data frame with 28 observations (sites) on the following 18 variables.

WaterCon Log percentage of soil dry mass.

BareSand Log percentage cover of bare sand.
FallTwig Log percentage cover of fallen leaves and twigs.
CoveMoss Log percentage cover of the moss layer.
CoveHerb Log percentage cover of the herb layer.
ReflLux Reflection of the soil surface with cloudless sky.
Alopacce Abundance of Alopecosa accentuata.
Alopcune Abundance of Alopecosa cuneata.
Alopfabr Abundance of Alopecosa fabrilis.
Arctlute Abundance of Arctosa lutetiana.
Arctperi Abundance of Arctosa perita.
Auloalbi Abundance of Aulonia albimana.
Pardlugu Abundance of Pardosa lugubris.
Pardmont Abundance of Pardosa monticola.
Pardnigr Abundance of Pardosa nigriceps.
Pardpull Abundance of Pardosa pullata.
Trocterr Abundance of Trochosa terricola.
Zoraspin Abundance of Zora spinimana.
350 huber2

Details

The data, which originally came from Van der Aart and Smeek-Enserink (1975) consists of abun-
dances (numbers trapped over a 60 week period) and 6 environmental variables. There were 28
sites.
This data set has been often used to illustrate ordination, e.g., using canonical correspondence
analysis (CCA). In the example below, the data is used for constrained quadratic ordination (CQO;
formerly called canonical Gaussian ordination or CGO), a numerically intensive method that has
many superior qualities. See cqo for details.

References

Van der Aart, P. J. M. and Smeek-Enserink, N. (1975) Correlations between distributions of hunting
spiders (Lycosidae, Ctenidae) and environmental characteristics in a dune area. Netherlands Journal
of Zoology, 25, 145.

Examples
summary(hspider)

## Not run:
# Standardize the environmental variables:
hspider[, 1:6] <- scale(subset(hspider, select = WaterCon:ReflLux))

# Fit a rank-1 binomial CAO

hsbin <- hspider # Binary species data
hsbin[, -(1:6)] <- as.numeric(hsbin[, -(1:6)] > 0)
set.seed(123)
ahsb1 <- cao(cbind(Alopcune, Arctlute, Auloalbi, Zoraspin) ~
WaterCon + ReflLux,
family = binomialff(multiple.responses = TRUE),
df1.nl = 2.2, Bestof = 3, data = hsbin)
par(mfrow = 2:1, las = 1)
lvplot(ahsb1, type = "predictors", llwd = 2, ylab = "logit p", lcol = 1:9)
persp(ahsb1, rug = TRUE, col = 1:10, lwd = 2)
coef(ahsb1)

## End(Not run)

huber2 Hubers Least Favourable Distribution Family Function

Description

M-estimation of the two parameters of Hubers least favourable distribution. The one parameter
case is also implemented.
huber2 351

Usage
huber1(llocation = "identitylink", k = 0.862, imethod = 1)
huber2(llocation = "identitylink", lscale = "loge",
k = 0.862, imethod = 1, zero = "scale")

Arguments
llocation, lscale
Link functions applied to the location and scale parameters. See Links for more
choices.
k Tuning constant. See rhuber for more information.
imethod, zero See CommonVGAMffArguments for information. The default value of zero means
the scale parameter is modelled as intercept-only.

Details
Hubers least favourable distribution family function is popular for resistant/robust regression. The
center of the distribution is normal and its tails are double exponential.
By default, the mean is the first linear/additive predictor (returned as the fitted values; this is the
location parameter), and the log of the scale parameter is the second linear/additive predictor. The
Fisher information matrix is diagonal; Fisher scoring is implemented.
The VGAM family function huber1() estimates only the location parameter. It assumes a scale
parameter of unit value.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Note
Warning: actually, huber2() may be erroneous since the first derivative is not continuous when
there are two parameters to estimate. huber1() is fine in this respect.
The response should be univariate.

Author(s)
T. W. Yee. Help was given by Arash Ardalan.

References
Huber, P. J. and Ronchetti, E. (2009) Robust Statistics, 2nd ed. New York: Wiley.

See Also
rhuber, uninormal, gaussianff, laplace, CommonVGAMffArguments.
352 Huggins89.t1

Examples
set.seed(1231); NN <- 30; coef1 <- 1; coef2 <- 10
hdata <- data.frame(x2 = sort(runif(NN)))
hdata <- transform(hdata, y = rhuber(NN, mu = coef1 + coef2 * x2))

hdata$x2[1] <- 0.0 # Add an outlier

hdata$y[1] <- 10

fit.huber2 <- vglm(y ~ x2, huber2(imethod = 3), data = hdata, trace = TRUE)
fit.huber1 <- vglm(y ~ x2, huber1(imethod = 3), data = hdata, trace = TRUE)

coef(fit.huber2, matrix = TRUE)

summary(fit.huber2)

## Not run: # Plot the results

plot(y ~ x2, data = hdata, col = "blue", las = 1)
lines(fitted(fit.huber2) ~ x2, data = hdata, col = "darkgreen", lwd = 2)

fit.lm <- lm(y ~ x2, hdata) # Compare to a LM:

lines(fitted(fit.lm) ~ x2, data = hdata, col = "lavender", lwd = 3)

# Compare to truth:
lines(coef1 + coef2 * x2 ~ x2, data = hdata, col = "orange", lwd = 2, lty = "dashed")

legend("bottomright", legend = c("truth", "huber", "lm"),

col = c("orange", "darkgreen", "lavender"),
lty = c("dashed", "solid", "solid"), lwd = c(2, 2, 3))
## End(Not run)

Huggins89.t1 Table 1 of Huggins (1989)

Description

Simulated capture data set for the linear logistic model depending on an occasion covariate and an
individual covariate for 10 trapping occasions and 20 individuals.

Usage

data(Huggins89table1)
data(Huggins89.t1)

Format

The format is a data frame.

Huggins89.t1 353

Details
Table 1 of Huggins (1989) gives this toy data set. Note that variables t1,. . . ,t10 are occasion-
specific variables. They correspond to the response variables y1,. . . ,y10 which have values 1 for
capture and 0 for not captured.
Both Huggins89table1 and Huggins89.t1 are identical. The latter used variables beginning with
z, not t, and may be withdrawn very soon.

References
Huggins, R. M. (1989) On the statistical analysis of capture experiments. Biometrika, 76, 133140.

Examples
Huggins89table1 <- transform(Huggins89table1, x3.tij = t01,
T02 = t02, T03 = t03, T04 = t04, T05 = t05, T06 = t06,
T07 = t07, T08 = t08, T09 = t09, T10 = t10)
small.table1 <- subset(Huggins89table1,
y01 + y02 + y03 + y04 + y05 + y06 + y07 + y08 + y09 + y10 > 0)
# fit.tbh is the bottom equation on p.133.
# It is a M_tbh model.
fit.tbh <-
vglm(cbind(y01, y02, y03, y04, y05, y06, y07, y08, y09, y10) ~ x2 + x3.tij,
xij = list(x3.tij ~ t01 + t02 + t03 + t04 + t05 + t06 + t07 + t08 + t09 + t10 +
T02 + T03 + T04 + T05 + T06 + T07 + T08 + T09 + T10 - 1),
posbernoulli.tb(parallel.t = TRUE ~ x2 + x3.tij),
data = small.table1, trace = TRUE,
form2 = ~ x2 + x3.tij +
t01 + t02 + t03 + t04 + t05 + t06 + t07 + t08 + t09 + t10 +
T02 + T03 + T04 + T05 + T06 + T07 + T08 + T09 + T10)

# These results differ a bit from Huggins (1989), probably because

# two animals had to be removed here (they were never caught):
coef(fit.tbh) # First element is the behavioural effect
sqrt(diag(vcov(fit.tbh))) # SEs
constraints(fit.tbh, matrix = TRUE)
summary(fit.tbh, presid = FALSE)
fit.tbh@extra$N.hat # Estimate of the population site N; cf. 20.86
fit.tbh@extra$SE.N.hat # Its standard error; cf. 1.87 or 4.51

fit.th <- vglm(cbind(y01, y02, y03, y04, y05, y06, y07, y08, y09, y10) ~ x2,
posbernoulli.t, data = small.table1, trace = TRUE)
coef(fit.th)
constraints(fit.th)
coef(fit.th, matrix = TRUE) # M_th model
summary(fit.th, presid = FALSE)
fit.th@extra$N.hat # Estimate of the population size N
fit.th@extra$SE.N.hat # Its standard error

fit.bh <- vglm(cbind(y01, y02, y03, y04, y05, y06, y07, y08, y09, y10) ~ x2,
posbernoulli.b(I2 = FALSE), data = small.table1, trace = TRUE)
coef(fit.bh)
354 hunua

constraints(fit.bh)
coef(fit.bh, matrix = TRUE) # M_bh model
summary(fit.bh, presid = FALSE)
fit.bh@extra$N.hat
fit.bh@extra$SE.N.hat

fit.h <- vglm(cbind(y01, y02, y03, y04, y05, y06, y07, y08, y09, y10) ~ x2,
posbernoulli.b, data = small.table1, trace = TRUE)
coef(fit.h, matrix = TRUE) # M_h model (version 1)
coef(fit.h)
summary(fit.h, presid = FALSE)
fit.h@extra$N.hat
fit.h@extra$SE.N.hat

Fit.h <- vglm(cbind(y01, y02, y03, y04, y05, y06, y07, y08, y09, y10) ~ x2,
posbernoulli.t(parallel.t = TRUE ~ x2),
data = small.table1, trace = TRUE)
coef(Fit.h)
coef(Fit.h, matrix = TRUE) # M_h model (version 2)
summary(Fit.h, presid = FALSE)
Fit.h@extra$N.hat
Fit.h@extra$SE.N.hat

hunua Hunua Ranges Data

Description
The hunua data frame has 392 rows and 18 columns. Altitude is explanatory, and there are binary
responses (presence/absence = 1/0 respectively) for 17 plant species.

Usage
data(hunua)

Format
This data frame contains the following columns:

agaaus Agathis australis, or Kauri

hedarb Hedycarya arborea

hohpop Species name unknown
kniexc Knightia excelsa, or Rewarewa
kuneri Kunzea ericoides
lepsco Leptospermum scoparium
metrob Metrosideros robusta
neslan Nestegis lanceolata
rhosap Rhopalostylis sapida
vitluc Vitex lucens, or Puriri
altitude meters above sea level

Details

These were collected from the Hunua Ranges, a small forest in southern Auckland, New Zealand.
At 392 sites in the forest, the presence/absence of 17 plant species was recorded, as well as the
altitude. Each site was of area size 200m2 .

Source

Dr Neil Mitchell, University of Auckland.

See Also

waitakere.

Examples

# Fit a GAM using vgam() and compare it with the Waitakere Ranges one
fit.h <- vgam(agaaus ~ s(altitude, df = 2), binomialff, data = hunua)
## Not run:
plot(fit.h, se = TRUE, lcol = "orange", scol = "orange",
llwd = 2, slwd = 2, main = "Orange is Hunua, Blue is Waitakere")
## End(Not run)
head(predict(fit.h, hunua, type = "response"))

fit.w <- vgam(agaaus ~ s(altitude, df = 2), binomialff, data = waitakere)

## Not run:
plot(fit.w, se = TRUE, lcol = "blue", scol = "blue", add = TRUE)
## End(Not run)
head(predict(fit.w, hunua, type = "response")) # Same as above?
356 hyperg

hyperg Hypergeometric Family Function

Description
Family function for a hypergeometric distribution where either the number of white balls or the total
number of white and black balls are unknown.

Usage
hyperg(N = NULL, D = NULL, lprob = "logit", iprob = NULL)

Arguments
N Total number of white and black balls in the urn. Must be a vector with positive
values, and is recycled, if necessary, to the same length as the response. One of
N and D must be specified.
D Number of white balls in the urn. Must be a vector with positive values, and is
recycled, if necessary, to the same length as the response. One of N and D must
be specified.
lprob Link function for the probabilities. See Links for more choices.
iprob Optional initial value for the probabilities. The default is to choose initial values
internally.

Details
Consider the scenario from dhyper where there are N = m + n balls in an urn, where m are
white and n are black. A simple random sample (i.e., without replacement) of k balls is taken. The
response here is the sample proportion of white balls. In this document, N is N = m + n, D is
m (for the number of defectives, in quality control terminology, or equivalently, the number of
marked individuals). The parameter to be estimated is the population proportion of white balls, viz.
prob = m/(m + n).
Depending on which one of N and D is inputted, the estimate of the other parameter can be obtained
from the equation prob = m/(m + n), or equivalently, prob = D/N. However, the log-factorials
are computed using lgamma and both m and n are not restricted to being integer. Thus if an integer
N is to be estimated, it will be necessary to evaluate the likelihood function at integer values about
the estimate, i.e., at trunc(Nhat) and ceiling(Nhat) where Nhat is the (real) estimate of N .

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, vgam, rrvglm, cqo, and cao.

Warning
No checking is done to ensure that certain values are within range, e.g., k N .
hypersecant 357

Note
The response can be of one of three formats: a factor (first level taken as success), a vector of
proportions of success, or a 2-column matrix (first column = successes) of counts. The argument
weights in the modelling function can also be specified. In particular, for a general vector of
proportions, you will need to specify weights because the number of trials is needed.

Author(s)
Thomas W. Yee

References
Forbes, C., Evans, M., Hastings, N. and Peacock, B. (2011) Statistical Distributions, Hoboken, NJ,
USA: John Wiley and Sons, Fourth edition.

# N is unknown, D is known. Both models are equivalent:

fit <- vglm(cbind(y,k-y) ~ 1, hyperg(D = m), trace = TRUE, crit = "c")
fit <- vglm(yprop ~ 1, hyperg(D = m), weight = k, trace = TRUE, crit = "c")

# N is known, D is unknown. Both models are equivalent:

fit <- vglm(cbind(y, k-y) ~ 1, hyperg(N = m+n), trace = TRUE, crit = "l")
fit <- vglm(yprop ~ 1, hyperg(N = m+n), weight = k, trace = TRUE, crit = "l")

coef(fit, matrix = TRUE)

Coef(fit) # Should be equal to the true population proportion
unique(m / (m+n)) # The true population proportion
fit@extra
head(fitted(fit))
summary(fit)

hypersecant Hyperbolic Secant Distribution Family Function

Description
Estimation of the parameter of the hyperbolic secant distribution.
358 hypersecant

Usage

hypersecant(link.theta = extlogit(min = -pi/2, max = pi/2), init.theta = NULL)

hypersecant01(link.theta = extlogit(min = -pi/2, max = pi/2), init.theta = NULL)

Arguments

link.theta Parameter link function applied to the parameter . See Links for more choices.
init.theta Optional initial value for . If failure to converge occurs, try some other value.
The default means an initial value is determined internally.

Details

The probability density function of the hyperbolic secant distribution is given by

f (y; ) = exp(y + log(cos()))/(2 cosh(y/2)),

for parameter /2 < < /2 and all real y. The mean of Y is tan() (returned as the fitted
values). Morris (1982) calls this model NEF-HS (Natural Exponential Family-Hyperbolic Secant).
It is used to generate NEFs, giving rise to the class of NEF-GHS (G for Generalized).
Another parameterization is used for hypersecant01(): let Y = (logitU )/. Then this uses

f (u; ) = (cos()/) u0.5+/ (1 u)0.5/ ,

for parameter /2 < < /2 and 0 < u < 1. Then the mean of U is 0.5 + / (returned as the
fitted values) and the variance is ( 2 42 )/(8 2 ).
For both parameterizations Newton-Raphson is same as Fisher scoring.

Value

An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Author(s)

T. W. Yee

References

Jorgensen, B. (1997) The Theory of Dispersion Models. London: Chapman & Hall.
Morris, C. N. (1982) Natural exponential families with quadratic variance functions. The Annals of
Statistics, 10(1), 6580.

See Also

extlogit.
Hzeta 359

Examples
hdata <- data.frame(x2 = rnorm(nn <- 200))
hdata <- transform(hdata, y = rnorm(nn)) # Not very good data!
fit1 <- vglm(y ~ x2, hypersecant, data = hdata, trace = TRUE, crit = "coef")
coef(fit1, matrix = TRUE)
fit1@misc$earg

# Not recommended:
fit2 <- vglm(y ~ x2, hypersecant(link = "identitylink"), data = hdata, trace = TRUE)
coef(fit2, matrix = TRUE)
fit2@misc$earg

Hzeta Haights Zeta Distribution

Description
Density, distribution function, quantile function and random generation for Haights zeta distribu-
tion with parameter shape.

Usage
dhzeta(x, shape, log = FALSE)
phzeta(q, shape, log.p = FALSE)
qhzeta(p, shape)
rhzeta(n, shape)

Arguments
x, q, p, n Same meaning as runif.
shape The positive shape parameter. Called below.
log,log.p Same meaning as in pnorm or qnorm.

Details
The probability function is
f (x) = (2x 1)() (2x + 1)() ,
where > 0 and x = 1, 2, . . ..

Value
dhzeta gives the density, phzeta gives the distribution function, qhzeta gives the quantile function,
and rhzeta generates random deviates.

Note
Given some response data, the VGAM family function hzeta estimates the parameter shape.
360 hzeta

Author(s)

T. W. Yee and Kai Huang

See Also

hzeta, zeta, zetaff, simulate.vlm.

Examples

dhzeta(1:20, 2.1)
rhzeta(20, 2.1)

round(1000 * dhzeta(1:8, 2))

table(rhzeta(1000, 2))

## Not run: shape <- 1.1; x <- 1:10

plot(x, dhzeta(x, shape = shape), type = "h", ylim = 0:1, lwd = 2,
sub = paste("shape =", shape), las = 1, col = "blue", ylab = "Probability",
main = "Haight's zeta: blue = density; orange = distribution function")
lines(x+0.1, phzeta(x, shape = shape), col = "orange", lty = 3, lwd = 2,
type = "h")

## End(Not run)

hzeta Haights Zeta Family Function

Description

Estimating the parameter of Haights zeta distribution

Usage

hzeta(lshape = "loglog", ishape = NULL, nsimEIM = 100)

Arguments

lshape Parameter link function for the parameter, called below. See Links for more
choices. Here, a log-log link keeps the parameter greater than one, meaning the
mean is finite.
ishape,nsimEIM See CommonVGAMffArguments for more information.
iam 361

Details
The probability function is

f (y) = (2y 1)() (2y + 1)() ,

where the parameter > 0 and y = 1, 2, . . .. The function dhzeta computes this probability
function. The mean of Y , which is returned as fitted values, is (1 2 )() provided > 1,
where is Riemanns zeta function. The mean is a decreasing function of . The mean is infinite if
1, and the variance is infinite if 2.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Author(s)
T. W. Yee

References
Pages 5334 of Johnson N. L., Kemp, A. W. and Kotz S. (2005) Univariate Discrete Distributions,
3rd edition, Hoboken, New Jersey: Wiley.

See Also
Hzeta, zeta, zetaff, loglog, simulate.vlm.

Examples
shape <- exp(exp(-0.1)) # The parameter
hdata <- data.frame(y = rhzeta(n = 1000, shape))
fit <- vglm(y ~ 1, hzeta, data = hdata, trace = TRUE, crit = "coef")
coef(fit, matrix = TRUE)
Coef(fit) # Useful for intercept-only models; should be same as shape
c(with(hdata, mean(y)), head(fitted(fit), 1))
summary(fit)

iam Index from Array to Matrix

Description
Maps the elements of an array containing symmetric positive-definite matrices to a matrix with
sufficient columns to hold them (called matrix-band format.)

Usage
iam(j, k, M, both = FALSE, diag = TRUE)
362 iam

Arguments

j An integer from the set {1:M} giving the row number of an element.
k An integer from the set {1:M} giving the column number of an element.
M The number of linear/additive predictors. This is the dimension of each positive-
definite symmetric matrix.
both Logical. Return both the row and column indices? See below for more details.
diag Logical. Return the indices for the diagonal elements? If FALSE then only the
strictly upper triangular part of the matrix elements are used.

Details

Suppose we have n symmetric positive-definite square matrices, each M by M , and these are stored
in an array of dimension c(n,M,M). Then these can be more compactly represented by a matrix
of dimension c(n,K) where K is an integer between M and M*(M+1)/2 inclusive. The mapping
between these two representations is given by this function. It firstly enumerates by the diagonal
elements, followed by the band immediately above the diagonal, then the band above that one, etc.
The last element is (1,M). This function performs the mapping from elements (j,k) of symmetric
positive-definite square matrices to the columns of another matrix representing such. This is called
the matrix-band format and is used by the VGAM package.

Value

This function has a dual purpose depending on the value of both. If both=FALSE then the column
number corresponding to the j-k element of the matrix is returned. If both = TRUE then j and k
are ignored and a list with the following components are returned.

row.index The row indices of the upper triangular part of the matrix (This may or may not
include the diagonal elements, depending on the argument diagonal).
col.index The column indices of the upper triangular part of the matrix (This may or may
not include the diagonal elements, depending on the argument diagonal).

Note

This function is used in the weight slot of many VGAM family functions (see vglmff-class),
especially those whose M is determined by the data, e.g., dirichlet, multinomial.

Author(s)

T. W. Yee

See Also

vglmff-class.
identitylink 363

Examples
iam(1, 2, M = 3) # The 4th column represents element (1,2) of a 3x3 matrix
iam(NULL, NULL, M = 3, both = TRUE) # Return the row and column indices

dirichlet()@weight

M <- 4
temp1 <- iam(NA, NA, M = M, both = TRUE)
mat1 <- matrix(NA, M, M)
mat1[cbind(temp1$row, temp1$col)] = 1:length(temp1$row)
mat1 # More commonly used

temp2 <- iam(NA, NA, M = M, both = TRUE, diag = FALSE)

mat2 <- matrix(NA, M, M)
mat2[cbind(temp2$row, temp2$col)] = 1:length(temp2$row)
mat2 # Rarely used

identitylink Identity Link Function

Description

Computes the identity transformation, including its inverse and the first two derivatives.

Usage

identitylink(theta, inverse = FALSE, deriv = 0, short = TRUE, tag = FALSE)

negidentity(theta, inverse = FALSE, deriv = 0, short = TRUE, tag = FALSE)

Arguments

theta Numeric or character. See below for further details.

inverse, deriv, short, tag
Details at Links.

Details

The identity link function g() = should be available to every parameter estimated by the VGAM
library. However, it usually results in numerical problems because the estimates lie outside the
permitted range. Consequently, the result may contain Inf, -Inf, NA or NaN.
The function negidentity is the negative-identity link function and corresponds to g() = .
This is useful for some models, e.g., in the literature supporting the gevff function it seems that
half of the authors use = k for the shape parameter and the other half use k instead of .
364 inv.binomial

Value
For identitylink(): for deriv = 0, the identity of theta, i.e., theta when inverse = FALSE,
and if inverse = TRUE then theta. For deriv = 1, then the function returns d eta / d theta as a
function of theta if inverse = FALSE, else if inverse = TRUE then it returns the reciprocal.
For negidentity(): the results are similar to identitylink() except for a sign change in most
cases.

Author(s)
Thomas W. Yee

References
McCullagh, P. and Nelder, J. A. (1989) Generalized Linear Models, 2nd ed. London: Chapman &
Hall.

See Also
Links, loge, logit, probit, powerlink.

Examples
identitylink((-5):5)
identitylink((-5):5, deriv = 1)
identitylink((-5):5, deriv = 2)
negidentity((-5):5)
negidentity((-5):5, deriv = 1)
negidentity((-5):5, deriv = 2)

inv.binomial Inverse Binomial Distribution Family Function

Description
Estimates the two parameters of an inverse binomial distribution by maximum likelihood estima-
tion.

Usage
inv.binomial(lrho = extlogit(min = 0.5, max = 1),
llambda = "loge", irho = NULL, ilambda = NULL, zero = NULL)

Arguments
lrho, llambda Link function for the and parameters. See Links for more choices.
irho, ilambda Numeric. Optional initial values for and .
zero See CommonVGAMffArguments.
inv.binomial 365

Details
The inverse binomial distribution of Yanagimoto (1989) has density function

(2y + )
f (y; , ) = {(1 )}y
(y + 1) (y + + 1)

where y = 0, 1, 2, . . . and 21 < < 1, and > 0. The first two moments exist for > 21 ; then the
mean is (1 )/(2 1) (returned as the fitted values) and the variance is (1 )/(2 1)3 .
The inverse binomial distribution is a special case of the generalized negative binomial distribution
of Jain and Consul (1971). It holds that V ar(Y ) > E(Y ) so that the inverse binomial distribution
is overdispersed compared with the Poisson distribution.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Note
This VGAM family function only works reasonably well with intercept-only models. Good initial
values are needed; if convergence failure occurs use irho and/or ilambda.
Some elements of the working weight matrices use the expected information matrix while other
elements use the observed information matrix. Yet to do: using the mean and the reciprocal of
results in an EIM that is diagonal.

Author(s)
T. W. Yee

References
Yanagimoto, T. (1989) The inverse binomial distribution as a statistical model. Communications in
Statistics: Theory and Methods, 18, 36253633.
Jain, G. C. and Consul, P. C. (1971) A generalized negative binomial distribution. SIAM Journal on
Applied Mathematics, 21, 501513.
Jorgensen, B. (1997) The Theory of Dispersion Models. London: Chapman & Hall

sum(weights(fit)) # Sum of the prior weights

sum(weights(fit, type = "work")) # Sum of the working weights

Inv.gaussian The Inverse Gaussian Distribution

Description

Density, distribution function and random generation for the inverse Gaussian distribution.

Usage

dinv.gaussian(x, mu, lambda, log = FALSE)

pinv.gaussian(q, mu, lambda)
rinv.gaussian(n, mu, lambda)

Arguments

x, q vector of quantiles.
n number of observations. If length(n) > 1 then the length is taken to be the
number required.
mu the mean parameter.
lambda the parameter.
log Logical. If log = TRUE then the logarithm of the density is returned.

Details

See inv.gaussianff, the VGAM family function for estimating both parameters by maximum
likelihood estimation, for the formula of the probability density function.

Value

dinv.gaussian gives the density, pinv.gaussian gives the distribution function, and rinv.gaussian
generates random deviates.

Note

Currently qinv.gaussian is unavailable.

Author(s)

T. W. Yee
inv.gaussianff 367

References
Johnson, N. L. and Kotz, S. and Balakrishnan, N. (1994) Continuous Univariate Distributions, 2nd
edition, Volume 1, New York: Wiley.
Taraldsen, G. and Lindqvist, B. H. (2005) The multiple roots simulation algorithm, the inverse
Gaussian distribution, and the sufficient conditional Monte Carlo method. Preprint Statistics No.
4/2005, Norwegian University of Science and Technology, Trondheim, Norway.

inv.gaussianff Inverse Gaussian Distribution Family Function

Description
Estimates the two parameters of the inverse Gaussian distribution by maximum likelihood estima-
tion.

Usage
inv.gaussianff(lmu = "loge", llambda = "loge",
imethod = 1, ilambda = NULL,
parallel = FALSE, ishrinkage = 0.99, zero = NULL)

Arguments
lmu, llambda Parameter link functions for the and parameters. See Links for more
choices.
ilambda, parallel
See CommonVGAMffArguments for more information. If parallel = TRUE then
the constraint is not applied to the intercept.
imethod, ishrinkage, zero
See CommonVGAMffArguments for information.
368 inv.gaussianff

Details
The standard (canonical) form of the inverse Gaussian distribution has a density that can be
written as p
f (y; , ) = /(2y 3 ) exp (y )2 /(22 y)

where y > 0, > 0, and > 0. The mean of Y is and its variance is 3 /. By default,
1 = log() and 2 = log(). The mean is returned as the fitted values. This VGAM family
function can handle multiple responses (inputted as a matrix).

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.

Note
The inverse Gaussian distribution can be fitted (to a certain extent) using the usual GLM framework
involving a scale parameter. This family function is different from that approach in that it estimates
both parameters by full maximum likelihood estimation.

Author(s)
T. W. Yee

References
Johnson, N. L. and Kotz, S. and Balakrishnan, N. (1994) Continuous Univariate Distributions, 2nd
edition, Volume 1, New York: Wiley.
Forbes, C., Evans, M., Hastings, N. and Peacock, B. (2011) Statistical Distributions, Hoboken, NJ,
USA: John Wiley and Sons, Fourth edition.

See Also
Inv.gaussian, waldff, bisa.
The R package SuppDists has several functions for evaluating the density, distribution function,
quantile function and generating random numbers from the inverse Gaussian distribution.

Examples
idata <- data.frame(x2 = runif(nn <- 1000))
idata <- transform(idata, mymu = exp(2 + 1 * x2),
Lambda = exp(2 + 1 * x2))
idata <- transform(idata, y = rinv.gaussian(nn, mu = mymu, lambda = Lambda))
fit1 <- vglm(y ~ x2, inv.gaussianff, data = idata, trace = TRUE)
rrig <- rrvglm(y ~ x2, inv.gaussianff, data = idata, trace = TRUE)
coef(fit1, matrix = TRUE)
coef(rrig, matrix = TRUE)
Coef(rrig)
summary(fit1)
Inv.lomax 369

Inv.lomax The Inverse Lomax Distribution

Description
Density, distribution function, quantile function and random generation for the inverse Lomax dis-
tribution with shape parameter p and scale parameter scale.

Usage
dinv.lomax(x, scale = 1, shape2.p, log = FALSE)
pinv.lomax(q, scale = 1, shape2.p, lower.tail = TRUE, log.p = FALSE)
qinv.lomax(p, scale = 1, shape2.p, lower.tail = TRUE, log.p = FALSE)
rinv.lomax(n, scale = 1, shape2.p)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. If length(n) > 1, the length is taken to be the number
required.
shape2.p shape parameter.
scale scale parameter.
log Logical. If log = TRUE then the logarithm of the density is returned.
lower.tail, log.p
Same meaning as in pnorm or qnorm.

Details
See inv.lomax, which is the VGAM family function for estimating the parameters by maximum
likelihood estimation.

Value
dinv.lomax gives the density, pinv.lomax gives the distribution function, qinv.lomax gives the
quantile function, and rinv.lomax generates random deviates.

Note
The inverse Lomax distribution is a special case of the 4-parameter generalized beta II distribution.

Author(s)
T. W. Yee
370 inv.lomax

References
Kleiber, C. and Kotz, S. (2003) Statistical Size Distributions in Economics and Actuarial Sciences,
Hoboken, NJ, USA: Wiley-Interscience.

inv.lomax Inverse Lomax Distribution Family Function

Description
Maximum likelihood estimation of the 2-parameter inverse Lomax distribution.

Usage
inv.lomax(lscale = "loge", lshape2.p = "loge", iscale = NULL,
ishape2.p = NULL, imethod = 1, gscale = exp(-5:5),
gshape2.p = exp(-5:5), probs.y = c(0.25, 0.5, 0.75), zero = "shape2.p")

Arguments
lscale, lshape2.p
Parameter link functions applied to the (positive) parameters b, and p. See Links
for more choices.
iscale, ishape2.p, imethod, zero
See CommonVGAMffArguments for information. For imethod = 2 a good initial
value for ishape2.p is needed to obtain a good estimate for the other parameter.
gscale, gshape2.p
See CommonVGAMffArguments for information.
probs.y See CommonVGAMffArguments for information.

Details
The 2-parameter inverse Lomax distribution is the 4-parameter generalized beta II distribution with
shape parameters a = q = 1. It is also the 3-parameter Dagum distribution with shape parameter
a = 1, as well as the beta distribution of the second kind with q = 1. More details can be found in
Kleiber and Kotz (2003).
inv.lomax 371

The inverse Lomax distribution has density

f (y) = py p1 /[bp {1 + y/b}p+1 ]

for b > 0, p > 0, y 0. Here, b is the scale parameter scale, and p is a shape parameter. The mean
does not seem to exist; the median is returned as the fitted values. This family function handles
multiple responses.

Value

An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Note

See the notes in genbetaII.

Author(s)

T. W. Yee

References

Kleiber, C. and Kotz, S. (2003) Statistical Size Distributions in Economics and Actuarial Sciences,
Hoboken, NJ, USA: Wiley-Interscience.

See Also

inv.lomax, genbetaII, betaII, dagum, sinmad, fisk, lomax, paralogistic, inv.paralogistic,

simulate.vlm.

Examples

idata <- data.frame(y = rinv.lomax(n = 2000, scale = exp(2), exp(1)))

fit <- vglm(y ~ 1, inv.lomax, data = idata, trace = TRUE)
fit <- vglm(y ~ 1, inv.lomax(iscale = exp(3)), data = idata,
trace = TRUE, epsilon = 1e-8, crit = "coef")
coef(fit, matrix = TRUE)
Coef(fit)
summary(fit)
372 Inv.paralogistic

Inv.paralogistic The Inverse Paralogistic Distribution

Description
Density, distribution function, quantile function and random generation for the inverse paralogistic
distribution with shape parameters a and p, and scale parameter scale.

Usage
dinv.paralogistic(x, scale = 1, shape1.a, log = FALSE)
pinv.paralogistic(q, scale = 1, shape1.a, lower.tail = TRUE, log.p = FALSE)
qinv.paralogistic(p, scale = 1, shape1.a, lower.tail = TRUE, log.p = FALSE)
rinv.paralogistic(n, scale = 1, shape1.a)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. If length(n) > 1, the length is taken to be the number
required.
shape1.a shape parameter.
scale scale parameter.
log Logical. If log = TRUE then the logarithm of the density is returned.
lower.tail, log.p
Same meaning as in pnorm or qnorm.

Details
See inv.paralogistic, which is the VGAM family function for estimating the parameters by
maximum likelihood estimation.

Value
dinv.paralogistic gives the density, pinv.paralogistic gives the distribution function, qinv.paralogistic
gives the quantile function, and rinv.paralogistic generates random deviates.

Note
The inverse paralogistic distribution is a special case of the 4-parameter generalized beta II distri-
bution.

Author(s)
T. W. Yee
inv.paralogistic 373

References

Kleiber, C. and Kotz, S. (2003) Statistical Size Distributions in Economics and Actuarial Sciences,
Hoboken, NJ, USA: Wiley-Interscience.

See Also

inv.paralogistic, genbetaII.

Examples
idata <- data.frame(y = rinv.paralogistic(n = 3000, exp(1), scale = exp(2)))
fit <- vglm(y ~ 1, inv.paralogistic(lss = FALSE, ishape1.a = 2.1),
data = idata, trace = TRUE, crit = "coef")
coef(fit, matrix = TRUE)
Coef(fit)

inv.paralogistic Inverse Paralogistic Distribution Family Function

Description

Maximum likelihood estimation of the 2-parameter inverse paralogistic distribution.

Usage

inv.paralogistic(lscale = "loge", lshape1.a = "loge", iscale = NULL,

ishape1.a = NULL, imethod = 1, lss = TRUE, gscale = exp(-5:5),
gshape1.a = seq(0.75, 4, by = 0.25), probs.y = c(0.25, 0.5, 0.75),
zero = "shape")

Arguments

lss See CommonVGAMffArguments for important information.

lshape1.a, lscale
Parameter link functions applied to the (positive) parameters a and scale. See
Links for more choices.
iscale, ishape1.a, imethod, zero
See CommonVGAMffArguments for information. For imethod = 2 a good initial
value for ishape1.a is needed to obtain a good estimate for the other parameter.
gscale, gshape1.a
See CommonVGAMffArguments for information.
probs.y See CommonVGAMffArguments for information.
374 inv.paralogistic

Details
The 2-parameter inverse paralogistic distribution is the 4-parameter generalized beta II distribution
with shape parameter q = 1 and a = p. It is the 3-parameter Dagum distribution with a = p. More
details can be found in Kleiber and Kotz (2003).
The inverse paralogistic distribution has density
2 2
1
f (y) = a2 y a /[ba {1 + (y/b)a }a+1 ]

for a > 0, b > 0, y 0. Here, b is the scale parameter scale, and a is the shape parameter. The
mean is
E(Y ) = b (a + 1/a) (1 1/a)/(a)
provided a > 1; these are returned as the fitted values. This family function handles multiple
responses.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Note
See the notes in genbetaII.

Author(s)
T. W. Yee

References
Kleiber, C. and Kotz, S. (2003) Statistical Size Distributions in Economics and Actuarial Sciences,
Hoboken, NJ, USA: Wiley-Interscience.

See Also
Inv.paralogistic, genbetaII, betaII, dagum, sinmad, fisk, inv.lomax, lomax, paralogistic,
simulate.vlm.

Examples
idata <- data.frame(y = rinv.paralogistic(n = 3000, exp(1), scale = exp(2)))
fit <- vglm(y ~ 1, inv.paralogistic(lss = FALSE), data = idata, trace = TRUE)
fit <- vglm(y ~ 1, inv.paralogistic(imethod = 2, ishape1.a = 4),
data = idata, trace = TRUE, crit = "coef")
coef(fit, matrix = TRUE)
Coef(fit)
summary(fit)
is.buggy 375

is.buggy Does the fitted object suffer from a known bug?

Description
Checks to see if a fitted object suffers from some known bug.

Usage
is.buggy(object, ...)
is.buggy.vlm(object, each.term = FALSE, ...)

Arguments
object A fitted VGAM object, e.g., from vgam.
each.term Logical. If TRUE then a logical is returned for each term.
... Unused for now.

Details
It is known that vgam with s terms do not correctly handle constraint matrices (cmat, say) when
crossprod(cmat) is not diagonal. This function detects whether this is so or not. Note that proba-
bly all VGAM family functions have defaults where all crossprod(cmat)s are diagonal, therefore
do not suffer from this bug. It is more likely to occur if the user inputs constraint matrices using the
constraints argument (and setting zero = NULL if necessary).
Second-generation VGAMs based on sm.ps are a modern alternative to using s. It does not suffer
from this bug. However, G2-VGAMs require a reasonably large sample size in order to work more
reliably.

Value
The default is a single logical (TRUE if any term is TRUE), otherwise a vector of such with each
element corresponding to a term. If the value is TRUE then I suggest replacing the VGAM by a
similar model fitted by vglm and using regression splines, e.g., bs, ns.

Note
When the bug is fixed this function may be withdrawn, otherwise always return FALSEs!

Author(s)
T. W. Yee

See Also
vgam. vglm, s, sm.ps, bs, ns.
376 is.parallel

Examples
fit1 <- vgam(cbind(agaaus, kniexc) ~ s(altitude, df = c(3, 4)),
binomialff(multiple.responses = TRUE), data = hunua)
is.buggy(fit1) # Okay
is.buggy(fit1, each.term = TRUE) # No terms are buggy

fit2 <- vgam(cbind(agaaus, kniexc) ~ s(altitude, df = c(3, 4)),

binomialff(multiple.responses = TRUE), data = hunua,
constraints =
list("(Intercept)" = diag(2),
"s(altitude, df = c(3, 4))" = matrix(c(1, 1, 0, 1), 2, 2)))
is.buggy(fit2) # TRUE
is.buggy(fit2, each.term = TRUE)
constraints(fit2)

# fit2b is an approximate alternative to fit2:

fit2b <- vglm(cbind(agaaus, kniexc) ~ bs(altitude, df = 3) + bs(altitude, df = 4),
binomialff(multiple.responses = TRUE), data = hunua,
constraints =
list("(Intercept)" = diag(2),
"bs(altitude, df = 3)" = rbind(1, 1),
"bs(altitude, df = 4)" = rbind(0, 1)))
is.buggy(fit2b) # Okay
is.buggy(fit2b, each.term = TRUE)
constraints(fit2b)

is.parallel Parallelism Constraint Matrices

Description
Returns a logical vector from a test of whether an object such as a matrix or VGLM object corre-
sponds to a parallelism assumption.

Usage
is.parallel.matrix(object, ...)
is.parallel.vglm(object, type = c("term", "lm"), ...)

Arguments
object an object such as a constraint matrix or a vglm object.
type passed into constraints.
... additional optional arguments. Currently unused.

Details
These functions may be useful for categorical models such as propodds, cumulative, acat, cratio,
sratio, multinomial.
is.smart 377

Value
A vector of logicals, testing whether each constraint matrix is a one-column matrix of ones. Note
that parallelism can still be thought of as holding if the constraint matrix has a non-zero but constant
values, however, this is currently not implemented. No checking is done that the constraint matrices
have the same number of rows.

is.smart Test For a Smart Object

Description
Tests an object to see if it is smart.

Usage
is.smart(object)

Arguments
object a function or a fitted model.

Details
If object is a function then this function looks to see whether object has the logical attribute
"smart". If so then this is returned, else FALSE.
If object is a fitted model then this function looks to see whether [email protected] or
object$smart.prediction exists. If it does and it is not equal to list(smart.arg=FALSE) then
a TRUE is returned, else FALSE. The reason for this is because, e.g., lm(...,smart=FALSE) and
vglm(...,smart=FALSE), will return such a specific list.
Writers of smart functions manually have to assign this attribute to their smart function after it has
been written.
378 is.zero

Value
Returns TRUE or FALSE, according to whether the object is smart or not.

Examples
is.smart(sm.min1) # TRUE
is.smart(sm.poly) # TRUE
library(splines)
is.smart(sm.bs) # TRUE
is.smart(sm.ns) # TRUE
is.smart(tan) # FALSE
## Not run:
udata <- data.frame(x2 = rnorm(9))
fit1 <- vglm(rnorm(9) ~ x2, uninormal, data = udata)
is.smart(fit1) # TRUE
fit2 <- vglm(rnorm(9) ~ x2, uninormal, data = udata, smart = FALSE)
is.smart(fit2) # FALSE
[email protected]

## End(Not run)

is.zero Zero Constraint Matrices

Description
Returns a logical vector from a test of whether an object such as a matrix or VGLM object corre-
sponds to a zero assumption.

Usage
is.zero.matrix(object, ...)
is.zero.vglm(object, ...)

Arguments
object an object such as a coefficient matrix of a vglm object, or a vglm object.
... additional optional arguments. Currently unused.

Details
These functions test the effect of the zero argument on a vglm object or the coefficient matrix of a
vglm object. The latter is obtained by coef(vglmObject, matrix = TRUE).

Value
A vector of logicals, testing whether each linear/additive predictor has the zero argument applied
to it. It is TRUE if that linear/additive predictor is intercept-only, i.e., all other regression coefficients
are set to zero.
No checking is done for the intercept term at all, i.e., that it was estimated in the first place.
kendall.tau 379

kendall.tau Kendalls Tau Statistic

Description
Computes Kendalls Tau, which is a rank-based correlation measure, between two vectors.

Usage
kendall.tau(x, y, exact = FALSE, max.n = 3000)

Arguments
x, y Numeric vectors. Must be of equal length. Ideally their values are continuous
and not too discrete. Let length(x) be N , say.
exact Logical. If TRUE then the exact value is computed.
max.n Numeric. If exact = FALSE and length(x) is more than max.n then a random
sample of max.n pairs are chosen.

Details
Kendalls tau is a measure of dependency in a bivariate distribution. Loosely, two random vari-
ables are concordant if large values of one random variable are associated with large values of the
other random variable. Similarly, two random variables are disconcordant if large values of one
random variable are associated with small values of the other random variable. More formally,
if (x[i] - x[j])*(y[i] - y[j]) > 0 then that comparison is concordant (i 6= j). And if
(x[i] - x[j])*(y[i] - y[j]) < 0 then that comparison is disconcordant (i 6= j). Out of
choose(N, 2) comparisons, let c and d be the number of concordant and disconcordant pairs. Then
Kendalls tau can be estimated by (c d)/(c + d). If there are ties then half the ties are deemed
concordant and half disconcordant so that (c d)/(c + d + t) is used.

Value
Kendalls tau, which lies between 1 and 1.
380 Kumar

Warning
If length(x) is large then the cost is O(N 2 ), which is expensive! Under these circumstances it is
not advisable to set exact = TRUE or max.n to a very large number.

## Not run: plot(x, y, col = "blue")

kendall.tau(x, y) # A random sample is taken here

kendall.tau(x, y, exact = TRUE) # Costly if length(x) is large

kendall.tau(x, y, max.n = N) # Same as exact = TRUE

(rhohat <- sin(kendall.tau(x, y) * pi / 2)) # This formula holds for N_2 actually
true.rho # rhohat should be near this value

Kumar The Kumaraswamy Distribution

Description
Density, distribution function, quantile function and random generation for the Kumaraswamy dis-
tribution.

Usage
dkumar(x, shape1, shape2, log = FALSE)
pkumar(q, shape1, shape2, lower.tail = TRUE, log.p = FALSE)
qkumar(p, shape1, shape2, lower.tail = TRUE, log.p = FALSE)
rkumar(n, shape1, shape2)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. If length(n) > 1 then the length is taken to be the
number required.
Kumar 381

shape1, shape2 positive shape parameters.

log Logical. If log = TRUE then the logarithm of the density is returned.
lower.tail, log.p
Same meaning as in pnorm or qnorm.

Details

See kumar, the VGAM family function for estimating the parameters, for the formula of the proba-
bility density function and other details.

Value

dkumar gives the density, pkumar gives the distribution function, qkumar gives the quantile function,
and rkumar generates random deviates.

Author(s)

T. W. Yee and Kai Huang

See Also

kumar.

Examples

## Not run:
shape1 <- 2; shape2 <- 2; nn <- 201; # shape1 <- shape2 <- 0.5;
x <- seq(-0.05, 1.05, len = nn)
plot(x, dkumar(x, shape1, shape2), type = "l", las = 1, ylim = c(0,1.5),
ylab = paste("fkumar(shape1 = ", shape1, ", shape2 = ", shape2, ")"),
col = "blue", cex.main = 0.8,
main = "Blue is density, orange is cumulative distribution function",
sub = "Purple lines are the 10,20,...,90 percentiles")
lines(x, pkumar(x, shape1, shape2), col = "orange")
probs <- seq(0.1, 0.9, by = 0.1)
Q <- qkumar(probs, shape1, shape2)
lines(Q, dkumar(Q, shape1, shape2), col = "purple", lty = 3, type = "h")
lines(Q, pkumar(Q, shape1, shape2), col = "purple", lty = 3, type = "h")
abline(h = probs, col = "purple", lty = 3)
max(abs(pkumar(Q, shape1, shape2) - probs)) # Should be 0

## End(Not run)
382 kumar

kumar Kumaraswamy Distribution Family Function

Description
Estimates the two parameters of the Kumaraswamy distribution by maximum likelihood estimation.

Usage
kumar(lshape1 = "loge", lshape2 = "loge",
ishape1 = NULL, ishape2 = NULL, gshape1 = exp(2*ppoints(5) - 1),
tol12 = 1.0e-4, zero = NULL)

Arguments
lshape1, lshape2
Link function for the two positive shape parameters, respectively, called a and b
below. See Links for more choices.
ishape1, ishape2
Numeric. Optional initial values for the two positive shape parameters.
tol12 Numeric and positive. Tolerance for testing whether the second shape parameter
is either 1 or 2. If so then the working weights need to handle these singularities.
gshape1 Values for a grid search for the first shape parameter. See CommonVGAMffArguments
for more information.
zero See CommonVGAMffArguments.

Details
The Kumaraswamy distribution has density function

f (y; a = shape1, b = shape2) = aby a1 (1 y a )b1

where 0 < y < 1 and the two shape parameters, a and b, are positive. The mean is b Beta(1 +
1/a, b) (returned as the fitted values) and the variance is b Beta(1 + 2/a, b) (b Beta(1 +
1/a, b))2 . Applications of the Kumaraswamy distribution include the storage volume of a water
reservoir. Fisher scoring is implemented. Handles multiple responses (matrix input).

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Author(s)
T. W. Yee
lakeO 383

References
Kumaraswamy, P. (1980). A generalized probability density function for double-bounded random
processes. Journal of Hydrology, 46, 7988.
Jones, M. C. (2009). Kumaraswamys distribution: A beta-type distribution with some tractability
advantages. Statistical Methodology, 6, 7081.

lakeO Annual catches on Lake Otamangakau from October 1974 to October

1989

Description
Rainbow and brown trout catches by a Mr Swainson at Lake Otamangakau in the central North
Island of New Zealand during the 1970s and 1980s.

Usage
data(lakeO)

Format
A data frame with 15 observations on the following 5 variables.

year a numeric vector, the season began on 1 October of the year and ended 12 months later.
total.fish a numeric vector, the total number of fish caught during the season. Simply the sum
of brown and rainbow trout.
brown a numeric vector, the number of brown trout (Salmo trutta) caught.
rainbow a numeric vector, the number of rainbow trout (Oncorhynchus mykiss) caught.
visits a numeric vector, the number of visits during the season that the angler made to the lake.
It is necessary to assume that the visits were of an equal time length in order to interpret the
usual Poisson regressions.
384 lambertW

Details

The data was extracted from the season summaries at Lake Otamangakau by Anthony Swainson for
the seasons 197475 to 198889.
Mr Swainson was one of a small group of regular fly fishing anglers and kept a diary of his catches.
Lake Otamangakau is a lake of area 1.8 squared km and has a maximum depth of about 12m,
and is located in the central North Island of New Zealand. It is trout-infested and known for its
trophy-sized fish.
See also trapO.

Source

Table 7.2 of the reference below. Thanks to Dr Michel Dedual for a copy of the report and for help
reading the final years data. The report is available from TWY on request.

References

Dedual, M. and MacLean, G. and Rowe, D. and Cudby, E., The Trout Population and Fishery
of Lake OtamangakauInterim Report. National Institute of Water and Atmospheric Research,
Hamilton, New Zealand. Consultancy Report Project No. ELE70207, (Dec 1996).

Examples
data(lakeO)
lakeO
summary(lakeO)

lambertW The Lambert W function

Description

Computes the Lambert W function for real values.

Usage

lambertW(x, tolerance = 1e-10, maxit = 50)

Arguments

x A vector of reals.
tolerance Accuracy desired.
maxit Maximum number of iterations of third-order Halleys method.
laplace 385

Details
The Lambert W function is the root of the equation W (z) exp(W (z)) = z for complex z. It is
multi-valued if z is real and z < 1/e. For real 1/e z < 0 it has two possible real values, and
currently only the upper branch is computed.

Value
This function returns the principal branch of the W function for real z. It returns W (z) 1, and
NA for z < 1/e.

Note
If convergence does not occur then increase the value of maxit and/or tolerance.
Yet to do: add an argument lbranch = TRUE to return the lower branch for real 1/e z < 0;
this would give W (z) 1.

Author(s)
T. W. Yee

References
Corless, R. M. and Gonnet, G. H. and Hare, D. E. G. and Jeffrey, D. J. and Knuth, D. E. (1996) On
the Lambert W function. Advances in Computational Mathematics, 5(4), 329359.

laplace Laplace Distribution

Description
Maximum likelihood estimation of the 2-parameter classical Laplace distribution.

Usage
laplace(llocation = "identitylink", lscale = "loge",
ilocation = NULL, iscale = NULL, imethod = 1, zero = "scale")
386 laplace

Arguments
llocation, lscale
Character. Parameter link functions for location parameter a and scale parameter
b. See Links for more choices.
ilocation, iscale
Optional initial values. If given, it must be numeric and values are recycled to
the appropriate length. The default is to choose the value internally.
imethod Initialization method. Either the value 1 or 2.
zero See CommonVGAMffArguments for information.

where < y < , < a < and b > 0. Its mean is a and its variance is 2b2 . This
parameterization is called the classical Laplace distribution by Kotz et al. (2001), and the density
is symmetric about a.
For y ~ 1 (where y is the response) the maximum likelihood estimate (MLE) for the location
parameter is the sample median, and the MLE for b is mean(abs(y-location)) (replace location
by its MLE if unknown).

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Warning
This family function has not been fully tested. The MLE regularity conditions do not hold for this
distribution, therefore misleading inferences may result, e.g., in the summary and vcov of the object.

Note
This family function uses Fisher scoring. Convergence may be slow for non-intercept-only models;
half-stepping is frequently required.

Author(s)
T. W. Yee

See Also

rlaplace, alaplace2 (which differs slightly from this parameterization), exponential, median.

Examples
ldata <- data.frame(y = rlaplace(nn <- 100, loc = 2, scale = exp(1)))
fit <- vglm(y ~ 1, laplace, data = ldata, trace = TRUE, crit = "l")
coef(fit, matrix = TRUE)
Coef(fit)
with(ldata, median(y))

ldata <- data.frame(x = runif(nn <- 1001))

ldata <- transform(ldata, y = rlaplace(nn, loc = 2, scale = exp(-1 + 1*x)))
coef(vglm(y ~ x, laplace(iloc = 0.2, imethod = 2, zero = 1), data = ldata,
trace = TRUE), matrix = TRUE)

laplaceUC The Laplace Distribution

Description

Density, distribution function, quantile function and random generation for the Laplace distribution
with location parameter location and scale parameter scale.

Usage

dlaplace(x, location = 0, scale = 1, log = FALSE)

plaplace(q, location = 0, scale = 1, lower.tail = TRUE, log.p = FALSE)
qlaplace(p, location = 0, scale = 1, lower.tail = TRUE, log.p = FALSE)
rlaplace(n, location = 0, scale = 1)

Arguments

x, q vector of quantiles.
p vector of probabilities.
n number of observations. Same as in runif.
location the location parameter a, which is the mean.
scale the scale parameter b. Must consist of positive values.
log Logical. If log = TRUE then the logarithm of the density is returned.
lower.tail, log.p
Same meaning as in pnorm or qnorm.
388 laplaceUC

Details
The Laplace distribution is often known as the double-exponential distribution and, for modelling,
has heavier tail than the normal distribution. The Laplace density function is

1 |y a|
f (y) = exp
2b b
where < y < , < a < and b > 0. The mean is a and the variance is 2b2 .
See laplace, the VGAM family function for estimating the two parameters by maximum likelihood
estimation, for formulae and details. Apart from n, all the above arguments may be vectors and are
recyled to the appropriate length if necessary.

Value
dlaplace gives the density, plaplace gives the distribution function, qlaplace gives the quantile
function, and rlaplace generates random deviates.

Author(s)
T. W. Yee and Kai Huang

References
Forbes, C., Evans, M., Hastings, N. and Peacock, B. (2011) Statistical Distributions, Hoboken, NJ,
USA: John Wiley and Sons, Fourth edition.

## Not run: loc <- 0; b <- 1.5; x <- seq(-5, 5, by = 0.01)

plot(x, dlaplace(x, loc, b), type = "l", col = "blue", ylim = c(0,1),
main = "Blue is density, orange is cumulative distribution function",
sub = "Purple are 5,10,...,95 percentiles", las = 1, ylab = "")
abline(h = 0, col = "blue", lty = 2)
lines(qlaplace(seq(0.05,0.95,by = 0.05), loc, b),
dlaplace(qlaplace(seq(0.05, 0.95, by = 0.05), loc, b), loc, b),
col = "purple", lty = 3, type = "h")
lines(x, plaplace(x, loc, b), type = "l", col = "orange")
abline(h = 0, lty = 2)
## End(Not run)

plaplace(qlaplace(seq(0.05, 0.95, by = 0.05), loc, b), loc, b)

latvar 389

latvar Latent Variables

Description
Generic function for the latent variables of a model.

Usage
latvar(object, ...)
lv(object, ...)

Arguments
object An object for which the extraction of latent variables is meaningful.
... Other arguments fed into the specific methods function of the model. Sometimes
they are fed into the methods function for Coef.

Details
Latent variables occur in reduced-rank regression models, as well as in quadratic and additive ordi-
nation models. For the latter two, latent variable values are often called site scores by ecologists.
Latent variables are linear combinations of the explanatory variables.

Value
The value returned depends specifically on the methods function invoked.

Warning
latvar and lv are identical, but the latter will be deprecated soon.
Latent variables are not really applicable to vglm/vgam models.

Author(s)
Thomas W. Yee

See Also
latvar.qrrvglm, latvar.rrvglm, latvar.cao, lvplot.

Examples
## Not run:
hspider[, 1:6] <- scale(hspider[, 1:6]) # Standardized environmental vars
set.seed(123)
p1 <- cao(cbind(Pardlugu, Pardmont, Pardnigr, Pardpull, Zoraspin) ~
WaterCon + BareSand + FallTwig + CoveMoss + CoveHerb + ReflLux,
family = poissonff, data = hspider, Rank = 1, df1.nl =
c(Zoraspin = 2.5, 3), Bestof = 3, Crow1positive = TRUE)

var(latvar(p1)) # Scaled to unit variance # Scaled to unit variance

c(latvar(p1)) # Estimated site scores

## End(Not run)

leipnik Leipnik Distribution Family Function

Description
Estimates the two parameters of a (transformed) Leipnik distribution by maximum likelihood esti-
mation.

Usage
leipnik(lmu = "logit", llambda = logoff(offset = 1),
imu = NULL, ilambda = NULL)

Arguments
lmu, llambda Link function for the and parameters. See Links for more choices.
imu, ilambda Numeric. Optional initial values for and .

Details
The (transformed) Leipnik distribution has density function
1
{y(1 y)} 2 (y )2 2
f (y; , ) = 1+
Beta( +1 1
2 , 2)
y(1 y)

where 0 < y < 1 and > 1. The mean is (returned as the fitted values) and the variance is
1/.
Jorgensen (1997) calls the above the transformed Leipnik distribution, and if y = (x + 1)/2
and = ( + 1)/2, then the distribution of X as a function of x and is known as the the
(untransformed) Leipnik distribution. Here, both x and are in (1, 1).
lerch 391

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.

Note
Convergence may be slow or fail. Until better initial value estimates are forthcoming try assigning
the argument ilambda some numerical value if it fails to converge. Currently, Newton-Raphson
is implemented, not Fisher scoring. Currently, this family function probably only really works for
intercept-only models, i.e., y ~ 1 in the formula.

Author(s)
T. W. Yee

References
Jorgensen, B. (1997) The Theory of Dispersion Models. London: Chapman & Hall
Johnson, N. L. and Kotz, S. and Balakrishnan, N. (1995) Continuous Univariate Distributions, 2nd
edition, Volume 2, New York: Wiley. (pages 612617).

sum(weights(fit)) # Sum of the prior weights

sum(weights(fit, type = "work")) # Sum of the working weights

lerch Lerch Phi Function

Description
Computes the Lerch transcendental Phi function.

Usage
lerch(x, s, v, tolerance = 1.0e-10, iter = 100)
392 lerch

Arguments
x, s, v Numeric. This function recyles values of x, s, and v if necessary.
tolerance Numeric. Accuracy required, must be positive and less than 0.01.
iter Maximum number of iterations allowed to obtain convergence. If iter is too
small then a result of NA may occur; if so, try increasing its value.

Details
The Lerch transcendental function is defined by

X xn
(x, s, v) =
n=0
(n + v)s

where |x| < 1 and v 6= 0, 1, 2, . . .. Actually, x may be complex but this function only works for
real x. The algorithm used is based on the relation
m1
X xn
(x, s, v) = xm (x, s, v + m) + .
n=0
(n + v)s

See the URL below for more information. This function is a wrapper function for the C code
described below.

Value
Returns the value of the function evaluated at the values of x, s, v. If the above ranges of x and v are
not satisfied, or some numeric problems occur, then this function will return a NA for those values.

Warning
This function has not been thoroughly tested and contains bugs, for example, the zeta function can-
not be computed with this function even though (s) = (x = 1, s, v = 1). There are many sources
of problems such as lack of convergence, overflow and underflow, especially near singularities. If
any problems occur then a NA will be returned.

Note
There are a number of special cases, e.g., the Riemann zeta-function is given by (s) = (x =
1, s, v = 1). The special case of s = 1 corresponds to the hypergeometric 2F1, and this is imple-
mented in the gsl package. The Lerch transcendental Phi function should not be confused with the
Lerch zeta function though they are quite similar.

Author(s)
S. V. Aksenov and U. D. Jentschura wrote the C code. The R wrapper function was written by T.
W. Yee.

References
Originally the code was found at http://aksenov.freeshell.org/lerchphi/source/lerchphi.c.
Bateman, H. (1953) Higher Transcendental Functions. Volume 1. McGraw-Hill, NY, USA.
leukemia 393

s <- rnorm(n = 100)

max(abs(zeta(s) - lerch(x = 1, s = s, v = 1))) # This fails (a bug); should be 0

## End(Not run)

leukemia Acute Myelogenous Leukemia Survival Data

Description
Survival in patients with Acute Myelogenous Leukemia

Usage

data(leukemia)

Format

time: survival or censoring time

status: censoring status
x: maintenance chemotherapy given? (factor)

Note
This data set has been transferred from survival and renamed from aml to leukemia.

Source
Rupert G. Miller (1997), Survival Analysis. John Wiley & Sons. ISBN: 0-471-25218-2.
394 levy

levy Levy Distribution Family Function

Description
Estimates the scale parameter of the Levy distribution by maximum likelihood estimation.

Usage
levy(location = 0, lscale = "loge", iscale = NULL)

Arguments
location Location parameter. Must have a known value. Called a below.
lscale Parameter link function for the (positive) scale parameter b. See Links for more
choices.
iscale Initial value for the b parameter. By default, an initial value is chosen internally.

Details
The Levy distribution is one of three stable distributions whose density function has a tractable
form. The formula for the density is
r
b b
f (y; b) = exp /(y a)3/2
2 2(y a)

where a < y < and b > 0. Note that if a is very close to min(y) (where y is the response), then
numerical problem will occur. The mean does not exist. The median is returned as the fitted values.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Author(s)
T. W. Yee

References
Nolan, J. P. (2005) Stable Distributions: Models for Heavy Tailed Data.

See Also
The Nolan article was at http://academic2.american.edu/~jpnolan/stable/chap1.pdf.
lgamma1 395

Examples
nn <- 1000; loc1 <- 0; loc2 <- 10
myscale <- 1 # log link ==> 0 is the answer
ldata <- data.frame(y1 = loc1 + myscale/rnorm(nn)^2, # Levy(myscale, a)
y2 = rlevy(nn, loc = loc2, scale = exp(+2)))

# Cf. Table 1.1 of Nolan for Levy(1,0)

with(ldata, sum(y1 > 1) / length(y1)) # Should be 0.6827
with(ldata, sum(y1 > 2) / length(y1)) # Should be 0.5205

fit1 <- vglm(y1 ~ 1, levy(location = loc1), data = ldata, trace = TRUE)

coef(fit1, matrix = TRUE)
Coef(fit1)
summary(fit1)
head(weights(fit1, type = "work"))

fit2 <- vglm(y2 ~ 1, levy(location = loc2), data = ldata, trace = TRUE)

coef(fit2, matrix = TRUE)
Coef(fit2)
c(median = with(ldata, median(y2)), fitted.median = head(fitted(fit2), 1))

lgamma1 Log-gamma Distribution Family Function

Description
Estimation of the parameter of the standard and nonstandard log-gamma distribution.

Usage
lgamma1(lshape = "loge", ishape = NULL)
lgamma3(llocation = "identitylink", lscale = "loge", lshape = "loge",
ilocation = NULL, iscale = NULL, ishape = 1,
zero = c("scale", "shape"))

Arguments
llocation, lscale
Parameter link function applied to the location parameter a and the positive scale
parameter b. See Links for more choices.
lshape Parameter link function applied to the positive shape parameter k. See Links
for more choices.
ishape Initial value for k. If given, it must be positive. If failure to converge occurs, try
some other value. The default means an initial value is determined internally.
ilocation, iscale
Initial value for a and b. The defaults mean an initial value is determined inter-
nally for each.
396 lgamma1

zero An integer-valued vector specifying which linear/additive predictors are mod-

elled as intercepts only. The values must be from the set {1,2,3}. The default
value means none are modelled as intercept-only terms. See CommonVGAMffArguments
for more information.

Details
The probability density function of the standard log-gamma distribution is given by

f (y; k) = exp[ky exp(y)]/(k),

for parameter k > 0 and all real y. The mean of Y is digamma(k) (returned as the fitted values)
and its variance is trigamma(k).
For the non-standard log-gamma distribution, one replaces y by (y a)/b, where a is the location
parameter and b is the positive scale parameter. Then the density function is

f (y) = exp[k(y a)/b exp((y a)/b)]/(b (k)).

The mean and variance of Y are a + b*digamma(k) (returned as the fitted values) and b^2 * trigamma(k),
respectively.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Note
The standard log-gamma distribution can be viewed as a generalization of the standard type 1 ex-
treme value density: when k = 1 the distribution of Y is the standard type 1 extreme value
distribution.
The standard log-gamma distribution is fitted with lgamma1 and the non-standard (3-parameter)
log-gamma distribution is fitted with lgamma3.

Author(s)
T. W. Yee

References
Kotz, S. and Nadarajah, S. (2000) Extreme Value Distributions: Theory and Applications, pages
4849, London: Imperial College Press.
Johnson, N. L. and Kotz, S. and Balakrishnan, N. (1995) Continuous Univariate Distributions, 2nd
edition, Volume 2, p.89, New York: Wiley.

See Also
rlgamma, gengamma.stacy, prentice74, gamma1, lgamma.
lgammaUC 397

Examples
ldata <- data.frame(y = rlgamma(100, shape = exp(1)))
fit <- vglm(y ~ 1, lgamma1, data = ldata, trace = TRUE, crit = "coef")
summary(fit)
coef(fit, matrix = TRUE)
Coef(fit)

ldata <- data.frame(x2 = runif(nn <- 5000)) # Another example

ldata <- transform(ldata, loc = -1 + 2 * x2, Scale = exp(1))
ldata <- transform(ldata, y = rlgamma(nn, loc, scale = Scale, shape = exp(0)))
fit2 <- vglm(y ~ x2, lgamma3, data = ldata, trace = TRUE, crit = "c")
coef(fit2, matrix = TRUE)

lgammaUC The Log-Gamma Distribution

Description
Density, distribution function, quantile function and random generation for the log-gamma distri-
bution with location parameter location, scale parameter scale and shape parameter k.

Usage
dlgamma(x, location = 0, scale = 1, shape = 1, log = FALSE)
plgamma(q, location = 0, scale = 1, shape = 1,
lower.tail = TRUE, log.p = FALSE)
qlgamma(p, location = 0, scale = 1, shape = 1,
lower.tail = TRUE, log.p = FALSE)
rlgamma(n, location = 0, scale = 1, shape = 1)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. Same as runif.
location the location parameter a.
scale the (positive) scale parameter b.
shape the (positive) shape parameter k.
log Logical. If log = TRUE then the logarithm of the density is returned.
lower.tail, log.p
Same meaning as in pnorm or qnorm.

Details
See lgamma1, the VGAM family function for estimating the one parameter standard log-gamma
distribution by maximum likelihood estimation, for formulae and other details. Apart from n, all
the above arguments may be vectors and are recyled to the appropriate length if necessary.
398 Lindley

Value
dlgamma gives the density, plgamma gives the distribution function, qlgamma gives the quantile
function, and rlgamma generates random deviates.

Note
The VGAM family function lgamma3 is for the three parameter (nonstandard) log-gamma distribu-
tion.

Author(s)
T. W. Yee and Kai Huang

References
Kotz, S. and Nadarajah, S. (2000) Extreme Value Distributions: Theory and Applications, pages
4849, London: Imperial College Press.

Lindley The Lindley Distribution

Description
Density, cumulative distribution function, and random generation for the Lindley distribution.
Lindley 399

Usage

dlind(x, theta, log = FALSE)

plind(q, theta, lower.tail = TRUE, log.p = FALSE)
rlind(n, theta)

Arguments

x, q vector of quantiles.
n number of observations. Same as in runif.
log Logical. If log = TRUE then the logarithm of the density is returned.
theta positive parameter.
lower.tail, log.p
Same meaning as in pnorm or qnorm.

Details

See lindley for details.

Value

dlind gives the density, plind gives the cumulative distribution function, and rlind generates
random deviates.

Author(s)

T. W. Yee and Kai Huang

See Also

lindley.

Examples

theta <- exp(-1); x <- seq(0.0, 17, length = 700)

dlind(0:10, theta)
## Not run:
plot(x, dlind(x, theta), type = "l", las = 1, col = "blue",
main = "dlind(x, theta = exp(-1))")
abline(h = 1, col = "grey", lty = "dashed")
## End(Not run)
400 lindley

lindley 1-parameter Gamma Distribution

Description
Estimates the (1-parameter) Lindley distribution by maximum likelihood estimation.

Usage
lindley(link = "loge", itheta = NULL, zero = NULL)

Arguments
link Link function applied to the (positive) parameter. See Links for more choices.
itheta, zero See CommonVGAMffArguments for information.

Details
The density function is given by

f (y; ) = 2 (1 + y) exp(y)/(1 + )

for > 0 and y > 0. The mean of Y (returned as the fitted values) is = ( + 2)/(( + 1)). The
variance is (2 + 4 + 2)/(( + 1))2 .

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Note
This VGAM family function can handle multiple responses (inputted as a matrix). Fisher scoring
is implemented.

Author(s)
T. W. Yee

References
Lindley, D. V. (1958) Fiducial distributions and Bayes theorem. Journal of the Royal Statistical
Society, Series B, Methodological, 20, 102107.
Ghitany, M. E. and Atieh, B. and Nadarajah, S. (2008) Lindley distribution and its application.
Math. Comput. Simul., 78, 493506.
linkfun 401

See Also

dlind, gammaR, simulate.vlm.

Examples
ldata <- data.frame(y = rlind(n = 1000, theta = exp(3)))
fit <- vglm(y ~ 1, lindley, data = ldata, trace = TRUE, crit = "coef")
coef(fit, matrix = TRUE)
Coef(fit)
summary(fit)

linkfun Link Functions

Description

Generic function for returning the link functions of a fitted object.

Usage

linkfun(object, ...)

Arguments

object An object which has parameter link functions.

... Other arguments fed into the specific methods function of the model.

Details

Fitted models in the VGAM have parameter link functions. This generic function returns these.

Value

The value returned depends specifically on the methods function invoked.

Author(s)

Thomas W. Yee

See Also

linkfun.vglm, multilogit, vglm.

402 linkfun.vglm

Examples
pneumo <- transform(pneumo, let = log(exposure.time))
fit1 <- vglm(cbind(normal, mild, severe) ~ let, propodds, data = pneumo)
coef(fit1, matrix = TRUE)
linkfun(fit1)
linkfun(fit1, earg = TRUE)

fit2 <- vglm(cbind(normal, mild, severe) ~ let, multinomial, data = pneumo)

coef(fit2, matrix = TRUE)
linkfun(fit2)
linkfun(fit2, earg = TRUE)

linkfun.vglm Link Functions for VGLMs

Description
Returns the link functions, and parameter names, for vector generalized linear models (VGLMs).

Usage
linkfun.vglm(object, earg = FALSE, ...)

Arguments
object Object of class "vglm", i.e., a VGLM object.
earg Logical. Return the extra arguments associated with each link function? If TRUE
then a list is returned.
... Arguments that might be used in the future.

Details
All fitted VGLMs have a link function applied to each parameter. This function returns these, and
optionally, the extra arguments associated with them.

Value
Usually just a (named) character string, with the link functions in order. It is named with the
parameter names. If earg = TRUE then a list with the following components.
link The default output.
earg The extra arguments, in order.

Note
Presently, the multinomial logit model has only one link function, multilogit, so a warning is not
issued for that link. For other models, if the number of link functions does not equal M then a
warning may be issued.
Links 403

Author(s)
Thomas W. Yee

Links Link functions for VGLM/VGAM/etc. families

Description
The VGAM package provides a number of (parameter) link functions which are described in gen-
eral here. Collectively, they offer the user considerable choice and flexibility for modelling data.

Usage
TypicalVGAMlink(theta, someParameter = 0, bvalue = NULL, inverse = FALSE,
deriv = 0, short = TRUE, tag = FALSE)

Arguments
theta Numeric or character. This is usually (default) but can sometimes be , de-
pending on the other arguments. If theta is character then inverse and deriv
are ignored. The name theta should always be the name of the first argument.
someParameter Some parameter, e.g., an offset.
bvalue Boundary value, positive if given. If 0 < theta then values of theta which
are less than or equal to 0 can be replaced by bvalue before computing the
link function value. Values of theta which are greater than or equal to 1 can
be replaced by 1 minus bvalue before computing the link function value. The
value bvalue = .Machine$double.eps is sometimes a reasonable value, or
something slightly higher.
inverse Logical. If TRUE and deriv = 0 then the inverse link value is returned, hence
the argument theta is really . In all other cases, the argument theta is really
.
deriv Integer. Either 0, 1, or 2, specifying the order of the derivative.
short, tag Logical. These are used for labelling the blurb slot of a vglmff-class object.
These arguments are used only if theta is character, and gives the formula for
the link in character form. If tag = TRUE then the result is preceeded by a little
more information.
404 Links

Details
Almost all VGAM link functions have something similar to the argument list as given above. In this
help file we have = g() where g is the link function, is the parameter and is the linear/additive
predictor. The link g must be strictly monotonic and twice-differentiable in its range.
The following is a brief enumeration of all VGAM link functions.
For parameters lying between 0 and 1 (e.g., probabilities): logit, probit, cloglog, cauchit,
foldsqrt, logc, golf, polf, nbolf.
For positive parameters (i.e., greater than 0): loge, negloge, powerlink.
For parameters greater than 1: loglog.
For parameters between 1 and 1: fisherz, rhobit.
For parameters between A and B: extlogit, logoff (B = ).
For unrestricted parameters (i.e., any value): identity, negidentity, reciprocal, negreciprocal.

Value
Returns one of: the link function value or its first or second derivative, the inverse link or its first or
second derivative, or a character description of the link.
Here are the general details. If inverse = FALSE and deriv = 0 (default) then the ordinary link
function = g() is returned.
If inverse = TRUE and deriv = 0 then the inverse link function value is returned, hence theta is
really (the only occasion this happens).
If inverse = FALSE and deriv = 1 then it is d/d as a function of . If inverse = FALSE and
deriv = 2 then it is d2 /d2 as a function of .
If inverse = TRUE and deriv = 1 then it is d/d as a function of . If inverse = TRUE and
deriv = 2 then it is d2 /d 2 as a function of .
It is only when deriv = 1 that linkfun(theta, deriv = 1, inverse = TRUE) and linkfun(theta, deriv = 1, inverse
are reciprocals of each other. In particular, linkfun(theta, deriv = 2, inverse = TRUE) and
linkfun(theta, deriv = 2, inverse = FALSE) are not reciprocals of each other in general.

Warning
The output of link functions changed at VGAM 0.9-9 (date was around 2015-07). Formerly,
linkfun(theta, deriv = 1) is now linkfun(theta, deriv = 1, inverse = TRUE), or equiva-
lently, 1 / linkfun(theta, deriv = 1, inverse = TRUE). Also, formerly, linkfun(theta, deriv = 2)
was 1 / linkfun(theta, deriv = 2, inverse = TRUE). This was a bug. Altogether, these are
big changes and the user should beware!
One day in the future, all VGAM link functions may be renamed to end in the characters "link".

Note
VGAM link functions are generally not compatible with other functions outside the package. In
particular, they wont work with glm or any other package for fitting GAMs.
From October 2006 onwards, all VGAM family functions will only contain one default value for
each link argument rather than giving a vector of choices. For example, rather than binomialff(link = c("logit", "probi
Links 405

it is now binomialff(link = "logit", ...). No checking will be done to see if the users choice
is reasonable. This means that the user can write his/her own VGAM link function and use it within
any VGAM family function. Altogether this provides greater flexibility. The downside is that the
user must specify the full name of the link function, by either assigning the link argument the full
name as a character string, or just the name itself. See the examples below.
From August 2012 onwards, a major change in link functions occurred. Argument esigma (and the
like such as earg) used to be in VGAM prior to version 0.9-0 (released during the 2nd half of 2012).
The major change is that arguments such as offset that used to be passed in via those arguments can
done directly through the link function. For example, gev(lshape = "logoff", eshape = list(offset = 0.5))
is replaced by gev(lshape = logoff(offset = 0.5)). The @misc slot no longer has link and
earg components, but two other components replace these. Functions such as dtheta.deta(),
d2theta.deta2(), eta2theta(), theta2eta() are modified.

Author(s)
T. W. Yee

References
McCullagh, P. and Nelder, J. A. (1989) Generalized Linear Models, 2nd ed. London: Chapman &
Hall.

See Also
TypicalVGAMfamilyFunction, linkfun, vglm, vgam, rrvglm. cqo, cao.

Examples
logit("a")
logit("a", short = FALSE)
logit("a", short = FALSE, tag = TRUE)

logoff(1:5, offset = 1) # Same as log(1:5 + 1)

powerlink(1:5, power = 2) # Same as (1:5)^2

## Not run: # This is old and no longer works:

logoff(1:5, earg = list(offset = 1))
powerlink(1:5, earg = list(power = 2))

## End(Not run)

fit1 <- vgam(agaaus ~ altitude, binomialff(link = "cloglog"), hunua) # best

fit2 <- vgam(agaaus ~ altitude, binomialff(link = cloglog ), hunua) # okay

## Not run:
# This no longer works since "clog" is not a valid VGAM link function:
fit3 <- vgam(agaaus ~ altitude, binomialff(link = "clog"), hunua) # not okay

# No matter what the link, the estimated var-cov matrix is the same
y <- rbeta(n = 1000, shape1 = exp(0), shape2 = exp(1))
406 Lino

fit1 <- vglm(y ~ 1, betaR(lshape1 = "identitylink", lshape2 = "identitylink"),

trace = TRUE, crit = "coef")
fit2 <- vglm(y ~ 1, betaR(lshape1 = logoff(offset = 1.1),
lshape2 = logoff(offset = 1.1)), trace = TRUE)
vcov(fit1, untransform = TRUE)
vcov(fit1, untransform = TRUE) - vcov(fit2, untransform = TRUE) # Should be all 0s
\dontrun{ # This is old:
fit1@misc$earg # Some 'special' parameters
fit2@misc$earg # Some 'special' parameters are here
}

par(mfrow = c(2, 2))

p <- seq(0.05, 0.95, len = 200) # A rather restricted range
x <- seq(-4, 4, len = 200)
plot(p, logit(p), type = "l", col = "blue")
plot(x, logit(x, inverse = TRUE), type = "l", col = "blue")
plot(p, logit(p, deriv = 1), type = "l", col = "blue") # 1 / (p*(1-p))
plot(p, logit(p, deriv = 2), type = "l", col = "blue") # (2*p-1)/(p*(1-p))^2

## End(Not run)

Lino The Generalized Beta Distribution (Libby and Novick, 1982)

Description
Density, distribution function, quantile function and random generation for the generalized beta
distribution, as proposed by Libby and Novick (1982).

Usage
dlino(x, shape1, shape2, lambda = 1, log = FALSE)
plino(q, shape1, shape2, lambda = 1, lower.tail = TRUE, log.p = FALSE)
qlino(p, shape1, shape2, lambda = 1, lower.tail = TRUE, log.p = FALSE)
rlino(n, shape1, shape2, lambda = 1)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. Same as in runif.
shape1, shape2, lambda
see lino.
log Logical. If log = TRUE then the logarithm of the density is returned.
lower.tail, log.p
Same meaning as in pnorm or qnorm.
lino 407

Details
See lino, the VGAM family function for estimating the parameters, for the formula of the proba-
bility density function and other details.

Value
dlino gives the density, plino gives the distribution function, qlino gives the quantile function,
and rlino generates random deviates.

Author(s)
T. W. Yee and Kai Huang

lino Generalized Beta Distribution Family Function

Description
Maximum likelihood estimation of the 3-parameter generalized beta distribution as proposed by
Libby and Novick (1982).

Usage
lino(lshape1 = "loge", lshape2 = "loge", llambda = "loge",
ishape1 = NULL, ishape2 = NULL, ilambda = 1, zero = NULL)
408 lino

Arguments
lshape1, lshape2
Parameter link functions applied to the two (positive) shape parameters a and b.
See Links for more choices.
llambda Parameter link function applied to the parameter . See Links for more choices.
ishape1, ishape2, ilambda
Initial values for the parameters. A NULL value means one is computed inter-
nally. The argument ilambda must be numeric, and the default corresponds to a
standard beta distribution.
zero Can be an integer-valued vector specifying which linear/additive predictors are
modelled as intercepts only. Here, the values must be from the set {1,2,3} which
correspond to a, b, , respectively. See CommonVGAMffArguments for more in-
formation.

Details
Proposed by Libby and Novick (1982), this distribution has density

a y a1 (1 y)b1
f (y; a, b, ) =
B(a, b){1 (1 )y}a+b
for a > 0, b > 0, > 0, 0 < y < 1. Here B is the beta function (see beta). The mean is a
complicated function involving the Gauss hypergeometric function. If X has a lino distribution
with parameters shape1, shape2, lambda, then Y = X/(1 (1 )X) has a standard beta
distribution with parameters shape1, shape2.
Since log() = 0 corresponds to the standard beta distribution, a summary of the fitted model
performs a t-test for whether the data belongs to a standard beta distribution (provided the loge link
for is used; this is the default).

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Note
The fitted values, which is usually the mean, have not been implemented yet. Currently the median
is returned as the fitted values.
Although Fisher scoring is used, the working weight matrices are positive-definite only in a certain
region of the parameter space. Problems with this indicate poor initial values or an ill-conditioned
model or insufficient data etc.
This model is can be difficult to fit. A reasonably good value of ilambda seems to be needed so if
the self-starting initial values fail, try experimenting with the initial value arguments. Experience
suggests ilambda is better a little larger, rather than smaller, compared to the true value.

Author(s)
T. W. Yee
lirat 409

References
Libby, D. L. and Novick, M. R. (1982) Multivariate generalized beta distributions with applications
to utility assessment. Journal of Educational Statistics, 7, 271294.
Gupta, A. K. and Nadarajah, S. (2004) Handbook of Beta Distribution and Its Applications, NY:
Marcel Dekker, Inc.

# Nonstandard beta distribution

ldata <- transform(ldata, y2 = rlino(n = 1000, shape1 = exp(1),
shape2 = exp(2), lambda = exp(1)))
fit2 <- vglm(y2 ~ 1, lino(lshape1 = "identitylink", lshape2 = "identitylink",
ilamb = 10), data = ldata, trace = TRUE)
coef(fit2, matrix = TRUE)

lirat Low-iron Rat Teratology Data

Description
Low-iron rat teratology data.

Usage
data(lirat)

Format
A data frame with 58 observations on the following 4 variables.

N Litter size.
R Number of dead fetuses.
hb Hemoglobin level.
grp Group number. Group 1 is the untreated (low-iron) group, group 2 received injections on day
7 or day 10 only, group 3 received injections on days 0 and 7, and group 4 received injections
weekly.
410 lms.bcg

Details
The following description comes from Moore and Tsiatis (1991). The data comes from the experi-
mental setup from Shepard et al. (1980), which is typical of studies of the effects of chemical agents
or dietary regimens on fetal development in laboratory rats.
Female rats were put in iron-deficient diets and divided into 4 groups. One group of controls was
given weekly injections of iron supplement to bring their iron intake to normal levels, while another
group was given only placebo injections. Two other groups were given fewer iron-supplement
injections than the controls. The rats were made pregnant, sacrificed 3 weeks later, and the total
number of fetuses and the number of dead fetuses in each litter were counted.
For each litter the number of dead fetuses may be considered to be Binomial(N, p) where N is the
litter size and p is the probability of a fetus dying. The parameter p is expected to vary from litter to
litter, therefore the total variance of the proportions will be greater than that predicted by a binomial
model, even when the covariates for hemoglobin level and experimental group are accounted for.

Source
Moore, D. F. and Tsiatis, A. (1991) Robust Estimation of the Variance in Moment Methods for
Extra-binomial and Extra-Poisson Variation. Biometrics, 47, 383401.

References
Shepard, T. H., Mackler, B. and Finch, C. A. (1980) Reproductive studies in the iron-deficient rat.
Teratology, 22, 329334.

Examples
## Not run:
# cf. Figure 3 of Moore and Tsiatis (1991)
plot(R / N ~ hb, data = lirat, pch = as.character(grp), col = grp,
las = 1, xlab = "Hemoglobin level", ylab = "Proportion Dead")
## End(Not run)

lms.bcg LMS Quantile Regression with a Box-Cox transformation to a Gamma

Distribution

Description
LMS quantile regression with the Box-Cox transformation to the gamma distribution.

Usage
lms.bcg(percentiles = c(25, 50, 75), zero = c("lambda", "sigma"),
llambda = "identitylink", lmu = "identitylink", lsigma = "loge",
idf.mu = 4, idf.sigma = 2, ilambda = 1, isigma = NULL)
lms.bcg 411

Details
Given a value of the covariate, this function applies a Box-Cox transformation to the response to
best obtain a gamma distribution. The parameters chosen to do this are estimated by maximum
likelihood or penalized maximum likelihood. Similar details can be found at lms.bcn.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.

Warning
This VGAM family function comes with the same warnings as lms.bcn. Also, the expected value
of the second derivative with respect to lambda may be incorrect (my calculations do not agree with
the Lopatatzidis and Green manuscript.)

Note
Similar notes can be found at lms.bcn.

Author(s)
Thomas W. Yee

References
Lopatatzidis A. and Green, P. J. (unpublished manuscript) Semiparametric quantile regression using
the gamma distribution.
Yee, T. W. (2004) Quantile regression via vector generalized additive models. Statistics in Medicine,
23, 22952315.

See Also
lms.bcn, lms.yjn, qtplot.lmscreg, deplot.lmscreg, cdf.lmscreg, bmi.nz, amlexponential.
412 lms.bcn

Examples
# This converges, but deplot(fit) and qtplot(fit) do not work
fit0 <- vglm(BMI ~ sm.bs(age, df = 4), lms.bcg, data = bmi.nz, trace = TRUE)
coef(fit0, matrix = TRUE)
## Not run:
par(mfrow = c(1, 1))
plotvgam(fit0, se = TRUE) # Plot mu function (only)

## End(Not run)

# Use a trick: fit0 is used for initial values for fit1.

fit1 <- vgam(BMI ~ s(age, df = c(4, 2)), etastart = predict(fit0),
lms.bcg(zero = 1), bmi.nz, trace = TRUE)

# Difficult to get a model that converges.

# Here, we prematurely stop iterations because it fails near the solution.
fit2 <- vgam(BMI ~ s(age, df = c(4, 2)), maxit = 4,
lms.bcg(zero = 1, ilam = 3), bmi.nz, trace = TRUE)
summary(fit1)
head(predict(fit1))
head(fitted(fit1))
head(bmi.nz)
# Person 1 is near the lower quartile of BMI amongst people his age
head(cdf(fit1))

## Not run:
# Quantile plot
par(bty = "l", mar=c(5, 4, 4, 3) + 0.1, xpd = TRUE)
qtplot(fit1, percentiles=c(5, 50, 90, 99), main = "Quantiles",
xlim = c(15, 90), las = 1, ylab = "BMI", lwd = 2, lcol = 4)

# Density plot
ygrid <- seq(15, 43, len = 100) # BMI ranges
par(mfrow = c(1, 1), lwd = 2)
(aa <- deplot(fit1, x0 = 20, y = ygrid, xlab = "BMI", col = "black",
main = "Density functions at Age = 20 (black), 42 (red) and 55 (blue)"))
aa <- deplot(fit1, x0=42, y=ygrid, add=TRUE, llty=2, col="red")
aa <- deplot(fit1, x0=55, y=ygrid, add=TRUE, llty=4, col="blue", Attach=TRUE)
aa@post$deplot # Contains density function values

## End(Not run)

lms.bcn LMS Quantile Regression with a Box-Cox Transformation to Normal-

ity

Description
LMS quantile regression with the Box-Cox transformation to normality.
lms.bcn 413

Usage
lms.bcn(percentiles = c(25, 50, 75), zero = c("lambda", "sigma"),
llambda = "identitylink", lmu = "identitylink", lsigma = "loge",
idf.mu = 4, idf.sigma = 2, ilambda = 1,
isigma = NULL, tol0 = 0.001)

Arguments
percentiles A numerical vector containing values between 0 and 100, which are the quan-
tiles. They will be returned as fitted values.
zero Can be an integer-valued vector specifying which linear/additive predictors are
modelled as intercepts only. The values must be from the set {1,2,3}. The
default value usually increases the chance of successful convergence. Setting
zero = NULL means they all are functions of the covariates. For more informa-
tion see CommonVGAMffArguments.
llambda, lmu, lsigma
Parameter link functions applied to the first, second and third linear/additive
predictors. See Links for more choices, and CommonVGAMffArguments.
idf.mu Degrees of freedom for the cubic smoothing spline fit applied to get an initial
estimate of mu. See vsmooth.spline.
idf.sigma Degrees of freedom for the cubic smoothing spline fit applied to get an initial
estimate of sigma. See vsmooth.spline. This argument may be assigned NULL
to get an initial value using some other algorithm.
ilambda Initial value for lambda. If necessary, it is recycled to be a vector of length n
where n is the number of (independent) observations.
isigma Optional initial value for sigma. If necessary, it is recycled to be a vector of
length n. The default value, NULL, means an initial value is computed in the
@initialize slot of the family function.
tol0 Small positive number, the tolerance for testing if lambda is equal to zero.

Details
Given a value of the covariate, this function applies a Box-Cox transformation to the response to
best obtain normality. The parameters chosen to do this are estimated by maximum likelihood or
penalized maximum likelihood.
In more detail, the basic idea behind this method is that, for a fixed value of x, a Box-Cox transfor-
mation of the response Y is applied to obtain standard normality. The 3 parameters (, , , which
start with the letters L-M-S respectively, hence its name) are chosen to maximize a penalized
log-likelihood (with vgam). Then the appropriate quantiles of the standard normal distribution are
back-transformed onto the original scale to get the desired quantiles. The three parameters may
vary as a smooth function of x.
The Box-Cox power transformation here of the Y , given x, is

Z = [(Y /(x))(x) 1]/((x) (x))

for (x) 6= 0. (The singularity at (x) = 0 is handled by a simple function involving a logarithm.)
Then Z is assumed to have a standard normal distribution. The parameter (x) must be positive,
414 lms.bcn

therefore VGAM chooses (x)T = ((x), (x), log((x))) by default. The parameter is also
positive, but while log() is available, it is not the default because is more directly interpretable.
Given the estimated linear/additive predictors, the 100 percentile can be estimated by inverting the
Box-Cox power transformation at the 100 percentile of the standard normal distribution.
Of the three functions, it is often a good idea to allow (x) to be more flexible because the functions
(x) and (x) usually vary more smoothly with x. This is somewhat reflected in the default value
for the argument zero, viz. zero = c(1, 3).

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.

Warning
The computations are not simple, therefore convergence may fail. Set trace = TRUE to monitor
convergence if it isnt set already. Convergence failure will occur if, e.g., the response is bimodal
at any particular value of x. In case of convergence failure, try different starting values. Also, the
estimate may diverge quickly near the solution, in which case try prematurely stopping the iterations
by assigning maxits to be the iteration number corresponding to the highest likelihood value.
One trick is to fit a simple model and use it to provide initial values for a more complex model; see
in the examples below.

Note
The response must be positive because the Box-Cox transformation cannot handle negative values.
The LMS-Yeo-Johnson-normal method can handle both positive and negative values.
In general, the lambda and sigma functions should be more smoother than the mean function. Hav-
ing zero = 1, zero = 3 or zero = c(1, 3) is often a good idea. See the example below.

Author(s)
Thomas W. Yee

References
Cole, T. J. and Green, P. J. (1992) Smoothing Reference Centile Curves: The LMS Method and
Penalized Likelihood. Statistics in Medicine, 11, 13051319.
Green, P. J. and Silverman, B. W. (1994) Nonparametric Regression and Generalized Linear Mod-
els: A Roughness Penalty Approach, London: Chapman & Hall.
Yee, T. W. (2004) Quantile regression via vector generalized additive models. Statistics in Medicine,
23, 22952315.

See Also
lms.bcg, lms.yjn, qtplot.lmscreg, deplot.lmscreg, cdf.lmscreg, alaplace1, amlnormal,
denorm, CommonVGAMffArguments.
lms.yjn 415

Examples
## Not run: require("VGAMdata")
mysubset <- subset(xs.nz, sex == "M" & ethnicity == "Maori" & study1)
mysubset <- transform(mysubset, BMI = weight / height^2)
BMIdata <- na.omit(mysubset)
BMIdata <- subset(BMIdata, BMI < 80 & age < 65,
select = c(age, BMI)) # Delete an outlier
summary(BMIdata)

fit <- vgam(BMI ~ s(age, df = c(4, 2)), lms.bcn(zero = 1), data = BMIdata)

par(mfrow = c(1, 2))

plot(fit, scol = "blue", se = TRUE) # The two centered smooths

head(predict(fit))
head(fitted(fit))
head(BMIdata)
head(cdf(fit)) # Person 46 is probably overweight, given his age
100 * colMeans(depvar(fit, drop = TRUE) < fitted(fit)) # Empirical proportions

# Convergence problems? Try this trick: fit0 is a simpler model used for fit1
fit0 <- vgam(BMI ~ s(age, df = 4), lms.bcn(zero = c(1, 3)), data = BMIdata)
fit1 <- vgam(BMI ~ s(age, df = c(4, 2)), lms.bcn(zero = 1), data = BMIdata,
etastart = predict(fit0))

## End(Not run)

## Not run:
# Quantile plot
par(bty = "l", mar = c(5, 4, 4, 3) + 0.1, xpd = TRUE)
qtplot(fit, percentiles = c(5, 50, 90, 99), main = "Quantiles",
xlim = c(15, 66), las = 1, ylab = "BMI", lwd = 2, lcol = 4)

# Density plot
ygrid <- seq(15, 43, len = 100) # BMI ranges
par(mfrow = c(1, 1), lwd = 2)
(aa <- deplot(fit, x0 = 20, y = ygrid, xlab = "BMI", col = "black",
main = "Density functions at Age = 20 (black), 42 (red) and 55 (blue)"))
aa <- deplot(fit, x0 = 42, y = ygrid, add = TRUE, llty = 2, col = "red")
aa <- deplot(fit, x0 = 55, y = ygrid, add = TRUE, llty = 4, col = "blue",
Attach = TRUE)
aa@post$deplot # Contains density function values

## End(Not run)

lms.yjn LMS Quantile Regression with a Yeo-Johnson Transformation to Nor-

mality
416 lms.yjn

Description
LMS quantile regression with the Yeo-Johnson transformation to normality.

Usage
lms.yjn(percentiles = c(25, 50, 75), zero = c("lambda", "sigma"),
llambda = "identitylink", lsigma = "loge",
idf.mu = 4, idf.sigma = 2,
ilambda = 1, isigma = NULL, rule = c(10, 5),
yoffset = NULL, diagW = FALSE, iters.diagW = 6)
lms.yjn2(percentiles=c(25,50,75), zero = c("lambda", "sigma"),
llambda = "identitylink", lmu = "identitylink", lsigma = "loge",
idf.mu = 4, idf.sigma = 2, ilambda = 1.0,
isigma = NULL, yoffset = NULL, nsimEIM = 250)

Arguments
percentiles A numerical vector containing values between 0 and 100, which are the quan-
tiles. They will be returned as fitted values.
zero See lms.bcn.
llambda, lmu, lsigma
See lms.bcn.
idf.mu, idf.sigma
See lms.bcn.
ilambda, isigma
See lms.bcn.
rule Number of abscissae used in the Gaussian integration scheme to work out ele-
ments of the weight matrices. The values given are the possible choices, with
the first value being the default. The larger the value, the more accurate the
approximation is likely to be but involving more computational expense.
yoffset A value to be added to the response y, for the purpose of centering the response
before fitting the model to the data. The default value, NULL, means -median(y)
is used, so that the response actually used has median zero. The yoffset is
saved on the object and used during prediction.
diagW Logical. This argument is offered because the expected information matrix may
not be positive-definite. Using the diagonal elements of this matrix results in
a higher chance of it being positive-definite, however convergence will be very
slow. If TRUE, then the first iters.diagW iterations will use the diagonal of the
expected information matrix. The default is FALSE, meaning faster convergence.
iters.diagW Integer. Number of iterations in which the diagonal elements of the expected
information matrix are used. Only used if diagW = TRUE.
nsimEIM See CommonVGAMffArguments for more information.

Details
Given a value of the covariate, this function applies a Yeo-Johnson transformation to the response
to best obtain normality. The parameters chosen to do this are estimated by maximum likelihood or
lms.yjn 417

penalized maximum likelihood. The function lms.yjn2() estimates the expected information ma-
trices using simulation (and is consequently slower) while lms.yjn() uses numerical integration.
Try the other if one function fails.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Warning
The computations are not simple, therefore convergence may fail. In that case, try different starting
values.
The generic function predict, when applied to a lms.yjn fit, does not add back the yoffset value.

Note
The response may contain both positive and negative values. In contrast, the LMS-Box-Cox-normal
and LMS-Box-Cox-gamma methods only handle a positive response because the Box-Cox trans-
formation cannot handle negative values.
Some other notes can be found at lms.bcn.

Author(s)
Thomas W. Yee

References
Yeo, I.-K. and Johnson, R. A. (2000) A new family of power transformations to improve normality
or symmetry. Biometrika, 87, 954959.
Yee, T. W. (2004) Quantile regression via vector generalized additive models. Statistics in Medicine,
23, 22952315.
Yee, T. W. (2002) An Implementation for Regression Quantile Estimation. Pages 314. In: Haerdle,
W. and Ronz, B., Proceedings in Computational Statistics COMPSTAT 2002. Heidelberg: Physica-
Verlag.

See Also
lms.bcn, lms.bcg, qtplot.lmscreg, deplot.lmscreg, cdf.lmscreg, bmi.nz, amlnormal.

Examples
fit <- vgam(BMI ~ s(age, df = 4), lms.yjn, bmi.nz, trace = TRUE)
head(predict(fit))
head(fitted(fit))
head(bmi.nz)
# Person 1 is near the lower quartile of BMI amongst people his age
head(cdf(fit))
418 Log

## Not run:
# Quantile plot
par(bty = "l", mar = c(5, 4, 4, 3) + 0.1, xpd = TRUE)
qtplot(fit, percentiles = c(5, 50, 90, 99), main = "Quantiles",
xlim = c(15, 90), las = 1, ylab = "BMI", lwd = 2, lcol = 4)

## End(Not run)

Log Logarithmic Distribution

Description
Density, distribution function, quantile function, and random generation for the logarithmic distri-
bution.

Usage
dlog(x, shape, log = FALSE)
plog(q, shape, log.p = FALSE)
qlog(p, shape)
rlog(n, shape)

Arguments
x, q, p, n Same interpretation as in runif.
shape The shape parameter value c described in in logff.
log, log.p Logical. If log.p = TRUE then all probabilities p are given as log(p).

Details
The details are given in logff.

Value
dlog gives the density, plog gives the distribution function, qlog gives the quantile function, and
rlog generates random deviates.
log1mexp 419

Note
Given some response data, the VGAM family function logff estimates the parameter shape. For
plog(), if argument q contains large values and/or q is long in length then the memory requirements
may be very high. Very large values in q are handled by an approximation by Owen (1965).

Author(s)
T. W. Yee

References
Forbes, C., Evans, M., Hastings, N. and Peacock, B. (2011) Statistical Distributions, Hoboken, NJ,
USA: John Wiley and Sons, Fourth edition.

## Not run: shape <- 0.8; x <- 1:10

plot(x, dlog(x, shape = shape), type = "h", ylim = 0:1,
sub = "shape=0.8", las = 1, col = "blue", ylab = "shape",
main = "Logarithmic distribution: blue=density; orange=distribution function")
lines(x + 0.1, plog(x, shape = shape), col = "orange", lty = 3, type = "h")
## End(Not run)

log1mexp Logarithms with an Unit Offset and Exponential Term

Description
Computes log(1 + exp(x)) and log(1 - exp(-x)) accurately.

Usage
log1mexp(x)
log1pexp(x)

Arguments
x A vector of reals (numeric). Complex numbers not allowed since expm1 and
log1p do not handle these.
420 logc

Details
Computes log(1 + exp(x)) and log(1 - exp(-x)) accurately. An adjustment is made when x
is away from 0 in value.

Value
log1mexp(x) gives the value of log(1 exp(x)).
log1pexp(x) gives the value of log(1 + exp(x)).

Note
If NA or NaN is present in the input, the corresponding output will be NA.

Author(s)
This is a direct translation of the function in Martin Maechlers (2012) paper by Xiangjie Xue and
T. W. Yee.

References
Maechler, Martin (2012). Accurately Computing log(1-exp(-|a|)). Assessed from the Rmpfr pack-
age.

logc Complementary-log Link Function

Description
Computes the Complementary-log Transformation, Including its Inverse and the First Two Deriva-
tives.
logc 421

Usage
logc(theta, bvalue = NULL, inverse = FALSE, deriv = 0,
short = TRUE, tag = FALSE)

Arguments
theta Numeric or character. See below for further details.
bvalue See Links.
inverse, deriv, short, tag
Details at Links.

Details
The complementary-log link function is suitable for parameters that are less than unity. Numerical
values of theta close to 1 or out of range result in Inf, -Inf, NA or NaN.

Value
For deriv = 0, the log of theta, i.e., log(1-theta) when inverse = FALSE, and if inverse = TRUE
then 1-exp(theta).
For deriv = 1, then the function returns d eta / d theta as a function of theta if inverse = FALSE,
else if inverse = TRUE then it returns the reciprocal.
Here, all logarithms are natural logarithms, i.e., to base e.

Note
Numerical instability may occur when theta is close to 1. One way of overcoming this is to use
bvalue.

Author(s)
Thomas W. Yee

References
McCullagh, P. and Nelder, J. A. (1989) Generalized Linear Models, 2nd ed. London: Chapman &
Hall.

See Also
Links, loge, cloglog, loglog, logoff.

Examples
## Not run:
logc(seq(-0.2, 1.1, by = 0.1)) # Has NAs

## End(Not run)
logc(seq(-0.2, 1.1, by = 0.1), bvalue = 1 - .Machine$double.eps) # Has no NAs
422 loge

loge Log Link Function, and Variants

Description
Computes the log transformation, including its inverse and the first two derivatives.

Usage
loge(theta, bvalue = NULL, inverse = FALSE, deriv = 0,
short = TRUE, tag = FALSE)
negloge(theta, bvalue = NULL, inverse = FALSE, deriv = 0,
short = TRUE, tag = FALSE)
logneg(theta, bvalue = NULL, inverse = FALSE, deriv = 0,
short = TRUE, tag = FALSE)

Arguments
theta Numeric or character. See below for further details.
bvalue See Links.
inverse, deriv, short, tag
Details at Links.

Details
The log link function is very commonly used for parameters that are positive. Here, all logarithms
are natural logarithms, i.e., to base e. Numerical values of theta close to 0 or out of range result in
Inf, -Inf, NA or NaN.
The function loge computes log() whereas negloge computes log() = log(1/).
The function logneg computes log(), hence is suitable for parameters that are negative, e.g., a
trap-shy effect in posbernoulli.b.

Value
The following concerns loge. For deriv = 0, the log of theta, i.e., log(theta) when inverse = FALSE,
and if inverse = TRUE then exp(theta). For deriv = 1, then the function returns d eta / d theta
as a function of theta if inverse = FALSE, else if inverse = TRUE then it returns the reciprocal.

Note
This function is called loge to avoid conflict with the log function.
Numerical instability may occur when theta is close to 0 unless bvalue is used.

Author(s)
Thomas W. Yee
logF 423

References
McCullagh, P. and Nelder, J. A. (1989) Generalized Linear Models, 2nd ed. London: Chapman &
Hall.

See Also
Links, explink, logit, logc, loglog, log, logoff, lambertW, posbernoulli.b.

Examples
## Not run: loge(seq(-0.2, 0.5, by = 0.1))
loge(seq(-0.2, 0.5, by = 0.1), bvalue = .Machine$double.xmin)
negloge(seq(-0.2, 0.5, by = 0.1))
negloge(seq(-0.2, 0.5, by = 0.1), bvalue = .Machine$double.xmin)
## End(Not run)
logneg(seq(-0.5, -0.2, by = 0.1))

logF Natural Exponential Family Generalized Hyperbolic Secant Distribu-

tion Family Function

Description
Maximum likelihood estimation of the 1-parameter log F distribution.

Usage
logF(lshape1 = "loge", lshape2 = "loge",
ishape1 = NULL, ishape2 = 1, imethod = 1)

Arguments
lshape1, lshape2
Parameter link functions for the shape parameters. Called and respectively.
See Links for more choices.
ishape1, ishape2
Optional initial values for the shape parameters. If given, it must be numeric
and values are recycled to the appropriate length. The default is to choose the
value internally. See CommonVGAMffArguments for more information.
imethod Initialization method. Either the value 1, 2, or . . . . See CommonVGAMffArguments
for more information.

Details
The density for this distribution is
f (y; , ) = exp(y)/[B(, )(1 + ey )+ ]
where y is real, > 0, > 0, B(., .) is the beta function beta.
424 logff

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Author(s)
Thomas W. Yee

References
Jones, M. C. (2008). On a class of distributions with simple exponential tails. Statistica Sinica,
18(3), 11011110.

logff Logarithmic Distribution

Description
Estimating the (single) parameter of the logarithmic distribution.

Usage
logff(lshape = "logit", gshape = ppoints(8), zero = NULL)

Arguments
lshape Parameter link function for the parameter c, which lies between 0 and 1. See
Links for more choices and information. Soon logfflink() will hopefully be
available for event-rate data.
gshape, zero Details at CommonVGAMffArguments.
logff 425

Details
The logarithmic distribution is a generalized power series distribution that is based specifically on
the logarithmic series (scaled to a probability function). Its probability function is f (y) = acy /y,
for y = 1, 2, 3, . . ., where 0 < c < 1 (called shape), and a = 1/ log(1 c). The mean is
ac/(1 c) (returned as the fitted values) and variance is ac(1 ac)/(1 c)2 .

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Note
The function log computes the natural logarithm. In the VGAM library, a link function with option
loge corresponds to this.
Multiple responses are permitted.
The logarithmic distribution is sometimes confused with the log-series distribution. The latter was
used by Fisher et al. for species abundance data and has two parameters.

Author(s)
T. W. Yee

References
Chapter 7 of Johnson N. L., Kemp, A. W. and Kotz S. (2005) Univariate Discrete Distributions, 3rd
edition, Hoboken, New Jersey: Wiley.
Forbes, C., Evans, M., Hastings, N. and Peacock, B. (2011) Statistical Distributions, Hoboken, NJ,
USA: John Wiley and Sons, Fourth edition.

See Also
rlog, oalog, oilog, otlog, log, loge, logoff, explogff, simulate.vlm.

Examples
ldata <- data.frame(y = rlog(n = 1000, shape = logit(0.2, inverse = TRUE)))
fit <- vglm(y ~ 1, logff, data = ldata, trace = TRUE, crit = "c")
coef(fit, matrix = TRUE)
Coef(fit)
## Not run: with(ldata,
hist(y, prob = TRUE, breaks = seq(0.5, max(y) + 0.5, by = 1),
border = "blue"))
x <- seq(1, with(ldata, max(y)), by = 1)
with(ldata, lines(x, dlog(x, Coef(fit)[1]), col = "orange", type = "h", lwd = 2))
## End(Not run)

# Example: Corbet (1943) butterfly Malaya data

426 logistic

corbet <- data.frame(nindiv = 1:24,

ofreq = c(118, 74, 44, 24, 29, 22, 20, 19, 20, 15, 12,
14, 6, 12, 6, 9, 9, 6, 10, 10, 11, 5, 3, 3))
fit <- vglm(nindiv ~ 1, logff, data = corbet, weights = ofreq)
coef(fit, matrix = TRUE)
shapehat <- Coef(fit)["shape"]
pdf2 <- dlog(x = with(corbet, nindiv), shape = shapehat)
print(with(corbet, cbind(nindiv, ofreq, fitted = pdf2 * sum(ofreq))), digits = 1)

logistic Logistic Distribution Family Function

Description
Estimates the location and scale parameters of the logistic distribution by maximum likelihood
estimation.

Usage
logistic1(llocation = "identitylink", scale.arg = 1, imethod = 1)
logistic(llocation = "identitylink", lscale = "loge",
ilocation = NULL, iscale = NULL, imethod = 1, zero = "scale")

Arguments
llocation, lscale
Parameter link functions applied to the location parameter l and scale param-
eter s. See Links for more choices, and CommonVGAMffArguments for more
information.
scale.arg Known positive scale parameter (called s below).
ilocation, iscale
See CommonVGAMffArguments for information.
imethod, zero See CommonVGAMffArguments for information.

Details
The two-parameter logistic distribution has a density that can be written as
exp[(y l)/s]
f (y; l, s) = 2
s (1 + exp[(y l)/s])
where s > 0 is the scale parameter, and l is the location parameter. The response < y < .
The mean of Y (which is the fitted value) is l and its variance is 2 s2 /3.
A logistic distribution with scale = 0.65 (see dlogis) resembles dt with df = 7; see logistic1
and studentt.
logistic1 estimates the location parameter only while logistic estimates both parameters. By
default, 1 = l and 2 = log(s) for logistic.
logistic can handle multiple responses.
logistic 427

Value

An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.

Note

Fisher scoring is used, and the Fisher information matrix is diagonal.

Author(s)

T. W. Yee

References

Johnson, N. L. and Kotz, S. and Balakrishnan, N. (1994) Continuous Univariate Distributions, 2nd
edition, Volume 1, New York: Wiley. Chapter 15.
Forbes, C., Evans, M., Hastings, N. and Peacock, B. (2011) Statistical Distributions, Hoboken, NJ,
USA: John Wiley and Sons, Fourth edition.
Castillo, E., Hadi, A. S., Balakrishnan, N. Sarabia, J. S. (2005) Extreme Value and Related Models
with Applications in Engineering and Science, Hoboken, NJ, USA: Wiley-Interscience, p.130.
deCani, J. S. and Stine, R. A. (1986) A Note on Deriving the Information Matrix for a Logistic
Distribution, The American Statistician, 40, 220222.

See Also

rlogis, CommonVGAMffArguments, logit, cumulative, bilogistic, simulate.vlm.

Examples

# Location unknown, scale known

ldata <- data.frame(x2 = runif(nn <- 500))
ldata <- transform(ldata, y1 = rlogis(nn, loc = 1 + 5*x2, scale = exp(2)))
fit1 <- vglm(y1 ~ x2, logistic1(scale = exp(2)), data = ldata, trace = TRUE)
coef(fit1, matrix = TRUE)

# Both location and scale unknown

ldata <- transform(ldata, y2 = rlogis(nn, loc = 1 + 5*x2, scale = exp(0 + 1*x2)))
fit2 <- vglm(cbind(y1, y2) ~ x2, logistic, data = ldata, trace = TRUE)
coef(fit2, matrix = TRUE)
vcov(fit2)
summary(fit2)
428 logit

logit Logit Link Function

Description
Computes the logit transformation, including its inverse and the first two derivatives.

Usage
logit(theta, bvalue = NULL, inverse = FALSE, deriv = 0,
short = TRUE, tag = FALSE)
extlogit(theta, min = 0, max = 1, bminvalue = NULL, bmaxvalue = NULL,
inverse = FALSE, deriv = 0, short = TRUE, tag = FALSE)

Arguments
theta Numeric or character. See below for further details.
bvalue, bminvalue, bmaxvalue
See Links. These are boundary values. For extlogit, values of theta less than
or equal to A or greater than or equal to B can be replaced by bminvalue and
bmaxvalue.
min, max For extlogit, min gives A, max gives B, and for out of range values, bminvalue
and bmaxvalue.
inverse, deriv, short, tag
Details at Links.

Details
The logit link function is very commonly used for parameters that lie in the unit interval. Numerical
values of theta close to 0 or 1 or out of range result in Inf, -Inf, NA or NaN.
The extended logit link function extlogit should be used more generally for parameters that lie in
the interval (A, B), say. The formula is

log(( A)/(B ))

and the default values for A and B correspond to the ordinary logit function. Numerical values of
theta close to A or B or out of range result in Inf, -Inf, NA or NaN. However these can be replaced
by values bminvalue and bmaxvalue first before computing the link function.

Value
For logit with deriv = 0, the logit of theta, i.e., log(theta/(1-theta)) when inverse = FALSE,
and if inverse = TRUE then exp(theta)/(1+exp(theta)).
For deriv = 1, then the function returns d eta / d theta as a function of theta if inverse = FALSE,
else if inverse = TRUE then it returns the reciprocal.
Here, all logarithms are natural logarithms, i.e., to base e.
logit 429

Note
Numerical instability may occur when theta is close to 1 or 0 (for logit), or close to A or B for
extlogit. One way of overcoming this is to use, e.g., bvalue.
In terms of the threshold approach with cumulative probabilities for an ordinal response this link
function corresponds to the univariate logistic distribution (see logistic).

Author(s)
Thomas W. Yee

References
McCullagh, P. and Nelder, J. A. (1989) Generalized Linear Models, 2nd ed. London: Chapman &
Hall.

See Also
Links, logitoffsetlink, probit, cloglog, cauchit, logistic1, loge, plogis, multilogit.

Examples
p <- seq(0.01, 0.99, by = 0.01)
logit(p)
max(abs(logit(logit(p), inverse = TRUE) - p)) # Should be 0

p <- c(seq(-0.02, 0.02, by = 0.01), seq(0.97, 1.02, by = 0.01))

logit(p) # Has NAs
logit(p, bvalue = .Machine$double.eps) # Has no NAs

p <- seq(0.9, 2.2, by = 0.1)

extlogit(p, min = 1, max = 2,
bminvalue = 1 + .Machine$double.eps,
bmaxvalue = 2 - .Machine$double.eps) # Has no NAs

## Not run: par(mfrow = c(2,2), lwd = (mylwd <- 2))

y <- seq(-4, 4, length = 100)
p <- seq(0.01, 0.99, by = 0.01)
for (d in 0:1) {
myinv <- (d > 0)
matplot(p, cbind( logit(p, deriv = d, inverse = myinv),
probit(p, deriv = d, inverse = myinv)),
type = "n", col = "purple", ylab = "transformation", las = 1,
main = if (d == 0) "Some probability link functions"
else "1 / first derivative")
lines(p, logit(p, deriv = d, inverse = myinv), col = "limegreen")
lines(p, probit(p, deriv = d, inverse = myinv), col = "purple")
lines(p, cloglog(p, deriv = d, inverse = myinv), col = "chocolate")
lines(p, cauchit(p, deriv = d, inverse = myinv), col = "tan")
if (d == 0) {
abline(v = 0.5, h = 0, lty = "dashed")
legend(0, 4.5, c("logit", "probit", "cloglog", "cauchit"),
430 logitoffsetlink

col = c("limegreen", "purple", "chocolate", "tan"), lwd = mylwd)

} else
abline(v = 0.5, lty = "dashed")
}

for (d in 0) {
matplot(y, cbind(logit(y, deriv = d, inverse = TRUE),
probit(y, deriv = d, inverse = TRUE)), las = 1,
type = "n", col = "purple", xlab = "transformation", ylab = "p",
main = if (d == 0) "Some inverse probability link functions"
else "First derivative")
lines(y, logit(y, deriv = d, inverse = TRUE), col = "limegreen")
lines(y, probit(y, deriv = d, inverse = TRUE), col = "purple")
lines(y, cloglog(y, deriv = d, inverse = TRUE), col = "chocolate")
lines(y, cauchit(y, deriv = d, inverse = TRUE), col = "tan")
if (d == 0) {
abline(h = 0.5, v = 0, lty = "dashed")
legend(-4, 1, c("logit", "probit", "cloglog", "cauchit"),
col = c("limegreen", "purple", "chocolate", "tan"), lwd = mylwd)
}
}

p <- seq(0.21, 0.59, by = 0.01)

plot(p, extlogit(p, min = 0.2, max = 0.6),
type = "l", col = "black", ylab = "transformation", xlim = c(0, 1),
las = 1, main = "extlogit(p, min = 0.2, max = 0.6)")
par(lwd = 1)

## End(Not run)

logitoffsetlink Logit-with-an-Offset Link Function

Description
Computes the logitoffsetlink transformation, including its inverse and the first two derivatives.

Usage
logitoffsetlink(theta, offset = 0, inverse = FALSE, deriv = 0,
short = TRUE, tag = FALSE)

Arguments
theta Numeric or character. See below for further details.
offset The offset value(s), which must be non-negative. It is called K below.
inverse, deriv, short, tag
Details at Links.
logitoffsetlink 431

Details

This link function allows for some asymmetry compared to the ordinary logit link. The formula is

log(/(1 ) K)

and the default value for the offset K is corresponds to the ordinary logit link. When inverse = TRUE
will mean that the value will lie in the interval (K/(1 + K), 1).

Value

For logitoffsetlink with deriv = 0, the logitoffsetlink of theta, i.e., log(theta/(1-theta) - K)

when inverse = FALSE, and if inverse = TRUE then (K + exp(theta))/(1 + exp(theta) + K).
For deriv = 1, then the function returns d eta / d theta as a function of theta if inverse = FALSE,
else if inverse = TRUE then it returns the reciprocal.
Here, all logarithms are natural logarithms, i.e., to base e.

Note

This function is numerical less stability than logit.

Author(s)

Thomas W. Yee

References

Komori, O. and Eguchi, S. et al., 2016. An asymmetric logistic model for ecological data. Methods
in Ecology and Evolution, 7.

See Also

Links, logit.

Examples

p <- seq(0.05, 0.99, by = 0.01); myoff <- 0.05

logitoffsetlink(p, myoff)
max(abs(logitoffsetlink(logitoffsetlink(p, myoff),
myoff, inverse = TRUE) - p)) # Should be 0
432 loglaplace

loglaplace Log-Laplace and Logit-Laplace Distribution Family Functions

Description
Maximum likelihood estimation of the 1-parameter log-Laplace and the 1-parameter logit-Laplace
distributions. These may be used for quantile regression for counts and proportions respectively.

Usage
loglaplace1(tau = NULL, llocation = "loge",
ilocation = NULL, kappa = sqrt(tau/(1 - tau)), Scale.arg = 1,
ishrinkage = 0.95, parallel.locat = FALSE, digt = 4,
idf.mu = 3, rep0 = 0.5, minquantile = 0, maxquantile = Inf,
imethod = 1, zero = NULL)
logitlaplace1(tau = NULL, llocation = "logit",
ilocation = NULL, kappa = sqrt(tau/(1 - tau)),
Scale.arg = 1, ishrinkage = 0.95, parallel.locat = FALSE,
digt = 4, idf.mu = 3, rep01 = 0.5, imethod = 1, zero = NULL)

Arguments
tau, kappa See alaplace1.
llocation Character. Parameter link functions for location parameter . See Links for
more choices. However, this argument should be left unchanged with count data
because it restricts the quantiles to be positive. With proportions data llocation
can be assigned a link such as logit, probit, cloglog, etc.
ilocation Optional initial values. If given, it must be numeric and values are recycled to
the appropriate length. The default is to choose the value internally.
parallel.locat Logical. Should the quantiles be parallel on the transformed scale (argument
llocation)? Assigning this argument to TRUE circumvents the seriously em-
barrassing quantile crossing problem.
imethod Initialization method. Either the value 1, 2, or . . . .
idf.mu, ishrinkage, Scale.arg, digt, zero
See alaplace1.
rep0, rep01 Numeric, positive. Replacement values for 0s and 1s respectively. For count
data, values of the response whose value is 0 are replaced by rep0; it avoids
computing log(0). For proportions data values of the response whose value is 0
or 1 are replaced by min(rangey01[1]/2, rep01/w[y< = 0]) and max((1 + rangey01[2])/2, 1-rep0
respectively; e.g., it avoids computing logit(0) or logit(1). Here, rangey01
is the 2-vector range(y[(y > 0) & (y < 1)]) of the response.
minquantile, maxquantile
Numeric. The minimum and maximum values possible in the quantiles. These
argument are effectively ignored by default since loge keeps all quantiles pos-
itive. However, if llocation = logoff(offset = 1) then it is possible that
the fitted quantiles have value 0 because minquantile = 0.
loglaplace 433

Details
These VGAM family functions implement translations of the asymmetric Laplace distribution
(ALD). The resulting variants may be suitable for quantile regression for count data or sample
proportions. For example, a log link applied to count data is assumed to follow an ALD. Another
example is a logit link applied to proportions data so as to follow an ALD. A positive random vari-
able Y is said to have a log-Laplace distribution if Y = eW where W has an ALD. There are many
variants of ALDs and the one used here is described in alaplace1.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.
In the extra slot of the fitted object are some list components which are useful. For example, the
sample proportion of values which are less than the fitted quantile curves, which is sum(wprior[y <= location]) / sum(wpr
internally. Here, wprior are the prior weights (called ssize below), y is the response and location
is a fitted quantile curve. This definition comes about naturally from the transformed ALD data.

Warning
The VGAM family function logitlaplace1 will not handle a vector of just 0s and 1s as the re-
sponse; it will only work satisfactorily if the number of trials is large.
See alaplace1 for other warnings. Care is needed with tau values which are too small, e.g., for
count data the sample proportion of zeros must be less than all values in tau. Similarly, this also
holds with logitlaplace1, which also requires all tau values to be less than the sample proportion
of ones.

Note
The form of input for logitlaplace1 as response is a vector of proportions (values in [0, 1]) and
the number of trials is entered into the weights argument of vglm/vgam. See Example 2 below. See
alaplace1 for other notes in general.

Author(s)
Thomas W. Yee

References
Kotz, S., Kozubowski, T. J. and Podgorski, K. (2001) The Laplace distribution and generaliza-
tions: a revisit with applications to communications, economics, engineering, and finance, Boston:
Birkhauser.
Kozubowski, T. J. and Podgorski, K. (2003) Log-Laplace distributions. International Mathematical
Journal, 3, 467495.
Yee, T. W. (2012) Quantile regression for counts and proportions. In preparation.

See Also
alaplace1, dloglap.
434 loglaplace

Examples

# Example 1: quantile regression of counts with regression splines

set.seed(123); my.k <- exp(0)
adata <- data.frame(x2 = sort(runif(n <- 500)))
mymu <- function(x) exp( 1 + 3*sin(2*x) / (x+0.5)^2)
adata <- transform(adata, y = rnbinom(n, mu = mymu(x2), size = my.k))
mytau <- c(0.1, 0.25, 0.5, 0.75, 0.9); mydof = 3
# halfstepping is usual:
fitp <- vglm(y ~ sm.bs(x2, df = mydof), data = adata, trace = TRUE,
loglaplace1(tau = mytau, parallel.locat = TRUE))

## Not run:
par(las = 1) # Plot on a log1p() scale
mylwd <- 1.5

plot(jitter(log1p(y), factor = 1.5) ~ x2, adata, col = "red", pch = "o",

main = "Example 1; darkgreen=truth, blue=estimated", cex = 0.75)
with(adata, matlines(x2, log1p(fitted(fitp)), col = "blue",
lty = 1, lwd = mylwd))
finexgrid <- seq(0, 1, len = 201)
for (ii in 1:length(mytau))
lines(finexgrid, col = "darkgreen", lwd = mylwd,
log1p(qnbinom(p = mytau[ii], mu = mymu(finexgrid), si = my.k)))

## End(Not run)
fitp@extra # Contains useful information

# Example 2: sample proportions

set.seed(123); nnn <- 1000; ssize <- 100 # ssize = 1 will not work!
adata <- data.frame(x2 = sort(runif(nnn)))
mymu <- function(x) logit( 1.0 + 4*x, inv = TRUE)
adata <- transform(adata, ssize = ssize,
y2 = rbinom(nnn, size = ssize, prob = mymu(x2)) / ssize)

mytau <- c(0.25, 0.50, 0.75)

fit1 <- vglm(y2 ~ sm.bs(x2, df = 3), data = adata,
weights = ssize, trace = TRUE,
logitlaplace1(tau = mytau, lloc = "cloglog", paral = TRUE))

## Not run:
# Check the solution. Note: this may be like comparing apples with oranges.
plotvgam(fit1, se = TRUE, scol = "red", lcol = "blue",
main = "Truth = 'darkgreen'")
# Centered approximately !
linkFunctionChar <- as.character(fit1@misc$link)
adata <- transform(adata, trueFunction=
theta2eta(theta = mymu(x2), link=linkFunctionChar))
with(adata, lines(x2, trueFunction - mean(trueFunction), col = "darkgreen"))
loglapUC 435

# Plot the data + fitted quantiles (on the original scale)

myylim <- with(adata, range(y2))
plot(y2 ~ x2, adata, col = "blue", ylim = myylim, las = 1,
pch = ".", cex = 2.5)
with(adata, matplot(x2, fitted(fit1), add = TRUE, lwd = 3, type = "l"))
truecol <- rep(1:3, len = fit1@misc$M) # Add the 'truth'
smallxgrid <- seq(0, 1, len = 501)
for (ii in 1:length(mytau))
lines(smallxgrid, col = truecol[ii], lwd = 2,
qbinom(p = mytau[ii], prob = mymu(smallxgrid), size = ssize) / ssize)

# Plot on the eta (== logit()/probit()/...) scale

with(adata, matplot(x2, predict(fit1), add = FALSE, lwd = 3, type = "l"))
# Add the 'truth'
for (ii in 1:length(mytau)) {
true.quant <- qbinom(p = mytau[ii], pr = mymu(smallxgrid), si = ssize) / ssize
lines(smallxgrid, theta2eta(theta = true.quant, link = linkFunctionChar),
col = truecol[ii], lwd = 2)
}

## End(Not run)

loglapUC The Log-Laplace Distribution

Description
Density, distribution function, quantile function and random generation for the 3-parameter log-
Laplace distribution with location parameter location.ald, scale parameter scale.ald (on the
log scale), and asymmetry parameter kappa.

Usage
dloglap(x, location.ald = 0, scale.ald = 1,
tau = 0.5, kappa = sqrt(tau/(1-tau)), log = FALSE)
ploglap(q, location.ald = 0, scale.ald = 1,
tau = 0.5, kappa = sqrt(tau/(1-tau)), lower.tail = TRUE, log.p = FALSE)
qloglap(p, location.ald = 0, scale.ald = 1,
tau = 0.5, kappa = sqrt(tau/(1-tau)), lower.tail = TRUE, log.p = FALSE)
rloglap(n, location.ald = 0, scale.ald = 1,
tau = 0.5, kappa = sqrt(tau/(1-tau)))

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. If length(n) > 1 then the length is taken to be the
number required.
436 loglapUC

location.ald, scale.ald
the location parameter and the (positive) scale parameter , on the log scale.
tau the quantile parameter . Must consist of values in (0, 1). This argument is used
to specify kappa and is ignored if kappa is assigned.
kappa the asymmetry parameter . Must consist of positive values.
log if TRUE, probabilities p are given as log(p).
lower.tail, log.p
Same meaning as in pnorm or qnorm.

Details
A positive random variable Y is said to have a log-Laplace distribution if log(Y ) has an asymmetric
Laplace distribution (ALD). There are many variants of ALDs and the one used here is described
in alaplace3.

Value
dloglap gives the density, ploglap gives the distribution function, qloglap gives the quantile
function, and rloglap generates random deviates.

Author(s)
T. W. Yee and Kai Huang

References
Kozubowski, T. J. and Podgorski, K. (2003) Log-Laplace distributions. International Mathematical
Journal, 3, 467495.

logLik.vlm Extract Log-likelihood for VGLMs/VGAMs/etc.

Description
Calculates the log-likelihood value or the element-by-element contributions of the log-likelihood.

Usage
## S3 method for class 'vlm'
logLik(object, summation = TRUE, ...)

Arguments
object Some VGAM object, for example, having class vglmff-class.
summation Logical, apply sum? If FALSE then a n-vector or n-row matrix (with the num-
ber of responses as the number of columns) is returned. Each element is the
contribution to the log-likelihood.
... Currently unused. In the future: other possible arguments fed into logLik in
order to compute the log-likelihood.

Details
By default, this function returns the log-likelihood of the object. Thus this code relies on the log-
likelihood being defined, and computed, for the object.

Value
Returns the log-likelihood of the object. If summation = FALSE then a n-vector or n-row ma-
trix (with the number of responses as the number of columns) is returned. Each element is the
contribution to the log-likelihood. The prior weights are assimulated within the answer.

Warning
Not all VGAM family functions have had the summation checked.

Note
Not all VGAM family functions currently have the summation argument implemented.

Author(s)
T. W. Yee.

See Also
VGLMs are described in vglm-class; VGAMs are described in vgam-class; RR-VGLMs are
described in rrvglm-class; AIC.
438 loglinb2

Examples
zdata <- data.frame(x2 = runif(nn <- 50))
zdata <- transform(zdata, Ps01 = logit(-0.5 , inverse = TRUE),
Ps02 = logit( 0.5 , inverse = TRUE),
lambda1 = loge(-0.5 + 2*x2, inverse = TRUE),
lambda2 = loge( 0.5 + 2*x2, inverse = TRUE))
zdata <- transform(zdata, y1 = rzipois(nn, lambda = lambda1, pstr0 = Ps01),
y2 = rzipois(nn, lambda = lambda2, pstr0 = Ps02))

with(zdata, table(y1)) # Eyeball the data

with(zdata, table(y2))
fit2 <- vglm(cbind(y1, y2) ~ x2, zipoisson(zero = NULL), data = zdata)

logLik(fit2) # Summed over the two responses

sum(logLik(fit2, sum = FALSE)) # For checking purposes
(ll.matrix <- logLik(fit2, sum = FALSE)) # nn x 2 matrix
colSums(ll.matrix) # log-likelihood for each response

loglinb2 Loglinear Model for Two Binary Responses

Description
Fits a loglinear model to two binary responses.

Usage
loglinb2(exchangeable = FALSE, zero = "u12")

Arguments
exchangeable Logical. If TRUE, the two marginal probabilities are constrained to be equal.
Should be set TRUE for ears, eyes, etc. data.
zero Which linear/additive predictors are modelled as intercept-only? A NULL means
none of them. See CommonVGAMffArguments for more information.

Details
The model is
P (Y1 = y1 , Y2 = y2 ) = exp(u0 + u1 y1 + u2 y2 + u12 y1 y2 )
where y1 and y2 are 0 or 1, and the parameters are u1 , u2 , u12 . The normalizing parameter u0 can
be expressed as a function of the other parameters, viz.,

u0 = log[1 + exp(u1 ) + exp(u2 ) + exp(u1 + u2 + u12 )].

The linear/additive predictors are (1 , 2 , 3 )T = (u1 , u2 , u12 )T .

loglinb2 439

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.
When fitted, the fitted.values slot of the object contains the four joint probabilities, labelled as
(Y1 , Y2 ) = (0,0), (0,1), (1,0), (1,1), respectively.

Note
The response must be a two-column matrix of ones and zeros only. This is more restrictive than
binom2.or, which can handle more types of input formats. Note that each of the 4 combinations of
the multivariate response need to appear in the data set.

Author(s)
Thomas W. Yee

References
Yee, T. W. and Wild, C. J. (2001) Discussion to: Smoothing spline ANOVA for multivariate
Bernoulli observations, with application to ophthalmology data (with discussion) by Gao, F.,
Wahba, G., Klein, R., Klein, B. Journal of the American Statistical Association, 96, 127160.
McCullagh, P. and Nelder, J. A. (1989) Generalized Linear Models, 2nd ed. London: Chapman &
Hall.

See Also
binom2.or, binom2.rho, loglinb3.

Examples
coalminers <- transform(coalminers, Age = (age - 42) / 5)

# Get the n x 4 matrix of counts

fit0 <- vglm(cbind(nBnW,nBW,BnW,BW) ~ Age, binom2.or, data = coalminers)
counts <- round(c(weights(fit0, type = "prior")) * depvar(fit0))

# Create a n x 2 matrix response for loglinb2()

# bwmat <- matrix(c(0,0, 0,1, 1,0, 1,1), 4, 2, byrow = TRUE)
bwmat <- cbind(bln = c(0,0,1,1), wheeze = c(0,1,0,1))
matof1 <- matrix(1, nrow(counts), 1)
newminers <- data.frame(bln = kronecker(matof1, bwmat[, 1]),
wheeze = kronecker(matof1, bwmat[, 2]),
wt = c(t(counts)),
Age = with(coalminers, rep(age, rep(4, length(age)))))
newminers <- newminers[with(newminers, wt) > 0,]

fit <- vglm(cbind(bln,wheeze) ~ Age, loglinb2(zero = NULL),

weight = wt, data = newminers)
coef(fit, matrix = TRUE) # Same! (at least for the log odds-ratio)
summary(fit)
440 loglinb3

# Try reconcile this with McCullagh and Nelder (1989), p.234

(0.166-0.131) / 0.027458 # 1.275 is approximately 1.25

loglinb3 Loglinear Model for Three Binary Responses

Description
Fits a loglinear model to three binary responses.

Usage
loglinb3(exchangeable = FALSE, zero = c("u12", "u13", "u23"))

Arguments
exchangeable Logical. If TRUE, the three marginal probabilities are constrained to be equal.
zero Which linear/additive predictors are modelled as intercept-only? A NULL means
none. See CommonVGAMffArguments for further information.

Details
The model is P (Y1 = y1 , Y2 = y2 , Y3 = y3 ) =

exp(u0 + u1 y1 + u2 y2 + u3 y3 + u12 y1 y2 + u13 y1 y3 + u23 y2 y3 )

where y1 , y2 and y3 are 0 or 1, and the parameters are u1 , u2 , u3 , u12 , u13 , u23 . The normaliz-
ing parameter u0 can be expressed as a function of the other parameters. Note that a third-order
association parameter, u123 for the product y1 y2 y3 , is assumed to be zero for this family function.
The linear/additive predictors are (1 , 2 , . . . , 6 )T = (u1 , u2 , u3 , u12 , u13 , u23 )T .

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.
When fitted, the fitted.values slot of the object contains the eight joint probabilities, labelled as
(Y1 , Y2 , Y3 ) = (0,0,0), (0,0,1), (0,1,0), (0,1,1), (1,0,0), (1,0,1), (1,1,0), (1,1,1), respectively.

Note
The response must be a three-column matrix of ones and zeros only. Note that each of the 8 com-
binations of the multivariate response need to appear in the data set, therefore data sets will need to
be large in order for this family function to work.

Author(s)
Thomas W. Yee
loglog 441

loglog Log-log Link Function

Description
Computes the log-log transformation, including its inverse and the first two derivatives.

Usage
loglog(theta, bvalue = NULL, inverse = FALSE, deriv = 0,
short = TRUE, tag = FALSE)

Arguments
theta Numeric or character. See below for further details.
bvalue Values of theta which are less than or equal to 1 can be replaced by bvalue
before computing the link function value. The component name bvalue stands
for boundary value. See Links for more information.
inverse, deriv, short, tag
Details at Links.

Details
The log-log link function is commonly used for parameters that are greater than unity. Numerical
values of theta close to 1 or out of range result in Inf, -Inf, NA or NaN.
442 lognormal

Value
For deriv = 0, the log of theta, i.e., log(log(theta)) when inverse = FALSE, and if
inverse = TRUE then exp(exp(theta)).
For deriv = 1, then the function returns d theta / d eta as a function of theta if inverse = FALSE,
else if inverse = TRUE then it returns the reciprocal.
Here, all logarithms are natural logarithms, i.e., to base e.

Note
Numerical instability may occur when theta is close to 1 unless bvalue is used.

Author(s)
Thomas W. Yee

References
McCullagh, P. and Nelder, J. A. (1989) Generalized Linear Models, 2nd ed. London: Chapman &
Hall.

x <- seq(1.01, 10, len = 100)

loglog(x)
max(abs(loglog(loglog(x), inverse = TRUE) - x)) # Should be 0

lognormal Lognormal Distribution

Description
Maximum likelihood estimation of the (univariate) lognormal distribution.

Usage
lognormal(lmeanlog = "identitylink", lsdlog = "loge", zero = "sdlog")
lognormal 443

Arguments
lmeanlog, lsdlog
Parameter link functions applied to the mean and (positive) (standard devia-
tion) parameter. Both of these are on the log scale. See Links for more choices.
zero Specifies which linear/additive predictor is modelled as intercept-only. For lognormal(),
the values can be from the set {1,2} which correspond to mu, sigma, respectively.
See CommonVGAMffArguments for more information.

Details
A random variable Y has a 2-parameter lognormal distribution if log(Y ) is distributed N (, 2 ).
The expected value of Y , which is

E(Y ) = exp( + 0.5 2 )

and not , make up the fitted values. The variance of Y is

V ar(Y ) = [exp( 2 ) 1] exp(2 + 2 ).

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Author(s)
T. W. Yee

References
Kleiber, C. and Kotz, S. (2003) Statistical Size Distributions in Economics and Actuarial Sciences,
Hoboken, NJ, USA: Wiley-Interscience.

See Also
rlnorm, uninormal, CommonVGAMffArguments, simulate.vlm.

Examples
ldata2 <- data.frame(x2 = runif(nn <- 1000))
ldata2 <- transform(ldata2, y1 = rlnorm(nn, mean = 1 + 2 * x2, sd = exp(-1)),
y2 = rlnorm(nn, mean = 1, sd = exp(-1 + x2)))
fit1 <- vglm(y1 ~ x2, lognormal(zero = 2), data = ldata2, trace = TRUE)
fit2 <- vglm(y2 ~ x2, lognormal(zero = 1), data = ldata2, trace = TRUE)
coef(fit1, matrix = TRUE)
coef(fit2, matrix = TRUE)
444 logoff

logoff Log Link Function with an Offset

Description
Computes the log transformation with an offset, including its inverse and the first two derivatives.

Usage
logoff(theta, offset = 0, inverse = FALSE, deriv = 0,
short = TRUE, tag = FALSE)

Arguments
theta Numeric or character. See below for further details.
offset Offset value. See Links.
inverse, deriv, short, tag
Details at Links.

Details
The log-offset link function is very commonly used for parameters that are greater than a certain
value. In particular, it is defined by log(theta + offset) where offset is the offset value. For
example, if offset = 0.5 then the value of theta is restricted to be greater than 0.5.
Numerical values of theta close to -offset or out of range result in Inf, -Inf, NA or NaN.

Value
For deriv = 0, the log of theta+offset, i.e., log(theta+offset) when inverse = FALSE, and
if inverse = TRUE then exp(theta)-offset.
For deriv = 1, then the function returns d theta / d eta as a function of theta if inverse = FALSE,
else if inverse = TRUE then it returns the reciprocal.
Here, all logarithms are natural logarithms, i.e., to base e.

Note
The default means this function is identical to loge.
Numerical instability may occur when theta is close to -offset.

Author(s)
Thomas W. Yee

References
McCullagh, P. and Nelder, J. A. (1989) Generalized Linear Models, 2nd ed. London: Chapman &
Hall.
Lomax 445

Lomax The Lomax Distribution

Description
Density, distribution function, quantile function and random generation for the Lomax distribution
with scale parameter scale and shape parameter q.

Usage
dlomax(x, scale = 1, shape3.q, log = FALSE)
plomax(q, scale = 1, shape3.q, lower.tail = TRUE, log.p = FALSE)
qlomax(p, scale = 1, shape3.q, lower.tail = TRUE, log.p = FALSE)
rlomax(n, scale = 1, shape3.q)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. If length(n) > 1, the length is taken to be the number
required.
scale scale parameter.
shape3.q shape parameter.
log Logical. If log = TRUE then the logarithm of the density is returned.
lower.tail, log.p
Same meaning as in pnorm or qnorm.

Details
See lomax, which is the VGAM family function for estimating the parameters by maximum likeli-
hood estimation.

Value
dlomax gives the density, plomax gives the distribution function, qlomax gives the quantile function,
and rlomax generates random deviates.
446 lomax

Note
The Lomax distribution is a special case of the 4-parameter generalized beta II distribution.

Author(s)
T. W. Yee and Kai Huang

References
Kleiber, C. and Kotz, S. (2003) Statistical Size Distributions in Economics and Actuarial Sciences,
Hoboken, NJ, USA: Wiley-Interscience.

## Not run: par(mfrow = c(1, 2))

x <- seq(-0.01, 5, len = 401)
plot(x, dexp(x), type = "l", col = "black", ylab = "", ylim = c(0, 3),
main = "Black is standard exponential, others are dlomax(x, shape3.q)")
lines(x, dlomax(x, shape3.q = 1), col = "orange")
lines(x, dlomax(x, shape3.q = 2), col = "blue")
lines(x, dlomax(x, shape3.q = 5), col = "green")
legend("topright", col = c("orange","blue","green"), lty = rep(1, 3),
legend = paste("shape3.q =", c(1, 2, 5)))

plot(x, pexp(x), type = "l", col = "black", ylab = "", las = 1,

main = "Black is standard exponential, others are plomax(x, shape3.q)")
lines(x, plomax(x, shape3.q = 1), col = "orange")
lines(x, plomax(x, shape3.q = 2), col = "blue")
lines(x, plomax(x, shape3.q = 5), col = "green")
legend("bottomright", col = c("orange","blue","green"), lty = rep(1, 3),
legend = paste("shape3.q =", c(1, 2, 5)))

## End(Not run)

lomax Lomax Distribution Family Function

Description
Maximum likelihood estimation of the 2-parameter Lomax distribution.
lomax 447

Usage
lomax(lscale = "loge", lshape3.q = "loge", iscale = NULL,
ishape3.q = NULL, imethod = 1, gscale = exp(-5:5),
gshape3.q = seq(0.75, 4, by = 0.25),
probs.y = c(0.25, 0.5, 0.75), zero = "shape")

Arguments
lscale, lshape3.q
Parameter link function applied to the (positive) parameters scale and q. See
Links for more choices.
iscale, ishape3.q, imethod
See CommonVGAMffArguments for information. For imethod = 2 a good initial
value for iscale is needed to obtain a good estimate for the other parameter.
gscale, gshape3.q, zero, probs.y
See CommonVGAMffArguments.

Details
The 2-parameter Lomax distribution is the 4-parameter generalized beta II distribution with shape
parameters a = p = 1. It is probably more widely known as the Pareto (II) distribution. It is
also the 3-parameter Singh-Maddala distribution with shape parameter a = 1, as well as the beta
distribution of the second kind with p = 1. More details can be found in Kleiber and Kotz (2003).
The Lomax distribution has density

f (y) = q/[b{1 + y/b}1+q ]

for b > 0, q > 0, y 0. Here, b is the scale parameter scale, and q is a shape parameter. The
cumulative distribution function is

F (y) = 1 [1 + (y/b)]q .

The mean is
E(Y ) = b/(q 1)
provided q > 1; these are returned as the fitted values. This family function handles multiple
responses.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Note
See the notes in genbetaII.

Author(s)
T. W. Yee
448 lqnorm

References
Kleiber, C. and Kotz, S. (2003) Statistical Size Distributions in Economics and Actuarial Sciences,
Hoboken, NJ, USA: Wiley-Interscience.

See Also
Lomax, genbetaII, betaII, dagum, sinmad, fisk, inv.lomax, paralogistic, inv.paralogistic,
simulate.vlm.

Examples
ldata <- data.frame(y = rlomax(n = 1000, scale = exp(1), exp(2)))
fit <- vglm(y ~ 1, lomax, data = ldata, trace = TRUE)
coef(fit, matrix = TRUE)
Coef(fit)
summary(fit)

lqnorm Minimizing the L-q norm Family Function

Description
Minimizes the L-q norm of residuals in a linear model.

Usage
lqnorm(qpower = 2, link = "identitylink",
imethod = 1, imu = NULL, ishrinkage = 0.95)

Arguments
qpower A single numeric, must be greater than one, called q below. The absolute value
of residuals are raised to the power of this argument, and then summed. This
quantity is minimized with respect to the regression coefficients.
link Link function applied to the mean . See Links for more details.
imethod Must be 1, 2 or 3. See CommonVGAMffArguments for more information. Ignored
if imu is specified.
imu Numeric, optional initial values used for the fitted values. The default is to use
imethod = 1.
ishrinkage How much shrinkage is used when initializing the fitted values. The value must
be between 0 and 1 inclusive, and a value of 0 means the individual response
values are used, and a value of 1 means the median or mean is used. This
argument is used in conjunction with imethod = 3.
lqnorm 449

Details

This function minimizes the objective function

n
X
wi (|yi i |)q
i=1

where q is the argument qpower, i = g(i ) where g is the link function, and i is the vector of
linear/additive predictors. The prior weights wi can be inputted using the weights argument of
vlm/vglm/vgam etc.; it should be just a vector here since this function handles only a single vector
or one-column response.
Numerical problem will occur when q is too close to one. Probably reasonable values range from
1.5 and up, say. The value q = 2 corresponds to ordinary least squares while q = 1 corresponds to
the MLE of a double exponential (Laplace) distibution. The procedure becomes more sensitive to
outliers the larger the value of q.

Value

An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Warning

Convergence failure is common, therefore the user is advised to be cautious and monitor conver-
gence!

Note

This VGAM family function is an initial attempt to provide a more robust alternative for regression
and/or offer a little more flexibility than least squares. The @misc slot of the fitted object contains a
list component called objectiveFunction which is the value of the objective function at the final
iteration.

Author(s)

Thomas W. Yee

References

Yee, T. W. and Wild, C. J. (1996) Vector generalized additive models. Journal of the Royal Statisti-
cal Society, Series B, Methodological, 58, 481493.

See Also

gaussianff.
450 lrtest

Examples
set.seed(123)
ldata <- data.frame(x = sort(runif(nn <- 10 )))
realfun <- function(x) 4 + 5*x
ldata <- transform(ldata, y = realfun(x) + rnorm(nn, sd = exp(-1)))
# Make the first observation an outlier
ldata <- transform(ldata, y = c(4*y[1], y[-1]), x = c(-1, x[-1]))
fit <- vglm(y ~ x, lqnorm(qpower = 1.2), data = ldata)
coef(fit, matrix = TRUE)
head(fitted(fit))
fit@misc$qpower
fit@misc$objectiveFunction

## Not run:
# Graphical check
with(ldata, plot(x, y, main = paste("LS = red, lqnorm = blue (qpower = ",
fit@misc$qpower, "), truth = black", sep = ""), col = "blue"))
lmfit <- lm(y ~ x, data = ldata)
with(ldata, lines(x, fitted(fit), col = "blue"))
with(ldata, lines(x, lmfit$fitted, col = "red"))
with(ldata, lines(x, realfun(x), col = "black"))
## End(Not run)

lrtest Likelihood Ratio Test of Nested Models

Description
lrtest is a generic function for carrying out likelihood ratio tests. The default method can be
employed for comparing nested VGLMs (see details below).

Usage
lrtest(object, ...)

lrtest_vglm(object, ..., no.warning = FALSE, name = NULL)

Arguments
object a vglm object. See below for details.
... further object specifications passed to methods. See below for details.
no.warning logical; if TRUE then no warning is issued. For example, setting TRUE might be a
good idea when testing for linearity of a variable for a "pvgam" object.
name a function for extracting a suitable name/description from a fitted model object.
By default the name is queried by calling formula.
lrtest 451

Details
lrtest is intended to be a generic function for comparisons of models via asymptotic likelihood
ratio tests. The default method consecutively compares the fitted model object object with the
models passed in .... Instead of passing the fitted model objects in ..., several other specifications
are possible. The updating mechanism is the same as for waldtest: the models in ... can be
specified as integers, characters (both for terms that should be eliminated from the previous model),
update formulas or fitted model objects. Except for the last case, the existence of an update method
is assumed. See waldtest for details.
Subsequently, an asymptotic likelihood ratio test for each two consecutive models is carried out:
Twice the difference in log-likelihoods (as derived by the logLik methods) is compared with a
Chi-squared distribution.

Value
An object of class "VGAManova" which contains a slot with the log-likelihood, degrees of freedom,
the difference in degrees of freedom, likelihood ratio Chi-squared statistic and corresponding p
value. These are printed by stats:::print.anova(); see anova.

Warning
Several VGAM family functions implement distributions which do not satisfying the usual regular-
ity conditions needed for the LRT to work. No checking or warning is given for these.

Note
The code was adapted directly from lmtest (written by T. Hothorn, A. Zeileis, G. Millo, D. Mitchell)
and made to work for VGLMs and S4. This help file also was adapted from lmtest.
Approximate LRTs might be applied to VGAMs, as produced by vgam, but it is probably better in
inference to use vglm with regression splines (bs and ns). This methods function should not be
applied to other models such as those produced by rrvglm, by cqo, by cao.

(ans4 <- lrtest(fit3, fit1)) # Test proportional odds (parallelism) assumption

lvplot Latent Variable Plot

Description
Generic function for a latent variable plot (also known as an ordination diagram by ecologists).

Usage
lvplot(object, ...)

Arguments
object An object for a latent variable plot is meaningful.
... Other arguments fed into the specific methods function of the model. They
usually are graphical parameters, and sometimes they are fed into the methods
function for Coef.

Details
Latent variables occur in reduced-rank regression models, as well as in quadratic and additive or-
dination. For the latter, latent variables are often called the site scores. Latent variable plots were
coined by Yee (2004), and have the latent variable as at least one of its axes.

Value
The value returned depends specifically on the methods function invoked.

Note
Latent variables are not really applicable to vglm/vgam models.

Author(s)
Thomas W. Yee

See Also
lvplot.qrrvglm, lvplot.cao, latvar, trplot.

Examples
## Not run:
hspider[,1:6] <- scale(hspider[,1:6]) # Standardized environmental vars
set.seed(123)
p1 <- cao(cbind(Pardlugu, Pardmont, Pardnigr, Pardpull, Zoraspin) ~
WaterCon + BareSand + FallTwig +
CoveMoss + CoveHerb + ReflLux,
family = poissonff, data = hspider, Bestof = 3,
df1.nl = c(Zoraspin = 2.5, 3), Crow1positive = TRUE)
index <- 1:ncol(depvar(p1))
lvplot(p1, lcol = index, pcol = index, y = TRUE, las = 1)

## End(Not run)

lvplot.qrrvglm Latent Variable Plot for QO models

Description
Produces an ordination diagram (latent variable plot) for quadratic ordination (QO) models. For
rank-1 models, the x-axis is the first ordination/constrained/canonical axis. For rank-2 models, the
x- and y-axis are the first and second ordination axes respectively.

Usage
lvplot.qrrvglm(object, varI.latvar = FALSE, refResponse = NULL,
add = FALSE, show.plot = TRUE,
rug = TRUE, y = FALSE, type = c("fitted.values", "predictors"),
xlab = paste("Latent Variable", if (Rank == 1) "" else " 1", sep = ""),
ylab = if (Rank == 1) switch(type, predictors = "Predictors",
fitted.values = "Fitted values") else "Latent Variable 2",
pcex = par()$cex, pcol = par()$col, pch = par()$pch,
llty = par()$lty, lcol = par()$col, llwd = par()$lwd,
label.arg = FALSE, adj.arg = -0.1,
ellipse = 0.95, Absolute = FALSE,
elty = par()$lty, ecol = par()$col, elwd = par()$lwd, egrid = 200,
chull.arg = FALSE, clty = 2, ccol = par()$col, clwd = par()$lwd,
cpch = " ",
C = FALSE, OriginC = c("origin", "mean"),
Clty = par()$lty, Ccol = par()$col, Clwd = par()$lwd,
Ccex = par()$cex, Cadj.arg = -0.1, stretchC = 1,
sites = FALSE, spch = NULL, scol = par()$col, scex = par()$cex,
sfont = par()$font, check.ok = TRUE, ...)
454 lvplot.qrrvglm

Arguments
object A CQO object.
varI.latvar Logical that is fed into Coef.qrrvglm.
refResponse Integer or character that is fed into Coef.qrrvglm.
add Logical. Add to an existing plot? If FALSE, a new plot is made.
show.plot Logical. Plot it?
rug Logical. If TRUE, a rug plot is plotted at the foot of the plot (applies to rank-1
models only). These values are jittered to expose ties.
y Logical. If TRUE, the responses will be plotted (applies only to rank-1 models
and if type = "fitted.values".)
type Either "fitted.values" or "predictors", specifies whether the y-axis is on
the response or eta-scales respectively.
xlab Caption for the x-axis. See par.
ylab Caption for the y-axis. See par.
pcex Character expansion of the points. Here, for rank-1 models, points are the re-
sponse y data. For rank-2 models, points are the optimums. See the cex argu-
ment in par.
pcol Color of the points. See the col argument in par.
pch Either an integer specifying a symbol or a single character to be used as the
default in plotting points. See par. The pch argument can be of length M , the
number of species.
llty Line type. Rank-1 models only. See the lty argument of par.
lcol Line color. Rank-1 models only. See the col argument of par.
llwd Line width. Rank-1 models only. See the lwd argument of par.
label.arg Logical. Label the optimums and C? (applies only to rank-2 models only).
adj.arg Justification of text strings for labelling the optimums (applies only to rank-2
models only). See the adj argument of par.
ellipse Numerical, of length 0 or 1 (applies only to rank-2 models only). If Absolute
is TRUE then ellipse should be assigned a value that is used for the elliptical
contouring. If Absolute is FALSE then ellipse should be assigned a value
between 0 and 1, for example, setting ellipse = 0.9 means an ellipse with
contour = 90% of the maximum will be plotted about each optimum. If ellipse
is a negative value, then the function checks that the model is an equal-tolerances
model and varI.latvar = FALSE, and if so, plots circles with radius -ellipse.
For example, setting ellipse = -1 will result in circular contours that have unit
radius (in latent variable units). If ellipse is NULL or FALSE then no ellipse is
drawn around the optimums.
Absolute Logical. If TRUE, the contours corresponding to ellipse are on an absolute
scale. If FALSE, the contours corresponding to ellipse are on a relative scale.
elty Line type of the ellipses. See the lty argument of par.
ecol Line color of the ellipses. See the col argument of par.
lvplot.qrrvglm 455

elwd Line width of the ellipses. See the lwd argument of par.
egrid Numerical. Line resolution of the ellipses. Choosing a larger value will result in
smoother ellipses. Useful when ellipses are large.
chull.arg Logical. Add a convex hull around the site scores?
clty Line type of the convex hull. See the lty argument of par.
ccol Line color of the convex hull. See the col argument of par.
clwd Line width of the convex hull. See the lwd argument of par.
cpch Character to be plotted at the intersection points of the convex hull. Having
white spaces means that site labels are not obscured there. See the pch argument
of par.
C Logical. Add C (represented by arrows emanating from OriginC) to the plot?
OriginC Character or numeric. Where the arrows representing C emanate from. If char-
acter, it must be one of the choices given. By default the first is chosen. The
value "origin" means c(0,0). The value "mean" means the sample mean of
the latent variables (centroid). Alternatively, the user may specify a numerical
vector of length 2.
Clty Line type of the arrows representing C. See the lty argument of par.
Ccol Line color of the arrows representing C. See the col argument of par.
Clwd Line width of the arrows representing C. See the lwd argument of par.
Ccex Numeric. Character expansion of the labelling of C. See the cex argument of
par.
Cadj.arg Justification of text strings when labelling C. See the adj argument of par.
stretchC Numerical. Stretching factor for C. Instead of using C, stretchC * C is used.
sites Logical. Add the site scores (aka latent variable values, nus) to the plot? (ap-
plies only to rank-2 models only).
spch Plotting character of the site scores. The default value of NULL means the row
labels of the data frame are used. They often are the site numbers. See the pch
argument of par.
scol Color of the site scores. See the col argument of par.
scex Character expansion of the site scores. See the cex argument of par.
sfont Font used for the site scores. See the font argument of par.
check.ok Logical. Whether a check is performed to see that noRRR = ~ 1 was used. It
doesnt make sense to have a latent variable plot unless this is so.
... Arguments passed into the plot function when setting up the entire plot. Useful
arguments here include xlim and ylim.

Details
This function only works for rank-1 and rank-2 QRR-VGLMs with argument noRRR = ~ 1.
For unequal-tolerances models, the latent variable axes can be rotated so that at least one of the
tolerance matrices is diagonal; see Coef.qrrvglm for details.
Arguments beginning with p correspond to the points e.g., pcex and pcol correspond to the size
and color of the points. Such p arguments should be vectors of length 1, or n, the number of sites.
For the rank-2 model, arguments beginning with p correspond to the optimums.
456 lvplot.qrrvglm

Value
Returns a matrix of latent variables (site scores) regardless of whether a plot was produced or not.

Warning
Interpretation of a latent variable plot (CQO diagram) is potentially very misleading in terms of dis-
tances if (i) the tolerance matrices of the species are unequal and (ii) the contours of these tolerance
matrices are not included in the ordination diagram.

Note
A species which does not have an optimum will not have an ellipse drawn even if requested, i.e., if
its tolerance matrix is not positive-definite.
Plotting C gives a visual display of the weights (loadings) of each of the variables used in the linear
combination defining each latent variable.
The arguments elty, ecol and elwd, may be replaced in the future by llty, lcol and llwd,
respectively.
For rank-1 models, a similar function to this one is perspqrrvglm. It plots the fitted values on a
more fine grid rather than at the actual site scores here. The result is a collection of smooth bell-
shaped curves. However, it has the weakness that the plot is more divorced from the data; the user
thinks it is the truth without an appreciation of the statistical variability in the estimates.
In the example below, the data comes from an equal-tolerances model. The species tolerance
matrices are all the identity matrix, and the optimums are at (0,0), (1,1) and (-2,0) for species 1, 2,
3 respectively.

Author(s)
Thomas W. Yee

References
Yee, T. W. (2004) A new technique for maximum-likelihood canonical Gaussian ordination. Eco-
logical Monographs, 74, 685701.

See Also
lvplot, perspqrrvglm, Coef.qrrvglm, par, cqo.

Examples
set.seed(123); nn <- 200
cdata <- data.frame(x2 = rnorm(nn), # Has mean 0 (needed when I.tol=TRUE)
x3 = rnorm(nn), # Has mean 0 (needed when I.tol=TRUE)
x4 = rnorm(nn)) # Has mean 0 (needed when I.tol=TRUE)
cdata <- transform(cdata, latvar1 = x2 + x3 - 2*x4,
latvar2 = -x2 + x3 + 0*x4)
# Nb. latvar2 is weakly correlated with latvar1
cdata <- transform(cdata,
lambda1 = exp(6 - 0.5 * (latvar1-0)^2 - 0.5 * (latvar2-0)^2),
lvplot.rrvglm 457

lambda2 = exp(5 - 0.5 * (latvar1-1)^2 - 0.5 * (latvar2-1)^2),

lambda3 = exp(5 - 0.5 * (latvar1+2)^2 - 0.5 * (latvar2-0)^2))
cdata <- transform(cdata,
spp1 = rpois(nn, lambda1),
spp2 = rpois(nn, lambda2),
spp3 = rpois(nn, lambda3))
set.seed(111)
## Not run:
p2 <- cqo(cbind(spp1, spp2, spp3) ~ x2 + x3 + x4, poissonff,
data = cdata, Rank = 2, I.tolerances = TRUE,
Crow1positive = c(TRUE, FALSE)) # deviance = 505.81
if (deviance(p2) > 506) stop("suboptimal fit obtained")
sort(deviance(p2, history = TRUE)) # A history of all the iterations
Coef(p2)

## End(Not run)

## Not run:
lvplot(p2, sites = TRUE, spch = "*", scol = "darkgreen", scex = 1.5,
chull = TRUE, label = TRUE, Absolute = TRUE, ellipse = 140,
adj = -0.5, pcol = "blue", pcex = 1.3, las = 1, Ccol = "orange",
C = TRUE, Cadj = c(-0.3, -0.3, 1), Clwd = 2, Ccex = 1.4,
main = paste("Contours at Abundance = 140 with",
"convex hull of the site scores"))
## End(Not run)
## Not run:
var(latvar(p2)) # A diagonal matrix, i.e., uncorrelated latent vars
var(latvar(p2, varI.latvar = TRUE)) # Identity matrix
Tol(p2)[, , 1:2] # Identity matrix
Tol(p2, varI.latvar = TRUE)[, , 1:2] # A diagonal matrix

## End(Not run)

lvplot.rrvglm Latent Variable Plot for RR-VGLMs

Description
Produces an ordination diagram (also known as a biplot or latent variable plot) for reduced-rank
vector generalized linear models (RR-VGLMs). For rank-2 models only, the x- and y-axis are the
first and second canonical axes respectively.

Usage
lvplot.rrvglm(object,
A = TRUE, C = TRUE, scores = FALSE, show.plot = TRUE,
groups = rep(1, n), gapC = sqrt(sum(par()$cxy^2)),
scaleA = 1,
xlab = "Latent Variable 1", ylab = "Latent Variable 2",
Alabels = if (length(object@misc$predictors.names))
458 lvplot.rrvglm

object@misc$predictors.names else paste("LP", 1:M, sep = ""),

Aadj = par()$adj, Acex = par()$cex, Acol = par()$col,
Apch = NULL,
Clabels = rownames(Cmat), Cadj = par()$adj,
Ccex = par()$cex, Ccol = par()$col, Clty = par()$lty,
Clwd = par()$lwd,
chull.arg = FALSE, ccex = par()$cex, ccol = par()$col,
clty = par()$lty, clwd = par()$lwd,
spch = NULL, scex = par()$cex, scol = par()$col,
slabels = rownames(x2mat), ...)

Arguments
object Object of class "rrvglm".
A Logical. Allow the plotting of A?
C Logical. Allow the plotting of C? If TRUE then C is represented by arrows eme-
nating from the origin.
scores Logical. Allow the plotting of the n scores? The scores are the values of the
latent variables for each observation.
show.plot Logical. Plot it? If FALSE, no plot is produced and the matrix of scores (n latent
variable values) is returned. If TRUE, the rank of object need not be 2.
groups A vector whose distinct values indicate which group the observation belongs
to. By default, all the observations belong to a single group. Useful for the
multinomial logit model (see multinomial.
gapC The gap between the end of the arrow and the text labelling of C, in latent
variable units.
scaleA Numerical value that is multiplied by A, so that C is divided by this value.
xlab Caption for the x-axis. See par.
ylab Caption for the y-axis. See par.
Alabels Character vector to label A. Must be of length M .
Aadj Justification of text strings for labelling A. See the adj argument of par.
Acex Numeric. Character expansion of the labelling of A. See the cex argument of
par.
Acol Line color of the arrows representing C. See the col argument of par.
Apch Either an integer specifying a symbol or a single character to be used as the
default in plotting points. See par. The pch argument can be of length M , the
number of species.
Clabels Character vector to label C. Must be of length p2.
Cadj Justification of text strings for labelling C. See the adj argument of par.
Ccex Numeric. Character expansion of the labelling of C. See the cex argument of
par.
Ccol Line color of the arrows representing C. See the col argument of par.
Clty Line type of the arrows representing C. See the lty argument of par.
lvplot.rrvglm 459

Clwd Line width of the arrows representing C. See the lwd argument of par.
chull.arg Logical. Plot the convex hull of the scores? This is done for each group (see the
group argument).
ccex Numeric. Character expansion of the labelling of the convex hull. See the cex
argument of par.
ccol Line color of the convex hull. See the col argument of par.
clty Line type of the convex hull. See the lty argument of par.
clwd Line width of the convex hull. See the lwd argument of par.
spch Either an integer specifying a symbol or a single character to be used as the
default in plotting points. See par. The spch argument can be of length M , the
number of species.
scex Numeric. Character expansion of the labelling of the scores. See the cex argu-
ment of par.
scol Line color of the arrows representing C. See the col argument of par.
slabels Character vector to label the scores. Must be of length n.
... Arguments passed into the plot function when setting up the entire plot. Useful
arguments here include xlim and ylim.

Details
For RR-VGLMs, a biplot and a latent variable plot coincide. In general, many of the arguments
starting with A refer to A (of length M ), C to C (of length p2), c to the convex hull (of length
length(unique(groups))), and s to scores (of length n).
As the result is a biplot, its interpretation is based on the inner product.

Value
The matrix of scores (n latent variable values) is returned regardless of whether a plot was produced
or not.

Note
The functions lvplot.rrvglm and biplot.rrvglm are equivalent.
In the example below the predictor variables are centered, which is a good idea.

Author(s)
Thomas W. Yee

References
Yee, T. W. and Hastie, T. J. (2003) Reduced-rank vector generalized linear models. Statistical
Modelling, 3, 1541.

See Also
lvplot, par, rrvglm, Coef.rrvglm, rrvglm.control.
460 machinists

Examples
nn <- nrow(pneumo) # x1, x2 and x3 are some unrelated covariates
pneumo <- transform(pneumo, slet = scale(log(exposure.time)),
x1 = rnorm(nn), x2 = rnorm(nn), x3 = rnorm(nn))
fit <- rrvglm(cbind(normal, mild, severe) ~ slet + x1 + x2 + x3,
multinomial, data = pneumo, Rank = 2,
Corner = FALSE, Uncorrel = TRUE)
## Not run:
lvplot(fit, chull = TRUE, scores = TRUE, clty = 2, ccol = "blue",
scol = "red", Ccol = "darkgreen", Clwd = 2, Ccex = 2,
main = "Biplot of some fictitional data")
## End(Not run)

machinists Machinists Accidents

Description
A small count data set involving 414 machinists from a three months study, of accidents around the
end of WWI.

Usage
data(machinists)

Format
A data frame with the following variables.

accidents The number of accidents

ofreq Observed frequency, i.e., the number of machinists with that many accidents

Details
The data was collected over a period of three months. There were 414 machinists in total. Also,
there were data collected over six months, but it is not given here.

Source
Incidence of Industrial Accidents. Report No. 4 (Industrial Fatigue Research Board), Stationery
Office, London, 1919.

References
Greenwood, M. and Yule, G. U. (1920). An Inquiry into the Nature of Frequency Distributions
Representative of Multiple Happenings with Particular Reference to the Occurrence of Multiple
Attacks of Disease or of Repeated Accidents. Journal of the Royal Statistical Society, 83, 255279.
Makeham 461

Makeham The Makeham Distribution

Description
Density, cumulative distribution function, quantile function and random generation for the Make-
ham distribution.

Usage
dmakeham(x, scale = 1, shape, epsilon = 0, log = FALSE)
pmakeham(q, scale = 1, shape, epsilon = 0, lower.tail = TRUE, log.p = FALSE)
qmakeham(p, scale = 1, shape, epsilon = 0, lower.tail = TRUE, log.p = FALSE)
rmakeham(n, scale = 1, shape, epsilon = 0)

Details
See makeham for details. The default value of epsilon = 0 corresponds to the Gompertz distribu-
tion. The function pmakeham uses lambertW.
462 makeham

Value

dmakeham gives the density, pmakeham gives the cumulative distribution function, qmakeham gives
the quantile function, and rmakeham generates random deviates.

Author(s)

T. W. Yee and Kai Huang

References

Jodra, P. (2009) A closed-form expression for the quantile function of the Gompertz-Makeham
distribution. Mathematics and Computers in Simulation, 79, 30693075.

See Also

makeham, lambertW.

Examples
probs <- seq(0.01, 0.99, by = 0.01)
Shape <- exp(-1); Scale <- exp(1); eps = Epsilon <- exp(-1)
max(abs(pmakeham(qmakeham(p = probs, sca = Scale, Shape, eps = Epsilon),
sca = Scale, Shape, eps = Epsilon) - probs)) # Should be 0

## Not run: x <- seq(-0.1, 2.0, by = 0.01);

plot(x, dmakeham(x, sca = Scale, Shape, eps = Epsilon), type = "l",
main = "Blue is density, orange is cumulative distribution function",
sub = "Purple lines are the 10,20,...,90 percentiles",
col = "blue", las = 1, ylab = "")
abline(h = 0, col = "blue", lty = 2)
lines(x, pmakeham(x, sca = Scale, Shape, eps = Epsilon), col = "orange")
probs <- seq(0.1, 0.9, by = 0.1)
Q <- qmakeham(probs, sca = Scale, Shape, eps = Epsilon)
lines(Q, dmakeham(Q, sca = Scale, Shape, eps = Epsilon),
col = "purple", lty = 3, type = "h")
pmakeham(Q, sca = Scale, Shape, eps = Epsilon) - probs # Should be all zero
abline(h = probs, col = "purple", lty = 3)
## End(Not run)

makeham Makeham Distribution Family Function

Description

Maximum likelihood estimation of the 3-parameter Makeham distribution.

makeham 463

Usage
makeham(lscale = "loge", lshape = "loge", lepsilon = "loge",
iscale = NULL, ishape = NULL, iepsilon = NULL,
gscale = exp(-5:5),gshape = exp(-5:5), gepsilon = exp(-4:1),
nsimEIM = 500, oim.mean = TRUE, zero = NULL, nowarning = FALSE)

Arguments
nowarning Logical. Suppress a warning? Ignored for VGAM 0.9-7 and higher.
lshape, lscale, lepsilon
Parameter link functions applied to the shape parameter shape, scale parameter
scale, and other parameter epsilon. All parameters are treated as positive here
(cf. dmakeham allows epsilon = 0, etc.). See Links for more choices.
ishape, iscale, iepsilon
Optional initial values. A NULL means a value is computed internally. A value
must be given for iepsilon currently, and this is a sensitive parameter!
gshape, gscale, gepsilon
See CommonVGAMffArguments.
nsimEIM, zero See CommonVGAMffArguments. Argument probs.y is used only when imethod = 2.
oim.mean To be currently ignored.

Details
The Makeham distribution, which adds another parameter to the Gompertz distribution, has cumu-
lative distribution function

y

F (y; , , ) = 1 exp y + 1e

which leads to a probability density function

y
y

f (y; , , ) = + e exp y + 1e ,

for > 0, > 0, 0, y > 0. Here, is called the scale parameter scale, and is called a
shape parameter. The moments for this distribution do not appear to be available in closed form.
Simulated Fisher scoring is used and multiple responses are handled.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Warning
A lot of care is needed because this is a rather difficult distribution for parameter estimation, espe-
cially when the shape parameter is large relative to the scale parameter. If the self-starting initial
values fail then try experimenting with the initial value arguments, especially iepsilon. Successful
464 margeff

convergence depends on having very good initial values. More improvements could be made here.
Also, monitor convergence by setting trace = TRUE.
A trick is to fit a gompertz distribution and use it for initial values; see below. However, this family
function is currently numerically fraught.

Author(s)
T. W. Yee

See Also
dmakeham, gompertz, simulate.vlm.

Examples
## Not run: set.seed(123)
mdata <- data.frame(x2 = runif(nn <- 1000))
mdata <- transform(mdata, eta1 = -1,
ceta1 = 1,
eeta1 = -2)
mdata <- transform(mdata, shape1 = exp(eta1),
scale1 = exp(ceta1),
epsil1 = exp(eeta1))
mdata <- transform(mdata,
y1 = rmakeham(nn, shape = shape1, scale = scale1, eps = epsil1))

# A trick is to fit a Gompertz distribution first

fit0 <- vglm(y1 ~ 1, gompertz, data = mdata, trace = TRUE)
fit1 <- vglm(y1 ~ 1, makeham, data = mdata,
etastart = cbind(predict(fit0), log(0.1)), trace = TRUE)

coef(fit1, matrix = TRUE)

summary(fit1)

## End(Not run)

margeff Marginal effects for several categorical response models

Description
Marginal effects for the multinomial logit model and cumulative logit/probit/... models and contin-
uation ratio models and stopping ratio models and adjacent categories models: the derivative of the
fitted probabilities with respect to each explanatory variable.

Usage
margeff(object, subset = NULL, ...)
margeff 465

Arguments
object A vglm object, with one of the following family functions: multinomial, cumulative,
cratio, sratio or acat.
subset Numerical or logical vector, denoting the required observation(s). Recycling is
used if possible. The default means all observations.
... further arguments passed into the other methods functions.

Details
Computes the derivative of the fitted probabilities of the categorical response model with respect to
each explanatory variable. Formerly one big function, this function now uses S4 dispatch to break
up the computations.
The function margeff() is not generic. However, it calls the function margeffS4VGAM() which
is. This is based on the class of the VGAMff argument, and it uses the S4 function setMethod to
correctly dispatch to the required methods function. The inheritance is given by the vfamily slot
of the VGAM family function.

Value
A p by M + 1 by n array, where p is the number of explanatory variables and the (hopefully)
nominal response has M + 1 levels, and there are n observations.
In general, if is.numeric(subset) and length(subset) == 1 then a p by M + 1 matrix is
returned.

Warning
Care is needed in interpretation, e.g., the change is not universally accurate for a unit change in each
explanatory variable because eventually the new probabilities may become negative or greater than
unity. Also, the new probabilities will not sum to one.
This function is not applicable for models with data-dependent terms such as bs and poly. Also the
function should not be applied to models with any terms that have generated more than one column
of the LM model matrix, such as bs and poly. For such try using numerical methods such as finite-
differences. The formula in object should comprise of simple terms of the form ~ x2 + x3 + x4,
etc.
Some numerical problems may occur if the fitted values are close to 0 or 1 for the cratio and
sratio models. Models with offsets may result in an incorrect answer.

Note
For multinomial this function should handle any value of refLevel and also any constraint matri-
ces. However, it does not currently handle the xij or form2 arguments, nor vgam objects.
Some other limitations are imposed, e.g., for acat models only a loge link is allowed.

Author(s)
T. W. Yee, with some help and motivation from Stasha Rmandic.
466 marital.nz

See Also
multinomial, cumulative, propodds, acat, cratio, sratio, vglm.

Examples
# Not a good example for multinomial() because the response is ordinal!!
ii <- 3; hh <- 1/100
pneumo <- transform(pneumo, let = log(exposure.time))
fit <- vglm(cbind(normal, mild, severe) ~ let, multinomial, data = pneumo)
fit <- vglm(cbind(normal, mild, severe) ~ let,
cumulative(reverse = TRUE, parallel = TRUE),
data = pneumo)
fitted(fit)[ii, ]

mynewdata <- with(pneumo, data.frame(let = let[ii] + hh))

(newp <- predict(fit, newdata = mynewdata, type = "response"))

# Compare the difference. Should be the same as hh --> 0.

round(digits = 3, (newp-fitted(fit)[ii, ])/hh) # Finite-difference approxn
round(digits = 3, margeff(fit, subset = ii)["let",])

# Other examples
round(digits = 3, margeff(fit))
round(digits = 3, margeff(fit, subset = 2)["let",])
round(digits = 3, margeff(fit, subset = c(FALSE, TRUE))["let",,]) # recycling
round(digits = 3, margeff(fit, subset = c(2, 4, 6, 8))["let",,])

marital.nz New Zealand Marital Data

Description
Some marital data mainly from a large NZ company collected in the early 1990s.

Usage
data(marital.nz)

Format
A data frame with 6053 observations on the following 3 variables.

age a numeric vector, age in years

ethnicity a factor with levels European Maori Other Polynesian. Only Europeans are included
in the data set.
mstatus a factor with levels Divorced/Separated, Married/Partnered, Single, Widowed.
Max 467

Details
This is a subset of a data set collected from a self-administered questionnaire administered in a large
New Zealand workforce observational study conducted during 19923. The data were augmented
by a second study consisting of retirees. The data can be considered a reasonable representation of
the white male New Zealand population in the early 1990s.

Source
Clinical Trials Research Unit, University of Auckland, New Zealand.

References
See bmi.nz and chest.nz.

Examples
summary(marital.nz)

Max Maximums

Description
Generic function for the maximums (maxima) of a model.

Usage
Max(object, ...)

Arguments
object An object for which the computation or extraction of a maximum (or maxi-
mums) is meaningful.
... Other arguments fed into the specific methods function of the model. Sometimes
they are fed into the methods function for Coef.

Details
Different models can define a maximum in different ways. Many models have no such notion or
definition.
Maximums occur in quadratic and additive ordination, e.g., CQO or CAO. For these models the
maximum is the fitted value at the optimum. For quadratic ordination models there is a formula for
the optimum but for additive ordination models the optimum must be searched for numerically. If
it occurs on the boundary, then the optimum is undefined. For a valid optimum, the fitted value at
the optimum is the maximum.
468 Maxwell

Value

The value returned depends specifically on the methods function invoked.

Author(s)

Thomas W. Yee

References

Yee, T. W. (2004) A new technique for maximum-likelihood canonical Gaussian ordination. Eco-
logical Monographs, 74, 685701.
Yee, T. W. (2006) Constrained additive ordination. Ecology, 87, 203213.

See Also

Max.qrrvglm, Tol, Opt.

Examples
## Not run:
set.seed(111) # This leads to the global solution
hspider[,1:6] <- scale(hspider[,1:6]) # Standardized environmental vars
p1 <- cqo(cbind(Alopacce, Alopcune, Alopfabr, Arctlute, Arctperi,
Auloalbi, Pardlugu, Pardmont, Pardnigr, Pardpull,
Trocterr, Zoraspin) ~
WaterCon + BareSand + FallTwig + CoveMoss + CoveHerb + ReflLux,
quasipoissonff, Bestof = 2, data = hspider, Crow1positive = FALSE)

Max(p1)

index <- 1:ncol(depvar(p1))

persp(p1, col = index, las = 1, llwd = 2)
abline(h = Max(p1), lty = 2, col = index)

## End(Not run)

Maxwell The Maxwell Distribution

Description

Density, distribution function, quantile function and random generation for the Maxwell distribu-
tion.
Maxwell 469

Usage
dmaxwell(x, rate, log = FALSE)
pmaxwell(q, rate, lower.tail = TRUE, log.p = FALSE)
qmaxwell(p, rate, lower.tail = TRUE, log.p = FALSE)
rmaxwell(n, rate)

Arguments
x, q, p, n Same as Uniform.
rate the (rate) parameter.
log Logical. If log = TRUE then the logarithm of the density is returned.
lower.tail, log.p
Same meaning as in pnorm or qnorm.

Details
See maxwell, the VGAM family function for estimating the (rate) parameter a by maximum likeli-
hood estimation, for the formula of the probability density function.

Value
dmaxwell gives the density, pmaxwell gives the distribution function, qmaxwell gives the quantile
function, and rmaxwell generates random deviates.

Note
The Maxwell distribution is related to the Rayleigh distribution.

Author(s)
T. W. Yee and Kai Huang

References
Balakrishnan, N. and Nevzorov, V. B. (2003) A Primer on Statistical Distributions. Hoboken, New
Jersey: Wiley.

probs <- seq(0.1, 0.9, by = 0.1)

Q <- qmaxwell(probs, rate = rate)
lines(Q, dmaxwell(Q, rate), col = "purple", lty = 3, type = "h")
lines(Q, pmaxwell(Q, rate), col = "purple", lty = 3, type = "h")
abline(h = probs, col = "purple", lty = 3)
max(abs(pmaxwell(Q, rate) - probs)) # Should be zero

## End(Not run)

maxwell Maxwell Distribution Family Function

Description
Estimating the parameter of the Maxwell distribution by maximum likelihood estimation.

Usage
maxwell(link = "loge", zero = NULL)

Arguments
link, zero Parameter link function applied to a, which is called the parameter rate. See
Links for more choices and information; a log link is the default because the
parameter is positive. More information is at CommonVGAMffArguments.

Details
The Maxwell distribution, which is used in the area of thermodynamics, has a probability density
function that can be written
p
f (y; a) = 2/a3/2 y 2 exp(0.5ay 2 )
p
for y > 0 and a > 0. The mean of Y is 8/(a) (returned as the fitted values), and its variance is
(3 8)/(a).

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.

Note
Fisher-scoring and Newton-Raphson are the same here. A related distribution is the Rayleigh dis-
tribution. This VGAM family function handles multiple responses. This VGAM family function
can be mimicked by poisson.points(ostatistic = 1.5, dimension = 2).

Author(s)
T. W. Yee
mccullagh89 471

References
von Seggern, D. H. (1993) CRC Standard Curves and Surfaces, Boca Raton, FL.: CRC Press.

See Also
Maxwell, rayleigh, poisson.points.

Examples
mdata <- data.frame(y = rmaxwell(1000, rate = exp(2)))
fit <- vglm(y ~ 1, maxwell, data = mdata, trace = TRUE, crit = "coef")
coef(fit, matrix = TRUE)
Coef(fit)

mccullagh89 McCullagh (1989) Distribution Family Function

Description
Estimates the two parameters of the McCullagh (1989) distribution by maximum likelihood estima-
tion.

Usage
mccullagh89(ltheta = "rhobit", lnu = logoff(offset = 0.5),
itheta = NULL, inu = NULL, zero = NULL)

Arguments
ltheta, lnu Link functions for the and parameters. See Links for general information.
itheta, inu Numeric. Optional initial values for and . The default is to internally compute
them.
zero See CommonVGAMffArguments for information.

Details
The McCullagh (1989) distribution has density function
1
{1 y 2 } 2
f (y; , ) =
(1 2y + 2 ) Beta( + 21 , 12 )
where 1 < y < 1 and 1 < < 1. This distribution is equation (1) in that paper. The parameter
satisfies > 1/2, therefore the default is to use an log-offset link with offset equal to 0.5, i.e.,
2 = log( + 0.5). The mean is of Y is /(1 + ), and these are returned as the fitted values.
This distribution is related to the Leipnik distribution (see Johnson et al. (1995)), is related to ultra-
spherical functions, and under certain conditions, arises as exit distributions for Brownian motion.
Fisher scoring is implemented here and it uses a diagonal matrix so the parameters are globally
orthogonal in the Fisher information sense. McCullagh (1989) also states that, to some extent,
and have the properties of a location parameter and a precision parameter, respectively.
472 melbmaxtemp

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.

Note
Convergence may be slow or fail unless the initial values are reasonably close. If a failure occurs,
try assigning the argument inu and/or itheta. Figure 1 of McCullagh (1989) gives a broad range
of densities for different values of and , and this could be consulted for obtaining reasonable
initial values if all else fails.

Author(s)
T. W. Yee

References
McCullagh, P. (1989) Some statistical properties of a family of continuous univariate distributions.
Journal of the American Statistical Association, 84, 125129.
Johnson, N. L. and Kotz, S. and Balakrishnan, N. (1995) Continuous Univariate Distributions, 2nd
edition, Volume 2, New York: Wiley. (pages 612617).

melbmaxtemp Melbourne Daily Maximum Temperatures

Description
Melbourne daily maximum temperatures in degrees Celsius over the ten-year period 19811990.

Usage
data(melbmaxtemp)
meplot 473

Format
A vector with 3650 observations.

Details
This is a time series data from Melbourne, Australia. It is commonly used to give a difficult quantile
regression problem since the data is bimodal. That is, a hot day is likely to be followed by either an
equally hot day or one much cooler. However, an independence assumption is typically made.

References
Hyndman, R. J. and Bashtannyk, D. M. and Grunwald, G. K. (1996). Estimating and visualizing
conditional densities. J. Comput. Graph. Statist., 5(4), 315336.

meplot Mean Excess Plot

Description
Mean excess plot (also known as a mean residual life plot), a diagnostic plot for the generalized
Pareto distribution (GPD).

Usage
meplot(object, ...)
meplot.default(y, main = "Mean Excess Plot",
xlab = "Threshold", ylab = "Mean Excess", lty = c(2, 1:2),
conf = 0.95, col = c("blue", "black", "blue"), type = "l", ...)
meplot.vlm(object, ...)
474 meplot

Arguments
y A numerical vector. NAs etc. are not allowed.
main, xlab, ylab
Character. Overall title for the plot, and titles for the x- and y-axes.
lty Line type. The second value is for the mean excess value, the first and third
values are for the envelope surrounding the confidence interval.
conf Confidence level. The default results in approximate 95 percent confidence in-
tervals for each mean excess value.
col Colour of the three lines.
type Type of plot. The default means lines are joined between the mean excesses and
also the upper and lower limits of the confidence intervals.
object An object that inherits class "vlm", usually of class vglm-class or vgam-class.
... Graphical argument passed into plot. See par for an exhaustive list. The argu-
ments xlim and ylim are particularly useful.

Details
If Y has a GPD with scale parameter and shape parameter < 1, and if y > 0, then

+ u
E(Y u|Y > u) = .
1

It is a linear function in u, the threshold. Note that Y u is called the excess and values of Y
greater than u are called exceedances. The empirical versions used by these functions is to use
sample means to estimate the left hand side of the equation. Values of u in the plot are the values
of y itself. If the plot is roughly a straight line then the GPD is a good fit; this plot can be used
to select an appropriate threshold value. See gpd for more details. If the plot is flat then the data
may be exponential, and if it is curved then it may be Weibull or gamma. There is often a lot of
variance/fluctuation at the RHS of the plot due to fewer observations.
The function meplot is generic, and meplot.default and meplot.vlm are some methods functions
for mean excess plots.

Value
A list is returned invisibly with the following components.

threshold The x axis values.

meanExcess The y axis values. Each value is a sample mean minus a value u.
plusminus The amount which is added or subtracted from the mean excess to give the
confidence interval. The last value is a NA because it is based on one observation.

Note
The function is designed for speed and not accuracy, therefore huge data sets with extremely large
values may cause failure (the function cumsum is used.) Ties may not be well handled.
micmen 475

Author(s)
T. W. Yee

References
Davison, A. C. and Smith, R. L. (1990) Models for exceedances over high thresholds (with discus-
sion). Journal of the Royal Statistical Society, Series B, Methodological, 52, 393442.
Coles, S. (2001) An Introduction to Statistical Modeling of Extreme Values. London: Springer-
Verlag.

par(mfrow = c(2, 2))

for (ii in 1:4)
meplot(rgpd(1000), col = c("orange", "blue", "orange"))

## End(Not run)

micmen Michaelis-Menten Model

Description
Fits a Michaelis-Menten nonlinear regression model.

Usage
micmen(rpar = 0.001, divisor = 10, init1 = NULL, init2 = NULL,
imethod = 1, oim = TRUE, link1 = "identitylink", link2 = "identitylink",
firstDeriv = c("nsimEIM", "rpar"), probs.x = c(0.15, 0.85),
nsimEIM = 500, dispersion = 0, zero = NULL)

Arguments
rpar Numeric. Initial positive ridge parameter. This is used to create positive-definite
weight matrices.
divisor Numerical. The divisor used to divide the ridge parameter at each iteration until
it is very small but still positive. The value of divisor should be greater than
one.
476 micmen

init1, init2 Numerical. Optional initial value for the first and second parameters, respec-
tively. The default is to use a self-starting value.
link1, link2 Parameter link function applied to the first and second parameters, respectively.
See Links for more choices.
dispersion Numerical. Dispersion parameter.
firstDeriv Character. Algorithm for computing the first derivatives and working weights.
The first is the default.
imethod, probs.x
See CommonVGAMffArguments for information.
nsimEIM, zero See CommonVGAMffArguments for information.
oim Use the OIM? See CommonVGAMffArguments for information.

Details
The Michaelis-Menten model is given by

E(Yi ) = (1 ui )/(2 + ui )

where 1 and 2 are the two parameters.

The relationship between iteratively reweighted least squares and the Gauss-Newton algorithm is
given in Wedderburn (1974). However, the algorithm used by this family function is different.
Details are given at the Authors web site.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Warning
This function is not (nor could ever be) entirely reliable. Plotting the fitted function and monitoring
convergence is recommended.

Note
The regressor values ui are inputted as the RHS of the form2 argument. It should just be a simple
term; no smart prediction is used. It should just a single vector, therefore omit the intercept term.
The LHS of the formula form2 is ignored.
To predict the response at new values of ui one must assign the @extra$Xm2 slot in the fitted object
these values, e.g., see the example below.
Numerical problems may occur. If so, try setting some initial values for the parameters. In the
future, several self-starting initial values will be implemented.

Author(s)
T. W. Yee
mix2exp 477

References
Seber, G. A. F. and Wild, C. J. (1989) Nonlinear Regression, New York: Wiley.
Wedderburn, R. W. M. (1974) Quasi-likelihood functions, generalized linear models, and the Gauss-
Newton method. Biometrika, 61, 439447.
Bates, D. M. and Watts, D. G. (1988) Nonlinear Regression Analysis and Its Applications, New
York: Wiley.

## Not run: plot(velocity ~ conc, enzyme, xlab = "concentration", las = 1,

col = "blue", main = "Michaelis-Menten equation for the enzyme data",
ylim = c(0, max(velocity)), xlim = c(0, max(conc)))
points(fitted(fit) ~ conc, enzyme, col = "red", pch = "+", cex = 1.5)

# This predicts the response at a finer grid:

newenzyme <- data.frame(conc = seq(0, max(with(enzyme, conc)), len = 200))
fit@extra$Xm2 <- newenzyme$conc # This assignment is needed for prediction
lines(predict(fit, newenzyme, "response") ~ conc, newenzyme, col = "red")
## End(Not run)

mix2exp Mixture of Two Exponential Distributions

Description
Estimates the three parameters of a mixture of two exponential distributions by maximum likelihood
estimation.

Usage
mix2exp(lphi = "logit", llambda = "loge", iphi = 0.5, il1 = NULL,
il2 = NULL, qmu = c(0.8, 0.2), nsimEIM = 100, zero = "phi")

Arguments
lphi, llambda Link functions for the parameters and . The latter is the rate parameter and
note that the mean of an ordinary exponential distribution is 1/. See Links for
more choices.
478 mix2exp

iphi, il1, il2 Initial value for , and optional initial value for 1 and 2 . The last two have
values that must be positive. The default is to compute initial values internally
using the argument qmu.
qmu Vector with two values giving the probabilities relating to the sample quantiles
for obtaining initial values for 1 and 2 . The two values are fed in as the probs
argument into quantile.
nsimEIM, zero See CommonVGAMffArguments.

Details

The probability density function can be loosely written as

f (y) = Exponential(1 ) + (1 ) Exponential(2 )

where is the probability an observation belongs to the first group, and y > 0. The parameter
satisfies 0 < < 1. The mean of Y is /1 + (1 )/2 and this is returned as the fitted values.
By default, the three linear/additive predictors are (logit(), log(1 ), log(2 ))T .

Value

An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Warning

This VGAM family function requires care for a successful application. In particular, good initial
values are required because of the presence of local solutions. Therefore running this function with
several different combinations of arguments such as iphi, il1, il2, qmu is highly recommended.
Graphical methods such as hist can be used as an aid.
This VGAM family function is experimental and should be used with care.

Note

Fitting this model successfully to data can be difficult due to local solutions, uniqueness problems
and ill-conditioned data. It pays to fit the model several times with different initial values and check
that the best fit looks reasonable. Plotting the results is recommended. This function works better
as 1 and 2 become more different. The default control argument trace = TRUE is to encourage
monitoring convergence.

Author(s)

T. W. Yee

See Also

rexp, exponential, mix2poisson.

mix2normal 479

Examples
## Not run: lambda1 <- exp(1); lambda2 <- exp(3)
(phi <- logit(-1, inverse = TRUE))
mdata <- data.frame(y1 = rexp(nn <- 1000, lambda1))
mdata <- transform(mdata, y2 = rexp(nn, lambda2))
mdata <- transform(mdata, Y = ifelse(runif(nn) < phi, y1, y2))
fit <- vglm(Y ~ 1, mix2exp, data = mdata, trace = TRUE)
coef(fit, matrix = TRUE)

# Compare the results with the truth

round(rbind('Estimated' = Coef(fit),
'Truth' = c(phi, lambda1, lambda2)), digits = 2)

with(mdata, hist(Y, prob = TRUE, main = "Orange = estimate, blue = truth"))

abline(v = 1 / Coef(fit)[c(2, 3)], lty = 2, col = "orange", lwd = 2)
abline(v = 1 / c(lambda1, lambda2), lty = 2, col = "blue", lwd = 2)

## End(Not run)

mix2normal Mixture of Two Univariate Normal Distributions

Description
Estimates the five parameters of a mixture of two univariate normal distributions by maximum
likelihood estimation.

Usage
mix2normal(lphi = "logit", lmu = "identitylink", lsd = "loge",
iphi = 0.5, imu1 = NULL, imu2 = NULL, isd1 = NULL, isd2 = NULL,
qmu = c(0.2, 0.8), eq.sd = TRUE, nsimEIM = 100, zero = "phi")

Arguments
lphi,lmu,lsd Link functions for the parameters , , and . See Links for more choices.
iphi Initial value for , whose value must lie between 0 and 1.
imu1, imu2 Optional initial value for 1 and 2 . The default is to compute initial values
internally using the argument qmu.
isd1, isd2 Optional initial value for 1 and 2 . The default is to compute initial values
internally based on the argument qmu. Currently these are not great, therefore
using these arguments where practical is a good idea.
qmu Vector with two values giving the probabilities relating to the sample quantiles
for obtaining initial values for 1 and 2 . The two values are fed in as the probs
argument into quantile.
eq.sd Logical indicating whether the two standard deviations should be constrained to
be equal. If TRUE then the appropriate constraint matrices will be used.
480 mix2normal

nsimEIM See CommonVGAMffArguments.

zero May be an integer vector specifying which linear/additive predictors are mod-
elled as intercept-only. If given, the value or values can be from the set {1, 2, . . . , 5}.
The default is the first one only, meaning is a single parameter even when there
are explanatory variables. Set zero = NULL to model all linear/additive predic-
tors as functions of the explanatory variables. See CommonVGAMffArguments for
more information.

Details
The probability density function can be loosely written as

f (y) = N (1 , 1 ) + (1 ) N (2 , 2 )

where is the probability an observation belongs to the first group. The parameters 1 and 2
are the means, and 1 and 2 are the standard deviations. The parameter satisfies 0 < < 1.
The mean of Y is 1 + (1 )2 and this is returned as the fitted values. By default, the five
linear/additive predictors are (logit(), 1 , log(1 ), 2 , log(2 ))T . If eq.sd = TRUE then 1 = 2
is enforced.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Warning
Numerical problems can occur and half-stepping is not uncommon. If failure to converge occurs,
try inputting better initial values, e.g., by using iphi, qmu, imu1, imu2, isd1, isd2, etc.
This VGAM family function is experimental and should be used with care.

Note
Fitting this model successfully to data can be difficult due to numerical problems and ill-conditioned
data. It pays to fit the model several times with different initial values and check that the best fit
looks reasonable. Plotting the results is recommended. This function works better as 1 and 2
become more different.
Convergence can be slow, especially when the two component distributions are not well separated.
The default control argument trace = TRUE is to encourage monitoring convergence. Having
eq.sd = TRUE often makes the overall optimization problem easier.

Author(s)
T. W. Yee

References
McLachlan, G. J. and Peel, D. (2000) Finite Mixture Models. New York: Wiley.
Everitt, B. S. and Hand, D. J. (1981) Finite Mixture Distributions. London: Chapman & Hall.
mix2poisson 481

See Also

uninormal, Normal, mix2poisson.

Examples
## Not run: mu1 <- 99; mu2 <- 150; nn <- 1000
sd1 <- sd2 <- exp(3)
(phi <- logit(-1, inverse = TRUE))
mdata <- data.frame(y = ifelse(runif(nn) < phi, rnorm(nn, mu1, sd1),
rnorm(nn, mu2, sd2)))
fit <- vglm(y ~ 1, mix2normal(eq.sd = TRUE), data = mdata)

# Compare the results

cfit <- coef(fit)
round(rbind('Estimated' = c(logit(cfit[1], inverse = TRUE),
cfit[2], exp(cfit[3]), cfit[4]),
'Truth' = c(phi, mu1, sd1, mu2)), digits = 2)

# Plot the results

xx <- with(mdata, seq(min(y), max(y), len = 200))
plot(xx, (1-phi) * dnorm(xx, mu2, sd2), type = "l", xlab = "y",
main = "Orange = estimate, blue = truth",
col = "blue", ylab = "Density")
phi.est <- logit(coef(fit)[1], inverse = TRUE)
sd.est <- exp(coef(fit)[3])
lines(xx, phi*dnorm(xx, mu1, sd1), col = "blue")
lines(xx, phi.est * dnorm(xx, Coef(fit)[2], sd.est), col = "orange")
lines(xx, (1-phi.est) * dnorm(xx, Coef(fit)[4], sd.est), col = "orange")
abline(v = Coef(fit)[c(2,4)], lty = 2, col = "orange")
abline(v = c(mu1, mu2), lty = 2, col = "blue")

## End(Not run)

mix2poisson Mixture of Two Poisson Distributions

Description

Estimates the three parameters of a mixture of two Poisson distributions by maximum likelihood
estimation.

Usage

mix2poisson(lphi = "logit", llambda = "loge",

iphi = 0.5, il1 = NULL, il2 = NULL,
qmu = c(0.2, 0.8), nsimEIM = 100, zero = "phi")
482 mix2poisson

Arguments
lphi, llambda Link functions for the parameter and . See Links for more choices.
iphi Initial value for , whose value must lie between 0 and 1.
il1, il2 Optional initial value for 1 and 2 . These values must be positive. The default
is to compute initial values internally using the argument qmu.
qmu Vector with two values giving the probabilities relating to the sample quantiles
for obtaining initial values for 1 and 2 . The two values are fed in as the probs
argument into quantile.
nsimEIM, zero See CommonVGAMffArguments.

Details
The probability function can be loosely written as

P (Y = y) = P oisson(1 ) + (1 ) P oisson(2 )

where is the probability an observation belongs to the first group, and y = 0, 1, 2, . . .. The
parameter satisfies 0 < < 1. The mean of Y is 1 + (1 )2 and this is returned as the
fitted values. By default, the three linear/additive predictors are (logit(), log(1 ), log(2 ))T .

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Warning
This VGAM family function requires care for a successful application. In particular, good initial
values are required because of the presence of local solutions. Therefore running this function with
several different combinations of arguments such as iphi, il1, il2, qmu is highly recommended.
Graphical methods such as hist can be used as an aid.
With grouped data (i.e., using the weights argument) one has to use a large value of nsimEIM; see
the example below.
This VGAM family function is experimental and should be used with care.

Note
The response must be integer-valued since dpois is invoked.
Fitting this model successfully to data can be difficult due to local solutions and ill-conditioned
data. It pays to fit the model several times with different initial values, and check that the best fit
looks reasonable. Plotting the results is recommended. This function works better as 1 and 2
become more different. The default control argument trace = TRUE is to encourage monitoring
convergence.

Author(s)
T. W. Yee
MNSs 483

# Compare the results with the truth

round(rbind('Estimated' = Coef(mfit), 'Truth' = c(phi, mu1, mu2)), digits = 2)

ty <- with(mdata, table(y))

plot(names(ty), ty, type = "h", main = "Orange=estimate, blue=truth",
ylab = "Frequency", xlab = "y")
abline(v = Coef(mfit)[-1], lty = 2, col = "orange", lwd = 2)
abline(v = c(mu1, mu2), lty = 2, col = "blue", lwd = 2)

# Example 2: London Times data (Lange, 1997, p.31)

ltdata1 <- data.frame(deaths = 0:9,
freq = c(162, 267, 271, 185, 111, 61, 27, 8, 3, 1))
ltdata2 <- data.frame(y = with(ltdata1, rep(deaths, freq)))

# Usually this does not work well unless nsimEIM is large

Mfit <- vglm(deaths ~ 1, weight = freq, data = ltdata1,
mix2poisson(iphi = 0.3, il1 = 1, il2 = 2.5, nsimEIM = 5000))

# This works better in general

Mfit <- vglm(y ~ 1, mix2poisson(iphi = 0.3, il1 = 1, il2 = 2.5), data = ltdata2)
coef(Mfit, matrix = TRUE)
Coef(Mfit)

## End(Not run)

MNSs The MNSs Blood Group System

Description
Estimates the three independent parameters of the the MNSs blood group system.

Usage
MNSs(link = "logit", imS = NULL, ims = NULL, inS = NULL)
484 MNSs

Arguments

link Link function applied to the three parameters. See Links for more choices.
imS, ims, inS Optional initial value for mS, ms and nS respectively. A NULL means they are
computed internally.

Details

There are three independent parameters: m_S, m_s, n_S, say, so that n_s = 1 - m_S - m_s - n_S.
We let the eta vector (transposed) be (g(m_S), g(m_s), g(n_S)) where g is the link function.

Value

An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Note

The input can be a 6-column matrix of counts, where the columns are MS, Ms, MNS, MNs, NS, Ns
(in order). Alternatively, the input can be a 6-column matrix of proportions (so each row adds to 1)
and the weights argument is used to specify the total number of counts for each row.

Author(s)

T. W. Yee

References

Elandt-Johnson, R. C. (1971) Probability Models and Statistical Methods in Genetics, New York:
Wiley.

See Also

AA.Aa.aa, AB.Ab.aB.ab, ABO, A1A2A3.

Examples
# Order matters only:
y <- cbind(MS = 295, Ms = 107, MNS = 379, MNs = 322, NS = 102, Ns = 214)
fit <- vglm(y ~ 1, MNSs("logit", .25, .28, .08), trace = TRUE)
fit <- vglm(y ~ 1, MNSs(link = logit), trace = TRUE, crit = "coef")
Coef(fit)
rbind(y, sum(y)*fitted(fit))
sqrt(diag(vcov(fit)))
model.framevlm 485

model.framevlm Construct the Model Frame of a VLM Object

Description

This function returns a data.frame with the variables. It is applied to an object which inherits from
class "vlm" (e.g., a fitted model of class "vglm").

Usage

model.framevlm(object, setupsmart = TRUE, wrapupsmart = TRUE, ...)

Arguments

object a model object from the VGAM R package that inherits from a vector linear
model (VLM), e.g., a model of class "vglm".
... further arguments such as data, na.action, subset. See model.frame for
more information on these.
setupsmart, wrapupsmart
Logical. Arguments to determine whether to use smart prediction.

Details

Since object is an object which inherits from class "vlm" (e.g., a fitted model of class "vglm"),
the method will either returned the saved model frame used when fitting the model (if any, selected
by argument model = TRUE) or pass the call used when fitting on to the default method.
This code implements smart prediction (see smartpred).

Value

A data.frame containing the variables used in the object plus those specified in ....

References

Chambers, J. M. (1992) Data for models. Chapter 3 of Statistical Models in S eds J. M. Chambers
and T. J. Hastie, Wadsworth & Brooks/Cole.

See Also

model.frame, model.matrixvlm, predictvglm, smartpred.

486 model.matrixvlm

Examples
# Illustrates smart prediction
pneumo <- transform(pneumo, let = log(exposure.time))
fit <- vglm(cbind(normal,mild, severe) ~ poly(c(scale(let)), 2),
multinomial, data = pneumo, trace = TRUE, x = FALSE)
class(fit)

check1 <- head(model.frame(fit))

check1
check2 <- model.frame(fit, data = head(pneumo))
check2
all.equal(unlist(check1), unlist(check2)) # Should be TRUE

q0 <- head(predict(fit))
q1 <- head(predict(fit, newdata = pneumo))
q2 <- predict(fit, newdata = head(pneumo))
all.equal(q0, q1) # Should be TRUE
all.equal(q1, q2) # Should be TRUE

model.matrixvlm Construct the Design Matrix of a VLM Object

Description
Creates a design matrix. Two types can be returned: a large one (class "vlm" or one that inherits
from this such as "vglm") or a small one (such as returned if it were of class "lm").

Usage
model.matrixvlm(object, type = c("vlm", "lm", "lm2", "bothlmlm2"),
linpred.index = NULL, ...)

Arguments
object an object of a class that inherits from the vector linear model (VLM).
type Type of design matrix returned. The first is the default. The value "vlm" is the
VLM model matrix corresponding to the formula argument. The value "lm" is
the LM model matrix corresponding to the formula argument. The value "lm2"
is the second (LM) model matrix corresponding to the form2 argument. The
value "bothlmlm2" means both LM and VLM model matrices.
linpred.index Single integer. The index for a linear/additive predictor, it must have a value
from the set 1:M, and type = "lm" must be assigned. Then it returns a sub-
set of the VLM matrix corresponding to the linpred.indexth linear/additive
predictor; this is a LM-type matrix.
... further arguments passed to or from other methods. These include data (which
is a data frame created with model.framevlm), contrasts.arg, and xlev. See
model.matrix for more information.
model.matrixvlm 487

Details

This function creates a design matrix from object. This can be a small LM object or a big VLM
object (default). The latter is constructed from the former and the constraint matrices.
This code implements smart prediction (see smartpred).

Value

The design matrix for a regression model with the specified formula and data. If type = "bothlmlm2"
then a list is returned with components "X" and "Xm2".

References

Yee, T. W. and Hastie, T. J. (2003) Reduced-rank vector generalized linear models. Statistical
Modelling, 3, 1541.
Chambers, J. M. (1992) Data for models. Chapter 3 of Statistical Models in S eds J. M. Chambers
and T. J. Hastie, Wadsworth & Brooks/Cole.

See Also

model.matrix, model.framevlm, predictvglm, smartpred.

Examples

# Illustrates smart prediction

pneumo <- transform(pneumo, let = log(exposure.time))
fit <- vglm(cbind(normal, mild, severe) ~ poly(c(scale(let)), 2),
multinomial, data = pneumo, trace = TRUE, x = FALSE)
class(fit)
fit@x # Not saved on the object
model.matrix(fit)
model.matrix(fit, linpred.index = 1, type = "lm")
model.matrix(fit, linpred.index = 2, type = "lm")

(Check1 <- head(model.matrix(fit, type = "lm")))

(Check2 <- model.matrix(fit, data = head(pneumo), type = "lm"))
all.equal(c(Check1), c(Check2))

q0 <- head(predict(fit))
q1 <- head(predict(fit, newdata = pneumo))
q2 <- predict(fit, newdata = head(pneumo))
all.equal(q0, q1) # Should be TRUE
all.equal(q1, q2) # Should be TRUE
488 moffset

moffset Matrix Offset

Description
Modify a matrix by shifting successive elements.

Usage
moffset(mat, roffset = 0, coffset = 0, postfix = "",
rprefix = "Row.", cprefix = "Col.")

Arguments
mat Data frame or matrix. This ought to have at least three rows and three columns.
The elements are shifted in the order of c(mat), i.e., going down successive
columns, as the columns go from left to right. Wrapping of values is done.
roffset, coffset
Numeric or character. If numeric, the amount of shift (offset) for each row and
column. The default is no change to mat. If character, the offset is computed
by matching with the row or column names. For example, for the alcoff, put
roffset = "6" means that we make an effective days dataset start from 6:00
am, and this wraps around to include midnight to 05.59 am on the next day.
postfix Character. Modified rows and columns are renamed by pasting this argument to
the end of each name. The default is no change.
rprefix, cprefix
Same as rcim.

Details
This function allows a matrix to be rearranged so that element (roffset + 1, coffset + 1) becomes
the (1, 1) element. The elements are assumed to be ordered in the same way as the elements of
c(mat),
This function is applicable to, e.g., alcoff, where it is useful to define the effective day as starting at
some other hour than midnight, e.g., 6.00am. This is because partying on Friday night continues on
into Saturday morning, therefore it is more interpretable to use the effective day when considering
a daily effect.
This is a data preprocessing function for rcim and plotrcim0. The differences between Rcim and
moffset is that Rcim only reorders the level of the rows and columns so that the data is shifted but
not moved. That is, a value in one row stays in that row, and ditto for column. But in moffset
values in one column can be moved to a previous column. See the examples below.

Value
A matrix of the same dimensional as its input.
multilogit 489

Note

The input mat should have row names and column names.

Author(s)

T. W. Yee, Alfian F. Hadi.

See Also

Rcim, rcim, plotrcim0, alcoff, crashi.

Examples
moffset(alcoff, 3, 2, "*") # Some day's data is moved to previous day.
Rcim(alcoff, 3 + 1, 2 + 1) # Data does not move as much.
alcoff # Original data
moffset(alcoff, 3, 2, "*") - Rcim(alcoff, 3+1, 2+1) # Note the differences

# An 'effective day' data set:

alcoff.e <- moffset(alcoff, roffset = "6", postfix = "*")
fit.o <- rcim(alcoff) # default baselines are first row and col
fit.e <- rcim(alcoff.e) # default baselines are first row and col

## Not run: par(mfrow = c(2, 2), mar = c(9, 4, 2, 1))

plot(fit.o, rsub = "Not very interpretable", csub = "Not very interpretable")
plot(fit.e, rsub = "More interpretable", csub = "More interpretable")

## End(Not run)

# Some checking
all.equal(moffset(alcoff), alcoff) # Should be no change
moffset(alcoff, 1, 1, "*")
moffset(alcoff, 2, 3, "*")
moffset(alcoff, 1, 0, "*")
moffset(alcoff, 0, 1, "*")
moffset(alcoff, "6", "Mon", "*") # This one is good

# Customise row and column baselines

fit2 <- rcim(Rcim(alcoff.e, rbaseline = "11", cbaseline = "Mon*"))

multilogit Multi-logit Link Function

Description

Computes the multilogit transformation, including its inverse and the first two derivatives.
490 multilogit

Usage
multilogit(theta, refLevel = "(Last)", M = NULL, whitespace = FALSE,
bvalue = NULL, inverse = FALSE, deriv = 0,
short = TRUE, tag = FALSE)

Arguments
theta Numeric or character. See below for further details.
refLevel, M, whitespace
See multinomial.
bvalue See Links.
inverse, deriv, short, tag
Details at Links.

Details
The multilogit() link function is a generalization of the logit link to M levels/classes. It forms
the basis of the multinomial logit model. It is sometimes called the multi-logit link or the multino-
mial logit link. When its inverse function is computed it returns values which are positive and add
to unity.

Value
For multilogit with deriv = 0, the multilogit of theta, i.e., log(theta[, j]/theta[, M+1])
when inverse = FALSE, and if inverse = TRUE then exp(theta[, j])/(1+rowSums(exp(theta))).
For deriv = 1, then the function returns d eta / d theta as a function of theta if inverse = FALSE,
else if inverse = TRUE then it returns the reciprocal.
Here, all logarithms are natural logarithms, i.e., to base e.

Note
Numerical instability may occur when theta is close to 1 or 0 (for multilogit). One way of
overcoming this is to use, e.g., bvalue. Currently care.exp() is used to avoid NAs being returned
if the probability is too close to 1.

Author(s)
Thomas W. Yee

References
McCullagh, P. and Nelder, J. A. (1989) Generalized Linear Models, 2nd ed. London: Chapman &
Hall.

See Also
Links, multinomial, logit, normal.vcm, CommonVGAMffArguments.
multinomial 491

Examples
pneumo <- transform(pneumo, let = log(exposure.time))
fit <- vglm(cbind(normal, mild, severe) ~ let,
multinomial, trace = TRUE, data = pneumo) # For illustration only!
fitted(fit)
predict(fit)

multilogit(fitted(fit))
multilogit(fitted(fit)) - predict(fit) # Should be all 0s

multilogit(predict(fit), inverse = TRUE) # rowSums() add to unity

multilogit(predict(fit), inverse = TRUE, refLevel = 1) # For illustration only
multilogit(predict(fit), inverse = TRUE) - fitted(fit) # Should be all 0s

multilogit(fitted(fit), deriv = 1)
multilogit(fitted(fit), deriv = 2)

multinomial Multinomial Logit Model

Description
Fits a multinomial logit model to a (preferably unordered) factor response.

Usage
multinomial(zero = NULL, parallel = FALSE, nointercept = NULL,
refLevel = "(Last)", whitespace = FALSE)

Arguments
zero Can be an integer-valued vector specifying which linear/additive predictors are
modelled as intercepts only. Any values must be from the set {1,2,. . . ,M }. The
default value means none are modelled as intercept-only terms. See CommonVGAMffArguments
for more information.
parallel A logical, or formula specifying which terms have equal/unequal coefficients.
nointercept, whitespace
See CommonVGAMffArguments for more details.
refLevel Either a (1) single positive integer or (2) a value of the factor or (3) a character
string. If inputted as an integer then it specifies which column of the response
matrix is the reference or baseline level. The default is the last one (the (M +1)th
one). If used, this argument will be usually assigned the value 1. If inputted as
a value of a factor then beware of missing values of certain levels of the factor
(drop.unused.levels = TRUE or drop.unused.levels = FALSE). See the ex-
ample below. If inputted as a character string then this should be equal to (A) one
of the levels of the factor response, else (B) one of the column names of the ma-
trix response of counts; e.g., vglm(cbind(normal, mild, severe) ~ let, multinomial(refLevel =
492 multinomial

if it was (incorrectly because the response is ordinal) applied to the pneumo data
set. Another example is vglm(ethnicity ~ age, multinomial(refLevel = "European"), data = xs
if it was applied to the xs.nz data set.

Details
In this help file the response Y is assumed to be a factor with unordered values 1, 2, . . . , M + 1, so
that M is the number of linear/additive predictors j .
The default model can be written

j = log(P [Y = j]/P [Y = M + 1])

where j is the jth linear/additive predictor. Here, j = 1, . . . , M , and M +1 is 0 by definition.

That is, the last level of the factor, or last column of the response matrix, is taken as the reference
level or baselinethis is for identifiability of the parameters. The reference or baseline level can be
changed with the refLevel argument.
In almost all the literature, the constraint matrices associated with this family of models are known.
For example, setting parallel = TRUE will make all constraint matrices (except for the intercept)
equal to a vector of M 1s. If the constraint matrices are unknown and to be estimated, then this can
be achieved by fitting the model as a reduced-rank vector generalized linear model (RR-VGLM; see
rrvglm). In particular, a multinomial logit model with unknown constraint matrices is known as a
stereotype model (Anderson, 1984), and can be fitted with rrvglm.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.

Warning
No check is made to verify that the response is nominal.
See CommonVGAMffArguments for more warnings.

Note
The response should be either a matrix of counts (with row sums that are all positive), or a factor.
In both cases, the y slot returned by vglm/vgam/rrvglm is the matrix of sample proportions.
The multinomial logit model is more appropriate for a nominal (unordered) factor response than
for an ordinal (ordered) factor response. Models more suited for the latter include those based on
cumulative probabilities, e.g., cumulative.
multinomial is prone to numerical difficulties if the groups are separable and/or the fitted proba-
bilities are close to 0 or 1. The fitted values returned are estimates of the probabilities P [Y = j] for
j = 1, . . . , M + 1. See safeBinaryRegression for the logistic regression case.
Here is an example of the usage of the parallel argument. If there are covariates x2, x3 and x4,
then parallel = TRUE ~ x2 + x3 - 1 and parallel = FALSE ~ x4 are equivalent. This
would constrain the regression coefficients for x2 and x3 to be equal; those of the intercepts and x4
would be different.
multinomial 493

In Example 4 below, a conditional logit model is fitted to an artificial data set that explores how cost
and travel time affect peoples decision about how to travel to work. Walking is the baseline group.
The variable Cost.car is the difference between the cost of travel to work by car and walking,
etc. The variable Time.car is the difference between the travel duration/time to work by car and
walking, etc. For other details about the xij argument see vglm.control and fill.
The multinom function in the nnet package uses the first level of the factor as baseline, whereas the
last level of the factor is used here. Consequently the estimated regression coefficients differ.

Author(s)
Thomas W. Yee

References
Yee, T. W. (2010) The VGAM package for categorical data analysis. Journal of Statistical Software,
32, 134. http://www.jstatsoft.org/v32/i10/.
Yee, T. W. and Hastie, T. J. (2003) Reduced-rank vector generalized linear models. Statistical
Modelling, 3, 1541.
McCullagh, P. and Nelder, J. A. (1989) Generalized Linear Models, 2nd ed. London: Chapman &
Hall.
Agresti, A. (2013) Categorical Data Analysis, 3rd ed. Hoboken, NJ, USA: Wiley.
Hastie, T. J., Tibshirani, R. J. and Friedman, J. H. (2009) The Elements of Statistical Learning:
Data Mining, Inference and Prediction, 2nd ed. New York, USA: Springer-Verlag.
Simonoff, J. S. (2003) Analyzing Categorical Data, New York, USA: Springer-Verlag.
Anderson, J. A. (1984) Regression and ordered categorical variables. Journal of the Royal Statisti-
cal Society, Series B, Methodological, 46, 130.
Tutz, G. (2012) Regression for Categorical Data, Cambridge University Press.

See Also
margeff, cumulative, acat, cratio, sratio, dirichlet, dirmultinomial, rrvglm, fill1,
Multinomial, multilogit, iris. The authors homepage has further documentation about cat-
egorical data analysis using VGAM.

Examples
# Example 1: fit a multinomial logit model to Edgar Anderson's iris data
data(iris)
## Not run: fit <- vglm(Species ~ ., multinomial, iris)
coef(fit, matrix = TRUE)
## End(Not run)

# Example 2a: a simple example

ycounts <- t(rmultinom(10, size = 20, prob = c(0.1, 0.2, 0.8))) # Counts
fit <- vglm(ycounts ~ 1, multinomial)
head(fitted(fit)) # Proportions
[email protected] # NOT recommended for extraction of prior weights
weights(fit, type = "prior", matrix = FALSE) # The better method
494 multinomial

depvar(fit) # Sample proportions; same as fit@y

constraints(fit) # Constraint matrices

# Example 2b: Different reference level used as the baseline

fit2 <- vglm(ycounts ~ 1, multinomial(refLevel = 2))
coef(fit2, matrix = TRUE)
coef(fit , matrix = TRUE) # Easy to reconcile this output with fit2

# Example 3: The response is a factor.

nn <- 10
dframe3 <- data.frame(yfactor = gl(3, nn, labels = c("Control", "Trt1", "Trt2")),
x2 = runif(3 * nn))
myrefLevel <- with(dframe3, yfactor[12])
fit3a <- vglm(yfactor ~ x2, multinomial(refLevel = myrefLevel), dframe3)
fit3b <- vglm(yfactor ~ x2, multinomial(refLevel = 2), dframe3)
coef(fit3a, matrix = TRUE) # "Treatment1" is the reference level
coef(fit3b, matrix = TRUE) # "Treatment1" is the reference level
margeff(fit3b)

# Example 4: Fit a rank-1 stereotype model

fit4 <- rrvglm(Country ~ Width + Height + HP, multinomial, data = car.all)
coef(fit4) # Contains the C matrix
constraints(fit4)$HP # The A matrix
coef(fit4, matrix = TRUE) # The B matrix
Coef(fit4)@C # The C matrix
concoef(fit4) # Better to get the C matrix this way
Coef(fit4)@A # The A matrix
svd(coef(fit4, matrix = TRUE)[-1, ])$d # This has rank 1; = C %*% t(A)
# Classification (but watch out for NAs in some of the variables):
apply(fitted(fit4), 1, which.max) # Classification
colnames(fitted(fit4))[apply(fitted(fit4), 1, which.max)] # Classification
apply(predict(fit4, car.all, type = "response"), 1, which.max) # Ditto

# Example 5: The use of the xij argument (aka conditional logit model)
set.seed(111)
nn <- 100 # Number of people who travel to work
M <- 3 # There are M+1 models of transport to go to work
ycounts <- matrix(0, nn, M+1)
ycounts[cbind(1:nn, sample(x = M+1, size = nn, replace = TRUE))] = 1
dimnames(ycounts) <- list(NULL, c("bus","train","car","walk"))
gotowork <- data.frame(cost.bus = runif(nn), time.bus = runif(nn),
cost.train= runif(nn), time.train= runif(nn),
cost.car = runif(nn), time.car = runif(nn),
cost.walk = runif(nn), time.walk = runif(nn))
gotowork <- round(gotowork, digits = 2) # For convenience
gotowork <- transform(gotowork,
Cost.bus = cost.bus - cost.walk,
Cost.car = cost.car - cost.walk,
Cost.train = cost.train - cost.walk,
Cost = cost.train - cost.walk, # for labelling
Time.bus = time.bus - time.walk,
Time.car = time.car - time.walk,
Nakagami 495

Time.train = time.train - time.walk,

Time = time.train - time.walk) # for labelling
fit <- vglm(ycounts ~ Cost + Time,
multinomial(parall = TRUE ~ Cost + Time - 1),
xij = list(Cost ~ Cost.bus + Cost.train + Cost.car,
Time ~ Time.bus + Time.train + Time.car),
form2 = ~ Cost + Cost.bus + Cost.train + Cost.car +
Time + Time.bus + Time.train + Time.car,
data = gotowork, trace = TRUE)
head(model.matrix(fit, type = "lm")) # LM model matrix
head(model.matrix(fit, type = "vlm")) # Big VLM model matrix
coef(fit)
coef(fit, matrix = TRUE)
constraints(fit)
summary(fit)
max(abs(predict(fit) - predict(fit, new = gotowork))) # Should be 0

Nakagami Nakagami Distribution

Description
Density, cumulative distribution function, quantile function and random generation for the Nak-
agami distribution.

Usage
dnaka(x, scale = 1, shape, log = FALSE)
pnaka(q, scale = 1, shape, lower.tail = TRUE, log.p = FALSE)
qnaka(p, scale = 1, shape, ...)
rnaka(n, scale = 1, shape, Smallno = 1.0e-6)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. Same as in runif.
scale, shape arguments for the parameters of the distribution. See nakagami for more details.
For rnaka, arguments shape and scale must be of length 1.
Smallno Numeric, a small value used by the rejection method for determining the upper
limit of the distribution. That is, pnaka(U) > 1-Smallno where U is the upper
limit.
... Arguments that can be passed into uniroot.
log Logical. If log = TRUE then the logarithm of the density is returned.
lower.tail, log.p
Same meaning as in pnorm or qnorm.
496 nakagami

Details
See nakagami for more details.

Value
dnaka gives the density, pnaka gives the cumulative distribution function, qnaka gives the quantile
function, and rnaka generates random deviates.

Author(s)
T. W. Yee and Kai Huang

plot(x, pnorm(x), type = "n", col = "black", ylab = "",

ylim = 0:1, main = "pnaka(x, shape = shape)")
lines(x, pnaka(x, shape = 1), col = "orange")
lines(x, pnaka(x, shape = 2), col = "blue")
lines(x, pnaka(x, shape = 3), col = "green")
legend(2, 0.6, col = c("orange","blue","green"), lty = rep(1, len = 3),
legend = paste("shape =", c(1, 2, 3)))
## End(Not run)

probs <- seq(0.1, 0.9, by = 0.1)

pnaka(qnaka(p = probs, shape = 2), shape = 2) - probs # Should be all 0

nakagami Nakagami Distribution Family Function

Description
Estimation of the two parameters of the Nakagami distribution by maximum likelihood estimation.

Usage
nakagami(lscale = "loge", lshape = "loge", iscale = 1, ishape = NULL,
nowarning = FALSE)
nakagami 497

Arguments
nowarning Logical. Suppress a warning?
lscale, lshape Parameter link functions applied to the scale and shape parameters. Log links
ensure they are positive. See Links for more choices and information.
iscale, ishape Optional initial values for the shape and scale parameters. For ishape, a NULL
value means it is obtained in the initialize slot based on the value of iscale.
For iscale, assigning a NULL means a value is obtained in the initialize slot,
however, setting another numerical value is recommended if convergence fails
or is too slow.

Details
The Nakagami distribution, which is useful for modelling wireless systems such as radio links, can
be written

f (y) = 2(shape/scale)shape y 2shape1 exp(shape y 2 /scale)/(shape)

p
for y > 0, shape > 0, scale > 0. The mean of Y is scale/shape (shape + 0.5)/(shape)
and these are returned as the fitted values. By default, the linear/additive predictors are 1 =
log(scale) and 2 = log(shape). Fisher scoring is implemented.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Note
The Nakagami distribution is also known as the Nakagami-m distribution, where m = shape here.
Special cases: m = 0.5 is a one-sided Gaussian distribution and m = 1 is a Rayleigh distribution.
The second moment is E(Y 2 ) = m.
If Y has a Nakagami distribution with parameters shape and scale then Y 2 has a gamma distribution
with shape parameter shape and scale parameter scale/shape.

Author(s)
T. W. Yee

References
Nakagami, M. (1960) The m-distribution: a general formula of intensity distribution of rapid fading,
pp.336 in: Statistical Methods in Radio Wave Propagation. W. C. Hoffman, Ed., New York:
Pergamon.

See Also
rnaka, gamma2, rayleigh.
498 nbcanlink

Examples
nn <- 1000; shape <- exp(0); Scale <- exp(1)
ndata <- data.frame(y1 = sqrt(rgamma(nn, shape = shape, scale = Scale/shape)))
fit <- vglm(y1 ~ 1, nakagami, data = ndata, trace = TRUE, crit = "coef")
ndata <- transform(ndata, y2 = rnaka(nn, scale = Scale, shape = shape))
fit <- vglm(y2 ~ 1, nakagami(iscale = 3), data = ndata, trace = TRUE)
head(fitted(fit))
with(ndata, mean(y2))
coef(fit, matrix = TRUE)
(Cfit <- Coef(fit))
## Not run: sy <- with(ndata, sort(y2))
hist(with(ndata, y2), prob = TRUE, main = "", xlab = "y", ylim = c(0, 0.6),
col = "lightblue")
lines(dnaka(sy, scale = Cfit["scale"], shape = Cfit["shape"]) ~ sy,
data = ndata, col = "orange")
## End(Not run)

nbcanlink Negative Binomial Canonical Link Function

Description
Computes the negative binomial canonical link transformation, including its inverse and the first
two derivatives.

Usage
nbcanlink(theta, size = NULL, wrt.param = NULL, bvalue = NULL,
inverse = FALSE, deriv = 0, short = TRUE, tag = FALSE)

Arguments
theta Numeric or character. Typically the mean of a negative binomial (NB) distribu-
tion. See below for further details.
size, wrt.param
size contains the k matrix which must be of a conformable dimension as theta.
Also, if deriv > 0 then wrt.param is either 1 or 2 (1 for with respect to the
first parameter, and 2 for with respect to the second parameter (size)).
bvalue Details at Links.
inverse, deriv, short, tag
Details at Links.

Details
The negative binomial (NB) canonical link is log(/( + k)) where is the mean of a NB distribu-
tion. The canonical link is used for theoretically relating the NB to GLM class.
This link function was specifically written for negbinomial and negbinomial.size, and should
not be used elsewhere (these VGAM family functions have code that specifically handles nbcanlink().)
nbcanlink 499

Value
For deriv = 0, the above equation when inverse = FALSE, and if inverse = TRUE then
kmatrix / expm1(-theta). For deriv = 1, then the function returns d eta / d theta as a
function of theta if inverse = FALSE, else if inverse = TRUE then it returns the reciprocal.

Warning
This function currently does not work very well with negbinomial! The NB-C model is sensitive
to the initial values and may converge to a local solution. Pages 210 and 309 of Hilbe (2011) notes
convergence difficulties (of Newton-Raphson type algorithms), and this applies here. This function
should work okay with negbinomial.size. Currently trying something like imethod = 3 or imu,
stepsize = 0.5, maxit = 100, zero = -2 should help; see the example below.
Standard errors may be unreliable.

Note
While theoretically nice, this function is not recommended in general since its value is always
negative (linear predictors ought to be unbounded in general). A loge link for argument lmu is
recommended instead.
Numerical instability may occur when theta is close to 0 or 1. Values of theta which are less than
or equal to 0 can be replaced by bvalue before computing the link function value. See Links.

Author(s)
Thomas W. Yee

References
Yee, T. W. (2014) Reduced-rank vector generalized linear models with two linear predictors. Com-
putational Statistics and Data Analysis, 71, 889902.
Hilbe, J. M. (2011) Negative Binomial Regression, 2nd Edition. Cambridge: Cambridge University
Press.

mymu <- 1:10 # Test some basic operations:

kmatrix <- matrix(runif(length(mymu)), length(mymu), 1)
eta1 <- nbcanlink(mymu, size = kmatrix)
ans2 <- nbcanlink(eta1, size = kmatrix, inverse = TRUE)
max(abs(ans2 - mymu)) # Should be 0

## Not run: mymu <- c(seq(0.5, 10, length = 101))

kmatrix <- matrix(10, length(mymu), 1)
500 nbolf

plot(nbcanlink(mymu, size = kmatrix) ~ mymu, las = 1,

type = "l", col = "blue", lwd = 1.5, xlab = expression({mu}))

# Estimate the parameters from some simulated data (see Warning section)
set.seed(123)
ndata <- data.frame(x2 = runif(nn <- 1000 ))
size1 <- exp(1); size2 <- exp(2)
ndata <- transform(ndata, eta1 = -1 - 1 * x2, # eta1 < 0
size1 = size1,
size2 = size2)
ndata <- transform(ndata,
mu1 = nbcanlink(eta1, size = size1, inv = TRUE),
mu2 = nbcanlink(eta1, size = size2, inv = TRUE))
ndata <- transform(ndata, y1 = rnbinom(nn, mu = mu1, size = size1),
y2 = rnbinom(nn, mu = mu2, size = size2))
head(ndata)
summary(ndata)

fit <- vglm(cbind(y1, y2) ~ x2,

# negbinomial("nbcanlink", imethod = 1, max.chunk.MB = 9),
negbinomial("nbcanlink", imethod = 2),
stepsize = 0.25, data = ndata, # Deliberately slow the convergence rate
maxit = 100, trace = TRUE) # Warning: may converge to a local soln
coef(fit, matrix = TRUE)
summary(fit)

## End(Not run)

nbolf Negative Binomial-Ordinal Link Function

Description
Computes the negative binomial-ordinal transformation, including its inverse and the first two
derivatives.

Usage
nbolf(theta, cutpoint = NULL, k = NULL,
inverse = FALSE, deriv = 0, short = TRUE, tag = FALSE)

Arguments
theta Numeric or character. See below for further details.
cutpoint, k Here, k is the k parameter associated with the negative binomial distribution;
see negbinomial. The cutpoints should be non-negative integers. If nbolf() is
used as the link function in cumulative then one should choose reverse = TRUE, parallel = TRUE.
inverse, deriv, short, tag
Details at Links.
nbolf 501

Details
The negative binomial-ordinal link function (NBOLF) can be applied to a parameter lying in the
unit interval. Its purpose is to link cumulative probabilities associated with an ordinal response
coming from an underlying negative binomial distribution.
See Links for general information about VGAM link functions.

Value
See Yee (2012) for details.

Warning
Prediction may not work on vglm or vgam etc. objects if this link function is used.

Note
Numerical values of theta too close to 0 or 1 or out of range result in large positive or negative
values, or maybe 0 depending on the arguments. Although measures have been taken to handle
cases where theta is too close to 1 or 0, numerical instabilities may still arise.
In terms of the threshold approach with cumulative probabilities for an ordinal response this link
function corresponds to the negative binomial distribution (see negbinomial) that has been recorded
as an ordinal response using known cutpoints.

Author(s)
Thomas W. Yee

References
Yee, T. W. (2012) Ordinal ordination with normalizing link functions for count data, (in prepara-
tion).

See Also
Links, negbinomial, polf, golf, nbolf2, cumulative, CommonVGAMffArguments.

Examples
## Not run:
nbolf("p", cutpoint = 2, k = 1, short = FALSE)
nbolf("p", cutpoint = 2, k = 1, tag = TRUE)

p <- seq(0.02, 0.98, by = 0.01)

y <- nbolf(p,cutpoint = 2, k = 1)
y. <- nbolf(p,cutpoint = 2, k = 1, deriv = 1)
max(abs(nbolf(y,cutpoint = 2, k = 1, inv = TRUE) - p)) # Should be 0

#\ dontrun{ par(mfrow = c(2, 1), las = 1)

#plot(p, y, type = "l", col = "blue", main = "nbolf()")
#abline(h = 0, v = 0.5, col = "red", lty = "dashed")
502 negbinomial

#
#plot(p, y., type = "l", col = "blue",
# main = "(Reciprocal of) first NBOLF derivative") }

# Another example
nn <- 1000
x2 <- sort(runif(nn))
x3 <- runif(nn)
mymu <- exp( 3 + 1 * x2 - 2 * x3)
k <- 4
y1 <- rnbinom(nn, mu = mymu, size = k)
cutpoints <- c(-Inf, 10, 20, Inf)
cuty <- Cut(y1, breaks = cutpoints)
#\ dontrun{ plot(x2, x3, col = cuty, pch = as.character(cuty)) }
table(cuty) / sum(table(cuty))
fit <- vglm(cuty ~ x2 + x3, trace = TRUE,
cumulative(reverse = TRUE, multiple.responses = TRUE,
parallel = TRUE,
link = nbolf(cutpoint = cutpoints[2:3], k = k)))
head(depvar(fit))
head(fitted(fit))
head(predict(fit))
coef(fit)
coef(fit, matrix = TRUE)
constraints(fit)
fit@misc

## End(Not run)

negbinomial Negative Binomial Distribution Family Function

Description
Maximum likelihood estimation of the two parameters of a negative binomial distribution.

Usage
negbinomial(zero = "size", parallel = FALSE, deviance.arg = FALSE,
type.fitted = c("mean", "quantiles"),
percentiles = c(25, 50, 75),
mds.min = 1e-3, nsimEIM = 500, cutoff.prob = 0.999,
eps.trig = 1e-7, max.support = 4000, max.chunk.MB = 30,
lmu = "loge", lsize = "loge",
imethod = 1, imu = NULL, iprobs.y = NULL,
gprobs.y = ppoints(6), isize = NULL,
gsize.mux = exp(c(-30, -20, -15, -10, -6:3)))
polya(zero = "size", type.fitted = c("mean", "prob"),
mds.min = 1e-3, nsimEIM = 500, cutoff.prob = 0.999,
negbinomial 503

eps.trig = 1e-7, max.support = 4000, max.chunk.MB = 30,

lprob = "logit", lsize = "loge", imethod = 1, iprob = NULL,
iprobs.y = NULL, gprobs.y = ppoints(6), isize = NULL,
gsize.mux = exp(c(-30, -20, -15, -10, -6:3)), imunb = NULL)
polyaR(zero = "size", type.fitted = c("mean", "prob"),
mds.min = 1e-3, nsimEIM = 500, cutoff.prob = 0.999,
eps.trig = 1e-7, max.support = 4000, max.chunk.MB = 30,
lsize = "loge", lprob = "logit", imethod = 1, iprob = NULL,
iprobs.y = NULL, gprobs.y = ppoints(6), isize = NULL,
gsize.mux = exp(c(-30, -20, -15, -10, -6:3)), imunb = NULL)

Arguments
lmu, lsize, lprob
Link functions applied to the , k and p parameters. See Links for more
choices. Note that the , k and p parameters are the mu, size and prob ar-
guments of rnbinom respectively. Common alternatives for lsize are negloge
and reciprocal, and loglog (if k > 1).
imu, imunb, isize, iprob
Optional initial values for the mean and k and p. For k, if failure to con-
verge occurs then try different values (and/or use imethod). For a S-column
response, isize can be of length S. A value NULL means an initial value for
each response is computed internally using a gridsearch based on gsize.mux.
The last argument is ignored if used within cqo; see the iKvector argument of
qrrvglm.control instead. In the future isize and iprob might be depreciated.
nsimEIM This argument is used for computing the diagonal element of the expected infor-
mation matrix (EIM) corresponding to k based on the simulated Fisher scoring
(SFS) algorithm. See CommonVGAMffArguments for more information and the
notes below. SFS is one of two algorithms for computing the EIM elements (so
that both algorithms may be used on a given data set). SFS is faster than the
exact method when Qmax is large.
cutoff.prob Fed into the p argument of qnbinom in order to obtain an upper limit for the
approximate support of the distribution, called Qmax, say. Similarly, the value
1-p is fed into the p argument of qnbinom in order to obtain a lower limit for
the approximate support of the distribution, called Qmin, say. Hence the approx-
imate support is Qmin:Qmax. This argument should be a numeric and close to
1 but never exactly 1. Used to specify how many terms of the infinite series
for computing the second diagonal element of the EIM are actually used. The
closer this argument is to 1, the more accurate the standard errors of the regres-
sion coefficients will be. If this argument is too small, convergence will take
longer.
max.chunk.MB, max.support
max.support is used to describe the eligibility of individual observations to
have their EIM computed by the exact method. Here, we are concerned about
computing the EIM wrt k. The exact method algorithm operates separately on
each response variable, and it constructs a large matrix provided that the num-
ber of columns is less than max.support. If so, then the computations are done
in chunks, so that no more than about max.chunk.MB megabytes of memory is
504 negbinomial

used at a time (actually, it is proportional to this amount). Regarding eligibility

of this algorithm, each observation must have the length of the vector, starting
from the 1-cutoff.prob quantile and finishing up at the cutoff.prob quantile,
less than max.support (as its approximate support). If you have abundant mem-
ory then you might try setting max.chunk.MB = Inf, but then the computations
might take a very long time. Setting max.chunk.MB = 0 or max.support = 0
will force the EIM to be computed using the SFS algorithm only (this used
to be the default method for all the observations). When the fitted values of
the model are large and k is small, the computation of the EIM will be costly
with respect to time and memory if the exact method is used. Hence the argu-
ment max.support limits the cost in terms of time. For intercept-only models
max.support is multiplied by a number (such as 10) because only one inner
product needs be computed. Note: max.support is an upper bound and limits
the number of terms dictated by the eps.trig argument.
mds.min Numeric. Minimum value of the NBD mean divided by size parameter. The
closer this ratio is to 0, the closer the distribution is to a Poisson. Iterations will
stop when an estimate of k is so large, relative to the mean, than it is below this
threshold (this is treated as a boundary of the parameter space).
eps.trig Numeric. A small positive value used in the computation of the EIMs. It
focusses on the denominator of the terms of a series. Each term in the se-
ries (that is used to approximate an infinite series) has a value greater than
size / sqrt(eps.trig), thus very small terms are ignored. Its a good
idea to set a smaller value that will result in more accuracy, but it will re-
quire a greater computing time (when k is close to 0). And adjustment to
max.support may be needed. In particular, the quantity computed by special
means is (k) E[(Y + k)], which is the difference between two trigamma.
functions. It is part of the calculation of the EIM with respect to the size pa-
rameter.
gsize.mux Similar to gsigma in CommonVGAMffArguments. However, this grid is multi-
plied by the initial estimates of the NBD mean parameter. That is, it is on a
relative scale rather than on an absolute scale. If the counts are very large in
value then convergence fail might occur; if so, then try a smaller value such as
gsize.mux = exp(-40).
type.fitted, percentiles
See CommonVGAMffArguments for more information.
deviance.arg Logical. If TRUE, the deviance is computed after convergence. It only works
in the NB-2 model. It is also necessary to set criterion = "coefficients"
or half.step = FALSE since one cannot use that criterion properly for the
minimization within the IRLS algorithm. It should be set TRUE when used with
cqo under the fast algorithm.
imethod An integer with value 1 or 2 etc. which specifies the initialization method for the
parameter. If failure to converge occurs try another value and/or else specify
a value for iprobs.y and/or else specify a value for isize.
parallel See CommonVGAMffArguments for more information. Setting parallel = TRUE
is useful in order to get something similar to quasipoissonff or what is known
as NB-1. If parallel = TRUE then the parallelism constraint does not apply to
negbinomial 505

any intercept term. You should set zero = NULL too if parallel = TRUE to
avoid a conflict.
gprobs.y A vector representing a grid; passed into the probs argument of quantile when
imethod = 1 to obtain an initial value for the mean of each response. Is over-
written by any value of iprobs.y.
iprobs.y Passed into the probs argument of quantile when imethod = 1 to obtain an
initial value for the mean of each response. Overwrites any value of gprobs.y.
This argument might be deleted in the future.
zero Can be an integer-valued vector, and if so, then it is usually assigned 2 or 2.
Specifies which of the two linear/additive predictors are modelled as an intercept
only. By default, the k parameter (after lsize is applied) is modelled as a single
unknown number that is estimated. It can be modelled as a function of the
explanatory variables by setting zero = NULL; this has been called a NB-H
model by Hilbe (2011). A negative value means that the value is recycled, so
setting 2 means all k are intercept-only. See CommonVGAMffArguments for
more information.

Details

The negative binomial distribution can be motivated in several ways, e.g., as a Poisson distribution
with a mean that is gamma distributed. There are several common parametrizations of the negative
binomial distribution. The one used by negbinomial() uses the mean and an index parameter k,
both which are positive. Specifically, the density of a random variable Y is
y k
y+k1 k
f (y; , k) =
y +k k+

where y = 0, 1, 2, . . ., and > 0 and k > 0. Note that the dispersion parameter is 1/k, so that
as k approaches infinity the negative binomial distribution approaches a Poisson distribution. The
response has variance V ar(Y ) = + 2 /k. When fitted, the fitted.values slot of the object
contains the estimated value of the parameter, i.e., of the mean E(Y ). It is common for some
to use = 1/k as the ancillary or heterogeneity parameter; so common alternatives for lsize are
negloge and reciprocal.
For polya the density is

y+k1 y
f (y; p, k) = (1 p) pk
y
where y = 0, 1, 2, . . ., and k > 0 and 0 < p < 1.
Family function polyaR() is the same as polya() except the order of the two parameters are
switched. The reason is that polyaR() tries to match with rnbinom closely in terms of the ar-
gument order, etc. Should the probability parameter be of primary interest, probably, users will
prefer using polya() rather than polyaR(). Possibly polyaR() will be decommissioned one day.
The negative binomial distribution can be coerced into the classical GLM framework with one of the
parameters being of interest and the other treated as a nuisance/scale parameter (this is implemented
in the MASS library). The VGAM family function negbinomial() treats both parameters on the
same footing, and estimates them both by full maximum likelihood estimation.
506 negbinomial

The parameters and k are independent (diagonal EIM), and the confidence region for k is ex-
tremely skewed so that its standard error is often of no practical use. The parameter 1/k has been
used as a measure of aggregation.
These VGAM family functions handle multiple responses, so that a response matrix can be inputted.
The number of columns is the number of species, say, and setting zero = -2 means that all species
have a k equalling a (different) intercept only.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.

Warning
Poisson regression corresponds to k equalling infinity. If the data is Poisson or close to Poisson,
numerical problems may occur. Some corrective measures are taken, e.g., k is effectively capped
(relative to the mean) during estimation to some large value and a warning is issued. And setting
stepsize = 0.5 for half stepping is probably a good idea too when the data is extreme.
The negative binomial distribution (NBD) is a strictly unimodal distribution. Any data set that
does not exhibit a mode (somewhere in the middle) makes the estimation problem difficult. Set
trace = TRUE to monitor convergence.
These functions are fragile; the maximum likelihood estimate of the index parameter is fraught (see
Lawless, 1987). In general, the quasipoissonff is more robust. Other alternatives to negbinomial
are to fit a NB-1 or RR-NB (aka NB-P) model; see Yee (2014). Also available are the NB-C, NB-H
and NB-G. Assigning values to the isize argument may lead to a local solution, and smaller values
are preferred over large values when using this argument.
If one wants to force SFS to be used on all observations, then set max.support = 0 or max.chunk.MB = 0.
If one wants to force the exact method to be used for all observations, then set max.support = Inf.
If the computer has much memory, then trying max.chunk.MB = Inf and max.support = Inf
may provide a small speed increase. If SFS is used at all, then the @weights slot of the fitted object
will be a matrix; otherwise that slot will be a 0 x 0 matrix.
Yet to do: write a family function which uses the methods of moments estimator for k.

Note
These 3 functions implement 2 common parameterizations of the negative binomial (NB). Some
people called the NB with integer k the Pascal distribution, whereas if k is real then this is the Polya
distribution. I dont. The one matching the details of rnbinom in terms of p and k is polya().
For polya() the code may fail when p is close to 0 or 1. It is not yet compatible with cqo or cao.
Suppose the response is called ymat. For negbinomial() the diagonal element of the expected in-
formation matrix (EIM) for parameter k involves an infinite series; consequently SFS (see nsimEIM)
is used as the backup algorithm only. SFS should be better if max(ymat) is large, e.g., max(ymat) > 1000,
or if there are any outliers in ymat. The default algorithm involves a finite series approximation to
the support 0:Inf; the arguments max.memory, min.size and cutoff.prob are pertinent.
Regardless of the algorithm used, convergence problems may occur, especially when the response
has large outliers or is large in magnitude. If convergence failure occurs, try using arguments
negbinomial 507

(in recommended decreasing order) max.support, nsimEIM, cutoff.prob, iprobs.y, imethod,

isize, zero, max.chunk.MB.
The function negbinomial can be used by the fast algorithm in cqo, however, setting eq.tolerances = TRUE
and I.tolerances = FALSE is recommended.
In the first example below (Bliss and Fisher, 1953), from each of 6 McIntosh apple trees in an
orchard that had been sprayed, 25 leaves were randomly selected. On each of the leaves, the number
of adult female European red mites were counted.
There are two special uses of negbinomial for handling count data. Firstly, when used by rrvglm
this results in a continuum of models in between and inclusive of quasi-Poisson and negative bi-
nomial regression. This is known as a reduced-rank negative binomial model (RR-NB). It fits a
negative binomial log-linear regression with variance function V ar(Y ) = + 1 2 where 1 and
2 are parameters to be estimated by MLE. Confidence intervals are available for 2 , therefore it
can be decided upon whether the data are quasi-Poisson or negative binomial, if any.
Secondly, the use of negbinomial with parallel = TRUE inside vglm can result in a model
similar to quasipoissonff. This is named the NB-1 model. The dispersion parameter is estimated
by MLE whereas glm uses the method of moments. In particular, it fits a negative binomial log-
linear regression with variance function V ar(Y ) = 0 where 0 is a parameter to be estimated
by MLE. Confidence intervals are available for 0 .

Author(s)
Thomas W. Yee

References
Lawless, J. F. (1987) Negative binomial and mixed Poisson regression. The Canadian Journal of
Statistics 15, 209225.
Hilbe, J. M. (2011) Negative Binomial Regression, 2nd Edition. Cambridge: Cambridge University
Press.
Bliss, C. and Fisher, R. A. (1953) Fitting the negative binomial distribution to biological data.
Biometrics 9, 174200.
Yee, T. W. (2014) Reduced-rank vector generalized linear models with two linear predictors. Com-
putational Statistics and Data Analysis, 71, 889902.

See Also
quasipoissonff, poissonff, zinegbinomial, negbinomial.size (e.g., NB-G), nbcanlink (NB-
C), posnegbinomial, inv.binomial, rnbinom, nbolf, rrvglm, cao, cqo, CommonVGAMffArguments,
simulate.vlm, ppoints, qnbinom.

Examples
# Example 1: apple tree data (Bliss and Fisher, 1953)
appletree <- data.frame(y = 0:7, w = c(70, 38, 17, 10, 9, 3, 2, 1))
fit <- vglm(y ~ 1, negbinomial(deviance = TRUE), data = appletree,
weights = w, crit = "coef") # Obtain the deviance
fit <- vglm(y ~ 1, negbinomial(deviance = TRUE), data = appletree,
weights = w, half.step = FALSE) # Alternative method
508 negbinomial

summary(fit)
coef(fit, matrix = TRUE)
Coef(fit) # For intercept-only models
deviance(fit) # NB2 only; needs 'crit = "coef"' & 'deviance = TRUE' above

# Example 2: simulated data with multiple responses

## Not run:
ndata <- data.frame(x2 = runif(nn <- 200))
ndata <- transform(ndata, y1 = rnbinom(nn, mu = exp(3+x2), size = exp(1)),
y2 = rnbinom(nn, mu = exp(2-x2), size = exp(0)))
fit1 <- vglm(cbind(y1, y2) ~ x2, negbinomial, data = ndata, trace = TRUE)
coef(fit1, matrix = TRUE)

## End(Not run)

# Example 3: large counts implies SFS is used

## Not run:
ndata <- transform(ndata, y3 = rnbinom(nn, mu = exp(10+x2), size = exp(1)))
with(ndata, range(y3)) # Large counts
fit2 <- vglm(y3 ~ x2, negbinomial, data = ndata, trace = TRUE)
coef(fit2, matrix = TRUE)
head(fit2@weights) # Non-empty; SFS was used

## End(Not run)

# Example 4: a NB-1 to estimate a negative binomial with Var(Y) = phi0 * mu

nn <- 200 # Number of observations
phi0 <- 10 # Specify this; should be greater than unity
delta0 <- 1 / (phi0 - 1)
mydata <- data.frame(x2 = runif(nn), x3 = runif(nn))
mydata <- transform(mydata, mu = exp(2 + 3 * x2 + 0 * x3))
mydata <- transform(mydata, y3 = rnbinom(nn, mu = mu, size = delta0 * mu))
## Not run:
plot(y3 ~ x2, data = mydata, pch = "+", col = 'blue',
main = paste("Var(Y) = ", phi0, " * mu", sep = ""), las = 1)
## End(Not run)
nb1 <- vglm(y3 ~ x2 + x3, negbinomial(parallel = TRUE, zero = NULL),
data = mydata, trace = TRUE)
# Extracting out some quantities:
cnb1 <- coef(nb1, matrix = TRUE)
mydiff <- (cnb1["(Intercept)", "loge(size)"] -
cnb1["(Intercept)", "loge(mu)"])
delta0.hat <- exp(mydiff)
(phi.hat <- 1 + 1 / delta0.hat) # MLE of phi
summary(nb1)
# Obtain a 95 percent confidence interval for phi0:
myvec <- rbind(-1, 1, 0, 0)
(se.mydiff <- sqrt(t(myvec) %*% vcov(nb1) %*% myvec))
ci.mydiff <- mydiff + c(-1.96, 1.96) * se.mydiff
ci.delta0 <- ci.exp.mydiff <- exp(ci.mydiff)
(ci.phi0 <- 1 + 1 / rev(ci.delta0)) # The 95 percent conf. interval for phi0

Confint.nb1(nb1) # Quick way to get it

negbinomial.size 509

summary(glm(y3 ~ x2 + x3, quasipoisson, mydata))$disper # cf. moment estimator

negbinomial.size Negative Binomial Distribution Family Function With Known Size

Description
Maximum likelihood estimation of the mean parameter of a negative binomial distribution with
known size parameter.

Usage
negbinomial.size(size = Inf, lmu = "loge", imu = NULL,
iprobs.y = 0.35, imethod = 1,
ishrinkage = 0.95, zero = NULL)

Arguments
size Numeric, positive. Same as argument size of rnbinom. If the response is a
matrix then this is recycled to a matrix of the same dimension, by row (matrix
with byrow = TRUE).
lmu, imu Same as negbinomial.
iprobs.y Same as negbinomial.
imethod, zero Same as negbinomial.
ishrinkage Same as negbinomial.

Details
This VGAM family function estimates only the mean parameter of the negative binomial distri-
bution. See negbinomial for general information. Setting size = 1 gives what I call the NB-G
(geometric model; see Hilbe (2011)). The default, size = Inf, corresponds to the Poisson distri-
bution.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.

Note
If lmu = "nbcanlink" in negbinomial.size() then the size argument here should be assigned.

Author(s)
Thomas W. Yee
510 normal.vcm

References

Hilbe, J. M. (2011) Negative Binomial Regression, 2nd Edition. Cambridge: Cambridge University
Press.
Yee, T. W. (2014) Reduced-rank vector generalized linear models with two linear predictors. Com-
putational Statistics and Data Analysis.

See Also

negbinomial, nbcanlink (NB-C model), quasipoissonff, poissonff, rnbinom, simulate.vlm.

Examples
# Simulated data with various multiple responses
size1 <- exp(1); size2 <- exp(2); size3 <- exp(0); size4 <- Inf
ndata <- data.frame(x2 = runif(nn <- 1000))
ndata <- transform(ndata, eta1 = -1 - 2 * x2, # eta1 must be negative
size1 = size1)
ndata <- transform(ndata,
mu1 = nbcanlink(eta1, size = size1, inv = TRUE))
ndata <- transform(ndata,
y1 = rnbinom(nn, mu = mu1, size = size1), # NB-C
y2 = rnbinom(nn, mu = exp(2 - x2), size = size2),
y3 = rnbinom(nn, mu = exp(3 + x2), size = size3), # NB-G
y4 = rpois (nn, la = exp(1 + x2)))

# Also known as NB-C with size known (Hilbe, 2011)

fit1 <- vglm(y1 ~ x2, negbinomial.size(size = size1, lmu = "nbcanlink"),
data = ndata, trace = TRUE, crit = "coef")
coef(fit1, matrix = TRUE)
head(fit1@misc$size) # size saved here

fit2 <- vglm(cbind(y2, y3, y4) ~ x2,

negbinomial.size(size = c(size2, size3, size4)),
data = ndata, trace = TRUE)
coef(fit2, matrix = TRUE)
head(fit2@misc$size) # size saved here

normal.vcm Univariate Normal Distribution as a Varying-Coefficient Model

Description

Maximum likelihood estimation of all the coefficients of a LM where each of the usual regression
coefficients is modelled with other explanatory variables via parameter link functions. Thus this is
a basic varying-coefficient model.
normal.vcm 511

Usage
normal.vcm(link.list = list("(Default)" = "identitylink"),
earg.list = list("(Default)" = list()),
lsd = "loge", lvar = "loge",
esd = list(), evar = list(),
var.arg = FALSE, imethod = 1,
icoefficients = NULL, isd = NULL, zero = "sd",
sd.inflation.factor = 2.5)

Arguments
link.list, earg.list
Link functions and extra arguments applied to the coefficients of the LM, ex-
cluding the standard deviation/variance. See CommonVGAMffArguments for more
information. The default is for an identity link to be applied to each of the re-
gression coefficients.
lsd, esd, lvar, evar
Link function and extra argument applied to the standard deviation/variance.
See CommonVGAMffArguments for more information. Same as uninormal.
icoefficients Optional initial values for the coefficients. Recycled to length M 1 (does not
include the standard deviation/variance). Try using this argument if there is a
link function that is not programmed explicitly to handle range restrictions in
the initialize slot.
var.arg, imethod, isd
Same as, or similar to, uninormal.
zero See CommonVGAMffArguments for more information. The default applies to the
last one, viz. the standard deviation/variance parameter.
sd.inflation.factor
Numeric, should be greater than 1. The initial value of the standard deviation is
multiplied by this, unless isd is inputted. Experience has shown that it is safer
to start off with a larger value rather than a smaller one.

Details
This function allows all the usual LM regression coefficients to be modelled as functions of other
explanatory variables via parameter link functions. For example, we may want some of them to be
positive. Or we may want a subset of them to be positive and add to unity. So a class of such models
have been named varying-coefficient models (VCMs).
The usual linear model is specified through argument form2. As with all other VGAM family
functions, the linear/additive predictors are specified through argument formula.
The multilogit link allows a subset of the coefficients to be positive and add to unity. Either
none or more than one call to multilogit is allowed. The last variable will be used as the base-
line/reference group, and therefore excluded from the estimation.
By default, the log of the standard deviation is the last linear/additive predictor. It is recommended
that this parameter be estimated as intercept-only, for numerical stability.
Technically, the Fisher information matrix is of unit-rank for all but the last parameter (the standard
deviation/variance). Hence an approximation is used that pools over all the observations.
512 normal.vcm

This VGAM family function cannot handle multiple responses. Also, this function will probably
not have the full capabilities of the class of varying-coefficient models as described by Hastie and
Tibshirani (1993). However, it should be able to manage some simple models, especially involving
the following links: identity, loge, logoff, loglog, logit, probit, cauchit. cloglog, rhobit,
fisherz.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Warning
This VGAM family function is fragile. One should monitor convergence, and possibly enter initial
values especially when there are non-identity-link functions. If the initial value of the standard
deviation/variance is too small then numerical problems may occur. One trick is to fit an intercept-
only only model and feed its predict() output into argument etastart of a more complicated
model. The use of the zero argument is recommended in order to keep models as simple as possible.

Note
The standard deviation/variance parameter is best modelled as intercept-only.
Yet to do: allow an argument such as parallel that enables many of the coefficients to be equal.
Fix a bug: Coef() does not work for intercept-only models.

Author(s)
T. W. Yee

References
Hastie, T. and Tibshirani, R. (1993) Varying-coefficient models. J. Roy. Statist. Soc. Ser. B, 55,
757796.

# A model with covariate x2 for the regression coefficients

fit2 <- vglm(y2 ~ 1 + x2,
form2 = ~ 1 + v2 + v3 + v4 + v5 + v6,
normal.vcm(link.list = list("(Intercept)" = "multilogit",
"v2" = "multilogit",
"v3" = "logit",
514 nparam.vlm

"v4" = "loglog",
"(Default)" = "identitylink",
"v5" = "multilogit"),
earg.list = list("(Intercept)" = list(),
"v2" = list(),
"v3" = list(),
"v4" = list(),
"(Default)" = list(),
"v5" = list()),
zero = c(1:2, 6)),
data = ndata, trace = TRUE)

coef(fit2, matrix = TRUE)

summary(fit2)

nparam.vlm Number of Parameters

Description
Returns the number of parameters in a fitted model object.

Usage
nparam(object, ...)
nparam.vlm(object, dpar = TRUE, ...)
nparam.vgam(object, dpar = TRUE, linear.only = FALSE, ...)
nparam.rrvglm(object, dpar = TRUE, ...)
nparam.qrrvglm(object, dpar = TRUE, ...)
nparam.rrvgam(object, dpar = TRUE, ...)

Arguments
object Some VGAM object, for example, having class vglmff-class.
... Other possible arguments fed into the function.
dpar Logical, include any (estimated) dispersion parameters as a parameter?
linear.only Logical, include only the number of linear (parametric) parameters?

Details
The code was copied from the AIC() methods functions.

Value
Returns a numeric value with the corresponding number of parameters. For vgam objects, this may
be real rather than integer, because the nonlinear degrees of freedom is real-valued.
Oalog 515

Warning
This code has not been double-checked.

Author(s)
T. W. Yee.

See Also
VGLMs are described in vglm-class; VGAMs are described in vgam-class; RR-VGLMs are
described in rrvglm-class; AICvlm.

Examples
pneumo <- transform(pneumo, let = log(exposure.time))
(fit1 <- vglm(cbind(normal, mild, severe) ~ let, propodds, data = pneumo))
coef(fit1)
coef(fit1, matrix = TRUE)
nparam(fit1)
(fit2 <- vglm(hits ~ 1, quasipoissonff, weights = ofreq, data = V1))
coef(fit2)
coef(fit2, matrix = TRUE)
nparam(fit2)
nparam(fit2, dpar = FALSE)

Oalog One-Altered Logarithmic Distribution

Description
Density, distribution function, quantile function and random generation for the one-altered logarith-
mic distribution with parameter pobs1.

Usage
doalog(x, shape, pobs1 = 0, log = FALSE)
poalog(q, shape, pobs1 = 0)
qoalog(p, shape, pobs1 = 0)
roalog(n, shape, pobs1 = 0)

Arguments
x, q, n, p Same Unif.
shape, log Same as Otlog).
pobs1 Probability of (an observed) one, called pobs1. The default value of pobs1 = 0
corresponds to the response having a 1-truncated logarithmic distribution.
516 oalog

Details
The probability function of Y is 1 with probability pobs1, else a 1-truncated logarithmic(shape)
distribution.

Value
doalog gives the density and poalog gives the distribution function, qoalog gives the quantile
function, and roalog generates random deviates.

Note
The argument pobs1 is recycled to the required length, and must have values which lie in the interval
[0, 1].

Author(s)
T. W. Yee

## Not run: x <- 0:10

barplot(rbind(doalog(x, shape = shape, pobs1 = pobs1),
dlog(x, shape = shape)),
beside = TRUE, col = c("blue", "orange"), cex.main = 0.7, las = 1,
ylab = "Probability", names.arg = as.character(x),
main = paste("OAL(shape = ", shape, ", pobs1 = ", pobs1,
") [blue] vs", " Logarithmic(shape = ", shape,
") [orange] densities", sep = ""))
## End(Not run)

oalog One-Altered Logarithmic Distribution

Description
Fits a one-altered logarithmic distribution based on a conditional model involving a Bernoulli dis-
tribution and a 1-truncated logarithmic distribution.
oalog 517

Usage
oalog(lpobs1 = "logit", lshape = "logit",
type.fitted = c("mean", "shape", "pobs1", "onempobs1"),
ipobs1 = NULL, gshape = ppoints(8), zero = NULL)

Arguments
lpobs1 Link function for the parameter p1 or , called pobs1 or phi here. See Links
for more choices.
lshape See logff for details.
gshape, type.fitted
See CommonVGAMffArguments and fittedvlm for information.
ipobs1, zero See CommonVGAMffArguments for information.

Details
The response Y is one with probability p1 , or Y has a 1-truncated logarithmic distribution with
probability 1 p1 . Thus 0 < p1 < 1, which is modelled as a function of the covariates. The
one-altered logarithmic distribution differs from the one-inflated logarithmic distribution in that the
former has ones coming from one source, whereas the latter has ones coming from the logarithmic
distribution too. The one-inflated logarithmic distribution is implemented in the VGAM package.
Some people call the one-altered logarithmic a hurdle model.
The input can be a matrix (multiple responses). By default, the two linear/additive predictors of
oalog are (logit(), logit(s))T .

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.
The fitted.values slot of the fitted object, which should be extracted by the generic function
fitted, returns the mean (default) which is given by

= + (1 )A

where A is the mean of the one-truncated logarithmic distribution. If type.fitted = "pobs1"

then p1 is returned.

Note
This family function effectively combines binomialff and otlog into one family function.

Author(s)
T. W. Yee

See Also
Oalog, logff, oilog, CommonVGAMffArguments, simulate.vlm.
518 Oapospois

Examples
odata <- data.frame(x2 = runif(nn <- 1000))
odata <- transform(odata, pobs1 = logit(-1 + 2*x2, inverse = TRUE),
shape = logit(-2 + 3*x2, inverse = TRUE))
odata <- transform(odata, y1 = roalog(nn, shape = shape, pobs1 = pobs1),
y2 = roalog(nn, shape = shape, pobs1 = pobs1))
with(odata, table(y1))

ofit <- vglm(cbind(y1, y2) ~ x2, oalog, data = odata, trace = TRUE)
coef(ofit, matrix = TRUE)
head(fitted(ofit))
head(predict(ofit))
summary(ofit)

Oapospois One-Altered Logarithmic Distribution

Description
Density, distribution function, quantile function and random generation for the one-altered positive-
Poisson distribution with parameter pobs1.

Usage
doapospois(x, lambda, pobs1 = 0, log = FALSE)
poapospois(q, lambda, pobs1 = 0)
qoapospois(p, lambda, pobs1 = 0)
roapospois(n, lambda, pobs1 = 0)

Arguments
x, q, n, p Same Unif.
lambda, log Same as Otpospois).
pobs1 Probability of (an observed) one, called pobs1. The default value of pobs1 = 0
corresponds to the response having a 1-truncated positive-Poisson distribution.

Details
The probability function of Y is 1 with probability pobs1, else a 1-truncated positive-Poisson(lambda)
distribution.

Value
doapospois gives the density and poapospois gives the distribution function, qoapospois gives
the quantile function, and roapospois generates random deviates.
oapospoisson 519

Note
The argument pobs1 is recycled to the required length, and must have values which lie in the interval
[0, 1].

Author(s)
T. W. Yee

See Also
oapospoisson, Oipospois, Otpospois.

Examples
lambda <- 3; pobs1 <- 0.30; x <- (-1):7
doapospois(x, lambda = lambda, pobs1 = pobs1)
table(roapospois(100, lambda = lambda, pobs1 = pobs1))

## Not run: x <- 0:10

barplot(rbind(doapospois(x, lambda = lambda, pobs1 = pobs1),
dpospois(x, lambda = lambda)),
beside = TRUE, col = c("blue", "orange"), cex.main = 0.7, las = 1,
ylab = "Probability", names.arg = as.character(x),
main = paste("OAPP(lambda = ", lambda, ", pobs1 = ", pobs1,
") [blue] vs", " PosPoisson(lambda = ", lambda,
") [orange] densities", sep = ""))
## End(Not run)

oapospoisson One-Altered Positive-Poisson Distribution

Description
Fits a one-altered positive-Poisson distribution based on a conditional model involving a Bernoulli
distribution and a 1-truncated positive-Poisson distribution.

Usage
oapospoisson(lpobs1 = "logit", llambda = "loge",
type.fitted = c("mean", "lambda", "pobs1", "onempobs1"),
ipobs1 = NULL, zero = NULL)

Arguments
lpobs1 Link function for the parameter p1 or , called pobs1 or phi here. See Links
for more choices.
llambda See pospoisson for details.
type.fitted See CommonVGAMffArguments and fittedvlm for information.
ipobs1, zero See CommonVGAMffArguments for information.
520 oapospoisson

Details
The response Y is one with probability p1 , or Y has a 1-truncated positive-Poisson distribution
with probability 1 p1 . Thus 0 < p1 < 1, which is modelled as a function of the covariates. The
one-altered positive-Poisson distribution differs from the one-inflated positive-Poisson distribution
in that the former has ones coming from one source, whereas the latter has ones coming from the
positive-Poisson distribution too. The one-inflated positive-Poisson distribution is implemented in
the VGAM package. Some people call the one-altered positive-Poisson a hurdle model.
The input can be a matrix (multiple responses). By default, the two linear/additive predictors of
oapospoisson are (logit(), log())T .

= + (1 )A

where A is the mean of the one-truncated positive-Poisson distribution. If type.fitted = "pobs1"

then p1 is returned.

Note
This family function effectively combines binomialff and otpospoisson into one family function.

Author(s)
T. W. Yee

See Also
Oapospois, pospoisson, oipospoisson, CommonVGAMffArguments, simulate.vlm.

Examples
odata <- data.frame(x2 = runif(nn <- 1000))
odata <- transform(odata, pobs1 = logit(-1 + 2*x2, inverse = TRUE),
lambda = loge( 1 + 1*x2, inverse = TRUE))
odata <- transform(odata, y1 = roapospois(nn, lambda = lambda, pobs1 = pobs1),
y2 = roapospois(nn, lambda = lambda, pobs1 = pobs1))
with(odata, table(y1))

ofit <- vglm(cbind(y1, y2) ~ x2, oapospoisson, data = odata, trace = TRUE)
coef(ofit, matrix = TRUE)
head(fitted(ofit))
head(predict(ofit))
summary(ofit)
Oazeta 521

Oazeta One-Altered Logarithmic Distribution

Description

Density, distribution function, quantile function and random generation for the one-altered zeta
distribution with parameter pobs1.

Usage

doazeta(x, shape, pobs1 = 0, log = FALSE)

poazeta(q, shape, pobs1 = 0)
qoazeta(p, shape, pobs1 = 0)
roazeta(n, shape, pobs1 = 0)

Arguments

x, q, n, p Same Unif.
shape, log Same as Otzeta).
pobs1 Probability of (an observed) one, called pobs1. The default value of pobs1 = 0
corresponds to the response having a 1-truncated zeta distribution.

Details

The probability function of Y is 1 with probability pobs1, else a 1-truncated zeta distribution.

Value

doazeta gives the density and poazeta gives the distribution function, qoazeta gives the quantile
function, and roazeta generates random deviates.

Note

The argument pobs1 is recycled to the required length, and must have values which lie in the interval
[0, 1].

Author(s)

T. W. Yee

See Also

oazeta, Oizeta, Otzeta, zeta.

522 oazeta

Examples
shape <- 1.1; pobs1 <- 0.10; x <- (-1):7
doazeta(x, shape = shape, pobs1 = pobs1)
table(roazeta(100, shape = shape, pobs1 = pobs1))

## Not run: x <- 0:10

barplot(rbind(doazeta(x, shape = shape, pobs1 = pobs1),
dzeta(x, shape = shape)),
beside = TRUE, col = c("blue", "orange"), cex.main = 0.7, las = 1,
ylab = "Probability", names.arg = as.character(x),
main = paste("OAZ(shape = ", shape, ", pobs1 = ", pobs1,
") [blue] vs", " zeta(shape = ", shape,
") [orange] densities", sep = ""))
## End(Not run)

oazeta One-Altered Zeta Distribution

Description
Fits a one-altered zeta distribution based on a conditional model involving a Bernoulli distribution
and a 1-truncated zeta distribution.

Usage
oazeta(lpobs1 = "logit", lshape = "loge",
type.fitted = c("mean", "shape", "pobs1", "onempobs1"),
gshape = exp((-4:3)/4), ishape = NULL, ipobs1 = NULL, zero = NULL)

Arguments
lpobs1 Link function for the parameter p1 or , called pobs1 or phi here. See Links
for more choices.
lshape See zeta for details.
type.fitted See CommonVGAMffArguments and fittedvlm for information.
gshape, ishape, ipobs1, zero
See CommonVGAMffArguments for information.

Details
The response Y is one with probability p1 , or Y has a 1-truncated zeta distribution with probability
1 p1 . Thus 0 < p1 < 1, which is modelled as a function of the covariates. The one-altered zeta
distribution differs from the one-inflated zeta distribution in that the former has ones coming from
one source, whereas the latter has ones coming from the zeta distribution too. The one-inflated zeta
distribution is implemented in the VGAM package. Some people call the one-altered zeta a hurdle
model.
The input can be a matrix (multiple responses). By default, the two linear/additive predictors of
oazeta are (logit(), log(shape))T .
Oilog 523

= + (1 )A

where A is the mean of the one-truncated zeta distribution. If type.fitted = "pobs1" then p1 is
returned.

Note
This family function effectively combines binomialff and otzeta into one family function.

Author(s)
T. W. Yee

See Also
Oazeta, zetaff, oizeta, otzeta, CommonVGAMffArguments, simulate.vlm.

Examples
odata <- data.frame(x2 = runif(nn <- 1000))
odata <- transform(odata, pobs1 = logit(-1 + 2*x2, inverse = TRUE),
shape = loge( 1 + 1*x2, inverse = TRUE))
odata <- transform(odata, y1 = roazeta(nn, shape = shape, pobs1 = pobs1),
y2 = roazeta(nn, shape = shape, pobs1 = pobs1))
with(odata, table(y1))

ofit <- vglm(cbind(y1, y2) ~ x2, oazeta, data = odata, trace = TRUE)
coef(ofit, matrix = TRUE)
head(fitted(ofit))
head(predict(ofit))
summary(ofit)

Oilog One-Inflated Logarithmic Distribution

Description
Density, distribution function, quantile function and random generation for the one-inflated loga-
rithmic distribution with parameter pstr1.
524 Oilog

Usage
doilog(x, shape, pstr1 = 0, log = FALSE)
poilog(q, shape, pstr1 = 0)
qoilog(p, shape, pstr1 = 0)
roilog(n, shape, pstr1 = 0)

Arguments
x, q, p, n Same as Uniform.
shape Vector of parameters that lie in (0, 1).
pstr1 Probability of a structural one (i.e., ignoring the logarithmic distribution), called
. The default value of = 0 corresponds to the response having an ordinary
logarithmic distribution.
log Same as Uniform.

Details
The probability function of Y is 1 with probability , and Logarithmic(prob) with probability
1 . Thus
P (Y = 1) = + (1 )P (W = 1)
where W is distributed as a Logarithmic(shape) random variable. The VGAM family function
oilog estimates by MLE.

Value
doilog gives the density, poilog gives the distribution function, qoilog gives the quantile function,
and roilog generates random deviates.

Note
The argument pstr1 is recycled to the required length, and usually has values which lie in the
interval [0, 1].
These functions actually allow for the zero-deflated logarithmic distribution. Here, pstr1 is also
permitted to lie in the interval [-dlog(1, shape) / (1 - dlog(1, shape)), 0]. The resulting
probability of a unit count is less than the nominal logarithmic value, and the use of pstr1 to stand
for the probability of a structural 1 loses its meaning.
When pstr1 equals -dlog(1, shape) / (1 - dlog(1, shape)) this corresponds to the 1-
truncated logarithmic distribution.

Author(s)
T. W. Yee

See Also
oilog, rlog, logff, Otlog.
oilog 525

Examples
shape <- 0.5; pstr1 <- 0.3; x <- (-1):7
(ii <- doilog(x, shape, pstr1 = pstr1))
max(abs(poilog(1:200, shape) -
cumsum(shape^(1:200) / (-(1:200) * log1p(-shape))))) # Should be 0

## Not run: x <- 0:10

par(mfrow = c(2, 1)) # One-Inflated logarithmic
barplot(rbind(doilog(x, shape, pstr1 = pstr1), dlog(x, shape)),
beside = TRUE, col = c("blue", "orange"),
main = paste("OILogff(", shape, ", pstr1 = ", pstr1, ") (blue) vs",
" Logff(", shape, ") (orange)", sep = ""),
names.arg = as.character(x))

deflat.limit <- -dlog(1, shape) / plog(1, shape, lower.tail = FALSE)

newpstr1 <- round(deflat.limit, 3) + 0.001 # Inside but near the boundary
barplot(rbind(doilog(x, shape, pstr1 = newpstr1),
dlog(x, shape)),
beside = TRUE, col = c("blue","orange"),
main = paste("ODLogff(", shape, ", pstr1 = ", newpstr1, ") (blue) vs",
" Logff(", shape, ") (orange)", sep = ""),
names.arg = as.character(x))
## End(Not run)

oilog One-inflated Logarithmic Distribution Family Function

Description

Fits a 1-inflated logarithmic distribution.

Usage

oilog(lpstr1 = "logit", lshape = "logit",

type.fitted = c("mean", "shape", "pobs1", "pstr1", "onempstr1"),
ishape = NULL, gpstr1 = ppoints(8), gshape = ppoints(8), zero = NULL)

Arguments

lpstr1, lshape Link functions. For lpstr1: the same idea as zipoisson except it applies to a
structural 1.
gpstr1, gshape, ishape
For initial values. See CommonVGAMffArguments for information.
type.fitted, zero
See CommonVGAMffArguments for information.
526 Oiposbinom

Details
The 1-inflated logarithmic distribution is a mixture distribution of the logarithmic distribution with
some probability of obtaining a (structural) 1. Thus there are two sources for obtaining the value 1.
This distribution is written here in a way that retains a similar notation to the one-inflated positive-
Poisson, i.e., the probability P [Y = 1] involves another parameter . See oipospoisson.
This family function can handle multiple responses.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.

Author(s)
Thomas W. Yee

Oiposbinom One-Inflated Positive Binomial Distribution

Description
Density, distribution function, quantile function and random generation for the one-inflated positive
binomial distribution with parameter pstr1.

Usage
doiposbinom(x, size, prob, pstr1 = 0, log = FALSE)
poiposbinom(q, size, prob, pstr1 = 0)
qoiposbinom(p, size, prob, pstr1 = 0)
roiposbinom(n, size, prob, pstr1 = 0)
Oiposbinom 527

Arguments
x, p, q, n Same as Posbinom.
size, prob Same as Posbinom.
pstr1 Probability of a structural one (i.e., ignoring the positive binomial distribution),
called . The default value of = 0 corresponds to the response having a pos-
itive binomial distribution. However, pstr1 can also be negative, in which case
it ceases its interpretation as a probability, and this is known as one-deflation.
log Logical. Return the logarithm of the answer?

Details
The probability function of Y is 1 with probability , and P osBinomial(size, prob) with proba-
bility 1 . Thus
P (Y = 1) = + (1 )P (W = 1)
where W is distributed as a positive binomial(size, prob) random variable.

Value
doiposbinom gives the density, poiposbinom gives the distribution function, qoiposbinom gives
the quantile function, and roiposbinom generates random deviates.

Note
The argument pstr1 is recycled to the required length, and usually has values which lie in the
interval [0, 1].
These functions actually allow for the zero-deflated binomial distribution. Here, pstr1 is also
permitted to lie in the interval [A, 0] for some positive quantity A. The resulting probability of a
unit value is less than the nominal positive binomial value, and the use of pstr1 to stand for the
probability of a structural 1 loses its meaning.
If pstr1 equals A then this corresponds to the 0- and 1-truncated binomial distribution.

Author(s)
T. W. Yee

See Also
posbinomial, dbinom, binomialff.

Examples
size <- 10; prob <- 0.2; pstr1 <- 0.4; x <- (-1):size
(ii <- doiposbinom(x, size, prob, pstr1 = pstr1))
table(roiposbinom(100, size, prob, pstr1 = pstr1))
round(doiposbinom(x , size, prob, pstr1 = pstr1) * 100) # Should be similar

## Not run: x <- 0:size

par(mfrow = c(2, 1)) # One-Inflated Positive Binomial
528 oiposbinomial

barplot(rbind(doiposbinom(x, size, prob, pstr1 = pstr1),

dposbinom(x, size, prob)),
beside = TRUE, col = c("blue", "orange"),
main = paste("OIPB(", size, ",", prob, ", pstr1 = ", pstr1, ") (blue) vs",
" PosBinomial(", size, ",", prob, ") (orange)", sep = ""),
names.arg = as.character(x))

# Zero-deflated Pos Binomial

deflat.limit <- -dposbinom(1, size, prob) / (1 - dposbinom(1, size, prob))
deflat.limit <- size * prob / (1 + (size-1) * prob - 1 / (1-prob)^(size-1))
newpstr1 <- round(deflat.limit, 3) + 0.001 # A little from the boundary
barplot(rbind(doiposbinom(x, size, prob, pstr1 = newpstr1),
dposbinom(x, size, prob)),
beside = TRUE, col = c("blue","orange"),
main = paste("ODPB(", size, ",", prob, ", pstr1 = ", newpstr1, ") (blue) vs",
" PosBinomial(", size, ",", prob, ") (orange)", sep = ""),
names.arg = as.character(x))
## End(Not run)

oiposbinomial One-Inflated Positive Binomial Distribution Family Function

Description

Fits a one-inflated positive binomial distribution by maximum likelihood estimation.

Usage

oiposbinomial(lpstr1 = "logit", lprob = "logit",

type.fitted = c("mean", "prob", "pobs1", "pstr1", "onempstr1"),
iprob = NULL, gpstr1 = ppoints(9), gprob = ppoints(9),
multiple.responses = FALSE, zero = NULL)

Arguments

lpstr1, lprob Link functions for the parameter and the positive binomial probability pa-
rameter. See Links for more choices. See CommonVGAMffArguments also. For
the one-deflated model see below.
type.fitted See CommonVGAMffArguments and fittedvlm.
iprob, gpstr1, gprob
For initial values; see CommonVGAMffArguments.
multiple.responses
Logical. See binomialff and posbinomial.
zero See CommonVGAMffArguments for information.
oiposbinomial 529

Details
These functions are based on

P (Y = y) = + (1 )N (1 )N /(1 (1 )N ),

for y = 1/N , and

N
P (Y = y) = (1 ) N y (1 )N (1y) /(1 (1 )N ).
Ny

for y = 2/N, . . . , 1. That is, the response is a sample proportion out of N trials, and the argument
size in roiposbinom is N here. Ideally N > 2 is needed. The parameter is the probability of a
structural one, and it satisfies 0 < < 1 (usually). The mean of Y is E(Y ) = + (1 )/(1
(1 )N ) and these are returned as the default fitted values. By default, the two linear/additive
predictors for oiposbinomial() are (logit(), logit())T .

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Note
The response variable should have one of the formats described by binomialff, e.g., a factor or
two column matrix or a vector of sample proportions with the weights argument specifying the
values of N .
To work well, one ideally needs large values of N and much greater than 0, i.e., the larger N
and are, the better. If N = 1 then the model is unidentifiable since the number of parameters is
excessive.
Estimated probabilities of a structural one and an observed one are returned, as in zipoisson.
The one-deflated positive binomial distribution might be fitted by setting lpstr1 = "identitylink",
albeit, not entirely reliably. See zipoisson for information that can be applied here.

Author(s)
T. W. Yee

See Also
roiposbinom, posbinomial, binomialff, rbinom.

Examples
size <- 10 # Number of trials; N in the notation above
nn <- 200
odata <- data.frame(pstr1 = logit( 0, inverse = TRUE), # 0.50
mubin1 = logit(-1, inverse = TRUE), # Mean of usual binomial
svec = rep(size, length = nn),
x2 = runif(nn))
odata <- transform(odata,
530 Oipospois

mubin2 = logit(-1 + x2, inverse = TRUE))

odata <- transform(odata,
y1 = roiposbinom(nn, svec, pr = mubin1, pstr1 = pstr1),
y2 = roiposbinom(nn, svec, pr = mubin2, pstr1 = pstr1))
with(odata, table(y1))
fit1 <- vglm(y1 / svec ~ 1, oiposbinomial, data = odata,
weights = svec, trace = TRUE, crit = "coef")
fit2 <- vglm(y2 / svec ~ x2, oiposbinomial, data = odata,
weights = svec, trace = TRUE)

coef(fit1, matrix = TRUE)

Coef(fit1) # Useful for intercept-only models
head(fitted(fit1, type = "pobs1")) # Estimate of P(Y = 1)
head(fitted(fit1))
with(odata, mean(y1)) # Compare this with fitted(fit1)
summary(fit1)

Oipospois One-Inflated Positive Poisson Distribution

Description
Density, distribution function, quantile function and random generation for the one-inflated positive
Poisson distribution with parameter pstr1.

Usage
doipospois(x, lambda, pstr1 = 0, log = FALSE)
poipospois(q, lambda, pstr1 = 0)
qoipospois(p, lambda, pstr1 = 0)
roipospois(n, lambda, pstr1 = 0)

Arguments
x, p, q, n Same as Pospois.
lambda Vector of positive means.
pstr1 Probability of a structural one (i.e., ignoring the positive Poisson distribution),
called . The default value of = 0 corresponds to the response having a
positive Poisson distribution.
log Logical. Return the logarithm of the answer?

Details
The probability function of Y is 1 with probability , and P osP oisson() with probability 1 .
Thus
P (Y = 1) = + (1 )P (W = 1)
where W is distributed as a positive P oisson() random variate.
Oipospois 531

Value
doipospois gives the density, poipospois gives the distribution function, qoipospois gives the
quantile function, and roipospois generates random deviates.

Note
The argument pstr1 is recycled to the required length, and usually has values which lie in the
interval [0, 1].
These functions actually allow for the zero-deflated Poisson distribution. Here, pstr1 is also permit-
ted to lie in the interval [-lambda / (expm1(lambda) - lambda), 0]. The resulting probability
of a unit count is less than the nominal positive Poisson value, and the use of pstr1 to stand for the
probability of a structural 1 loses its meaning.
When pstr1 equals -lambda / (expm1(lambda) - lambda) this corresponds to the 0- and
1-truncated Poisson distribution.

Author(s)
T. W. Yee

See Also
Pospois, oapospoisson, oipospoisson, otpospoisson, pospoisson, dpois, poissonff.

Examples
lambda <- 3; pstr1 <- 0.2; x <- (-1):7
(ii <- doipospois(x, lambda, pstr1 = pstr1))
table(roipospois(100, lambda, pstr1 = pstr1))
round(doipospois(1:10, lambda, pstr1 = pstr1) * 100) # Should be similar

## Not run: x <- 0:10

par(mfrow = c(2, 1)) # One-Inflated Positive Poisson
barplot(rbind(doipospois(x, lambda, pstr1 = pstr1), dpospois(x, lambda)),
beside = TRUE, col = c("blue", "orange"),
main = paste("OIPP(", lambda, ", pstr1 = ", pstr1, ") (blue) vs",
" PosPoisson(", lambda, ") (orange)", sep = ""),
names.arg = as.character(x))

deflat.limit <- -lambda / (expm1(lambda) - lambda) # 0-deflated Pos Poisson

newpstr1 <- round(deflat.limit, 3) + 0.001 # Inside and near the boundary
barplot(rbind(doipospois(x, lambda, pstr1 = newpstr1),
dpospois(x, lambda)),
beside = TRUE, col = c("blue","orange"),
main = paste("ODPP(", lambda, ", pstr1 = ", newpstr1, ") (blue) vs",
" PosPoisson(", lambda, ") (orange)", sep = ""),
names.arg = as.character(x))
## End(Not run)
532 oipospoisson

oipospoisson One-inflated Positive Poisson Distribution Family Function

Description
Fits a 1-inflated positive Poisson distribution.

Usage
oipospoisson(lpstr1 = "logit", llambda = "loge",
type.fitted = c("mean", "lambda", "pobs1", "pstr1", "onempstr1"),
ilambda = NULL, gpstr1 = (1:19)/20, gprobs.y = (1:19)/20,
imethod = 1, zero = NULL)

Arguments
lpstr1, llambda
For lpstr1: the same idea as zipoisson except it applies to a structural 1.
ilambda, gpstr1, gprobs.y, imethod
For initial values. See CommonVGAMffArguments for information.
type.fitted, zero
See CommonVGAMffArguments for information.

Details
The 1-inflated positive Poisson distribution is a mixture distribution of the positive (0-truncated)
Poisson distribution with some probability of obtaining a (structural) 1. Thus there are two sources
for obtaining the value 1. It is similar to a zero-inflated Poisson model, except the Poisson is
replaced by a positive Poisson and the 0 is replaced by 1. This distribution is written here in a way
that retains a similar notation to the zero-inflated Poisson, i.e., the probability P [Y = 1] involves
another parameter . See zipoisson.
This family function can handle multiple responses.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.

Warning
Under- or over-flow may occur if the data is ill-conditioned.

Author(s)
Thomas W. Yee
Oizeta 533

See Also
Oipospois, pospoisson, oapospoisson, otpospoisson, zipoisson, poissonff, simulate.vlm.

Examples
set.seed(1)
pdata <- data.frame(x2 = runif(nn <- 1000)) # Artificial data
pdata <- transform(pdata, pstr1 = 0.5, lambda = exp(3 - x2))
pdata <- transform(pdata, y1 = roipospois(nn, lambda, pstr1 = pstr1))
with(pdata, table(y1))
fit1 <- vglm(y1 ~ x2, oipospoisson, data = pdata, trace = TRUE)
coef(fit1, matrix = TRUE)

Oizeta One-Inflated Zeta Distribution

Description
Density, distribution function, quantile function and random generation for the one-inflated zeta
distribution with parameter pstr1.

Usage
doizeta(x, shape, pstr1 = 0, log = FALSE)
poizeta(q, shape, pstr1 = 0)
qoizeta(p, shape, pstr1 = 0)
roizeta(n, shape, pstr1 = 0)

Arguments
x, q, p, n Same as Uniform.
shape Vector of positive shape parameters.
pstr1 Probability of a structural one (i.e., ignoring the zeta distribution), called . The
default value of = 0 corresponds to the response having an ordinary zeta
distribution.
log Same as Uniform.

Details
The probability function of Y is 1 with probability , and Zeta(shape) with probability 1 .
Thus
P (Y = 1) = + (1 )P (W = 1)
where W is distributed as a zeta(shape) random variable.
534 Oizeta

Value
doizeta gives the density, poizeta gives the distribution function, qoizeta gives the quantile
function, and roizeta generates random deviates.

Note
The argument pstr1 is recycled to the required length, and usually has values which lie in the
interval [0, 1].
These functions actually allow for the zero-deflated zeta distribution. Here, pstr1 is also permitted
to lie in the interval [-dzeta(1, shape) / (1 - dzeta(1, shape)), 0]. The resulting
probability of a unit count is less than the nominal zeta value, and the use of pstr1 to stand for the
probability of a structural 1 loses its meaning.
When pstr1 equals -dzeta(1, shape) / (1 - dzeta(1, shape)) this corresponds to the
1-truncated zeta distribution.

Author(s)
T. W. Yee

## Not run: x <- 0:10

par(mfrow = c(2, 1)) # One-Inflated zeta
barplot(rbind(doizeta(x, shape, pstr1 = pstr1), dzeta(x, shape)),
beside = TRUE, col = c("blue", "orange"),
main = paste("OIZeta(", shape, ", pstr1 = ", pstr1, ") (blue) vs",
" Zeta(", shape, ") (orange)", sep = ""),
names.arg = as.character(x))

deflat.limit <- -dzeta(1, shape) / pzeta(1, shape, lower.tail = FALSE)

newpstr1 <- round(deflat.limit, 3) + 0.001 # Inside but near the boundary
barplot(rbind(doizeta(x, shape, pstr1 = newpstr1),
dzeta(x, shape)),
beside = TRUE, col = c("blue","orange"),
main = paste("ODZeta(", shape, ", pstr1 = ", newpstr1, ") (blue) vs",
" Zeta(", shape, ") (orange)", sep = ""),
names.arg = as.character(x))
## End(Not run)
oizeta 535

oizeta One-inflated Zeta Distribution Family Function

Description
Fits a 1-inflated zeta distribution.

Usage
oizeta(lpstr1 = "logit", lshape = "loge",
type.fitted = c("mean", "shape", "pobs1", "pstr1", "onempstr1"),
ishape = NULL, gpstr1 = ppoints(8), gshape = exp((-3:3) / 4), zero = NULL)

Arguments
lpstr1, lshape For lpstr1: the same idea as zipoisson except it applies to a structural 1.
gpstr1, gshape, ishape
For initial values. See CommonVGAMffArguments for information.
type.fitted, zero
See CommonVGAMffArguments for information.

Details
The 1-inflated zeta distribution is a mixture distribution of the zeta distribution with some prob-
ability of obtaining a (structural) 1. Thus there are two sources for obtaining the value 1. This
distribution is written here in a way that retains a similar notation to the zero-inflated Poisson, i.e.,
the probability P [Y = 1] involves another parameter . See zipoisson.
This family function can handle multiple responses.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.

Warning
Under- or over-flow may occur if the data is ill-conditioned. Lots of data is needed to estimate the
parameters accurately. Usually, probably the shape parameter is best modelled as intercept-only.

Author(s)
Thomas W. Yee

See Also
Oizeta, zetaff, oazeta, otzeta, diffzeta, zeta, Oizipf.
536 Oizipf

Examples
## Not run: odata <- data.frame(x2 = runif(nn <- 1000)) # Artificial data
odata <- transform(odata, pstr1 = logit(-1 + x2, inverse = TRUE), shape = exp(-0.5))
odata <- transform(odata, y1 = roizeta(nn, shape, pstr1 = pstr1))
with(odata, table(y1))
fit1 <- vglm(y1 ~ x2, oizeta(zero = "shape"), data = odata, trace = TRUE)
coef(fit1, matrix = TRUE)

## End(Not run)

Oizipf One-Inflated Zipf Distribution

Description
Density, distribution function, quantile function and random generation for the one-inflated Zipf
distribution with parameter pstr1.

Usage
doizipf(x, N, shape, pstr1 = 0, log = FALSE)
poizipf(q, N, shape, pstr1 = 0)
qoizipf(p, N, shape, pstr1 = 0)
roizipf(n, N, shape, pstr1 = 0)

Arguments
x, q, p, n Same as Uniform.
N, shape See Zipf.
pstr1 Probability of a structural one (i.e., ignoring the Zipf distribution), called . The
default value of = 0 corresponds to the response having an ordinary Zipf
distribution.
log Same as Uniform.

Details
The probability function of Y is 1 with probability , and Zipf (N, s) with probability 1 . Thus

P (Y = 1) = + (1 )P (W = 1)

where W is distributed as a Zipf (N, s) random variable. The VGAM family function oizeta
estimates the two parameters of this model by Fisher scoring.

Value
doizipf gives the density, poizipf gives the distribution function, qoizipf gives the quantile
function, and roizipf generates random deviates.
oizipf 537

Note
The argument pstr1 is recycled to the required length, and usually has values which lie in the
interval [0, 1].
These functions actually allow for the zero-deflated Zipf distribution. Here, pstr1 is also permitted
to lie in the interval [-dzipf(1, N, s) / (1 - dzipf(1, N, s)), 0]. The resulting probability
of a unit count is less than the nominal zipf value, and the use of pstr1 to stand for the probability
of a structural 1 loses its meaning.
When pstr1 equals -dzipf(1, N, s) / (1 - dzipf(1, N, s)) this corresponds to the
1-truncated zipf distribution.

Author(s)
T. W. Yee

## Not run: x <- 0:10

par(mfrow = c(2, 1)) # One-Inflated zipf
barplot(rbind(doizipf(x, N, shape, pstr1 = pstr1), dzipf(x, N, shape)),
beside = TRUE, col = c("blue", "orange"),
main = paste("OIZipf(", N, ", ", shape, ", pstr1 = ", pstr1, ") (blue) vs",
" Zipf(", N, ", ", shape, ") (orange)", sep = ""),
names.arg = as.character(x))

deflat.limit <- -dzipf(1, N, shape) / (1 - dzipf(1, N, shape))

newpstr1 <- round(deflat.limit, 3) + 0.001 # Inside but near the boundary
barplot(rbind(doizipf(x, N, shape, pstr1 = newpstr1),
dzipf(x, N, shape)),
beside = TRUE, col = c("blue", "orange"),
main = paste("ODZipf(", N, ", ", shape, ", pstr1 = ", newpstr1, ") (blue) vs",
" Zipf(", N, ", ", shape, ") (orange)", sep = ""),
names.arg = as.character(x))
## End(Not run)

oizipf One-inflated Zipf Distribution Family Function

Description
Fits a 1-inflated Zipf distribution.
538 oizipf

Usage
oizipf(N = NULL, lpstr1 = "logit", lshape = "loge",
type.fitted = c("mean", "shape", "pobs1", "pstr1", "onempstr1"),
ishape = NULL, gpstr1 = ppoints(8), gshape = exp((-3:3) / 4), zero = NULL)

Arguments
N Same as zipf.
lpstr1, lshape For lpstr1: the same idea as zipoisson except it applies to a structural 1.
gpstr1, gshape, ishape
For initial values. See CommonVGAMffArguments for information.
type.fitted, zero
See CommonVGAMffArguments for information.

Details
The 1-inflated Zipf distribution is a mixture distribution of the Zipf distribution with some prob-
ability of obtaining a (structural) 1. Thus there are two sources for obtaining the value 1. This
distribution is written here in a way that retains a similar notation to the zero-inflated Poisson, i.e.,
the probability P [Y = 1] involves another parameter . See zipoisson.
This family function can handle multiple responses.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.

Author(s)
Thomas W. Yee

coef(fit1, matrix = TRUE)

## End(Not run)

olympics 2008 and 2012 Summer Olympic Final Medal Count Data

Description

Final medal count, by country, for the Summer 2008 and 2012 Olympic Games.

Usage

data(olym08)
data(olym12)

Format

A data frame with 87 or 85 observations on the following 6 variables.

rank a numeric vector, overall ranking of the countries.

country a factor.
gold a numeric vector, number of gold medals.
silver a numeric vector, number of silver medals.
bronze a numeric vector, number of bronze medals.
totalmedal a numeric vector, total number of medals.

Details

The events were held during (i) August 824, 2008, in Beijing; and (ii) 27 July12 August, 2012,
in London.

References

The official English website was/is http://en.beijing2008.cn and http://www.london2012.com.

Help from Viet Hoang Quoc is gratefully acknowledged.

See Also

grc.
540 Opt

Examples
summary(olym08)
summary(olym12)
## maybe str(olym08) ; plot(olym08) ...
## Not run: par(mfrow = c(1, 2))
myylim <- c(0, 55)
with(head(olym08, n = 8),
barplot(rbind(gold, silver, bronze),
col = c("gold", "grey", "brown"), # No "silver" or "bronze"!
# "gold", "grey71", "chocolate4",
names.arg = country, cex.names = 0.5, ylim = myylim,
beside = TRUE, main = "2008 Summer Olympic Final Medal Count",
ylab = "Medal count", las = 1,
sub = "Top 8 countries; 'gold'=gold, 'grey'=silver, 'brown'=bronze"))
with(head(olym12, n = 8),
barplot(rbind(gold, silver, bronze),
col = c("gold", "grey", "brown"), # No "silver" or "bronze"!
names.arg = country, cex.names = 0.5, ylim = myylim,
beside = TRUE, main = "2012 Summer Olympic Final Medal Count",
ylab = "Medal count", las = 1,
sub = "Top 8 countries; 'gold'=gold, 'grey'=silver, 'brown'=bronze"))
## End(Not run)

Opt Optimums

Description
Generic function for the optimums (or optima) of a model.

Usage
Opt(object, ...)

Arguments
object An object for which the computation or extraction of an optimum (or optimums)
is meaningful.
... Other arguments fed into the specific methods function of the model. Sometimes
they are fed into the methods function for Coef.

Details
Different models can define an optimum in different ways. Many models have no such notion or
definition.
Optimums occur in quadratic and additive ordination, e.g., CQO or CAO. For these models the
optimum is the value of the latent variable where the maximum occurs, i.e., where the fitted value
achieves its highest value. For quadratic ordination models there is a formula for the optimum but
Opt 541

for additive ordination models the optimum must be searched for numerically. If it occurs on the
boundary, then the optimum is undefined. At an optimum, the fitted value of the response is called
the maximum.

Value

The value returned depends specifically on the methods function invoked.

Note

In ordination, the optimum of a species is sometimes called the species score.

Author(s)

Thomas W. Yee

References

Yee, T. W. (2004) A new technique for maximum-likelihood canonical Gaussian ordination. Eco-
logical Monographs, 74, 685701.
Yee, T. W. (2006) Constrained additive ordination. Ecology, 87, 203213.

See Also

Opt.qrrvglm, Max, Tol.

Examples
set.seed(111) # This leads to the global solution
hspider[,1:6] <- scale(hspider[,1:6]) # Standardized environmental vars
# vvv p1 = cqo(cbind(Alopacce, Alopcune, Alopfabr, Arctlute, Arctperi,
# vvv Auloalbi, Pardlugu, Pardmont, Pardnigr, Pardpull,
# vvv Trocterr, Zoraspin) ~
# vvv WaterCon + BareSand + FallTwig + CoveMoss + CoveHerb + ReflLux,
# vvv Bestof = 2,
# vvv fam = quasipoissonff, data = hspider, Crow1positive=FALSE)
# vvv Opt(p1)

## Not run:
index <- 1:ncol(depvar(p1))
persp(p1, col = index, las = 1, lwd = 2, main = "Vertical lines at the optimums")
abline(v = Opt(p1), lty = 2, col = index)

## End(Not run)
542 ordpoisson

ordpoisson Ordinal Poisson Family Function

Description
Fits a Poisson regression where the response is ordinal (the Poisson counts are grouped between
known cutpoints).

Usage
ordpoisson(cutpoints, countdata = FALSE, NOS = NULL,
Levels = NULL, init.mu = NULL, parallel = FALSE,
zero = NULL, link = "loge")

Arguments
cutpoints Numeric. The cutpoints, Kl . These must be non-negative integers. Inf values
may be included. See below for further details.
countdata Logical. Is the response (LHS of formula) in count-data format? If not then the
response is a matrix or vector with values 1, 2, . . . , L, say, where L is the number
of levels. Such input can be generated with cut with argument labels = FALSE.
If countdata = TRUE then the response is expected to be in the same format
as depvar(fit) where fit is a fitted model with ordpoisson as the VGAM
family function. That is, the response is matrix of counts with L columns (if
NOS = 1).
NOS Integer. The number of species, or more generally, the number of response
random variates. This argument must be specified when countdata = TRUE.
Usually NOS = 1.
Levels Integer vector, recycled to length NOS if necessary. The number of levels for
each response random variate. This argument should agree with cutpoints.
This argument must be specified when countdata = TRUE.
init.mu Numeric. Initial values for the means of the Poisson regressions. Recycled to
length NOS if necessary. Use this argument if the default initial values fail (the
default is to compute an initial value internally).
parallel, zero, link
See poissonff.

Details
This VGAM family function uses maximum likelihood estimation (Fisher scoring) to fit a Poisson
regression to each column of a matrix response. The data, however, is ordinal, and is obtained from
known integer cutpoints. Here, l = 1, . . . , L where L (L 2) is the number of levels. In more
detail, let Y = l if Kl1 < Y Kl where the Kl are the cutpoints. We have K0 = and
KL = . The response for this family function corresponds to Y but we are really interested in
the Poisson regression of Y .
ordpoisson 543

If NOS=1 then the argument cutpoints is a vector (K1 , K2 , . . . , KL ) where the last value (Inf) is
optional. If NOS>1 then the vector should have NOS-1 Inf values separating the cutpoints. For exam-
ple, if there are NOS=3 responses, then something like ordpoisson(cut = c(0, 5, 10, Inf, 20, 30, Inf, 0, 10, 40, In
is valid.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Warning
The input requires care as little to no checking is done. If fit is the fitted object, have a look at
fit@extra and depvar(fit) to check.

Note
Sometimes there are no observations between two cutpoints. If so, the arguments Levels and NOS
need to be specified too. See below for an example.

Author(s)
Thomas W. Yee

References
Yee, T. W. (2012) Ordinal ordination with normalizing link functions for count data, (in prepara-
tion).

coef(fit, matrix = TRUE)

fit@extra

# Example 2: multivariate and there are no obsns between some cutpoints

cutpts2 <- c(-Inf, 0, 9, 10, 20, 70, 200, 201, Inf)
fcutpts2 <- cutpts2[is.finite(cutpts2)] # finite cutpoints
y2 <- rpois(n, lambda = mymu) # Same model as y1
ystar2 <- cut(y2, breaks = cutpts2, labels = FALSE)
table(ystar2) / sum(table(ystar2))
fit <- vglm(cbind(ystar,ystar2) ~ x2 + x3, fam =
ordpoisson(cutpoi = c(fcutpts,Inf,fcutpts2,Inf),
Levels = c(length(fcutpts)+1,length(fcutpts2)+1),
parallel = TRUE), trace = TRUE)
coef(fit, matrix = TRUE)
fit@extra
constraints(fit)
summary(depvar(fit)) # Some columns have all zeros

Otlog One-truncated Logarithmic Distribution

Description
Density, distribution function, quantile function, and random generation for the one-truncated log-
arithmic distribution.

Usage
dotlog(x, shape, log = FALSE)
potlog(q, shape, log.p = FALSE)
qotlog(p, shape)
rotlog(n, shape)

Arguments
x, q Vector of quantiles. For the density, it should be a vector with integer values > 1
in order for the probabilities to be positive.
p vector of probabilities.
n number of observations. Same as in runif.
shape The parameter value c described in in logff. Here it is called shape because
0 < c < 1 is the range.
log, log.p Logical. If log.p = TRUE then all probabilities p are given as log(p).

Details
The one-truncated logarithmic distribution is a logarithmic distribution but with the probability of
a one being zero. The other probabilities are scaled to add to unity. Some more details are given in
logff.
otlog 545

Value

dotlog gives the density, potlog gives the distribution function, qotlog gives the quantile function,
and rotlog generates random deviates.

Note

Given some response data, the VGAM family function otlog estimates the parameter shape. Func-
tion potlog() suffers from the problems that plog sometimes has.

Author(s)

T. W. Yee

See Also

otlog, rlog, Oilog.

Examples
dotlog(1:20, 0.5)
rotlog(20, 0.5)

## Not run: shape <- 0.8; x <- 1:10

plot(x, dotlog(x, shape = shape), type = "h", ylim = 0:1,
sub = "shape=0.8", las = 1, col = "blue", ylab = "Probability",
main = "1-truncated logarithmic distribution: blue=PMF; orange=CDF")
lines(x + 0.1, potlog(x, shape = shape), col = "orange", lty = 3, type = "h")
## End(Not run)

otlog One-truncated Logarithmic Distribution

Description

Estimating the (single) parameter of the 1-truncated logarithmic distribution.

Usage

otlog(lshape = "logit", gshape = ppoints(8), zero = NULL)

Arguments
lshape, gshape, zero
Same as logff.
546 Otpospois

Details
The 1-truncated logarithmic distribution is a logarithmic distribution but with the probability of a
one being zero. The other probabilities are scaled to add to unity. Some more details can be found
at logff. Multiple responses are permitted.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Author(s)
T. W. Yee

See Also
Otlog, logff, oalog, oilog, simulate.vlm.

Examples
odata <- data.frame(y1 = rotlog(n = 1000, shape = logit(1/3, inverse = TRUE)))
ofit <- vglm(y1 ~ 1, otlog, data = odata, trace = TRUE, crit = "c")
coef(ofit, matrix = TRUE)
Coef(ofit)
## Not run: with(odata,
hist(y1, shape = TRUE, breaks = seq(0.5, max(y1) + 0.5, by = 1),
border = "blue"))
x <- seq(1, with(odata, max(y1)), by = 1)
with(odata, lines(x, dotlog(x, Coef(ofit)[1]), col = "orange", type = "h", lwd = 2))
## End(Not run)

Otpospois One-truncated Positive-Poisson Distribution

Description
Density, distribution function, quantile function, and random generation for the one-truncated positive-
Poisson distribution.

Usage
dotpospois(x, lambda, log = FALSE)
potpospois(q, lambda, log.p = FALSE)
qotpospois(p, lambda)
rotpospois(n, lambda)
Otpospois 547

Arguments

x, q, p, n Same as Pospois.
lambda, log, log.p
Same as Pospois.

Details

The one-truncated positive-Poisson is a Poisson distribution but with the probability of a one and
a zero being 0. That is, its support is 2, 3, . . . . The other probabilities are scaled to add to unity.
Some more details are given in pospoisson.

Value

dotpospois gives the density, potpospois gives the distribution function, qotpospois gives the
quantile function, and rotpospois generates random deviates.

Note

Given some response data, the VGAM family function otpospoisson estimates the parameter
lambda.

Author(s)

T. W. Yee

See Also

otpospoisson, Pospois, Oipospois.

Examples

dotpospois(1:20, 0.5)
rotpospois(20, 0.5)

## Not run: lambda <- 4; x <- 1:10

plot(x, dotpospois(x, lambda = lambda), type = "h", ylim = 0:1,
sub = "lambda=4", las = 1, col = "blue", ylab = "Probability",
main = "1-truncated positive-Poisson distribution: blue=PMF; orange=CDF")
lines(x + 0.1, potpospois(x, lambda = lambda), col = "orange", lty = 3, type = "h")
## End(Not run)
548 otpospoisson

otpospoisson One-truncated Poisson Distribution

Description
Estimating the (single) parameter of the 1-truncated positive Poisson distribution.

Usage
otpospoisson(llambda = "loge",
type.fitted = c("mean", "lambda", "prob0", "prob1"),
ilambda = NULL, imethod = 1, zero = NULL)

Arguments
llambda, type.fitted, ilambda
Same as pospoisson.
imethod, zero Same as pospoisson.

Details
The 1-truncated positive Poisson distribution has support on 2, 3, . . . . It is a Poisson distribution
but with the probability of a one or zero being 0. The other probabilities are scaled to add to unity.
Some more details can be found at pospoisson. Multiple responses are permitted.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Author(s)
T. W. Yee

See Also
Otpospois, oipospoisson, simulate.vlm.

Examples
odata <- data.frame(y1 = rotpospois(n = 1000, lambda = loge(1, inverse = TRUE)))
ofit <- vglm(y1 ~ 1, otpospoisson, data = odata, trace = TRUE, crit = "c")
coef(ofit, matrix = TRUE)
Coef(ofit)
## Not run: with(odata,
hist(y1, prob = TRUE, breaks = seq(0.5, max(y1) + 0.5, by = 1),
border = "blue"))
x <- seq(1, with(odata, max(y1)), by = 1)
with(odata, lines(x, dotpospois(x, Coef(ofit)[1]), col = "orange", type = "h", lwd = 2))
Otzeta 549

## End(Not run)

Otzeta One-truncated Zeta Distribution

Description
Density, distribution function, quantile function, and random generation for the one-truncated zeta
distribution.

Usage
dotzeta(x, shape, log = FALSE)
potzeta(q, shape, log.p = FALSE)
qotzeta(p, shape)
rotzeta(n, shape)

Arguments
x, q, p, n Same as in runif.
shape The positive shape parameter described in in zetaff. Here it is called shape
because it is positive.
log, log.p Same as in runif.

Details
The one-truncated zeta distribution is a zeta distribution but with the probability of a one being zero.
The other probabilities are scaled to add to unity. Some more details are given in zetaff.

Value
dotzeta gives the density, potzeta gives the distribution function, qotzeta gives the quantile
function, and rotzeta generates random deviates.

Note
Given some response data, the VGAM family function otzeta estimates the parameter shape.

Author(s)
T. W. Yee

See Also
Otzeta, zetaff, Oizeta.
550 otzeta

Examples
dotzeta(1:20, 0.5)
rotzeta(20, 0.5)

## Not run: shape <- 0.8; x <- 1:10

plot(x, dotzeta(x, shape = shape), type = "h", ylim = 0:1,
sub = "shape=0.8", las = 1, col = "blue", ylab = "Probability",
main = "1-truncated zeta distribution: blue=PMF; orange=CDF")
lines(x + 0.1, potzeta(x, shape = shape), col = "orange", lty = 3, type = "h")
## End(Not run)

otzeta One-truncated Zeta Distribution Family Function

Description
Estimates the parameter of the 1-truncated zeta distribution.

Usage
otzeta(lshape = "loge", ishape = NULL, gshape = exp((-4:3)/4), zero = NULL)

Arguments
lshape, ishape, gshape, zero
Same as zetaff.

Details
The 1-truncated zeta distribution is the ordinary zeta distribution but with the probability of one
being 0. Thus the other probabilities are scaled up (i.e., divided by 1 P [Y = 1]). The mean is
returned by default as the fitted values. More details can be found at zetaff. Multiple responses
are handled.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Author(s)
T. W. Yee

See Also
Otzeta, zetaff, oizeta, diffzeta, zeta, dzeta, hzeta, zipf.
oxtemp 551

Examples

odata <- data.frame(x2 = runif(nn <- 1000)) # Artificial data

odata <- transform(odata, shape = loge(-0.25 + x2, inverse = TRUE))
odata <- transform(odata, y1 = rotzeta(nn, shape))
with(odata, table(y1))
ofit <- vglm(y1 ~ x2, otzeta, data = odata, trace = TRUE, crit = "coef")
coef(ofit, matrix = TRUE)

oxtemp Oxford Temperature Data

Description

Annual maximum temperatures collected at Oxford, UK.

Usage

data(oxtemp)

Format

A data frame with 80 observations on the following 2 variables.

maxtemp Annual maximum temperatures (in degrees Fahrenheit).

year The values 1901 to 1980.

Details

The data were collected from 1901 to 1980.

Source

Unknown.

Examples

## Not run: fit <- vglm(maxtemp ~ 1, gevff, data = oxtemp, trace = TRUE)
552 Paralogistic

Paralogistic The Paralogistic Distribution

Description
Density, distribution function, quantile function and random generation for the paralogistic distri-
bution with shape parameter a and scale parameter scale.

Usage
dparalogistic(x, scale = 1, shape1.a, log = FALSE)
pparalogistic(q, scale = 1, shape1.a, lower.tail = TRUE, log.p = FALSE)
qparalogistic(p, scale = 1, shape1.a, lower.tail = TRUE, log.p = FALSE)
rparalogistic(n, scale = 1, shape1.a)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. If length(n) > 1, the length is taken to be the number
required.
shape1.a shape parameter.
scale scale parameter.
log Logical. If log=TRUE then the logarithm of the density is returned.
lower.tail, log.p
Same meaning as in pnorm or qnorm.

Details
See paralogistic, which is the VGAM family function for estimating the parameters by maxi-
mum likelihood estimation.

Value
dparalogistic gives the density, pparalogistic gives the distribution function, qparalogistic
gives the quantile function, and rparalogistic generates random deviates.

Note
The paralogistic distribution is a special case of the 4-parameter generalized beta II distribution.

Author(s)
T. W. Yee and Kai Huang
paralogistic 553

References

Kleiber, C. and Kotz, S. (2003) Statistical Size Distributions in Economics and Actuarial Sciences,
Hoboken, NJ, USA: Wiley-Interscience.

See Also

paralogistic, genbetaII.

Examples
pdata <- data.frame(y = rparalogistic(n = 3000, scale = exp(1), exp(2)))
fit <- vglm(y ~ 1, paralogistic(lss = FALSE, ishape1.a = 4.1),
data = pdata, trace = TRUE)
coef(fit, matrix = TRUE)
Coef(fit)

paralogistic Paralogistic Distribution Family Function

Description

Maximum likelihood estimation of the 2-parameter paralogistic distribution.

Usage

paralogistic(lscale = "loge", lshape1.a = "loge", iscale = NULL,

ishape1.a = NULL, imethod = 1, lss = TRUE, gscale = exp(-5:5),
gshape1.a = seq(0.75, 4, by = 0.25), probs.y = c(0.25, 0.5, 0.75),
zero = "shape")

Arguments

lss See CommonVGAMffArguments for important information.

lshape1.a, lscale
Parameter link functions applied to the (positive) parameters a and scale. See
Links for more choices.
iscale, ishape1.a, imethod, zero
See CommonVGAMffArguments for information. For imethod = 2 a good initial
value for ishape1.a is needed to obtain good estimates for the other parameter.
gscale, gshape1.a
See CommonVGAMffArguments for information.
probs.y See CommonVGAMffArguments for information.
554 paralogistic

Details
The 2-parameter paralogistic distribution is the 4-parameter generalized beta II distribution with
shape parameter p = 1 and a = q. It is the 3-parameter Singh-Maddala distribution with a = q.
More details can be found in Kleiber and Kotz (2003).
The 2-parameter paralogistic has density

f (y) = a2 y a1 /[ba {1 + (y/b)a }1+a ]

for a > 0, b > 0, y 0. Here, b is the scale parameter scale, and a is the shape parameter. The
mean is
E(Y ) = b (1 + 1/a) (a 1/a)/(a)
provided a > 1; these are returned as the fitted values. This family function handles multiple
responses.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Note
See the notes in genbetaII.

Author(s)
T. W. Yee

References
Kleiber, C. and Kotz, S. (2003) Statistical Size Distributions in Economics and Actuarial Sciences,
Hoboken, NJ, USA: Wiley-Interscience.

See Also
Paralogistic, sinmad, genbetaII, betaII, dagum, fisk, inv.lomax, lomax, inv.paralogistic.

Examples
pdata <- data.frame(y = rparalogistic(n = 3000, exp(1), scale = exp(1)))
fit <- vglm(y ~ 1, paralogistic(lss = FALSE), data = pdata, trace = TRUE)
fit <- vglm(y ~ 1, paralogistic(ishape1.a = 2.3, iscale = 5),
data = pdata, trace = TRUE)
coef(fit, matrix = TRUE)
Coef(fit)
summary(fit)
Pareto 555

Pareto The Pareto Distribution

Description
Density, distribution function, quantile function and random generation for the Pareto(I) distribution
with parameters scale and shape.

Usage
dpareto(x, scale = 1, shape, log = FALSE)
ppareto(q, scale = 1, shape, lower.tail = TRUE, log.p = FALSE)
qpareto(p, scale = 1, shape, lower.tail = TRUE, log.p = FALSE)
rpareto(n, scale = 1, shape)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. Same as in runif.
scale, shape the and k parameters.
log Logical. If log = TRUE then the logarithm of the density is returned.
lower.tail, log.p
Same meaning as in pnorm or qnorm.

Details
See paretoff, the VGAM family function for estimating the parameter k by maximum likelihood
estimation, for the formula of the probability density function and the range restrictions imposed on
the parameters.

Value
dpareto gives the density, ppareto gives the distribution function, qpareto gives the quantile
function, and rpareto generates random deviates.

Author(s)
T. W. Yee and Kai Huang

References
Forbes, C., Evans, M., Hastings, N. and Peacock, B. (2011) Statistical Distributions, Hoboken, NJ,
USA: John Wiley and Sons, Fourth edition.
556 paretoff

paretoff Pareto and Truncated Pareto Distribution Family Functions

Description
Estimates one of the parameters of the Pareto(I) distribution by maximum likelihood estimation.
Also includes the upper truncated Pareto(I) distribution.

Usage
paretoff(scale = NULL, lshape = "loge")
truncpareto(lower, upper, lshape = "loge", ishape = NULL, imethod = 1)

Arguments
lshape Parameter link function applied to the parameter k. See Links for more choices.
A log link is the default because k is positive.
scale Numeric. The parameter below. If the user inputs a number then it is assumed
known with this value. The default means it is estimated by maximum likelihood
estimation, which means min(y) is used, where y is the response vector.
lower, upper Numeric. Lower and upper limits for the truncated Pareto distribution. Each
must be positive and of length 1. They are called and U below.
ishape Numeric. Optional initial value for the shape parameter. A NULL means a value
is obtained internally. If failure to converge occurs try specifying a value, e.g., 1
or 2.
imethod See CommonVGAMffArguments for information. If failure to converge occurs
then try specifying a value for ishape.
paretoff 557

Details
A random variable Y has a Pareto distribution if

P [Y > y] = C/y k

for some positive k and C. This model is important in many applications due to the power law
probability tail, especially for large values of y.
The Pareto distribution, which is used a lot in economics, has a probability density function that can
be written
f (y; , k) = kk /y k+1
for 0 < < y and 0 < k. The is called the scale parameter, and it is either assumed known or
else min(y) is used. The parameter k is called the shape parameter. The mean of Y is k/(k 1)
provided k > 1. Its variance is 2 k/((k 1)2 (k 2)) provided k > 2.
The upper truncated Pareto distribution has a probability density function that can be written

f (y) = kk /[y k+1 (1 (/U )k )]

for 0 < < y < U < and k > 0. Possibly, better names for k are the index and tail parameters.
Here, and U are known. The mean of Y is kk (U 1k 1k )/[(1 k)(1 (/U )k )].

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Warning
The usual or unbounded Pareto distribution has two parameters (called and k here) but the family
function paretoff estimates only k using iteratively reweighted least squares. The MLE of the
parameter lies on the boundary and is min(y) where y is the response. Consequently, using the
default argument values, the standard errors are incorrect when one does a summary on the fitted
object. If the user inputs a value for alpha then it is assumed known with this value and then
summary on the fitted object should be correct. Numerical problems may occur for small k, e.g.,
k < 1.

Note
Outside of economics, the Pareto distribution is known as the Bradford distribution.
For paretoff, if the estimate of k is less than or equal to unity then the fitted values will be NAs.
Also, paretoff fits the Pareto(I) distribution. See paretoIV for the more general Pareto(IV/III/II)
distributions, but there is a slight change in notation: s = k and b = .
In some applications the Pareto law is truncated by a natural upper bound on the probability tail.
The upper truncated Pareto distribution has three parameters (called , U and k here) but the family
function truncpareto() estimates only k. With known lower and upper limits, the ML estimator
of k has the usual properties of MLEs. Aban (2006) discusses other inferential details.

Author(s)
T. W. Yee
558 ParetoIV

References

Forbes, C., Evans, M., Hastings, N. and Peacock, B. (2011) Statistical Distributions, Hoboken, NJ,
USA: John Wiley and Sons, Fourth edition.
Aban, I. B., Meerschaert, M. M. and Panorska, A. K. (2006) Parameter estimation for the truncated
Pareto distribution, Journal of the American Statistical Association, 101(473), 270277.

See Also

Pareto, Truncpareto, paretoIV, gpd, benini1.

Examples

alpha <- 2; kay <- exp(3)

pdata <- data.frame(y = rpareto(n = 1000, scale = alpha, shape = kay))
fit <- vglm(y ~ 1, paretoff, data = pdata, trace = TRUE)
fit@extra # The estimate of alpha is here
head(fitted(fit))
with(pdata, mean(y))
coef(fit, matrix = TRUE)
summary(fit) # Standard errors are incorrect!!

# Here, alpha is assumed known

fit2 <- vglm(y ~ 1, paretoff(scale = alpha), data = pdata, trace = TRUE)
fit2@extra # alpha stored here
head(fitted(fit2))
coef(fit2, matrix = TRUE)
summary(fit2) # Standard errors are okay

# Upper truncated Pareto distribution

lower <- 2; upper <- 8; kay <- exp(2)
pdata3 <- data.frame(y = rtruncpareto(n = 100, lower = lower,
upper = upper, shape = kay))
fit3 <- vglm(y ~ 1, truncpareto(lower, upper), data = pdata3, trace = TRUE)
coef(fit3, matrix = TRUE)
c(fit3@misc$lower, fit3@misc$upper)

ParetoIV The Pareto(IV/III/II) Distributions

Description

Density, distribution function, quantile function and random generation for the Pareto(IV/III/II)
distributions.
ParetoIV 559

Usage
dparetoIV(x, location = 0, scale = 1, inequality = 1, shape = 1, log = FALSE)
pparetoIV(q, location = 0, scale = 1, inequality = 1, shape = 1,
lower.tail = TRUE, log.p = FALSE)
qparetoIV(p, location = 0, scale = 1, inequality = 1, shape = 1,
lower.tail = TRUE, log.p = FALSE)
rparetoIV(n, location = 0, scale = 1, inequality = 1, shape = 1)
dparetoIII(x, location = 0, scale = 1, inequality = 1, log = FALSE)
pparetoIII(q, location = 0, scale = 1, inequality = 1,
lower.tail = TRUE, log.p = FALSE)
qparetoIII(p, location = 0, scale = 1, inequality = 1,
lower.tail = TRUE, log.p = FALSE)
rparetoIII(n, location = 0, scale = 1, inequality = 1)
dparetoII(x, location = 0, scale = 1, shape = 1, log = FALSE)
pparetoII(q, location = 0, scale = 1, shape = 1,
lower.tail = TRUE, log.p = FALSE)
qparetoII(p, location = 0, scale = 1, shape = 1,
lower.tail = TRUE, log.p = FALSE)
rparetoII(n, location = 0, scale = 1, shape = 1)
dparetoI(x, scale = 1, shape = 1, log = FALSE)
pparetoI(q, scale = 1, shape = 1,
lower.tail = TRUE, log.p = FALSE)
qparetoI(p, scale = 1, shape = 1,
lower.tail = TRUE, log.p = FALSE)
rparetoI(n, scale = 1, shape = 1)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. Same as in runif.
location the location parameter.
scale, shape, inequality
the (positive) scale, inequality and shape parameters.
log Logical. If log = TRUE then the logarithm of the density is returned.
lower.tail, log.p
Same meaning as in pnorm or qnorm.

Details
For the formulas and other details see paretoIV.

Value
Functions beginning with the letter d give the density, functions beginning with the letter p give the
distribution function, functions beginning with the letter q give the quantile function, and functions
beginning with the letter r generates random deviates.
560 paretoIV

Note

The functions [dpqr]paretoI are the same as [dpqr]pareto except for a slight change in notation:
s = k and b = ; see Pareto.

Author(s)

T. W. Yee and Kai Huang

References

Brazauskas, V. (2003) Information matrix for Pareto(IV), Burr, and related distributions. Comm.
Statist. Theory and Methods 32, 315325.
Arnold, B. C. (1983) Pareto Distributions. Fairland, Maryland: International Cooperative Publish-
ing House.

See Also

paretoIV, Pareto.

Examples
## Not run:
x <- seq(-0.2, 4, by = 0.01)
loc <- 0; Scale <- 1; ineq <- 1; shape <- 1.0
plot(x, dparetoIV(x, loc, Scale, ineq, shape), type = "l", col = "blue",
main = "Blue is density, orange is cumulative distribution function",
sub = "Purple are 5,10,...,95 percentiles", ylim = 0:1, las = 1, ylab = "")
abline(h = 0, col = "blue", lty = 2)
Q <- qparetoIV(seq(0.05, 0.95,by = 0.05), loc, Scale, ineq, shape)
lines(Q, dparetoIV(Q, loc, Scale, ineq, shape), col = "purple", lty = 3, type = "h")
lines(x, pparetoIV(x, loc, Scale, ineq, shape), col = "orange")
abline(h = 0, lty = 2)

## End(Not run)

paretoIV Pareto(IV/III/II) Distribution Family Functions

Description

Estimates three of the parameters of the Pareto(IV) distribution by maximum likelihood estimation.
Some special cases of this distribution are also handled.
paretoIV 561

Usage
paretoIV(location = 0, lscale = "loge", linequality = "loge", lshape = "loge",
iscale = 1, iinequality = 1, ishape = NULL, imethod = 1)
paretoIII(location = 0, lscale = "loge", linequality = "loge",
iscale = NULL, iinequality = NULL)
paretoII(location = 0, lscale = "loge", lshape = "loge",
iscale = NULL, ishape = NULL)

Arguments
location Location parameter, called a below. It is assumed known.
lscale, linequality, lshape
Parameter link functions for the scale parameter (called b below), inequality
parameter (called g below), and shape parameter (called s below). See Links
for more choices. A log link is the default for all because all these parameters
are positive.
iscale, iinequality, ishape
Initial values for the parameters. A NULL value means that it is obtained inter-
nally. If convergence failure occurs, use these arguments to input some alterna-
tive initial values.
imethod Method of initialization for the shape parameter. Currently only values 1 and 2
are available. Try the other value if convergence failure occurs.

Details
The Pareto(IV) distribution, which is used in actuarial science, economics, finance and telecommu-
nications, has a cumulative distribution function that can be written

F (y) = 1 [1 + ((y a)/b)1/g ]s

for y > a, b > 0, g > 0 and s > 0. The a is called the location parameter, b the scale parameter, g
the inequality parameter, and s the shape parameter.
The location parameter is assumed known otherwise the Pareto(IV) distribution will not be a regular
family. This assumption is not too restrictive in modelling because in typical applications this
parameter is known, e.g., in insurance and reinsurance it is pre-defined by a contract and can be
represented as a deductible or a retention level.
The inequality parameter is so-called because of its interpretation in the economics context. If
we choose a unit shape parameter value and a zero location parameter value then the inequality
parameter is the Gini index of inequality, provided g 1.
The fitted values are currently the median, e.g., qparetoIV is used for paretoIV().
There are a number of special cases of the Pareto(IV) distribution. These include the Pareto(I),
Pareto(II), Pareto(III), and Burr family of distributions. Denoting P IV (a, b, g, s) as the Pareto(IV)
distribution, the Burr distribution Burr(b, g, s) is P IV (a = 0, b, 1/g, s), the Pareto(III) distribu-
tion P III(a, b, g) is P IV (a, b, g, s = 1), the Pareto(II) distribution P II(a, b, s) is P IV (a, b, g =
1, s), and the Pareto(I) distribution P I(b, s) is P IV (b, b, g = 1, s). Thus the Burr distribution can
be fitted using the negloge link function and using the default location=0 argument. The Pareto(I)
distribution can be fitted using paretoff but there is a slight change in notation: s = k and b = .
562 paretoIV

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Warning
The Pareto(IV) distribution is very general, for example, special cases include the Pareto(I), Pareto(II),
Pareto(III), and Burr family of distributions. [Johnson et al. (1994) says on p.19 that fitting Type
IV by ML is very difficult and rarely attempted]. Consequently, reasonably good initial values are
recommended, and convergence to a local solution may occur. For this reason setting trace=TRUE
is a good idea for monitoring the convergence. Large samples are ideally required to get reasonable
results.

Note
The extra slot of the fitted object has a component called "location" which stores the location
parameter value(s).

Author(s)
T. W. Yee

References
Johnson N. L., Kotz S., and Balakrishnan N. (1994) Continuous Univariate Distributions, Volume
1, 2nd ed. New York: Wiley.
Brazauskas, V. (2003) Information matrix for Pareto(IV), Burr, and related distributions. Comm.
Statist. Theory and Methods 32, 315325.
Arnold, B. C. (1983) Pareto Distributions. Fairland, Maryland: International Cooperative Publish-
ing House.

Perks The Perks Distribution

Description

Density, cumulative distribution function, quantile function and random generation for the Perks
distribution.

Usage

dperks(x, scale = 1, shape, log = FALSE)

pperks(q, scale = 1, shape, lower.tail = TRUE, log.p = FALSE)
qperks(p, scale = 1, shape, lower.tail = TRUE, log.p = FALSE)
rperks(n, scale = 1, shape)

Arguments

x, q vector of quantiles.
p vector of probabilities.
n number of observations. Same as in runif.
log Logical. If log = TRUE then the logarithm of the density is returned.
lower.tail, log.p
Same meaning as in pnorm or qnorm.
shape, scale positive shape and scale parameters.

Details

See perks for details.

Value

dperks gives the density, pperks gives the cumulative distribution function, qperks gives the quan-
tile function, and rperks generates random deviates.

Author(s)

T. W. Yee and Kai Huang

See Also

perks.
564 perks

Examples
probs <- seq(0.01, 0.99, by = 0.01)
Shape <- exp(-1.0); Scale <- exp(1);
max(abs(pperks(qperks(p = probs, Shape, Scale),
Shape, Scale) - probs)) # Should be 0

## Not run: x <- seq(-0.1, 07, by = 0.01);

plot(x, dperks(x, Shape, Scale), type = "l", col = "blue", las = 1,
main = "Blue is density, orange is cumulative distribution function",
sub = "Purple lines are the 10,20,...,90 percentiles",
ylab = "", ylim = 0:1)
abline(h = 0, col = "blue", lty = 2)
lines(x, pperks(x, Shape, Scale), col = "orange")
probs <- seq(0.1, 0.9, by = 0.1)
Q <- qperks(probs, Shape, Scale)
lines(Q, dperks(Q, Shape, Scale), col = "purple", lty = 3, type = "h")
pperks(Q, Shape, Scale) - probs # Should be all zero
abline(h = probs, col = "purple", lty = 3)
## End(Not run)

perks Perks Distribution Family Function

Description

Maximum likelihood estimation of the 2-parameter Perks distribution.

Usage

perks(lscale = "loge", lshape = "loge",

iscale = NULL, ishape = NULL,
gscale = exp(-5:5), gshape = exp(-5:5),
nsimEIM = 500, oim.mean = FALSE, zero = NULL, nowarning = FALSE)

Arguments

nowarning Logical. Suppress a warning? Ignored for VGAM 0.9-7 and higher.
lscale, lshape Parameter link functions applied to the shape parameter shape, scale parameter
scale. All parameters are treated as positive here See Links for more choices.
iscale, ishape Optional initial values. A NULL means a value is computed internally.
gscale, gshape See CommonVGAMffArguments.
nsimEIM, zero See CommonVGAMffArguments.
oim.mean To be currently ignored.
perks 565

Details
The Perks distribution has cumulative distribution function
1/
1+
F (y; , ) = 1
1 + ey

which leads to a probability density function

1/
f (y; , ) = [1 + ] ey /(1 + ey )1+1/

for > 0, > 0, y > 0. Here, is called the scale parameter scale, and is called a shape
parameter. The moments for this distribution do not appear to be available in closed form.
Simulated Fisher scoring is used and multiple responses are handled.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Warning
A lot of care is needed because this is a rather difficult distribution for parameter estimation. If
the self-starting initial values fail then try experimenting with the initial value arguments, espe-
cially iscale. Successful convergence depends on having very good initial values. Also, monitor
convergence by setting trace = TRUE.

Author(s)
T. W. Yee

References
Perks, W. (1932) On some experiments in the graduation of mortality statistics. Journal of the
Institute of Actuaries, 63, 1240.
Richards, S. J. (2012) A handbook of parametric survival models for actuarial use. Scandinavian
Actuarial Journal. 125.

fit1 <- vglm(y1 ~ 1, perks, data = pdata, trace = TRUE)

coef(fit1, matrix = TRUE)
summary(fit1)

## End(Not run)

perspqrrvglm Perspective plot for QRR-VGLMs

Description
Produces a perspective plot for a CQO model (QRR-VGLM). It is only applicable for rank-1 or
rank-2 models with argument noRRR = ~ 1.

Usage
perspqrrvglm(x, varI.latvar = FALSE, refResponse = NULL, show.plot = TRUE,
xlim = NULL, ylim = NULL, zlim = NULL,
gridlength = if (Rank == 1) 301 else c(51,51),
which.species = NULL,
xlab = if (Rank == 1) "Latent Variable" else "Latent Variable 1",
ylab = if (Rank == 1) "Expected Value" else "Latent Variable 2",
zlab = "Expected value", labelSpecies = FALSE,
stretch = 1.05, main = "", ticktype = "detailed",
col = if (Rank == 1) par()$col else "white",
llty = par()$lty, llwd = par()$lwd,
add1 = FALSE, ...)

Arguments
x Object of class "qrrvglm", i.e., a constrained quadratic ordination (CQO) ob-
ject.
varI.latvar Logical that is fed into Coef.qrrvglm.
refResponse Integer or character that is fed into Coef.qrrvglm.
show.plot Logical. Plot it?
xlim, ylim Limits of the x- and y-axis. Both are numeric of length 2. See par.
zlim Limits of the z-axis. Numeric of length 2. Ignored if rank is 1. See par.
gridlength Numeric. The fitted values are evaluated on a grid, and this argument regulates
the fineness of the grid. If Rank = 2 then the argument is recycled to length
2, and the two numbers are the number of grid points on the x- and y-axes
respectively.
which.species Numeric or character vector. Indicates which species are to be plotted. The
default is to plot all of them. If numeric, it should contain values in the set
{1,2,. . . ,S} where S is the number of species.
perspqrrvglm 567

xlab, ylab Character caption for the x-axis and y-axis. By default, a suitable caption is
found. See the xlab argument in plot or title.
zlab Character caption for the z-axis. Used only if Rank = 2. By default, a suitable
caption is found. See the xlab argument in plot or title.
labelSpecies Logical. Whether the species should be labelled with their names. Used for
Rank = 1 only. The position of the label is just above the species maximum.
stretch Numeric. A value slightly more than 1, this argument adjusts the height of the
y-axis. Used for Rank = 1 only.
main Character, giving the title of the plot. See the main argument in plot or title.
ticktype Tick type. Used only if Rank = 2. See persp for more information.
col Color. See persp for more information.
llty Line type. Rank-1 models only. See the lty argument of par.
llwd Line width. Rank-1 models only. See the lwd argument of par.
add1 Logical. Add to an existing plot? Used only for rank-1 models.
... Arguments passed into persp. Useful arguments here include theta and phi,
which control the position of the eye.

Details

For a rank-1 model, a perspective plot is similar to lvplot.qrrvglm but plots the curves along a
fine grid and there is no rugplot to show the site scores.
For a rank-2 model, a perspective plot has the first latent variable as the x-axis, the second latent
variable as the y-axis, and the expected value (fitted value) as the z-axis. The result of a CQO is
that each species has a response surface with elliptical contours. This function will, at each grid
point, work out the maximum fitted value over all the species. The resulting response surface is
plotted. Thus rare species will be obscured and abundant species will dominate the plot. To view
rare species, use the which.species argument to select a subset of the species.
A perspective plot will be performed if noRRR = ~ 1, and Rank = 1 or 2. Also, all the tolerance
matrices of those species to be plotted must be positive-definite.

Value

For a rank-2 model, a list with the following components.

fitted A (G1 G2 ) by M matrix of fitted values on the grid. Here, G1 and G2 are the
two values of gridlength.
latvar1grid, latvar2grid
The grid points for the x-axis and y-axis.
max.fitted A G1 by G2 matrix of maximum of the fitted values over all species. These are
the values that are plotted on the z-axis.

For a rank-1 model, the components latvar2grid and max.fitted are NULL.
568 perspqrrvglm

Note
Yee (2004) does not refer to perspective plots. Instead, contour plots via lvplot.qrrvglm are used.
For rank-1 models, a similar function to this one is lvplot.qrrvglm. It plots the fitted values at
the actual site score values rather than on a fine grid here. The result has the advantage that the user
sees the curves as a direct result from a model fitted to data whereas here, it is easy to think that the
smooth bell-shaped curves are the truth because the data is more of a distance away.

Author(s)
Thomas W. Yee

References
Yee, T. W. (2004) A new technique for maximum-likelihood canonical Gaussian ordination. Eco-
logical Monographs, 74, 685701.

See Also
persp, cqo, Coef.qrrvglm, lvplot.qrrvglm, par, title.

Examples
## Not run:
hspider[, 1:6] <- scale(hspider[, 1:6]) # Good idea when I.tolerances = TRUE
set.seed(111)
r1 <- cqo(cbind(Alopacce, Alopcune, Alopfabr, Arctlute, Arctperi,
Auloalbi, Pardmont, Pardnigr, Pardpull, Trocterr) ~
WaterCon + BareSand + FallTwig + CoveMoss + CoveHerb + ReflLux,
poissonff, data = hspider, trace = FALSE, I.tolerances = TRUE)
set.seed(111) # r2 below is an ill-conditioned model
r2 <- cqo(cbind(Alopacce, Alopcune, Alopfabr, Arctlute, Arctperi,
Auloalbi, Pardmont, Pardnigr, Pardpull, Trocterr) ~
WaterCon + BareSand + FallTwig + CoveMoss + CoveHerb + ReflLux,
isd.lv = c(2.4, 1.0), Muxfactor = 3.0, trace = FALSE,
poissonff, data = hspider, Rank = 2, eq.tolerances = TRUE)

sort(deviance(r1, history = TRUE)) # A history of all the fits

sort(deviance(r2, history = TRUE)) # A history of all the fits
if (deviance(r2) > 857) stop("suboptimal fit obtained")

persp(r1, xlim = c(-6, 5), col = 1:4, label = TRUE)

# Involves all species

persp(r2, xlim = c(-6, 5), ylim = c(-4, 5), theta = 10, phi = 20, zlim = c(0, 220))
# Omit the two dominant species to see what is behind them
persp(r2, xlim = c(-6, 5), ylim = c(-4, 5), theta = 10, phi = 20, zlim = c(0, 220),
which = (1:10)[-c(8, 10)]) # Use zlim to retain the original z-scale

## End(Not run)
pgamma.deriv 569

pgamma.deriv Derivatives of the Incomplete Gamma Integral

Description
The first two derivatives of the incomplete gamma integral.

Usage
pgamma.deriv(q, shape, tmax = 100)

Arguments
q, shape As in pgamma but these must be vectors of positive values only and finite.
tmax Maximum number of iterations allowed in the computation (per q value).

Details
Write x = q and shape = a. The first and second derivatives with respect to q and a are returned.
This function is similar in spirit to pgamma; define
Z x
1
P (a, x) = ta1 et dt
(a) 0

so that P (a, x) is pgamma(x, a). Currently a 6-column matrix is returned (in the future this may
change and an argument may be supplied so that only what is required by the user is computed.)
The computations use a series expansion for a x 1 or or x < a, else otherwise a continued
fraction expansion. Machine overflow can occur for large values of x when x is much greater than
a.

Value
The first 5 columns, running from left to right, are the derivatives with respect to: x, x2 , a, a2 , xa.
The 6th column is P (a, x) (but it is not as accurate as calling pgamma directly).

Note
If convergence does not occur then try increasing the value of tmax.
Yet to do: add more arguments to give greater flexibility in the accuracy desired and to compute
only quantities that are required by the user.

Author(s)
T. W. Yee wrote the wrapper function to the Fortran subroutine written by R. J. Moore. The subrou-
tine was modified to run using double precision. The original code came from http://lib.stat.cmu.edu/apstat/187.
but this website has since become stale.
570 pgamma.deriv.unscaled

References
Moore, R. J. (1982) Algorithm AS 187: Derivatives of the Incomplete Gamma Integral. Journal of
the Royal Statistical Society, Series C (Applied Statistics), 31(3), 330335.

pgamma.deriv.unscaled Derivatives of the Incomplete Gamma Integral (Unscaled Version)

Description
The first two derivatives of the incomplete gamma integral with scaling.

Usage
pgamma.deriv.unscaled(q, shape)

Arguments
q, shape As in pgamma and pgamma.deriv but these must be vectors of positive values
only and finite.

Details
Define Z x
G(x, a) = ta1 et dt
0

so that G(x, a) is pgamma(x, a) * gamma(a). Write x = q and shape = a. The 0th and first
and second derivatives with respect to a of G are returned. This function is similar in spirit to
pgamma.deriv but here there is no gamma function to scale things. Currently a 3-column matrix
is returned (in the future this may change and an argument may be supplied so that only what is
required by the user is computed.) This function is based on Wingo (1989).
plotdeplot.lmscreg 571

Value
The 3 columns, running from left to right, are the 0:2th derivatives with respect to a.

Warning
These function seems inaccurate for q = 1 and q = 2; see the plot below.

Author(s)
T. W. Yee.

References
See truncweibull.

## Not run: par(mfrow = c(1, 3))

for (jay in 1:3) {
plot(aa, ans.u[, jay], type = "l", col = "blue", cex.lab = 1.5,
cex.axis = 1.5, las = 1, main = colnames(ans.u)[jay],
log = "", xlab = "shape", ylab = "")
abline(h = 0, v = 1:2, lty = "dashed", col = "gray") # Inaccurate at 1 and 2
}

## End(Not run)

plotdeplot.lmscreg Density Plot for LMS Quantile Regression

Description
Plots a probability density function associated with a LMS quantile regression.

Usage
plotdeplot.lmscreg(answer, y.arg, add.arg = FALSE,
xlab = "", ylab = "density", xlim = NULL, ylim = NULL,
llty.arg = par()$lty, col.arg = par()$col,
llwd.arg = par()$lwd, ...)
572 plotdeplot.lmscreg

Arguments
answer Output from functions of the form deplot.??? where ??? is the name of the
VGAM LMS family function, e.g., lms.yjn. See below for details.
y.arg Numerical vector. The values of the response variable at which to evaluate the
density. This should be a grid that is fine enough to ensure the plotted curves are
smooth.
add.arg Logical. Add the density to an existing plot?
xlab, ylab Caption for the x- and y-axes. See par.
xlim, ylim Limits of the x- and y-axes. See par.
llty.arg Line type. See the lty argument of par.
col.arg Line color. See the col argument of par.
llwd.arg Line width. See the lwd argument of par.
... Arguments passed into the plot function when setting up the entire plot. Useful
arguments here include main and las.

Details
The above graphical parameters offer some flexibility when plotting the quantiles.

Value
The list answer, which has components

newdata The argument newdata above from the argument list of deplot.lmscreg, or a
one-row data frame constructed out of the x0 argument.
y The argument y.arg above.
density Vector of the density function values evaluated at y.arg.

Note
While the graphical arguments of this function are useful to the user, this function should not be
called directly.

Author(s)
Thomas W. Yee

References
Yee, T. W. (2004) Quantile regression via vector generalized additive models. Statistics in Medicine,
23, 22952315.

See Also
deplot.lmscreg.
plotqrrvglm 573

Examples
fit <- vgam(BMI ~ s(age, df = c(4,2)), lms.bcn(zero = 1), bmi.nz)
## Not run: y = seq(15, 43, by = 0.25)
deplot(fit, x0 = 20, y = y, xlab = "BMI", col = "green", llwd = 2,
main = "BMI distribution at ages 20 (green), 40 (blue), 60 (orange)")
deplot(fit, x0 = 40, y = y, add = TRUE, col = "blue", llwd = 2)
deplot(fit, x0 = 60, y = y, add = TRUE, col = "orange", llwd = 2) -> aa

names(aa@post$deplot)
aa@post$deplot$newdata
head(aa@post$deplot$y)
head(aa@post$deplot$density)
## End(Not run)

plotqrrvglm Model Diagnostic Plots for QRR-VGLMs

Description
The residuals of a QRR-VGLM are plotted for model diagnostic purposes.

Usage
plotqrrvglm(object, rtype = c("response", "pearson", "deviance", "working"),
ask = FALSE,
main = paste(Rtype, "residuals vs latent variable(s)"),
xlab = "Latent Variable",
I.tolerances = object@control$eq.tolerances, ...)

Arguments
object An object of class "qrrvglm".
rtype Character string giving residual type. By default, the first one is chosen.
ask Logical. If TRUE, the user is asked to hit the return key for the next plot.
main Character string giving the title of the plot.
xlab Character string giving the x-axis caption.
I.tolerances Logical. This argument is fed into Coef(object, I.tolerances = I.tolerances).
... Other plotting arguments (see par).

Details
Plotting the residuals can be potentially very useful for checking that the model fit is adequate.

Value
The original object.
574 plotqtplot.lmscreg

Note
An ordination plot of a QRR-VGLM can be obtained by lvplot.qrrvglm.

Author(s)
Thomas W. Yee

References
Yee, T. W. (2004) A new technique for maximum-likelihood canonical Gaussian ordination. Eco-
logical Monographs, 74, 685701.

plotqtplot.lmscreg Quantile Plot for LMS Quantile Regression

Description
Plots the quantiles associated with a LMS quantile regression.

Usage
plotqtplot.lmscreg(fitted.values, object, newdata = NULL,
percentiles = object@misc$percentiles, lp = NULL,
add.arg = FALSE, y = if (length(newdata)) FALSE else TRUE,
spline.fit = FALSE, label = TRUE, size.label = 0.06,
xlab = NULL, ylab = "",
pch = par()$pch, pcex = par()$cex, pcol.arg = par()$col,
plotqtplot.lmscreg 575

xlim = NULL, ylim = NULL,

llty.arg = par()$lty, lcol.arg = par()$col, llwd.arg = par()$lwd,
tcol.arg = par()$col, tadj = 1, ...)

Arguments
fitted.values Matrix of fitted values.
object A VGAM quantile regression model, i.e., an object produced by modelling func-
tions such as vglm and vgam with a family function beginning with "lms.", e.g.,
lms.yjn.
newdata Data frame at which predictions are made. By default, the original data are used.
percentiles Numerical vector with values between 0 and 100 that specify the percentiles
(quantiles). The default is to use the percentiles when fitting the model. For
example, the value 50 corresponds to the median.
lp Length of percentiles.
add.arg Logical. Add the quantiles to an existing plot?
y Logical. Add the response as points to the plot?
spline.fit Logical. Add a spline curve to the plot?
label Logical. Add the percentiles (as text) to the plot?
size.label Numeric. How much room to leave at the RHS for the label. It is in percent (of
the range of the primary variable).
xlab Caption for the x-axis. See par.
ylab Caption for the x-axis. See par.
pch Plotting character. See par.
pcex Character expansion of the points. See par.
pcol.arg Color of the points. See the col argument of par.
xlim Limits of the x-axis. See par.
ylim Limits of the y-axis. See par.
llty.arg Line type. Line type. See the lty argument of par.
lcol.arg Color of the lines. See the col argument of par.
llwd.arg Line width. See the lwd argument of par.
tcol.arg Color of the text (if label is TRUE). See the col argument of par.
tadj Text justification. See the adj argument of par.
... Arguments passed into the plot function when setting up the entire plot. Useful
arguments here include main and las.

Details
The above graphical parameters offer some flexibility when plotting the quantiles.

Value
The matrix of fitted values.
576 plotrcim0

Note
While the graphical arguments of this function are useful to the user, this function should not be
called directly.

Author(s)
Thomas W. Yee

References
Yee, T. W. (2004) Quantile regression via vector generalized additive models. Statistics in Medicine,
23, 22952315.

plotrcim0 Main Effects Plot for a Row-Column Interaction Model (RCIM)

Description
Produces a main effects plot for Row-Column Interaction Models (RCIMs).

Usage
plotrcim0(object, centered = TRUE, which.plots = c(1, 2),
hline0 = TRUE, hlty = "dashed", hcol = par()$col, hlwd = par()$lwd,
rfirst = 1, cfirst = 1,
rtype = "h", ctype = "h",
rcex.lab = 1, rcex.axis = 1, rtick = FALSE,
ccex.lab = 1, ccex.axis = 1, ctick = FALSE,
rmain = "Row effects", rsub = "",
rxlab = "", rylab = "Row effects",
cmain = "Column effects", csub = "",
cxlab= "", cylab = "Column effects",
rcol = par()$col, ccol = par()$col,
no.warning = FALSE, ...)
plotrcim0 577

Arguments
object An rcim object. This should be of rank-0, i.e., main effects only and no interac-
tions.
which.plots Numeric, describing which plots are to be plotted. The row effects plot is 1 and
the column effects plot is 2. Set the value 0, say, for no plots at all.
centered Logical. If TRUE then the row and column effects are centered (but not scaled)
by scale. If FALSE then the raw effects are used (of which the first are zero by
definition).
hline0, hlty, hcol, hlwd
hline0 is logical. If TRUE then a horizontal line is plotted at 0 and the other
arguments describe this line. Probably having hline0 = TRUE only makes
sense when centered = TRUE.
rfirst, cfirst rfirst is the level of row that is placed first in the row effects plot, etc.
rmain, cmain Character. rmain is the main label in the row effects plot, etc.
rtype, ctype, rsub, csub
See the type and sub arguments of plot.
rxlab, rylab, cxlab, cylab
Character. For the row effects plot, rxlab is xlab and rylab is ylab; see par.
Ditto for cxlab and cylab for the column effects plot.
rcex.lab, ccex.lab
Numeric. rcex.lab is cex for the row effects plot label, etc.
rcex.axis, ccex.axis
Numeric. rcex.axis is the cex argument for the row effects axis label, etc.
rtick, ctick Logical. If rtick = TRUE then add ticks to the row effects plot, etc.
rcol, ccol rcol give a colour for the row effects plot, etc.
no.warning Logical. If TRUE then no warning is issued if the model is not rank-0.
... Arguments fed into both plot calls.

Details
This function plots the row and column effects of a rank-0 RCIM. As the result is a main effects
plot of a regression analysis, its interpretation when centered = FALSE is relative to the baseline
(reference level) of a row and column, and should also be considered in light of the link function
used. Many arguments that start with "r" refer to the row effects plot, and "c" for the column
effects plot.

Value
The original object with the post slot assigned additional information from the plot.

Note
This function should be only used to plot the object of rank-0 RCIM. If the rank is positive then it
will issue a warning.
Using an argument ylim will mean the row and column effects are plotted on a common scale; see
plot.window.
578 plotvgam

Author(s)
T. W. Yee, A. F. Hadi.

# Negative binomial example

## Not run:
fit1 <- rcim(alcoff.e, negbinomial, trace = TRUE)
plot(fit1, ylim = c(-2, 2))
## End(Not run)

# Univariate normal example

fit2 <- rcim(alcoff.e, uninormal, trace = TRUE)
## Not run: plot(fit2, ylim = c(-200, 400))

# Median-polish example
## Not run:
fit3 <- rcim(alcoff.e, alaplace1(tau = 0.5), maxit = 1000, trace = FALSE)
plot(fit3, ylim = c(-200, 250))
## End(Not run)

# Zero-inflated Poisson example on "crashp" (no 0s in alcoff)

## Not run:
cbind(rowSums(crashp)) # Easy to see the data
cbind(colSums(crashp)) # Easy to see the data
fit4 <- rcim(Rcim(crashp, rbaseline = "5", cbaseline = "Sun"),
zipoissonff, trace = TRUE)
plot(fit4, ylim = c(-3, 3))
## End(Not run)

plotvgam Default VGAM Plotting

plotvgam 579

Description
Component functions of a vgam-class object can be plotted with plotvgam(). These are on the
scale of the linear/additive predictor.

Usage
plotvgam(x, newdata = NULL, y = NULL, residuals = NULL,
rugplot = TRUE, se = FALSE, scale = 0, raw = TRUE,
offset.arg = 0, deriv.arg = 0, overlay = FALSE,
type.residuals = c("deviance", "working", "pearson", "response"),
plot.arg = TRUE, which.term = NULL, which.cf = NULL,
control = plotvgam.control(...), varxij = 1, ...)

Arguments
x A fitted VGAM object, e.g., produced by vgam, vglm, or rrvglm.
newdata Data frame. May be used to reconstruct the original data set.
y Unused.
residuals Logical. If TRUE then residuals are plotted. See type.residuals
rugplot Logical. If TRUE then a rug plot is plotted at the foot of each plot. These values
are jittered to expose ties.
se Logical. If TRUE then approximate 2 pointwise standard error bands are in-
cluded in the plot.
scale Numerical. By default, each plot will have its own y-axis scale. However, by
specifying a value, each plots y-axis scale will be at least scale wide.
raw Logical. If TRUE then the smooth functions are those obtained directly by the al-
gorithm, and are plotted without having to premultiply with the constraint matri-
ces. If FALSE then the smooth functions have been premultiply by the constraint
matrices. The raw argument is directly fed into predict.vgam().
offset.arg Numerical vector of length r. These are added to the component functions.
Useful for separating out the functions when overlay is TRUE. If overlay is
TRUE and there is one covariate then using the intercept values as the offsets can
be a good idea.
deriv.arg Numerical. The order of the derivative. Should be assigned an small integer
such as 0, 1, 2. Only applying to s() terms, it plots the derivative.
overlay Logical. If TRUE then component functions of the same covariate are overlaid
on each other. The functions are centered, so offset.arg can be useful when
overlay is TRUE.
type.residuals if residuals is TRUE then the first possible value of this vector, is used to specify
the type of residual.
plot.arg Logical. If FALSE then no plot is produced.
which.term Character or integer vector containing all terms to be plotted, e.g., which.term = c("s(age)", "s(heigh
or which.term = c(2, 5, 9). By default, all are plotted.
which.cf An integer-valued vector specifying which linear/additive predictors are to be
plotted. The values must be from the set {1,2,. . . ,r}. By default, all are plotted.
580 plotvgam

control Other control parameters. See plotvgam.control.

... Other arguments that can be fed into plotvgam.control. This includes line
colors, line widths, line types, etc.
varxij Positive integer. Used if xij of vglm.control was used, this chooses which
inner argument the component is plotted against. This argument is related to
raw = TRUE and terms such as NS(dum1, dum2) and constraint matrices that
have more than one column. The default would plot the smooth against dum1
but setting varxij = 2 could mean plotting the smooth against dum2. See the
VGAM website for further information.

Details
In this help file M is the number of linear/additive predictors, and r is the number of columns of
the constraint matrix of interest.
Many of plotvgam()s options can be found in plotvgam.control, e.g., line types, line widths,
colors.

Value
The original object, but with the preplot slot of the object assigned information regarding the plot.

Note
While plot(fit) will work if class(fit) is "vgam", it is necessary to use plotvgam(fit) ex-
plicitly otherwise.
plotvgam() is quite buggy at the moment.

Author(s)
Thomas W. Yee

See Also
vgam, plotvgam.control, predict.vgam, plotvglm, vglm.

Examples
coalminers <- transform(coalminers, Age = (age - 42) / 5)
fit <- vgam(cbind(nBnW, nBW, BnW, BW) ~ s(Age),
binom2.or(zero = NULL), data = coalminers)
## Not run: par(mfrow = c(1,3))
plot(fit, se = TRUE, ylim = c(-3, 2), las = 1)
plot(fit, se = TRUE, which.cf = 1:2, lcol = "blue", scol = "orange",
ylim = c(-3, 2))
plot(fit, se = TRUE, which.cf = 1:2, lcol = "blue", scol = "orange",
overlay = TRUE)
## End(Not run)
plotvgam.control 581

plotvgam.control Control Function for plotvgam()

Description
Provides default values for many arguments available for plotvgam().

Usage
plotvgam.control(which.cf = NULL,
xlim = NULL, ylim = NULL, llty = par()$lty,
slty = "dashed", pcex = par()$cex,
pch = par()$pch, pcol = par()$col,
lcol = par()$col, rcol = par()$col,
scol = par()$col, llwd = par()$lwd, slwd = par()$lwd,
add.arg = FALSE, one.at.a.time = FALSE,
.include.dots = TRUE, noxmean = FALSE,
shade = FALSE, shcol = "gray80", ...)

Arguments
which.cf Integer vector specifying which component functions are to be plotted (for each
covariate). Must have values from the set {1,2,. . . ,M }.
xlim Range for the x-axis.
ylim Range for the y-axis.
llty Line type for the fitted functions (lines). Fed into par(lty).
slty Line type for the standard error bands. Fed into par(lty).
pcex Character expansion for the points (residuals). Fed into par(cex).
pch Character used for the points (residuals). Same as par(pch).
pcol Color of the points. Fed into par(col).
lcol Color of the fitted functions (lines). Fed into par(col).
rcol Color of the rug plot. Fed into par(col).
scol Color of the standard error bands. Fed into par(col).
llwd Line width of the fitted functions (lines). Fed into par(lwd).
slwd Line width of the standard error bands. Fed into par(lwd).
add.arg Logical. If TRUE then the plot will be added to an existing plot, otherwise a new
plot will be made.
one.at.a.time Logical. If TRUE then the plots are done one at a time, with the user having to
hit the return key between the plots.
.include.dots Not to be used by the user.
noxmean Logical. If TRUE then the point at the mean of x, which is added when standard
errors are specified and it thinks the function is linear, is not added. One might
use this argument if ylab is specified.
582 plotvglm

shade, shcol shade is logical; if TRUE then the pointwise SE band is shaded gray by default.
The colour can be adjusted by setting shcol. These arguments are ignored un-
less se = TRUE and overlay = FALSE; If shade = TRUE then scol is ignored.
... Other arguments that may be fed into par(). In the above, M is the number of
linear/additive predictors.

Details
The most obvious features of plotvgam can be controlled by the above arguments.

Value
A list with values matching the arguments.

Author(s)
Thomas W. Yee

References
Yee, T. W. and Wild, C. J. (1996) Vector generalized additive models. Journal of the Royal Statisti-
cal Society, Series B, Methodological, 58, 481493.

plotvglm Plots for VGLMs

Description
Currently this function plots the Pearson residuals versus the linear predictors (M plots) and plots
the Pearson residuals versus the hat values (M plots).

Usage
plotvglm(x, which = "(All)", ...)

Arguments
x An object of class "vglm" (see vglm-class) or inherits from that class.
which If a subset of the plots is required, specify a subset of the numbers 1:(2*M). The
default is to plot them all.
... Arguments fed into the primitive plot functions.
pneumo 583

Details
This function is under development. Currently it plots the Pearson residuals against the predicted
values (on the transformed scale) and the hat values. There are 2M plots in total, therefore users
should call par to assign, e.g., the mfrow argument. Note: Section 3.7 of Yee (2015) describes the
Pearson residuals and hat values for VGLMs.

Value
Returns the object invisibly.

Author(s)
T. W. Yee

See Also
plotvgam, plotvgam.control, vglm.

Examples
## Not run:
ndata <- data.frame(x2 = runif(nn <- 200))
ndata <- transform(ndata, y1 = rnbinom(nn, mu = exp(3+x2), size = exp(1)))
fit1 <- vglm(y1 ~ x2, negbinomial, data = ndata, trace = TRUE)
coef(fit1, matrix = TRUE)
par(mfrow = c(2, 2))
plot(fit1)

# Manually produce the four plots

plot(fit1, which = 1, col = "blue", las = 1, main = "main1")
abline(h = 0, lty = "dashed", col = "gray50")
plot(fit1, which = 2, col = "blue", las = 1, main = "main2")
abline(h = 0, lty = "dashed", col = "gray50")
plot(fit1, which = 3, col = "blue", las = 1, main = "main3")
plot(fit1, which = 4, col = "blue", las = 1, main = "main4")

## End(Not run)

pneumo Pneumoconiosis in Coalminers Data

Description
The pneumo data frame has 8 rows and 4 columns. Exposure time is explanatory, and there are 3
ordinal response variables.

Usage
data(pneumo)
584 poisson.points

Format
This data frame contains the following columns:

exposure.time a numeric vector, in years

normal a numeric vector, counts
mild a numeric vector, counts
severe a numeric vector, counts

Details
These were collected from coalface workers. In the original data set, the two most severe categories
were combined.

Source
Ashford, J.R., 1959. An approach to the analysis of data for semi-quantal responses in biological
assay. Biometrics, 15, 573581.

References
McCullagh, P. and Nelder, J. A. (1989) Generalized Linear Models, 2nd ed. London: Chapman &
Hall.

poisson.points Poisson-points-on-a-plane/volume Distances Distribution

Description
Estimating the density parameter of the distances from a fixed point to the u-th nearest point, in a
plane or volume.

Usage
poisson.points(ostatistic, dimension = 2, link = "loge",
idensity = NULL, imethod = 1)
poisson.points 585

Arguments
ostatistic Order statistic. A single positive value, usually an integer. For example, the
value 5 means the response are the distances of the fifth nearest value to that
point (usually over many planes or volumes). Non-integers are allowed be-
cause the value 1.5 coincides with maxwell when dimension = 2. Note: if
ostatistic = 1 and dimension = 2 then this VGAM family function coin-
cides with rayleigh.
dimension The value 2 or 3; 2 meaning a plane and 3 meaning a volume.
link Parameter link function applied to the (positive) density parameter, called be-
low. See Links for more choices.
idensity Optional initial value for the parameter. A NULL value means a value is obtained
internally. Use this argument if convergence failure occurs.
imethod An integer with value 1 or 2 which specifies the initialization method for .
If failure to converge occurs try another value and/or else specify a value for
idensity.

Details
Suppose the number of points in any region of area A of the plane is a Poisson random variable with
mean A (i.e., is the density of the points). Given a fixed point P , define D1 , D2 ,. . . to be the
distance to the nearest point to P , second nearest to P , etc. This VGAM family function estimates
since the probability density function for Du is easily derived, u = 1, 2, . . .. Here, u corresponds
to the argument ostatistic.
Similarly, suppose the number of points in any volume V is a Poisson random variable with mean
V where, once again, is the density of the points. This VGAM family function estimates by
specifying the argument ostatistic and using dimension = 3.
The mean of Du is returned as the fitted values. Newton-Raphson is the same as Fisher-scoring.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.

Warning
Convergence may be slow if the initial values are far from the solution. This often corresponds to
the situation when the response values are all close to zero, i.e., there is a high density of points.
Formulae such as the means have not been fully checked.

Author(s)
T. W. Yee

See Also
poissonff, maxwell, rayleigh.
586 poissonff

Examples
pdata <- data.frame(y = rgamma(10, shape = exp(-1))) # Not proper data!
ostat <- 2
fit <- vglm(y ~ 1, poisson.points(ostat, 2), data = pdata,
trace = TRUE, crit = "coef")
fit <- vglm(y ~ 1, poisson.points(ostat, 3), data = pdata,
trace = TRUE, crit = "coef") # Slow convergence?
fit <- vglm(y ~ 1, poisson.points(ostat, 3, idensi = 1), data = pdata,
trace = TRUE, crit = "coef")
head(fitted(fit))
with(pdata, mean(y))
coef(fit, matrix = TRUE)
Coef(fit)

poissonff Poisson Family Function

Description
Family function for a generalized linear model fitted to Poisson responses. The dispersion parame-
ters may be known or unknown.

Usage
poissonff(link = "loge", dispersion = 1, onedpar = FALSE, imu = NULL,
imethod = 1, parallel = FALSE, zero = NULL, bred = FALSE,
earg.link = FALSE, type.fitted = c("mean", "quantiles"),
percentiles = c(25, 50, 75))

Arguments
link Link function applied to the mean or means. See Links for more choices and
information.
dispersion Dispersion parameter. By default, maximum likelihood is used to estimate the
model because it is known. However, the user can specify dispersion = 0
to have it estimated, or else specify a known positive value (or values if the
response is a matrixone value per column).
onedpar One dispersion parameter? If the response is a matrix, then a separate dispersion
parameter will be computed for each response (column), by default. Setting
onedpar=TRUE will pool them so that there is only one dispersion parameter to
be estimated.
parallel A logical or formula. Used only if the response is a matrix.
imu, imethod See CommonVGAMffArguments for more information.
zero Can be an integer-valued vector specifying which linear/additive predictors are
modelled as intercepts only. The values must be from the set {1,2,. . . ,M }, where
M is the number of columns of the matrix response. See CommonVGAMffArguments
for more information.
poissonff 587

bred, earg.link
Details at CommonVGAMffArguments. Setting bred = TRUE should work for
multiple responses and all VGAM link functions; it has been tested for loge,
identity but further testing is required.
type.fitted, percentiles
Details at CommonVGAMffArguments.

Details

M defined above is the number of linear/additive predictors.

If the dispersion parameter is unknown, then the resulting estimate is not fully a maximum likeli-
hood estimate.
A dispersion parameter that is less/greater than unity corresponds to under-/over-dispersion relative
to the Poisson model. Over-dispersion is more common in practice.
When fitting a Quadratic RR-VGLM (see cqo), the response is a matrix of M , say, columns (e.g.,
one column per species). Then there will be M dispersion parameters (one per column of the
response matrix) if dispersion = 0 and onedpar = FALSE.

Value

An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, vgam, rrvglm, cqo, and cao.

Warning

With multiple responses, assigning a known dispersion parameter for each response is not handled
well yet. Currently, only a single known dispersion parameter is handled well.

Note

This function will handle a matrix response automatically.

The call poissonff(dispersion=0, ...) is equivalent to quasipoissonff(...). The latter was
written so that R users of quasipoisson() would only need to add a ff to the end of the family
function name.
Regardless of whether the dispersion parameter is to be estimated or not, its value can be seen from
the output from the summary() of the object.

Author(s)

Thomas W. Yee

References

McCullagh, P. and Nelder, J. A. (1989) Generalized Linear Models, 2nd ed. London: Chapman &
Hall.
588 PoissonPoints

See Also
Links, quasipoissonff, genpoisson, zipoisson, pospoisson, oipospoisson, otpospoisson,
skellam, mix2poisson, cens.poisson, ordpoisson, amlpoisson, inv.binomial, simulate.vlm,
loge, polf, rrvglm, cqo, cao, binomialff, quasibinomialff, poisson, poisson.points, ruge,
V1.

Examples
poissonff()

set.seed(123)
pdata <- data.frame(x2 = rnorm(nn <- 100))
pdata <- transform(pdata, y1 = rpois(nn, exp(1 + x2)),
y2 = rpois(nn, exp(1 + x2)))
(fit1 <- vglm(cbind(y1, y2) ~ x2, poissonff, data = pdata))
(fit2 <- vglm(y1 ~ x2, poissonff(bred = TRUE), data = pdata))
coef(fit1, matrix = TRUE)
coef(fit2, matrix = TRUE)

nn <- 200
cdata <- data.frame(x2 = rnorm(nn), x3 = rnorm(nn), x4 = rnorm(nn))
cdata <- transform(cdata, lv1 = 0 + x3 - 2*x4)
cdata <- transform(cdata, lambda1 = exp(3 - 0.5 * (lv1-0)^2),
lambda2 = exp(2 - 0.5 * (lv1-1)^2),
lambda3 = exp(2 - 0.5 * ((lv1+4)/2)^2))
cdata <- transform(cdata, y1 = rpois(nn, lambda1),
y2 = rpois(nn, lambda2),
y3 = rpois(nn, lambda3))
## Not run: lvplot(p1, y = TRUE, lcol = 2:4, pch = 2:4, pcol = 2:4, rug = FALSE)

PoissonPoints Poisson Points Distribution

Description
Density for the PoissonPoints distribution.

Usage
dpois.points(x, lambda, ostatistic, dimension = 2, log = FALSE)

Arguments
x vector of quantiles.
lambda the mean density of points.
ostatistic positive values, usually integers.
dimension Either 2 and/or 3.
log Logical; if TRUE, the logarithm is returned.
polf 589

Details
See poisson.points, the VGAM family function for estimating the parameters, for the formula of
the probability density function and other details.

Value
dpois.points gives the density.

See Also
poisson.points, dpois, Maxwell.

Examples
## Not run: lambda <- 1; xvec <- seq(0, 2, length = 400)
plot(xvec, dpois.points(xvec, lambda, ostat = 1, dimension = 2),
type = "l", las = 1, col = "blue",
sub = "First order statistic",
main = paste("PDF of PoissonPoints distribution with lambda = ",
lambda, " and on the plane", sep = ""))
## End(Not run)

polf Poisson-Ordinal Link Function

Description
Computes the Poisson-ordinal transformation, including its inverse and the first two derivatives.

Usage
polf(theta, cutpoint = NULL,
inverse = FALSE, deriv = 0, short = TRUE, tag = FALSE)

Arguments
theta Numeric or character. See below for further details.
cutpoint The cutpoints should be non-negative integers. If polf() is used as the link
function in cumulative then one should choose reverse = TRUE, parallel = TRUE.
inverse, deriv, short, tag
Details at Links.

Details
The Poisson-ordinal link function (POLF) can be applied to a parameter lying in the unit interval.
Its purpose is to link cumulative probabilities associated with an ordinal response coming from an
underlying Poisson distribution. If the cutpoint is zero then a complementary log-log link is used.
See Links for general information about VGAM link functions.
590 polf

Value
See Yee (2012) for details.

Warning
Prediction may not work on vglm or vgam etc. objects if this link function is used.

Note
Numerical values of theta too close to 0 or 1 or out of range result in large positive or negative
values, or maybe 0 depending on the arguments. Although measures have been taken to handle
cases where theta is too close to 1 or 0, numerical instabilities may still arise.
In terms of the threshold approach with cumulative probabilities for an ordinal response this link
function corresponds to the Poisson distribution (see poissonff) that has been recorded as an ordi-
nal response using known cutpoints.

Author(s)
Thomas W. Yee

References
Yee, T. W. (2012) Ordinal ordination with normalizing link functions for count data, (in prepara-
tion).

See Also
Links, ordpoisson, poissonff, nbolf, golf, cumulative.

Examples
## Not run:
polf("p", cutpoint = 2, short = FALSE)
polf("p", cutpoint = 2, tag = TRUE)

p <- seq(0.01, 0.99, by = 0.01)

y <- polf(p, cutpoint = 2)
y. <- polf(p, cutpoint = 2, deriv = 1)
max(abs(polf(y, cutpoint = 2, inv = TRUE) - p)) # Should be 0

#\ dontrun{ par(mfrow = c(2, 1), las = 1)

#plot(p, y, type = "l", col = "blue", main = "polf()")
#abline(h = 0, v = 0.5, col = "orange", lty = "dashed")
#
#plot(p, y., type = "l", col = "blue",
# main = "(Reciprocal of) first POLF derivative")
#}

# Rutherford and Geiger data

ruge <- data.frame(yy = rep(0:14,
Polono 591

times = c(57,203,383,525,532,408,273,139,45,27,10,4,0,1,1)))
with(ruge, length(yy)) # 2608 1/8-minute intervals
cutpoint <- 5
ruge <- transform(ruge, yy01 = ifelse(yy <= cutpoint, 0, 1))
fit <- vglm(yy01 ~ 1, binomialff(link = polf(cutpoint = cutpoint)), ruge)
coef(fit, matrix = TRUE)
exp(coef(fit))

# Another example
pdata <- data.frame(x2 = sort(runif(nn <- 1000)))
pdata <- transform(pdata, x3 = runif(nn))
pdata <- transform(pdata, mymu = exp( 3 + 1 * x2 - 2 * x3))
pdata <- transform(pdata, y1 = rpois(nn, lambda = mymu))
cutpoints <- c(-Inf, 10, 20, Inf)
pdata <- transform(pdata, cuty = Cut(y1, breaks = cutpoints))
#\ dontrun{ with(pdata, plot(x2, x3, col = cuty, pch = as.character(cuty))) }
with(pdata, table(cuty) / sum(table(cuty)))
fit <- vglm(cuty ~ x2 + x3, data = pdata, trace = TRUE,
cumulative(reverse = TRUE,
parallel = TRUE,
link = polf(cutpoint = cutpoints[2:3]),
multiple.responses = TRUE))
head(depvar(fit))
head(fitted(fit))
head(predict(fit))
coef(fit)
coef(fit, matrix = TRUE)
constraints(fit)
fit@misc$earg

## End(Not run)

Polono The Poisson Lognormal Distribution

Description

Density, distribution function and random generation for the Poisson lognormal distribution.

Usage

dpolono(x, meanlog = 0, sdlog = 1, bigx = 170, ...)

ppolono(q, meanlog = 0, sdlog = 1,
isOne = 1 - sqrt( .Machine$double.eps ), ...)
rpolono(n, meanlog = 0, sdlog = 1)
592 Polono

Arguments
x, q vector of quantiles.
n number of observations. If length(n) > 1 then the length is taken to be the
number required.
meanlog, sdlog
the mean and standard deviation of the normal distribution (on the log scale).
They match the arguments in Lognormal.
bigx Numeric. This argument is for handling large values of x and/or when integrate
fails. A first order Taylor series approximation [Equation (7) of Bulmer (1974)]
is used at values of x that are greater or equal to this argument. For bigx = 10,
he showed that the approximation has a relative error less than 0.001 for values
of meanlog and sdlog likely to be encountered in practice. The argument can
be assigned Inf in which case the approximation is not used.
isOne Used to test whether the cumulative probabilities have effectively reached unity.
... Arguments passed into integrate.

Details
The Poisson lognormal distribution is similar to the negative binomial in that it can be motivated by
a Poisson distribution whose mean parameter comes from a right skewed distribution (gamma for
the negative binomial and lognormal for the Poisson lognormal distribution).

Value
dpolono gives the density, ppolono gives the distribution function, and rpolono generates random
deviates.

Note
By default, dpolono involves numerical integration that is performed using integrate. Conse-
quently, computations are very slow and numerical problems may occur (if so then the use of ...
may be needed). Alternatively, for extreme values of x, meanlog, sdlog, etc., the use of bigx = Inf
avoids the call to integrate, however the answer may be a little inaccurate.
For the maximum likelihood estimation of the 2 parameters a VGAM family function called polono(),
say, has not been written yet.

Author(s)
T. W. Yee. Some anonymous soul kindly wrote ppolono() and improved the original dpolono().

References
Bulmer, M. G. (1974) On fitting the Poisson lognormal distribution to species-abundance data.
Biometrics, 30, 101110.

See Also
lognormal, poissonff, negbinomial.
posbernoulli.b 593

Examples
meanlog <- 0.5; sdlog <- 0.5; yy <- 0:19
sum(proby <- dpolono(yy, m = meanlog, sd = sdlog)) # Should be 1
max(abs(cumsum(proby) - ppolono(yy, m = meanlog, sd = sdlog))) # Should be 0

## Not run: opar = par(no.readonly = TRUE)

par(mfrow = c(2, 2))
plot(yy, proby, type = "h", col = "blue", ylab = "P[Y=y]", log = "",
main = paste("Poisson lognormal(m = ", meanlog,
", sdl = ", sdlog, ")", sep = ""))

y <- 0:190 # More extreme values; use the approximation and plot on a log scale
(sum(proby <- dpolono(y, m = meanlog, sd = sdlog, bigx = 100))) # Should be 1
plot(y, proby, type = "h", col = "blue", ylab = "P[Y=y] (log)", log = "y",
main = paste("Poisson lognormal(m = ", meanlog,
", sdl = ", sdlog, ")", sep = "")) # Note the kink at bigx

# Random number generation

table(y <- rpolono(n = 1000, m = meanlog, sd = sdlog))
hist(y, breaks = ((-1):max(y))+0.5, prob = TRUE, border = "blue")
par(opar)
## End(Not run)

posbernoulli.b Positive Bernoulli Family Function with Behavioural Effects

Description
Fits a GLM-/GAM-like model to multiple Bernoulli responses where each row in the capture his-
tory matrix response has at least one success (capture). Capture history behavioural effects are
accommodated.

Usage
posbernoulli.b(link = "logit", drop.b = FALSE ~ 1,
type.fitted = c("likelihood.cond", "mean.uncond"), I2 = FALSE,
ipcapture = NULL, iprecapture = NULL,
p.small = 1e-4, no.warning = FALSE)

Arguments
link, drop.b, ipcapture, iprecapture
See CommonVGAMffArguments for information about these arguments. By de-
fault the parallelism assumption does not apply to the intercept. With an intercept-
only model setting drop.b = TRUE ~ 1 results in the M0 /Mh model.
I2 Logical. This argument is used for terms that are not parallel. If TRUE then the
constraint matrix diag(2) (the general default constraint matrix in VGAM) is
594 posbernoulli.b

used, else cbind(0:1, 1). The latter means the first element/column corre-
sponds to the behavioural effect. Consequently it and its standard error etc. can
be accessed directly without subtracting two quantities.
type.fitted Details at posbernoulli.tb.
p.small, no.warning
See posbernoulli.t.

Details

This model (commonly known as Mb /Mbh in the capturerecapture literature) operates on a cap-
ture history matrix response of 0s and 1s (n ). See posbernoulli.t for details, e.g., common
assumptions with other models. Once an animal is captured for the first time, it is marked/tagged
so that its future capture history can be recorded. The effect of the recapture probability is mod-
elled through a second linear/additive predictor. It is well-known that some species of animals
are affected by capture, e.g., trap-shy or trap-happy. This VGAM family function does allow
the capture history to be modelled via such behavioural effects. So does posbernoulli.tb but
posbernoulli.t cannot.
The number of linear/additive predictors is M = 2, and the default links are (logit pc , logit pr )T
where pc is the probability of capture and pr is the probability of recapture. The fitted value returned
is of the same dimension as the response matrix, and depends on the capture history: prior to being
first captured, it is pcapture. Afterwards, it is precapture.
By default, the constraint matrices for the intercept term and the other covariates are set up so
that pr differs from pc by a simple binary effect, on a logit scale. However, this difference (the
behavioural effect) is more directly estimated by having I2 = FALSE. Then it allows an estimate
of the trap-happy/trap-shy effect; these are positive/negative values respectively. If I2 = FALSE
then the (nonstandard) constraint matrix used is cbind(0:1, 1), meaning the first element can be
interpreted as the behavioural effect.

Value

An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Note

The dependent variable is not scaled to row proportions. This is the same as posbernoulli.t and
posbernoulli.tb but different from posbinomial and binomialff.

Author(s)

Thomas W. Yee.

References

See posbernoulli.t.
posbernoulli.b 595

See Also
posbernoulli.t and posbernoulli.tb (including estimating N ), deermice, dposbern, rposbern,
posbinomial, aux.posbernoulli.t, prinia.

Examples
# deermice data ,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,

# Fit a M_b model

M.b <- vglm(cbind(y1, y2, y3, y4, y5, y6) ~ 1,
posbernoulli.b, data = deermice, trace = TRUE)
coef(M.b)["(Intercept):1"] # Behavioural effect on the logit scale
coef(M.b, matrix = TRUE)
constraints(M.b, matrix = TRUE)
summary(M.b, presid = FALSE)

# Fit a M_bh model

M.bh <- vglm(cbind(y1, y2, y3, y4, y5, y6) ~ sex + weight,
posbernoulli.b, data = deermice, trace = TRUE)
coef(M.bh, matrix = TRUE)
coef(M.bh)["(Intercept):1"] # Behavioural effect on the logit scale
constraints(M.bh) # (2,1) element of "(Intercept)" is for the behavioural effect
summary(M.bh, presid = FALSE) # Significant positive (trap-happy) behavioural effect
# Approx. 95 percent confidence for the behavioural effect:
SE.M.bh <- coef(summary(M.bh))["(Intercept):1", "Std. Error"]
coef(M.bh)["(Intercept):1"] + c(-1, 1) * 1.96 * SE.M.bh

# Fit a M_h model

M.h <- vglm(cbind(y1, y2, y3, y4, y5, y6) ~ sex + weight,
posbernoulli.b(drop.b = TRUE ~ sex + weight),
data = deermice, trace = TRUE)
coef(M.h, matrix = TRUE)
constraints(M.h, matrix = TRUE)
summary(M.h, presid = FALSE)

# Fit a M_0 model

M.0 <- vglm(cbind( y1 + y2 + y3 + y4 + y5 + y6,
6 - y1 - y2 - y3 - y4 - y5 - y6) ~ 1,
posbinomial, data = deermice, trace = TRUE)
coef(M.0, matrix = TRUE)
summary(M.0, presid = FALSE)

# Simulated data set ,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,

set.seed(123); nTimePts <- 5; N <- 1000 # N is the popn size
pdata <- rposbern(n = N, nTimePts = nTimePts, pvars = 2, is.popn = TRUE)
nrow(pdata) # Less than N (because some animals were never captured)
# The truth: xcoeffs are c(-2, 1, 2) and cap.effect = +1

M.bh.2 <- vglm(cbind(y1, y2, y3, y4, y5) ~ x2,

posbernoulli.b, data = pdata, trace = TRUE)
coef(M.bh.2)
596 posbernoulli.t

coef(M.bh.2, matrix = TRUE)

constraints(M.bh.2, matrix = TRUE)
summary(M.bh.2, presid = FALSE)
head(depvar(M.bh.2)) # Capture history response matrix
head(M.bh.2@extra$cap.hist1) # Info on its capture history
head(M.bh.2@extra$cap1) # When it was first captured
head(fitted(M.bh.2)) # Depends on capture history
(trap.effect <- coef(M.bh.2)["(Intercept):1"]) # Should be +1
head(model.matrix(M.bh.2, type = "vlm"), 21)
head(pdata)
summary(pdata)
dim(depvar(M.bh.2))
vcov(M.bh.2)

M.bh.2@extra$N.hat # Estimate of the population size; should be about N

M.bh.2@extra$SE.N.hat # SE of the estimate of the population size
# An approximate 95 percent confidence interval:
round(M.bh.2@extra$N.hat + c(-1, 1) * 1.96 * M.bh.2@extra$SE.N.hat, 1)

posbernoulli.t Positive Bernoulli Family Function with Time Effects

Description

Fits a GLM/GAM-like model to multiple Bernoulli responses where each row in the capture history
matrix response has at least one success (capture). Sampling occasion effects are accommodated.

Usage

posbernoulli.t(link = "logit", parallel.t = FALSE ~ 1, iprob = NULL,

p.small = 1e-4, no.warning = FALSE)

Arguments
link, iprob, parallel.t
See CommonVGAMffArguments for information. By default, the parallelism as-
sumption does not apply to the intercept. Setting parallel.t = FALSE ~ -1,
or equivalently parallel.t = FALSE ~ 0, results in the M0 /Mh model.
p.small, no.warning
A small probability value used to give a warning for the HorvitzThompson
estimator. Any estimated probability value less than p.small will result in a
warning, however, setting no.warning = TRUE will suppress this warning if
it occurs. This is because the Horvitz-Thompson estimator is the sum of the
reciprocal of such probabilities, therefore any probability that is too close to 0
will result in an unstable estimate.
posbernoulli.t 597

Details
These models (commonly known as Mt or Mth (no prefix h means it is an intercept-only model) in
the capturerecapture literature) operate on a capture history matrix response of 0s and 1s (n ).
Each column is a sampling occasion where animals are potentially captured (e.g., a field trip), and
each row is an individual animal. Capture is a 1, else a 0. No removal of animals from the population
is made (closed population), e.g., no immigration or emigration. Each row of the response matrix
has at least one capture. Once an animal is captured for the first time, it is marked/tagged so that
its future capture history can be recorded. Then it is released immediately back into the population
to remix. It is released immediately after each recapture too. It is assumed that the animals are
independent and that, for a given animal, each sampling occasion is independent. And animals do
not lose their marks/tags, and all marks/tags are correctly recorded.
The number of linear/additive predictors is equal to the number of sampling occasions, i.e., M = ,
say. The default link functions are (logit p1 , . . . , logit p )T where each pj denotes the probability
of capture at time point j. The fitted value returned is a matrix of probabilities of the same dimension
as the response matrix.
A conditional likelihood is maximized here using Fisher scoring. Each sampling occasion has a
separate probability that is modelled here. The probabilities can be constrained to be equal by
setting parallel.t = FALSE ~ 0; then the results are effectively the same as posbinomial except
the binomial constants are not included in the log-likelihood. If parallel.t = TRUE ~ 0 then
each column should have at least one 1 and at least one 0.
It is well-known that some species of animals are affected by capture, e.g., trap-shy or trap-happy.
This VGAM family function does not allow any behavioral effect to be modelled (posbernoulli.b
and posbernoulli.tb do) because the denominator of the likelihood function must be free of
behavioral effects.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.
Upon fitting the extra slot has a (list) component called N.hat which is a point estimate of the
population size N (it is the Horvitz-Thompson (1952) estimator). And there is a component called
SE.N.hat containing its standard error.

Note
The weights argument of vglm need not be assigned, and the default is just a matrix of ones.
Fewer numerical problems are likely to occur for parallel.t = TRUE. Data-wise, each sampling
occasion may need at least one success (capture) and one failure. Less stringent conditions in the
data are needed when parallel.t = TRUE. Ditto when parallelism is applied to the intercept too.
The response matrix is returned unchanged; i.e., not converted into proportions like posbinomial.
If the response matrix has column names then these are used in the labelling, else prob1, prob2,
etc. are used.
Using AIC() or BIC() to compare posbernoulli.t, posbernoulli.b, posbernoulli.tb models
with a posbinomial model requires posbinomial(omit.constant = TRUE) because one needs
to remove the normalizing constant from the log-likelihood function. See posbinomial for an
example.
598 posbernoulli.t

Author(s)
Thomas W. Yee.

References
Huggins, R. M. (1991) Some practical aspects of a conditional likelihood approach to capture ex-
periments. Biometrics, 47, 725732.
Huggins, R. M. and Hwang, W.-H. (2011) A review of the use of conditional likelihood in capture
recapture experiments. International Statistical Review, 79, 385400.
Otis, D. L. and Burnham, K. P. and White, G. C. and Anderson, D. R. (1978) Statistical inference
from capture data on closed animal populations, Wildlife Monographs, 62, 3135.
Yee, T. W. and Stoklosa, J. and Huggins, R. M. (2015) The VGAM package for capturerecapture
data using the conditional likelihood. Journal of Statistical Software, 65, 133. http://www.
jstatsoft.org/v65/i05/.

See Also
posbernoulli.b, posbernoulli.tb, Select, deermice, Huggins89table1, Huggins89.t1, dposbern,
rposbern, posbinomial, AICvlm, BICvlm, prinia.

Examples
M.t <- vglm(cbind(y1, y2, y3, y4, y5, y6) ~ 1, posbernoulli.t,
data = deermice, trace = TRUE)
coef(M.t, matrix = TRUE)
constraints(M.t, matrix = TRUE)
summary(M.t, presid = FALSE)

M.h.1 <- vglm(Select(deermice, "y") ~ sex + weight, trace = TRUE,

posbernoulli.t(parallel.t = FALSE ~ -1), data = deermice)
coef(M.h.1, matrix = TRUE)
constraints(M.h.1)
summary(M.h.1, presid = FALSE)
head(depvar(M.h.1)) # Response capture history matrix
dim(depvar(M.h.1))

M.th.2 <- vglm(cbind(y1, y2, y3, y4, y5, y6) ~ sex + weight, trace = TRUE,
posbernoulli.t(parallel.t = FALSE), data = deermice)
lrtest(M.h.1, M.th.2) # Test the parallelism assumption wrt sex and weight
coef(M.th.2)
coef(M.th.2, matrix = TRUE)
constraints(M.th.2)
summary(M.th.2, presid = FALSE)
head(model.matrix(M.th.2, type = "vlm"), 21)

M.th.2@extra$N.hat # Estimate of the population size; should be about N

M.th.2@extra$SE.N.hat # SE of the estimate of the population size
# An approximate 95 percent confidence interval:
round(M.th.2@extra$N.hat + c(-1, 1) * 1.96 * M.th.2@extra$SE.N.hat, 1)
posbernoulli.tb 599

# Fit a M_h model, effectively the parallel M_t model, using posbinomial()
deermice <- transform(deermice, ysum = y1 + y2 + y3 + y4 + y5 + y6,
tau = 6)
M.h.3 <- vglm(cbind(ysum, tau - ysum) ~ sex + weight,
posbinomial(omit.constant = TRUE), data = deermice, trace = TRUE)
max(abs(coef(M.h.1) - coef(M.h.3))) # Should be zero
logLik(M.h.3) - logLik(M.h.1) # Difference is due to the binomial constants

posbernoulli.tb Positive Bernoulli Family Function with Time and Behavioural Effects

Description
Fits a GLM/GAM-like model to multiple Bernoulli responses where each row in the capture history
matrix response has at least one success (capture). Sampling occasion effects and behavioural
effects are accommodated.

Usage
posbernoulli.tb(link = "logit", parallel.t = FALSE ~ 1,
parallel.b = FALSE ~ 0, drop.b = FALSE ~ 1,
type.fitted = c("likelihood.cond", "mean.uncond"),
imethod = 1, iprob = NULL,
p.small = 1e-4, no.warning = FALSE,
ridge.constant = 0.01, ridge.power = -4)

Arguments
link, imethod, iprob
See CommonVGAMffArguments for information.
parallel.t, parallel.b, drop.b
A logical, or formula with a logical as the response. See CommonVGAMffArguments
for information. The parallel.-type arguments specify whether the constraint
matrices have a parallelism assumption for the temporal and behavioural ef-
fects. Argument parallel.t means parallel with respect to time, and matches
the same argument name in posbernoulli.t.
Suppose the model is intercept-only. Setting parallel.t = FALSE ~ 0 results
in the Mb model. Setting drop.b = FALSE ~ 0 results in the Mt model because
it drops columns off the constraint matrices corresponding to any behavioural ef-
fect. Setting parallel.t = FALSE ~ 0 and setting parallel.b = FALSE ~ 0
results in the Mb model. Setting parallel.t = FALSE ~ 0, parallel.b = FALSE ~ 0
and drop.b = FALSE ~ 0 results in the M0 model. Note the default for
parallel.t and parallel.b may be unsuitable for data sets which have a
large because of the large number of parameters; it might be too flexible.
If it is desired to have the behaviour affect some of the other covariates then set
drop.b = TRUE ~ 0.
600 posbernoulli.tb

The default model has a different intercept for each sampling occasion, a time-
parallelism assumption for all other covariates, and a dummy variable represent-
ing a single behavioural effect (also in the intercept).
The most flexible model is to set parallel.b = TRUE ~ 0, parallel.t = TRUE ~ 0
and drop.b = TRUE ~ 0. This means that all possible temporal and behavioural
effects are estimated, for the intercepts and other covariates. Such a model is not
recommended; it will contain a lot of paramters.
type.fitted Character, one of the choices for the type of fitted value returned. The default is
the first one. Partial matching is okay. For "likelihood.cond": the probability
defined by the conditional likelihood. For "mean.uncond": the unconditional
mean, which should agree with colMeans applied to the response matrix for
intercept-only models.
ridge.constant, ridge.power
Determines the ridge parameters at each IRLS iteration. They are the constant
and power (exponent) for the ridge adjustment for the working weight matrices
(the capture probability block matrix, hence the first diagonal values). At
iteration a of the IRLS algorithm a positive value is added to the first diagonal
elements of the working weight matrices to make them positive-definite. This
adjustment is the mean of the diagonal elements of wz multipled by K ap
where K is ridge.constant and p is ridge.power. This is always positive but
decays to zero as iterations proceed (provided p is negative etc.).
p.small, no.warning
See posbernoulli.t.

Details
This model (commonly known as Mtb /Mtbh in the capturerecapture literature) operates on a re-
sponse matrix of 0s and 1s (n ). See posbernoulli.t for information that is in common. It
allows time and behavioural effects to be modelled.
Evidently, the expected information matrix (EIM) seems not of full rank (especially in early it-
erations), so ridge.constant and ridge.power are used to try fix up the problem. The default
link functions are (logit pc1 , . . . , logit pc , logit pr2 , . . . , logit pr )T where the subscript c denotes
capture, the subscript r denotes recapture, and it is not possible to recapture the animal at sampling
occasion 1. Thus M = 2 1. The parameters are currently prefixed by pcapture and precapture
for the capture and recapture probabilities. This VGAM family function may be further modified
in the future.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Note
It is a good idea to apply the parallelism assumption to each sampling occasion except possibly
with respect to the intercepts. Also, a simple behavioural effect such as being modelled using the
intercept is recommended; if the behavioural effect is not parallel and/or allowed to apply to other
posbernoulli.tb 601

covariates then there will probably be too many parameters, and hence, numerical problems. See
M_tbh.1 below.
It is a good idea to monitor convergence. Simpler models such as the M0 /Mh models are best fitted
with posbernoulli.t or posbernoulli.b or posbinomial.

Author(s)
Thomas W. Yee.

References
See posbernoulli.t.

See Also
posbernoulli.b (including N.hat), posbernoulli.t, posbinomial, Select, fill1, Huggins89table1,
Huggins89.t1, deermice, prinia.

Examples
## Not run:
# Example 1: simulated data
nTimePts <- 5 # (aka tau == # of sampling occasions)
nnn <- 1000 # Number of animals
pdata <- rposbern(n = nnn, nTimePts = nTimePts, pvars = 2)
dim(pdata); head(pdata)

M_tbh.1 <- vglm(cbind(y1, y2, y3, y4, y5) ~ x2,

posbernoulli.tb, data = pdata, trace = TRUE)
coef(M_tbh.1) # First element is the behavioural effect
coef(M_tbh.1, matrix = TRUE)
constraints(M_tbh.1, matrix = TRUE)
summary(M_tbh.1, presid = FALSE) # Standard errors are approximate
head(fitted(M_tbh.1))
head(model.matrix(M_tbh.1, type = "vlm"), 21)
dim(depvar(M_tbh.1))

M_tbh.2 <- vglm(cbind(y1, y2, y3, y4, y5) ~ x2,

posbernoulli.tb(parallel.t = FALSE ~ 0),
data = pdata, trace = TRUE)
coef(M_tbh.2) # First element is the behavioural effect
coef(M_tbh.2, matrix = TRUE)
constraints(M_tbh.2, matrix = TRUE)
summary(M_tbh.2, presid = FALSE) # Standard errors are approximate
head(fitted(M_tbh.2))
head(model.matrix(M_tbh.2, type = "vlm"), 21)
dim(depvar(M_tbh.2))

# Example 2: deermice subset data

fit1 <- vglm(cbind(y1, y2, y3, y4, y5, y6) ~ sex + weight,
posbernoulli.t, data = deermice, trace = TRUE)
coef(fit1)
602 posbernUC

coef(fit1, matrix = TRUE)

constraints(fit1, matrix = TRUE)
summary(fit1, presid = FALSE) # Standard errors are approximate

# fit1 is the same as Fit1 (a M_{th} model):

Fit1 <- vglm(cbind(y1, y2, y3, y4, y5, y6) ~ sex + weight,
posbernoulli.tb(drop.b = TRUE ~ sex + weight,
parallel.t = TRUE), # No parallelism for the intercept
data = deermice, trace = TRUE)
constraints(Fit1)

## End(Not run)

posbernUC Positive Bernoulli Sequence Model

Description
Density, and random generation for multiple Bernoulli responses where each row in the response
matrix has at least one success.

Usage
rposbern(n, nTimePts = 5, pvars = length(xcoeff), xcoeff = c(-2, 1, 2),
Xmatrix = NULL, cap.effect = 1, is.popn = FALSE,
link = "logit", earg.link = FALSE)
dposbern(x, prob, prob0 = prob, log = FALSE)

Arguments
x response vector or matrix. Should only have 0 and 1 values, at least two columns,
and each row should have at least one 1.
nTimePts Number of sampling occasions. Called in posbernoulli.b and posbernoulli.t.
n number of observations. Usually a single positive integer, else the length of the
vector is used. See argument is.popn.
is.popn Logical. If TRUE then argument n is the population size and what is returned
may have substantially less rows than n. That is, if an animal has at least one
one in its sequence then it is returned, else that animal is not returned because it
never was captured.
Xmatrix Optional X matrix. If given, the X matrix is not generated internally.
cap.effect Numeric, the capture effect. Added to the linear predictor if captured previously.
A positive or negative value corresponds to a trap-happy and trap-shy effect
respectively.
pvars Number of other numeric covariates that make up the linear predictor. Labelled
x1, x2, . . . , where the first is an intercept, and the others are independent stan-
dard runif random variates. The first pvars elements of xcoeff are used.
posbernUC 603

xcoeff The regression coefficients of the linear predictor. These correspond to x1, x2,
. . . , and the first is for the intercept. The length of xcoeff must be at least
pvars.
link, earg.link
The former is used to generate the probabilities for capture at each occasion.
Other details at CommonVGAMffArguments.
prob, prob0 Matrix of probabilities for the numerator and denominators respectively. The
default does not correspond to the Mb model since the Mb model has a denom-
inator which involves the capture history.
log Logical. Return the logarithm of the answer?

Details
The form of the conditional likelihood is described in posbernoulli.b and/or posbernoulli.t
and/or posbernoulli.tb. The denominator is equally shared among the elements of the matrix x.

Value
rposbern returns a data frame with some attributes. The function generates random deviates (
columns labelled y1, y2, . . . ) for the response. Some indicator columns are also included (those
starting with ch are for previous capture history). The default setting corresponds to a Mbh model
that has a single trap-happy effect. Covariates x1, x2, . . . have the same affect on capture/recapture
at every sampling occasion (see the argument parallel.t in, e.g., posbernoulli.tb).
The function dposbern gives the density,

Note
The r-type function is experimental only and does not follow the usual conventions of r-type R
functions. It may change a lot in the future. The d-type function is more conventional and is less
likely to change.

Author(s)
Thomas W. Yee.

See Also
posbernoulli.tb, posbernoulli.b, posbernoulli.t.

Examples
rposbern(n = 10)
attributes(pdata <- rposbern(n = 100))
M.bh <- vglm(cbind(y1, y2, y3, y4, y5) ~ x2 + x3, posbernoulli.b(I2 = FALSE),
data = pdata, trace = TRUE)
constraints(M.bh)
summary(M.bh)
604 Posbinom

Posbinom Positive-Binomial Distribution

Description
Density, distribution function, quantile function and random generation for the positive-binomial
distribution.

Usage
dposbinom(x, size, prob, log = FALSE)
pposbinom(q, size, prob)
qposbinom(p, size, prob)
rposbinom(n, size, prob)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. Fed into runif.
size number of trials. It is the N symbol in the formula given in posbinomial and
should be positive.
prob probability of success on each trial. Should be in (0, 1).
log See dbinom.

Details
The positive-binomial distribution is a binomial distribution but with the probability of a zero being
zero. The other probabilities are scaled to add to unity. The mean therefore is

/(1 (1 )N )

where is the argument prob above. As increases, the positive-binomial and binomial distribu-
tions become more similar. Unlike similar functions for the binomial distribution, a zero value of
prob is not permitted here.

Value
dposbinom gives the density, pposbinom gives the distribution function, qposbinom gives the quan-
tile function, and rposbinom generates random deviates.

Note
For dposbinom(), if arguments size or prob equal 0 then a NaN is returned.
The family function posbinomial estimates the parameters by maximum likelihood estimation.
posbinomial 605

Author(s)
T. W. Yee.

See Also
posbinomial, dposbern, zabinomial, zibinomial, rbinom.

Examples
prob <- 0.2; size <- 10
table(y <- rposbinom(n = 1000, size, prob))
mean(y) # Sample mean
size * prob / (1 - (1 - prob)^size) # Population mean

(ii <- dposbinom(0:size, size, prob))

cumsum(ii) - pposbinom(0:size, size, prob) # Should be 0s
table(rposbinom(100, size, prob))

table(qposbinom(runif(1000), size, prob))

round(dposbinom(1:10, size, prob) * 1000) # Should be similar

## Not run: barplot(rbind(dposbinom(x = 0:size, size, prob),

dbinom(x = 0:size, size, prob)),
beside = TRUE, col = c("blue", "green"),
main = paste("Positive-binomial(", size, ",",
prob, ") (blue) vs",
" Binomial(", size, ",", prob, ") (green)", sep = ""),
names.arg = as.character(0:size), las = 1)
## End(Not run)

# Simulated data example

nn <- 1000; sizeval1 <- 10; sizeval2 <- 20
pdata <- data.frame(x2 = seq(0, 1, length = nn))
pdata <- transform(pdata, prob1 = logit(-2 + 2 * x2, inverse = TRUE),
prob2 = logit(-1 + 1 * x2, inverse = TRUE),
sizev1 = rep(sizeval1, len = nn),
sizev2 = rep(sizeval2, len = nn))
pdata <- transform(pdata, y1 = rposbinom(nn, size = sizev1, prob = prob1),
y2 = rposbinom(nn, size = sizev2, prob = prob2))
with(pdata, table(y1))
with(pdata, table(y2))
# Multiple responses
fit2 <- vglm(cbind(y1, y2) ~ x2, posbinomial(multiple.responses = TRUE),
trace = TRUE, data = pdata, weight = cbind(sizev1, sizev2))
coef(fit2, matrix = TRUE)

posbinomial Positive Binomial Distribution Family Function

606 posbinomial

Description

Fits a positive binomial distribution.

Usage

posbinomial(link = "logit", multiple.responses = FALSE, parallel = FALSE,

omit.constant = FALSE, p.small = 1e-4, no.warning = FALSE,
zero = NULL)

Arguments
link, multiple.responses, parallel, zero
Details at CommonVGAMffArguments.
omit.constant Logical. If TRUE then the constant (lchoose(size, size * yprop) is omitted
from the loglikelihood calculation. If the model is to be compared using
AIC() or BIC() (see AICvlm or BICvlm) to the likes of posbernoulli.tb etc.
then it is important to set omit.constant = TRUE because all models then
will not have any normalizing constants in the likelihood function. Hence they
become comparable. This is because the M0 Otis et al. (1978) model coincides
with posbinomial(). See below for an example. Also see posbernoulli.t
regarding estimating the population size (N.hat and SE.N.hat) if the number of
trials is the same for all observations.
p.small, no.warning
See posbernoulli.t.

Details

The positive binomial distribution is the ordinary binomial distribution but with the probability of
zero being zero. Thus the other probabilities are scaled up (i.e., divided by 1 P (Y = 0)). The
fitted values are the ordinary binomial distribution fitted values, i.e., the usual mean.
In the capturerecapture literature this model is called the M0 if it is an intercept-only model.
Otherwise it is called the Mh when there are covariates. It arises from a sum of a sequence of -
Bernoulli random variates subject to at least one success (capture). Here, each animal has the same
probability of capture or recapture, regardless of the sampling occasions. Independence between
animals and between sampling occasions etc. is assumed.

Value

An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Warning

Under- or over-flow may occur if the data is ill-conditioned.

posbinomial 607

Note
The input for this family function is the same as binomialff.
If multiple.responses = TRUE then each column of the matrix response should be a count (the
number of successes), and the weights argument should be a matrix of the same dimension as the
response containing the number of trials. If multiple.responses = FALSE then the response input
should be the same as binomialff.
Yet to be done: a quasi.posbinomial() which estimates a dispersion parameter.

Author(s)
Thomas W. Yee

References
Otis, D. L. et al. (1978) Statistical inference from capture data on closed animal populations,
Wildlife Monographs, 62, 3135.
Patil, G. P. (1962) Maximum likelihood estimation for generalised power series distributions and its
application to a truncated binomial distribution. Biometrika, 49, 227237.
Pearson, K. (1913) A Monograph on Albinism in Man. Drapers Company Research Memoirs.

See Also
posbernoulli.b, posbernoulli.t, posbernoulli.tb, binomialff, AICvlm, BICvlm, simulate.vlm.

Examples
# Number of albinotic children in families with 5 kids (from Patil, 1962) ,,,,
albinos <- data.frame(y = c(rep(1, 25), rep(2, 23), rep(3, 10), 4, 5),
n = rep(5, 60))
fit1 <- vglm(cbind(y, n-y) ~ 1, posbinomial, albinos, trace = TRUE)
summary(fit1)
Coef(fit1) # = MLE of p = 0.3088
head(fitted(fit1))
sqrt(vcov(fit1, untransform = TRUE)) # SE = 0.0322

# Fit a M_0 model (Otis et al. 1978) to the deermice data ,,,,,,,,,,,,,,,,,,,,,,,
M.0 <- vglm(cbind( y1 + y2 + y3 + y4 + y5 + y6,
6 - y1 - y2 - y3 - y4 - y5 - y6) ~ 1, trace = TRUE,
posbinomial(omit.constant = TRUE), data = deermice)
coef(M.0, matrix = TRUE)
Coef(M.0)
constraints(M.0, matrix = TRUE)
summary(M.0)
c( N.hat = M.0@extra$N.hat, # Since tau = 6, i.e., 6 Bernoulli trials per
SE.N.hat = M.0@extra$SE.N.hat) # observation is the same for each observation

# Compare it to the M_b using AIC and BIC

M.b <- vglm(cbind(y1, y2, y3, y4, y5, y6) ~ 1, trace = TRUE,
posbernoulli.b, data = deermice)
608 Posgeom

sort(c(M.0 = AIC(M.0), M.b = AIC(M.b))) # Okay since omit.constant = TRUE

sort(c(M.0 = BIC(M.0), M.b = BIC(M.b))) # Okay since omit.constant = TRUE

Posgeom Positive-Geometric Distribution

Description
Density, distribution function, quantile function and random generation for the positive-geometric
distribution.

Usage
dposgeom(x, prob, log = FALSE)
pposgeom(q, prob)
qposgeom(p, prob)
rposgeom(n, prob)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. Fed into runif.
prob vector of probabilities of success (of an ordinary geometric distribution). Short
vectors are recycled.
log logical.

Details
The positive-geometric distribution is a geometric distribution but with the probability of a zero
being zero. The other probabilities are scaled to add to unity. The mean therefore is 1/prob.
As prob decreases, the positive-geometric and geometric distributions become more similar. Like
similar functions for the geometric distribution, a zero value of prob is not permitted here.

Value
dposgeom gives the density, pposgeom gives the distribution function, qposgeom gives the quantile
function, and rposgeom generates random deviates.

Author(s)
T. W. Yee

See Also
zageometric, zigeometric, rgeom.
Posnegbin 609

Examples
prob <- 0.75; y <- rposgeom(n = 1000, prob)
table(y)
mean(y) # Sample mean
1 / prob # Population mean

(ii <- dposgeom(0:7, prob))

cumsum(ii) - pposgeom(0:7, prob) # Should be 0s
table(rposgeom(100, prob))

table(qposgeom(runif(1000), prob))
round(dposgeom(1:10, prob) * 1000) # Should be similar

## Not run:
x <- 0:5
barplot(rbind(dposgeom(x, prob), dgeom(x, prob)),
beside = TRUE, col = c("blue", "orange"),
main = paste("Positive geometric(", prob, ") (blue) vs",
" geometric(", prob, ") (orange)", sep = ""),
names.arg = as.character(x), las = 1, lwd = 2)
## End(Not run)

Posnegbin Positive-Negative Binomial Distribution

Description
Density, distribution function, quantile function and random generation for the positive-negative
binomial distribution.

Usage
dposnegbin(x, size, prob = NULL, munb = NULL, log = FALSE)
pposnegbin(q, size, prob = NULL, munb = NULL,
lower.tail = TRUE, log.p = FALSE)
qposnegbin(p, size, prob = NULL, munb = NULL)
rposnegbin(n, size, prob = NULL, munb = NULL)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. Fed into runif.
size, prob, munb, log
Same arguments as that of an ordinary negative binomial distribution (see dnbinom).
Some arguments have been renamed slightly.
610 Posnegbin

Short vectors are recycled. The parameter 1/size is known as a dispersion

parameter; as size approaches infinity, the negative binomial distribution ap-
proaches a Poisson distribution.
Note that prob must lie in (0, 1), otherwise a NaN is returned.
log.p, lower.tail
Same arguments as that of an ordinary negative binomial distribution (see pnbinom).

Details
The positive-negative binomial distribution is a negative binomial distribution but with the proba-
bility of a zero being zero. The other probabilities are scaled to add to unity. The mean therefore
is
/(1 p(0))
where the mean of an ordinary negative binomial distribution.

Value
dposnegbin gives the density, pposnegbin gives the distribution function, qposnegbin gives the
quantile function, and rposnegbin generates n random deviates.

Author(s)
T. W. Yee

See Also
posnegbinomial, zanegbinomial, zinegbinomial, rnbinom.

Examples
munb <- 5; size <- 4; n <- 1000
table(y <- rposnegbin(n, munb = munb, size = size))
mean(y) # sample mean
munb / (1 - (size / (size + munb))^size) # population mean
munb / pnbinom(0, mu = munb, size = size, lower.tail = FALSE) # same as before

x <- (-1):17
(ii <- dposnegbin(x, munb = munb, size = size))
max(abs(cumsum(ii) - pposnegbin(x, munb = munb, size = size))) # Should be 0

## Not run:
x <- 0:10
barplot(rbind(dposnegbin(x, munb = munb, size = size),
dnbinom(x, mu = munb, size = size)),
posnegbinomial 611

beside = TRUE, col = c("blue","green"),

main = paste("dposnegbin(munb = ", munb, ", size = ", size, ") (blue) vs",
" dnbinom(mu = ", munb, ", size = ", size, ") (green)", sep = ""),
names.arg = as.character(x))
## End(Not run)

# Another test for pposnegbin()

nn <- 5000
mytab <- cumsum(table(rposnegbin(nn, munb = munb, size = size))) / nn
myans <- pposnegbin(sort(as.numeric(names(mytab))), munb = munb, size = size)
max(abs(mytab - myans)) # Should be 0

posnegbinomial Positive Negative Binomial Distribution Family Function

Description
Maximum likelihood estimation of the two parameters of a positive negative binomial distribution.

Usage
posnegbinomial(zero = "size", type.fitted = c("mean", "munb", "prob0"),
mds.min = 0.001, nsimEIM = 500, cutoff.prob = 0.999,
eps.trig = 1e-07, max.support = 4000, max.chunk.MB = 30,
lmunb = "loge", lsize = "loge", imethod = 1,
imunb = NULL, iprobs.y = NULL,
gprobs.y = ppoints(8), isize = NULL,
gsize.mux = exp(c(-30, -20, -15, -10, -6:3)))

Arguments
lmunb Link function applied to the munb parameter, which is the mean nb of an ordi-
nary negative binomial distribution. See Links for more choices.
lsize Parameter link function applied to the dispersion parameter, called k. See Links
for more choices.
isize Optional initial value for k, an index parameter. The value 1/k is known as a
dispersion parameter. If failure to converge occurs try different values (and/or
use imethod). If necessary this vector is recycled to length equal to the number
of responses. A value NULL means an initial value for each response is computed
internally using a range of values.
nsimEIM, zero, eps.trig
See CommonVGAMffArguments.
mds.min, iprobs.y, cutoff.prob
Similar to negbinomial.
imunb, max.support
Similar to negbinomial.
612 posnegbinomial

max.chunk.MB, gsize.mux
Similar to negbinomial.
imethod, gprobs.y
See negbinomial.
type.fitted See CommonVGAMffArguments for details.

Details
The positive negative binomial distribution is an ordinary negative binomial distribution but with
the probability of a zero response being zero. The other probabilities are scaled to sum to unity.
This family function is based on negbinomial and most details can be found there. To avoid confu-
sion, the parameter munb here corresponds to the mean of an ordinary negative binomial distribution
negbinomial. The mean of posnegbinomial is

nb /(1 p(0))

where p(0) = (k/(k + nb ))k is the probability an ordinary negative binomial distribution has a
zero value.
The parameters munb and k are not independent in the positive negative binomial distribution,
whereas they are in the ordinary negative binomial distribution.
This function handles multiple responses, so that a matrix can be used as the response. The number
of columns is the number of species, say, and setting zero = -2 means that all species have a k
equalling a (different) intercept only.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.

Warning
This family function is fragile; at least two cases will lead to numerical problems. Firstly, the
positive-Poisson model corresponds to k equalling infinity. If the data is positive-Poisson or close
to positive-Poisson, then the estimated k will diverge to Inf or some very large value. Secondly,
if the data is clustered about the value 1 because the munb parameter is close to 0 then numerical
problems will also occur. Users should set trace = TRUE to monitor convergence. In the situation
when both cases hold, the result returned (which will be untrustworthy) will depend on the initial
values.
The negative binomial distribution (NBD) is a strictly unimodal distribution. Any data set that
does not exhibit a mode (in the middle) makes the estimation problem difficult. The positive NBD
inherits this feature. Set trace = TRUE to monitor convergence.
See the example below of a data set where posbinomial() fails; the so-called solution is extremely
poor. This is partly due to a lack of a unimodal shape because the number of counts decreases only.
This long tail makes it very difficult to estimate the mean parameter with any certainty. The result
too is that the size parameter is numerically fraught.
This VGAM family function inherits the same warnings as negbinomial. And if k is much less
than 1 then the estimation may be slow.
posnegbinomial 613

Note
If the estimated k is very large then fitting a pospoisson model is a good idea.
If both munb and k are large then it may be necessary to decrease eps.trig and increase max.support
so that the EIMs are positive-definite, e.g., eps.trig = 1e-8 and max.support = Inf.

Author(s)
Thomas W. Yee

References
Barry, S. C. and Welsh, A. H. (2002) Generalized additive modelling and zero inflated count data.
Ecological Modelling, 157, 179188.
Williamson, E. and Bretherton, M. H. (1964) Tables of the logarithmic series distribution. Annals
of Mathematical Statistics, 35, 284297.

See Also
rposnegbin, pospoisson, negbinomial, zanegbinomial, rnbinom, CommonVGAMffArguments,
corbet, logff, simulate.vlm.

Examples
pdata <- data.frame(x2 = runif(nn <- 1000))
pdata <- transform(pdata, y1 = rposnegbin(nn, munb = exp(0+2*x2), size = exp(1)),
y2 = rposnegbin(nn, munb = exp(1+2*x2), size = exp(3)))
fit <- vglm(cbind(y1, y2) ~ x2, posnegbinomial, data = pdata, trace = TRUE)
coef(fit, matrix = TRUE)
dim(depvar(fit)) # Using dim(fit@y) is not recommended

# Another artificial data example

pdata2 <- data.frame(munb = exp(2), size = exp(3)); nn <- 1000
pdata2 <- transform(pdata2, y3 = rposnegbin(nn, munb = munb, size = size))
with(pdata2, table(y3))
fit <- vglm(y3 ~ 1, posnegbinomial, data = pdata2, trace = TRUE)
coef(fit, matrix = TRUE)
with(pdata2, mean(y3)) # Sample mean
head(with(pdata2, munb/(1-(size/(size+munb))^size)), 1) # Population mean
head(fitted(fit), 3)
head(predict(fit), 3)

# Example: Corbet (1943) butterfly Malaya data

fit <- vglm(ofreq ~ 1, posnegbinomial, weights = species, data = corbet)
coef(fit, matrix = TRUE)
Coef(fit)
(khat <- Coef(fit)["size"])
pdf2 <- dposnegbin(x = with(corbet, ofreq), mu = fitted(fit), size = khat)
print(with(corbet, cbind(ofreq, species, fitted = pdf2*sum(species))), dig = 1)
614 Posnorm

## Not run:
with(corbet,
matplot(ofreq, cbind(species, fitted = pdf2*sum(species)), las = 1,
xlab = "Observed frequency (of individual butterflies)",
type = "b", ylab = "Number of species", col = c("blue", "orange"),
main = "blue 1s = observe; orange 2s = fitted"))

## End(Not run)

## Not run:
# This data (courtesy of Maxim Gerashchenko) causes posbinomial() to fail
pnbd.fail <- data.frame(
y1 = c(1:16, 18:21, 23:28, 33:38, 42, 44, 49:51, 55, 56, 58,
59, 61:63, 66, 73, 76, 94, 107, 112, 124, 190, 191, 244),
ofreq = c(130, 80, 38, 23, 22, 11, 21, 14, 6, 7, 9, 9, 9, 4, 4, 5, 1,
4, 6, 1, 3, 2, 4, 3, 4, 5, 3, 1, 2, 1, 1, 4, 1, 2, 2, 1, 3,
1, 1, 2, 2, 2, 1, 3, 2, 1, 1, 1, 1, 1, 1, 1, 1, 1))
fit.fail <- vglm(y1 ~ 1, weights = ofreq, posnegbinomial,
trace = TRUE, data = pnbd.fail)

## End(Not run)

Posnorm The Positive-Normal Distribution

Description
Density, distribution function, quantile function and random generation for the univariate positive-
normal distribution.

Usage
dposnorm(x, mean = 0, sd = 1, log = FALSE)
pposnorm(q, mean = 0, sd = 1, lower.tail = TRUE, log.p = FALSE)
qposnorm(p, mean = 0, sd = 1, lower.tail = TRUE, log.p = FALSE)
rposnorm(n, mean = 0, sd = 1)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. If length(n) > 1 then the length is taken to be the
number required.
mean, sd, log, lower.tail, log.p
see rnorm.

Details
See posnormal, the VGAM family function for estimating the parameters, for the formula of the
probability density function and other details.
posnormal 615

Value

dposnorm gives the density, pposnorm gives the distribution function, qposnorm gives the quantile
function, and rposnorm generates random deviates.

Author(s)

T. W. Yee

See Also

posnormal.

Examples
## Not run: m <- 0.8; x <- seq(-1, 4, len = 501)
plot(x, dposnorm(x, m = m), type = "l", las = 1, ylim = 0:1,
ylab = paste("posnorm(m = ", m, ", sd = 1)"), col = "blue",
main = "Blue is density, orange is cumulative distribution function",
sub = "Purple lines are the 10,20,...,90 percentiles")
abline(h = 0, col = "grey")
lines(x, pposnorm(x, m = m), col = "orange", type = "l")
probs <- seq(0.1, 0.9, by = 0.1)
Q <- qposnorm(probs, m = m)
lines(Q, dposnorm(Q, m = m), col = "purple", lty = 3, type = "h")
lines(Q, pposnorm(Q, m = m), col = "purple", lty = 3, type = "h")
abline(h = probs, col = "purple", lty = 3)
max(abs(pposnorm(Q, m = m) - probs)) # Should be 0

## End(Not run)

posnormal Positive Normal Distribution Family Function

Description

Fits a positive (univariate) normal distribution.

Usage

posnormal(lmean = "identitylink", lsd = "loge",

eq.mean = FALSE, eq.sd = FALSE,
gmean = exp((-5:5)/2), gsd = exp((-1:5)/2),
imean = NULL, isd = NULL, probs.y = 0.10, imethod = 1,
nsimEIM = NULL, zero = "sd")
616 posnormal

Arguments
lmean, lsd Link functions for the mean and standard deviation parameters of the usual uni-
variate normal distribution. They are and respectively. See Links for more
choices.
gmean, gsd, imethod
See CommonVGAMffArguments for more information. gmean and gsd currently
operate on a multiplicative scale, on the sample mean and the sample standard
deviation, respectively.
imean, isd Optional initial values for and . A NULL means a value is computed internally.
See CommonVGAMffArguments for more information.
eq.mean, eq.sd See CommonVGAMffArguments for more information. The fact that these argu-
ments are supported results in default constraint matrices being a permutation
of the identity matrix (effectively trivial constraints).
zero, nsimEIM, probs.y
See CommonVGAMffArguments for information.

Details
The positive normal distribution is the ordinary normal distribution but with the probability of zero
or less being zero. The rest of the probability density function is scaled up. Hence the probability
density function can be written

1 1 2 2
f (y) = exp (y ) / / [1 (/)]
2 2

where () is the cumulative distribution function of a standard normal (pnorm). Equivalently, this
is
1 ((y )/)
f (y) = .
1 (/)
where () is the probability density function of a standard normal distribution (dnorm).
The mean of Y is
(/)
E(Y ) = + .
1 (/)
This family function handles multiple responses.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Warning
It is recommended that trace = TRUE be used to monitor convergence; sometimes the estimated
mean is -Inf and the estimated mean standard deviation is Inf, especially when the sample size is
small. Under- or over-flow may occur if the data is ill-conditioned.
Pospois 617

Note

The response variable for this family function is the same as uninormal except positive values are
required. Reasonably good initial values are needed.
The distribution of the reciprocal of a positive normal random variable is known as an alpha distri-
bution.

Author(s)

Thomas W. Yee

See Also

uninormal, tobit.

Examples
pdata <- data.frame(Mean = 1.0, SD = exp(1.0))
pdata <- transform(pdata, y = rposnorm(n <- 1000, m = Mean, sd = SD))

## Not run: with(pdata, hist(y, prob = TRUE, border = "blue",

main = paste("posnorm(m =", Mean[1], ", sd =", round(SD[1], 2),")")))
## End(Not run)
fit <- vglm(y ~ 1, posnormal, data = pdata, trace = TRUE)
coef(fit, matrix = TRUE)
(Cfit <- Coef(fit))
mygrid <- with(pdata, seq(min(y), max(y), len = 200)) # Add the fit to the histogram
## Not run: lines(mygrid, dposnorm(mygrid, Cfit[1], Cfit[2]), col = "orange")

Pospois Positive-Poisson Distribution

Description

Density, distribution function, quantile function and random generation for the positive-Poisson
distribution.

Usage

dpospois(x, lambda, log = FALSE)

ppospois(q, lambda)
qpospois(p, lambda)
rpospois(n, lambda)
618 Pospois

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. Fed into runif.
lambda vector of positive means (of an ordinary Poisson distribution). Short vectors are
recycled.
log logical.

Details
The positive-Poisson distribution is a Poisson distribution but with the probability of a zero being
zero. The other probabilities are scaled to add to unity. The mean therefore is

/(1 exp()).

As increases, the positive-Poisson and Poisson distributions become more similar. Unlike similar
functions for the Poisson distribution, a zero value of lambda returns a NaN.

Value
dpospois gives the density, ppospois gives the distribution function, qpospois gives the quantile
function, and rpospois generates random deviates.

Note
The family function pospoisson estimates by maximum likelihood estimation.

Author(s)
T. W. Yee

See Also
pospoisson, zapoisson, zipoisson, rpois.

Examples
lambda <- 2; y = rpospois(n = 1000, lambda)
table(y)
mean(y) # Sample mean
lambda / (1 - exp(-lambda)) # Population mean

(ii <- dpospois(0:7, lambda))

cumsum(ii) - ppospois(0:7, lambda) # Should be 0s
table(rpospois(100, lambda))

table(qpospois(runif(1000), lambda))
round(dpospois(1:10, lambda) * 1000) # Should be similar

## Not run: x <- 0:7

pospoisson 619

barplot(rbind(dpospois(x, lambda), dpois(x, lambda)),

beside = TRUE, col = c("blue", "orange"),
main = paste("Positive Poisson(", lambda, ") (blue) vs",
" Poisson(", lambda, ") (orange)", sep = ""),
names.arg = as.character(x), las = 1, lwd = 2)
## End(Not run)

pospoisson Positive Poisson Distribution Family Function

Description
Fits a positive Poisson distribution.

Usage
pospoisson(link = "loge", type.fitted = c("mean", "lambda", "prob0"),
expected = TRUE, ilambda = NULL, imethod = 1, zero = NULL)

Arguments
link Link function for the usual mean (lambda) parameter of an ordinary Poisson
distribution. See Links for more choices.
expected Logical. Fisher scoring is used if expected = TRUE, else Newton-Raphson.
ilambda, imethod, zero
See CommonVGAMffArguments for information.
type.fitted See CommonVGAMffArguments for details.

Details
The positive Poisson distribution is the ordinary Poisson distribution but with the probability of zero
being zero. Thus the other probabilities are scaled up (i.e., divided by 1 P [Y = 0]). The mean,
/(1 exp()), can be obtained by the extractor function fitted applied to the object.
A related distribution is the zero-inflated Poisson, in which the probability P [Y = 0] involves
another parameter . See zipoisson.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.

Warning
Under- or over-flow may occur if the data is ill-conditioned.
620 powerlink

Note

This family function can handle multiple responses.

Yet to be done: a quasi.pospoisson which estimates a dispersion parameter.

Author(s)

Thomas W. Yee

References

Coleman, J. S. and James, J. (1961) The equilibrium size distribution of freely-forming groups.
Sociometry, 24, 3645.

See Also

Pospois, posnegbinomial, poissonff, otpospoisson, zapoisson, zipoisson, simulate.vlm.

Examples
# Data from Coleman and James (1961)
cjdata <- data.frame(y = 1:6, freq = c(1486, 694, 195, 37, 10, 1))
fit <- vglm(y ~ 1, pospoisson, data = cjdata, weights = freq)
Coef(fit)
summary(fit)
fitted(fit)

pdata <- data.frame(x2 = runif(nn <- 1000)) # Artificial data

pdata <- transform(pdata, lambda = exp(1 - 2 * x2))
pdata <- transform(pdata, y1 = rpospois(nn, lambda))
with(pdata, table(y1))
fit <- vglm(y1 ~ x2, pospoisson, data = pdata, trace = TRUE, crit = "coef")
coef(fit, matrix = TRUE)

powerlink Power Link Function

Description

Computes the power transformation, including its inverse and the first two derivatives.

Usage

powerlink(theta, power = 1, inverse = FALSE, deriv = 0,

short = TRUE, tag = FALSE)
powerlink 621

Arguments
theta Numeric or character. See below for further details.
power This denotes the power or exponent.
inverse, deriv, short, tag
Details at Links.

Details
The power link function raises a parameter by a certain value of power. Care is needed because it
is very easy to get numerical problems, e.g., if power=0.5 and theta is negative.

Value
For powerlink with deriv = 0, then theta raised to the power of power. And if inverse = TRUE
then theta raised to the power of 1/power.
For deriv = 1, then the function returns d theta / d eta as a function of theta if inverse = FALSE,
else if inverse = TRUE then it returns the reciprocal.

Note
Numerical problems may occur for certain combinations of theta and power. Consequently this
link function should be used with caution.

Author(s)
Thomas W. Yee

prats Pregnant Rats Toxological Experiment Data

Description

A small toxological experiment data. The subjects are fetuses from two randomized groups of
pregnant rats, and they were given a placebo or chemical treatment. The number with birth defects
were recorded, as well as each litter size.

Usage

data(prats)

Format

A data frame with the following variables.

treatment A 0 means control; a 1 means the chemical treatment.

alive, litter.size The number of fetuses alive at 21 days, out of the number of fetuses alive at 4 days
(the litter size).

Details

The data concerns a toxological experiment where the subjects are fetuses from two randomized
groups of 16 pregnant rats each, and they were given a placebo or chemical treatment. The number
with birth defects andn the litter size were recorded. Half the rats were fed a control diet during
pregnancy and lactation, and the diet of the other half was treated with a chemical. For each litter
the number of pups alive at 4 days and the number of pups that survived the 21 day lactation period,
were recorded.

Source

Weil, C. S. (1970) Selection of the valid number of sampling units and a consideration of their
combination in toxicological studies involving reproduction, teratogenesis or carcinogenesis. Food
and Cosmetics Toxicology, 8(2), 177182.

References

Williams, D. A. (1975) The Analysis of Binary Responses From Toxicological Experiments Involv-
ing Reproduction and Teratogenicity. Biometrics, 31(4), 949952.

See Also

betabinomial, betabinomialff.
predictqrrvglm 623

Examples
prats
colSums(subset(prats, treatment == 0))
colSums(subset(prats, treatment == 1))
summary(prats)

predictqrrvglm Predict Method for a CQO fit

Description
Predicted values based on a constrained quadratic ordination (CQO) object.

Usage
predictqrrvglm(object, newdata=NULL,
type = c("link", "response", "latvar", "terms"),
se.fit = FALSE, deriv = 0, dispersion = NULL,
extra = object@extra, varI.latvar = FALSE, refResponse = NULL, ...)

Arguments
object Object of class inheriting from "qrrvglm".
newdata An optional data frame in which to look for variables with which to predict. If
omitted, the fitted linear predictors are used.
type, se.fit, dispersion, extra
See predictvglm.
deriv Derivative. Currently only 0 is handled.
varI.latvar, refResponse
Arguments passed into Coef.qrrvglm.
... Currently undocumented.

Details
Obtains predictions from a fitted CQO object. Currently there are lots of limitations of this function;
it is unfinished.

Value
See predictvglm.

Note
This function is not robust and has not been checked fully.
624 predictvglm

Author(s)

T. W. Yee

References

Yee, T. W. (2004) A new technique for maximum-likelihood canonical Gaussian ordination. Eco-
logical Monographs, 74, 685701.

See Also

cqo.

Examples
hspider[,1:6] <- scale(hspider[,1:6]) # Standardize the environmental vars
set.seed(1234)
# vvv p1 <- cqo(cbind(Alopacce, Alopcune, Alopfabr, Arctlute,
# vvv Arctperi, Auloalbi, Pardlugu, Pardmont,
# vvv Pardnigr, Pardpull, Trocterr, Zoraspin) ~
# vvv WaterCon + BareSand + FallTwig + CoveMoss + CoveHerb + ReflLux,
# vvv poissonff, data = hspider, Crow1positive = FALSE, I.tol = TRUE)
# vvv sort(deviance(p1, history = TRUE)) # A history of all the iterations

# vvv head(predict(p1))

# The following should be all zeros

# vvv max(abs(predict(p1, new = head(hspider)) - head(predict(p1))))
# vvv max(abs(predict(p1, new = head(hspider), type = "res")-head(fitted(p1))))

predictvglm Predict Method for a VGLM fit

Description

Predicted values based on a vector generalized linear model (VGLM) object.

Usage

predictvglm(object, newdata = NULL,

type = c("link", "response", "terms"),
se.fit = FALSE, deriv = 0, dispersion = NULL,
untransform = FALSE,
type.fitted = NULL, percentiles = NULL, ...)
predictvglm 625

Arguments
object Object of class inheriting from "vlm", e.g., vglm.
newdata An optional data frame in which to look for variables with which to predict. If
omitted, the fitted linear predictors are used.
type The value of this argument can be abbreviated. The type of prediction required.
The default is the first one, meaning on the scale of the linear predictors. This
should be a n M matrix.
The alternative "response" is on the scale of the response variable, and de-
pending on the family function, this may or may not be the mean. Often this
is the fitted value, e.g., fitted(vglmObject) (see fittedvlm). Note that the
response is output from the @linkinv slot, where the eta argument is the nM
matrix of linear predictors.
The "terms" option returns a matrix giving the fitted values of each term in the
model formula on the linear predictor scale. The terms have been centered.
se.fit logical: return standard errors?
deriv Non-negative integer. Currently this must be zero. Later, this may be imple-
mented for general values.
dispersion Dispersion parameter. This may be inputted at this stage, but the default is to
use the dispersion parameter of the fitted model.
type.fitted Some VGAM family functions have an argument by the same name. If so, then
one can obtain fitted values by setting type = "response" and choosing a
value of type.fitted from whats available. If type.fitted = "quantiles"
is available then the percentiles argument can be used to specify what quantile
values are requested.
percentiles Used only if type.fitted = "quantiles" is available and is selected.
untransform Logical. Reverses any parameter link function. This argument only works if
type = "link", se.fit = FALSE, deriv = 0. Setting untransform = TRUE
does not work for all VGAM family functions; only ones where there is a one-
to-one correspondence between a simple link function and a simple parameter
might work.
... Arguments passed into predictvlm.

Details
Obtains predictions and optionally estimates standard errors of those predictions from a fitted vglm
object.
This code implements smart prediction (see smartpred).

Value
If se.fit = FALSE, a vector or matrix of predictions. If se.fit = TRUE, a list with components
fitted.values Predictions
se.fit Estimated standard errors
df Degrees of freedom
sigma The square root of the dispersion parameter
626 predictvglm

Warning
This function may change in the future.

Note
Setting se.fit = TRUE and type = "response" will generate an error.
The arguments type.fitted and percentiles are provided in this function to give more conve-
nience than modifying the extra slot directly.

Author(s)
Thomas W. Yee

References
Yee, T. W. and Hastie, T. J. (2003) Reduced-rank vector generalized linear models. Statistical
Modelling, 3, 1541.

See Also
predict, vglm, predictvlm, smartpred.

Examples
# Illustrates smart prediction
pneumo <- transform(pneumo, let = log(exposure.time))
fit <- vglm(cbind(normal, mild, severe) ~ poly(c(scale(let)), 2),
propodds, data = pneumo, trace = TRUE, x.arg = FALSE)
class(fit)

(q0 <- head(predict(fit)))

(q1 <- predict(fit, newdata = head(pneumo)))
(q2 <- predict(fit, newdata = head(pneumo)))
all.equal(q0, q1) # Should be TRUE
all.equal(q1, q2) # Should be TRUE

head(predict(fit))
head(predict(fit, untransform = TRUE))

p0 <- head(predict(fit, type = "response"))

p1 <- head(predict(fit, type = "response", newdata = pneumo))
p2 <- head(predict(fit, type = "response", newdata = pneumo))
p3 <- head(fitted(fit))
all.equal(p0, p1) # Should be TRUE
all.equal(p1, p2) # Should be TRUE
all.equal(p2, p3) # Should be TRUE

predict(fit, type = "terms", se = TRUE)

prentice74 627

prentice74 Prentice (1974) Log-gamma Distribution

Description
Estimation of a 3-parameter log-gamma distribution described by Prentice (1974).

Usage
prentice74(llocation = "identitylink", lscale = "loge", lshape = "identitylink",
ilocation = NULL, iscale = NULL, ishape = NULL,
zero = c("scale", "shape"))

Arguments
llocation, lscale, lshape
Parameter link function applied to the location parameter a, positive scale pa-
rameter b and the shape parameter q, respectively. See Links for more choices.
ilocation, iscale
Initial value for a and b, respectively. The defaults mean an initial value is
determined internally for each.
ishape Initial value for q. If failure to converge occurs, try some other value. The
default means an initial value is determined internally.
zero Can be an integer-valued vector specifying which linear/additive predictors are
modelled as intercepts-only. Then the values must be from the set {1,2,3}. See
CommonVGAMffArguments for more information.

Details
The probability density function is given by

f (y; a, b, q) = |q| exp(w/q 2 ew )/(b (1/q 2 )),

for shape parameter q 6= 0, positive scale parameter b > 0, location parameter a, and all real y.
Here, w = (y a)q/b + (1/q 2 ) where is the digamma function, digamma. The mean of Y is a
(returned as the fitted values). This is a different parameterization compared to lgamma3.
Special cases: q = 0 is the normal distribution with standard deviation b, q = 1 is the extreme
value distribution for maximums, q = 1 is the extreme value distribution for minima (Weibull). If
q > 0 then the distribution is left skew, else q < 0 is right skew.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.
628 prinia

Warning
The special case q = 0 is not handled, therefore estimates of q too close to zero may cause numerical
problems.

Note
The notation used here differs from Prentice (1974): = a, = b. Fisher scoring is used.

Author(s)
T. W. Yee

References
Prentice, R. L. (1974) A log gamma model and its maximum likelihood estimation. Biometrika, 61,
539544.

See Also
lgamma3, lgamma, gengamma.stacy.

Examples
pdata <- data.frame(x2 = runif(nn <- 1000))
pdata <- transform(pdata, loc = -1 + 2*x2, Scale = exp(1))
pdata <- transform(pdata, y = rlgamma(nn, loc = loc, scale = Scale, shape = 1))
fit <- vglm(y ~ x2, prentice74(zero = 2:3), data = pdata, trace = TRUE)
coef(fit, matrix = TRUE) # Note the coefficients for location

prinia Yellow-bellied Prinia

Description
A data frame with yellow-bellied Prinia.

Usage
data(prinia)

Format
A data frame with 151 observations on the following 23 variables.

length a numeric vector, the scaled wing length (zero mean and unit variance).
fat a numeric vector, fat index; originally 1 (no fat) to 4 (very fat) but converted to 0 (no fat) versus
1 otherwise.
cap a numeric vector, number of times the bird was captured or recaptured.
probit 629

noncap a numeric vector, number of times the bird was not captured.
y01, y02, y03, y04, y05, y06 a numeric vector of 0s and 1s; for noncapture and capture resp.
y07, y08, y09, y10, y11, y12 same as above.
y13, y14, y15, y16, y17, y18, y19 same as above.

Details
The yellow-bellied Prinia Prinia flaviventris is a common bird species located in Southeast Asia. A
capturerecapture experiment was conducted at the Mai Po Nature Reserve in Hong Kong during
1991, where captured individuals had their wing lengths measured and fat index recorded. A total
of 19 weekly capture occasions were considered, where 151 distinct birds were captured.
More generally, the prinias are a genus of small insectivorous birds, and are sometimes referred
to as wren-warblers. They are a little-known group of the tropical and subtropical Old World, the
roughly 30 species being divided fairly equally between Africa and Asia.

Source
Thanks to Paul Yip for permission to make this data available.
Hwang, W.-H. and Huggins, R. M. (2007) Application of semiparametric regression models in the
analysis of capturerecapture experiments. Australian and New Zealand Journal of Statistics 49,
191202.

Examples
head(prinia)
summary(prinia)
rowSums(prinia[, c("cap", "noncap")]) # 19s

# Fit a positive-binomial distribution (M.h) to the data:

fit1 <- vglm(cbind(cap, noncap) ~ length + fat, posbinomial, data = prinia)

# Fit another positive-binomial distribution (M.h) to the data:

# The response input is suitable for posbernoulli.*-type functions.
fit2 <- vglm(cbind(y01, y02, y03, y04, y05, y06, y07, y08, y09, y10,
y11, y12, y13, y14, y15, y16, y17, y18, y19) ~
length + fat, posbernoulli.b(drop.b = FALSE ~ 0), data = prinia)

probit Probit Link Function

Description
Computes the probit transformation, including its inverse and the first two derivatives.

Usage
probit(theta, bvalue = NULL, inverse = FALSE, deriv = 0,
short = TRUE, tag = FALSE)
630 probit

Arguments
theta Numeric or character. See below for further details.
bvalue See Links.
inverse, deriv, short, tag
Details at Links.

Details
The probit link function is commonly used for parameters that lie in the unit interval. Numerical
values of theta close to 0 or 1 or out of range result in Inf, -Inf, NA or NaN.

Value
For deriv = 0, the probit of theta, i.e., qnorm(theta) when inverse = FALSE, and if inverse =
TRUE then pnorm(theta).
For deriv = 1, then the function returns d eta / d theta as a function of theta if inverse = FALSE,
else if inverse = TRUE then it returns the reciprocal.

Note
Numerical instability may occur when theta is close to 1 or 0. One way of overcoming this is to
use bvalue.
In terms of the threshold approach with cumulative probabilities for an ordinal response this link
function corresponds to the univariate normal distribution (see uninormal).

Author(s)
Thomas W. Yee

References
McCullagh, P. and Nelder, J. A. (1989) Generalized Linear Models, 2nd ed. London: Chapman &
Hall.

See Also
Links, logit, cloglog, cauchit.

Examples
p <- seq(0.01, 0.99, by = 0.01)
probit(p)
max(abs(probit(probit(p), inverse = TRUE) - p)) # Should be 0

p <- c(seq(-0.02, 0.02, by = 0.01), seq(0.97, 1.02, by = 0.01))

probit(p) # Has NAs
probit(p, bvalue = .Machine$double.eps) # Has no NAs

## Not run: p <- seq(0.01, 0.99, by = 0.01); par(lwd = (mylwd <- 2))
propodds 631

plot(p, logit(p), type = "l", col = "limegreen", ylab = "transformation",

las = 1, main = "Some probability link functions")
lines(p, probit(p), col = "purple")
lines(p, cloglog(p), col = "chocolate")
lines(p, cauchit(p), col = "tan")
abline(v = 0.5, h = 0, lty = "dashed")
legend(0.1, 4.0, c("logit", "probit", "cloglog", "cauchit"),
col = c("limegreen", "purple", "chocolate", "tan"), lwd = mylwd)
par(lwd = 1)
## End(Not run)

propodds Proportional Odds Model for Ordinal Regression

Description
Fits the proportional odds model to a (preferably ordered) factor response.

Usage
propodds(reverse = TRUE, whitespace = FALSE)

Arguments
reverse, whitespace
Logical. Fed into arguments of the same name in cumulative.

Details
The proportional odds model is a special case from the class of cumulative link models. It involves
a logit link applied to cumulative probabilities and a strong parallelism assumption. A parallelism
assumption means there is less chance of numerical problems because the fitted probabilities will
remain between 0 and 1; however the parallelism assumption ought to be checked, e.g., via a likeli-
hood ratio test. This VGAM family function is merely a shortcut for cumulative(reverse = reverse, link = "logit", p
Please see cumulative for more details on this model.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Warning
No check is made to verify that the response is ordinal if the response is a matrix; see ordered.

Author(s)
Thomas W. Yee
632 prplot

References
Agresti, A. (2010) Analysis of Ordinal Categorical Data, 2nd ed. Hoboken, NJ, USA: Wiley.
Yee, T. W. (2010) The VGAM package for categorical data analysis. Journal of Statistical Software,
32, 134. http://www.jstatsoft.org/v32/i10/.
Yee, T. W. and Wild, C. J. (1996) Vector generalized additive models. Journal of the Royal Statisti-
cal Society, Series B, Methodological, 58, 481493.

# Check that the model is linear in let ----------------------

fit2 <- vgam(cbind(normal, mild, severe) ~ s(let, df = 2), propodds, data = pneumo)
## Not run: plot(fit2, se = TRUE, lcol = 2, scol = 2)

# Check the proportional odds assumption with a LRT ----------

(fit3 <- vglm(cbind(normal, mild, severe) ~ let,
cumulative(parallel = FALSE, reverse = TRUE), data = pneumo))
pchisq(deviance(fit) - deviance(fit3),
df = df.residual(fit) - df.residual(fit3), lower.tail = FALSE)
lrtest(fit3, fit) # Easier

prplot Probability Plots for Categorical Data Analysis

Description
Plots the fitted probabilities for some very simplified special cases of categorical data analysis mod-
els.

Usage
prplot(object, control = prplot.control(...), ...)

prplot.control(xlab = NULL, ylab = "Probability", main = NULL, xlim = NULL,

ylim = NULL, lty = par()$lty, col = par()$col, rcol = par()$col,
lwd = par()$lwd, rlwd = par()$lwd, las = par()$las, rug.arg = FALSE, ...)
prplot 633

Arguments

object Currently only an cumulative object. This includes a propodds object since
that VGAM family function is a special case of cumulative.
control List containing some basic graphical parameters.
xlab, ylab, main, xlim, ylim, lty
See par and ... below.
col, rcol, lwd, rlwd, las, rug.arg
See par and ... below. Arguments starting with r refer to the rug. Argument
rug.arg is logical: add a rug for the distinct values of the explanatory variable?
... Arguments such as xlab which are fed into prplot.control(). Only a small
selection of graphical arguments from par are offered.

Details

For models involving one term in the RHS of the formula this function plots the fitted probabilities
against the single explanatory variable.

Value

The object is returned invisibly with the preplot slot assigned. This is obtained by a call to
plotvgam().

Note

This function is rudimentary.

See Also

cumulative.

Examples

pneumo <- transform(pneumo, let = log(exposure.time))

fit <- vglm(cbind(normal, mild, severe) ~ let, propodds, data = pneumo)
M <- npred(fit) # Or fit@misc$M
## Not run: prplot(fit)
prplot(fit, lty = 1:M, col = (1:M)+2, rug = TRUE, las = 1,
ylim = c(0, 1), rlwd = 2)
## End(Not run)
634 put.smart

put.smart Adds a List to the End of the List .smart.prediction

Description

Adds a list to the end of the list .smart.prediction in smartpredenv.

Usage

put.smart(smart)

Arguments

smart a list containing parameters needed later for smart prediction.

Details

put.smart is used in "write" mode within a smart function. It saves parameters at the time
of model fitting, which are later used for prediction. The function put.smart is the opposite of
get.smart, and both deal with the same contents.

Value

Nothing is returned.

Side Effects

The variable .smart.prediction.counter in smartpredenv is incremented beforehand, and .smart.prediction[[.smar

is assigned the list smart. If the list .smart.prediction in smartpredenv is not long enough to
hold smart, then it is made larger, and the variable .max.smart in smartpredenv is adjusted ac-
cordingly.

See Also

get.smart.

Examples

print(sm.min1)
qrrvglm.control 635

qrrvglm.control Control Function for QRR-VGLMs (CQO)

Description
Algorithmic constants and parameters for a constrained quadratic ordination (CQO), by fitting a
quadratic reduced-rank vector generalized linear model (QRR-VGLM), are set using this function.
It is the control function for cqo.

Usage
qrrvglm.control(Rank = 1,
Bestof = if (length(Cinit)) 1 else 10,
checkwz = TRUE,
Cinit = NULL,
Crow1positive = TRUE,
epsilon = 1.0e-06,
EqualTolerances = NULL,
eq.tolerances = TRUE,
Etamat.colmax = 10,
FastAlgorithm = TRUE,
GradientFunction = TRUE,
Hstep = 0.001,
isd.latvar = rep_len(c(2, 1, rep_len(0.5, Rank)), Rank),
iKvector = 0.1,
iShape = 0.1,
ITolerances = NULL,
I.tolerances = FALSE,
maxitl = 40,
imethod = 1,
Maxit.optim = 250,
MUXfactor = rep_len(7, Rank),
noRRR = ~ 1, Norrr = NA,
optim.maxit = 20,
Parscale = if (I.tolerances) 0.001 else 1.0,
sd.Cinit = 0.02,
SmallNo = 5.0e-13,
trace = TRUE,
Use.Init.Poisson.QO = TRUE,
wzepsilon = .Machine$double.eps^0.75, ...)

Arguments
In the following, R is the Rank, M is the number of linear predictors, and S
is the number of responses (species). Thus M = S for binomial and Poisson
responses, and M = 2S for the negative binomial and 2-parameter gamma
distributions.
636 qrrvglm.control

The numerical rank R of the model, i.e., the number of ordination axes. Must be
an element from the set {1,2,. . . ,min(M ,p2 )} where the vector of explanatory
variables x is partitioned into (x1 ,x2 ), which is of dimension p1 + p2 . The
variables making up x1 are given by the terms in the noRRR argument, and the
rest of the terms comprise x2 .
Rank
Bestof Integer. The best of Bestof models fitted is returned. This argument helps guard
against local solutions by (hopefully) finding the global solution from many fits.
The argument has value 1 if an initial value for C is inputted using Cinit.
checkwz logical indicating whether the diagonal elements of the working weight matri-
ces should be checked whether they are sufficiently positive, i.e., greater than
wzepsilon. If not, any values less than wzepsilon are replaced with this value.
Cinit Optional initial C matrix, which must be a p2 by R matrix. The default is to
apply .Init.Poisson.QO() to obtain initial values.
Crow1positive Logical vector of length Rank (recycled if necessary): are the elements of the
first row of C positive? For example, if Rank is 4, then specifying Crow1positive = c(FALSE, TRUE
will force C[1, 1] and C[1, 3] to be negative, and C[1, 2] and C[1, 4] to be pos-
itive. This argument allows for a reflection in the ordination axes because the
coefficients of the latent variables are unique up to a sign.
epsilon Positive numeric. Used to test for convergence for GLMs fitted in C. Larger
values mean a loosening of the convergence criterion. If an error code of 3 is
reported, try increasing this value.
eq.tolerances Logical indicating whether each (quadratic) predictor will have equal tolerances.
Having eq.tolerances = TRUE can help avoid numerical problems, espe-
cially with binary data. Note that the estimated (common) tolerance matrix
may or may not be positive-definite. If it is then it can be scaled to the R
by R identity matrix, i.e., made equivalent to I.tolerances = TRUE. Set-
ting I.tolerances = TRUE will force a common R by R identity matrix as
the tolerance matrix to the data even if it is not appropriate. In general, setting
I.tolerances = TRUE is preferred over eq.tolerances = TRUE because, if it
works, it is much faster and uses less memory. However, I.tolerances = TRUE
requires the environmental variables to be scaled appropriately. See Details for
more details.
EqualTolerances
Defunct argument. Use eq.tolerances instead.
Etamat.colmax Positive integer, no smaller than Rank. Controls the amount of memory used by
.Init.Poisson.QO(). It is the maximum number of columns allowed for the
pseudo-response and its weights. In general, the larger the value, the better the
initial value. Used only if Use.Init.Poisson.QO = TRUE.
FastAlgorithm Logical. Whether a new fast algorithm is to be used. The fast algorithm results
in a large speed increases compared to Yee (2004). Some details of the fast algo-
rithm are found in Appendix A of Yee (2006). Setting FastAlgorithm = FALSE
will give an error.
GradientFunction
Logical. Whether optims argument gr is used or not, i.e., to compute gradient
values. Used only if FastAlgorithm is TRUE. The default value is usually faster
on most problems.
qrrvglm.control 637

Hstep Positive value. Used as the step size in the finite difference approximation to the
derivatives by optim.
isd.latvar Initial standard deviations for the latent variables (site scores). Numeric, pos-
itive and of length R (recycled if necessary). This argument is used only if
I.tolerances = TRUE. Used by .Init.Poisson.QO() to obtain initial values
for the constrained coefficients C adjusted to a reasonable value. It adjusts the
spread of the site scores relative to a common species tolerance of 1 for each
ordination axis. A value between 0.5 and 10 is recommended; a value such as
10 means that the range of the environmental space is very large relative to the
niche width of the species. The successive values should decrease because the
first ordination axis should have the most spread of site scores, followed by the
second ordination axis, etc.
iKvector, iShape
Numeric, recycled to length S if necessary. Initial values used for estimat-
ing the positive k and parameters of the negative binomial and 2-parameter
gamma distributions respectively. For further information see negbinomial and
gamma2. These arguments override the ik and ishape arguments in negbinomial
and gamma2.
I.tolerances Logical. If TRUE then the (common) tolerance matrix is the R by R iden-
tity matrix by definition. Note that having I.tolerances = TRUE implies
eq.tolerances = TRUE, but not vice versa. Internally, the quadratic terms
will be treated as offsets (in GLM jargon) and so the models can potentially be
fitted very efficiently. However, it is a very good idea to center and scale all
numerical variables in the x2 vector. See Details for more details. The success
of I.tolerances = TRUE often depends on suitable values for isd.latvar
and/or MUXfactor.
ITolerances Defunct argument. Use I.tolerances instead.
maxitl Maximum number of times the optimizer is called or restarted. Most users
should ignore this argument.
imethod Method of initialization. A positive integer 1 or 2 or 3 etc. depending on the
VGAM family function. Currently it is used for negbinomial and gamma2 only,
and used within the C.
Maxit.optim Positive integer. Number of iterations given to the function optim at each of the
optim.maxit iterations.
MUXfactor Multiplication factor for detecting large offset values. Numeric, positive and of
length R (recycled if necessary). This argument is used only if I.tolerances = TRUE.
Offsets are 0.5 multiplied by the sum of the squares of all R latent variable val-
ues. If the latent variable values are too large then this will result in numerical
problems. By too large, it is meant that the standard deviation of the latent vari-
able values are greater than MUXfactor[r] * isd.latvar[r] for r=1:Rank
(this is why centering and scaling all the numerical predictor variables in x2 is
recommended). A value about 3 or 4 is recommended. If failure to converge
occurs, try a slightly lower value.
optim.maxit Positive integer. Number of times optim is invoked. At iteration i, the ith value
of Maxit.optim is fed into optim.
638 qrrvglm.control

noRRR Formula giving terms that are not to be included in the reduced-rank regression
(or formation of the latent variables), i.e., those belong to x1 . Those variables
which do not make up the latent variable (reduced-rank regression) correspond
to the B1 matrix. The default is to omit the intercept term from the latent vari-
ables.
Norrr Defunct. Please use noRRR. Use of Norrr will become an error soon.
Parscale Numerical and positive-valued vector of length C (recycled if necessary). Passed
into optim(..., control = list(parscale = Parscale)); the elements
of C become C / Parscale. Setting I.tolerances = TRUE results in line
searches that are very large, therefore C has to be scaled accordingly to avoid
large step sizes. See Details for more information. Its probably best to leave
this argument alone.
sd.Cinit Standard deviation of the initial values for the elements of C. These are normally
distributed with mean zero. This argument is used only if Use.Init.Poisson.QO = FALSE
and C is not inputted using Cinit.
trace Logical indicating if output should be produced for each iteration. The de-
fault is TRUE because the calculations are numerically intensive, meaning it may
take a long time, so that the user might think the computer has locked up if
trace = FALSE.
SmallNo Positive numeric between .Machine$double.eps and 0.0001. Used to avoid
under- or over-flow in the IRLS algorithm. Used only if FastAlgorithm is
TRUE.
Use.Init.Poisson.QO
Logical. If TRUE then the function .Init.Poisson.QO() is used to obtain initial
values for the canonical coefficients C. If FALSE then random numbers are used
instead.
wzepsilon Small positive number used to test whether the diagonals of the working weight
matrices are sufficiently positive.
... Ignored at present.

Details
Recall that the central formula for CQO is
M
X
= B1T x1 + A + ( T Dm )em
m=1

where x1 is a vector (usually just a 1 for an intercept), x2 is a vector of environmental variables,

= C T x2 is a R-vector of latent variables, em is a vector of 0s but with a 1 in the mth position.
QRR-VGLMs are an extension of RR-VGLMs and allow for maximum likelihood solutions to
constrained quadratic ordination (CQO) models.
Having I.tolerances = TRUE means all the tolerance matrices are the order-R identity matrix,
i.e., it forces bell-shaped curves/surfaces on all species. This results in a more difficult optimiza-
tion problem (especially for 2-parameter models such as the negative binomial and gamma) be-
cause of overflow errors and it appears there are more local solutions. To help avoid the over-
flow errors, scaling C by the factor Parscale can help enormously. Even better, scaling C by
qrrvglm.control 639

specifying isd.latvar is more understandable to humans. If failure to converge occurs, try ad-
justing Parscale, or better, setting eq.tolerances = TRUE (and hope that the estimated toler-
ance matrix is positive-definite). To fit an equal-tolerances model, it is firstly best to try setting
I.tolerances = TRUE and varying isd.latvar and/or MUXfactor if it fails to converge. If it still
fails to converge after many attempts, try setting eq.tolerances = TRUE, however this will usually
be a lot slower because it requires a lot more memory.
With a R > 1 model, the latent variables are always uncorrelated, i.e., the variance-covariance
matrix of the site scores is a diagonal matrix.
If setting eq.tolerances = TRUE is used and the common estimated tolerance matrix is positive-
definite then that model is effectively the same as the I.tolerances = TRUE model (the two are
transformations of each other). In general, I.tolerances = TRUE is numerically more unstable and
presents a more difficult problem to optimize; the arguments isd.latvar and/or MUXfactor often
must be assigned some good value(s) (possibly found by trial and error) in order for convergence
to occur. Setting I.tolerances = TRUE forces a bell-shaped curve or surface onto all the species
data, therefore this option should be used with deliberation. If unsuitable, the resulting fit may be
very misleading. Usually it is a good idea for the user to set eq.tolerances = FALSE to see which
species appear to have a bell-shaped curve or surface. Improvements to the fit can often be achieved
using transformations, e.g., nitrogen concentration to log nitrogen concentration.
Fitting a CAO model (see cao) first is a good idea for pre-examining the data and checking whether
it is appropriate to fit a CQO model.

Value
A list with components matching the input names.

Warning
The default value of Bestof is a bare minimum for many datasets, therefore it will be necessary to
increase its value to increase the chances of obtaining the global solution.

Note
When I.tolerances = TRUE it is a good idea to apply scale to all the numerical variables that
make up the latent variable, i.e., those of x2 . This is to make them have mean 0, and hence avoid
large offset values which cause numerical problems.
This function has many arguments that are common with rrvglm.control and vglm.control.
It is usually a good idea to try fitting a model with I.tolerances = TRUE first, and if convergence is
unsuccessful, then try eq.tolerances = TRUE and I.tolerances = FALSE. Ordination diagrams
with eq.tolerances = TRUE have a natural interpretation, but with eq.tolerances = FALSE
they are more complicated and requires, e.g., contours to be overlaid on the ordination diagram (see
lvplot.qrrvglm).
In the example below, an equal-tolerances CQO model is fitted to the hunting spiders data. Because
I.tolerances = TRUE, it is a good idea to center all the x2 variables first. Upon fitting the model,
the actual standard deviation of the site scores are computed. Ideally, the isd.latvar argument
should have had this value for the best chances of getting good initial values. For comparison, the
model is refitted with that value and it should run more faster and reliably.
640 qtplot.gumbel

Author(s)

Thomas W. Yee

References

Yee, T. W. (2004) A new technique for maximum-likelihood canonical Gaussian ordination. Eco-
logical Monographs, 74, 685701.
Yee, T. W. (2006) Constrained additive ordination. Ecology, 87, 203213.

See Also

cqo, rcqo, Coef.qrrvglm, Coef.qrrvglm-class, optim, binomialff, poissonff, negbinomial,

gamma2, gaussianff.

Examples
## Not run: # Poisson CQO with equal tolerances
set.seed(111) # This leads to the global solution
hspider[,1:6] <- scale(hspider[,1:6]) # Good idea when I.tolerances = TRUE
p1 <- cqo(cbind(Alopacce, Alopcune, Alopfabr, Arctlute, Arctperi, Auloalbi,
Pardlugu, Pardmont, Pardnigr, Pardpull, Trocterr, Zoraspin) ~
WaterCon + BareSand + FallTwig + CoveMoss + CoveHerb + ReflLux,
quasipoissonff, data = hspider, eq.tolerances = TRUE)
sort(deviance(p1, history = TRUE)) # A history of all the iterations

(isd.latvar <- apply(latvar(p1), 2, sd)) # Should be approx isd.latvar

# Refit the model with better initial values

set.seed(111) # This leads to the global solution
p1 <- cqo(cbind(Alopacce, Alopcune, Alopfabr, Arctlute, Arctperi, Auloalbi,
Pardlugu, Pardmont, Pardnigr, Pardpull, Trocterr, Zoraspin) ~
WaterCon + BareSand + FallTwig + CoveMoss + CoveHerb + ReflLux,
I.tolerances = TRUE, quasipoissonff, data = hspider,
isd.latvar = isd.latvar) # Note the use of isd.latvar here
sort(deviance(p1, history = TRUE)) # A history of all the iterations

## End(Not run)

qtplot.gumbel Quantile Plot for Gumbel Regression

Description

Plots quantiles associated with a Gumbel model.

qtplot.gumbel 641

Usage
qtplot.gumbel(object, show.plot = TRUE,
y.arg = TRUE, spline.fit = FALSE, label = TRUE,
R = object@misc$R, percentiles = object@misc$percentiles,
add.arg = FALSE, mpv = object@misc$mpv,
xlab = NULL, ylab = "", main = "",
pch = par()$pch, pcol.arg = par()$col,
llty.arg = par()$lty, lcol.arg = par()$col, llwd.arg = par()$lwd,
tcol.arg = par()$col, tadj = 1, ...)

Arguments
object A VGAM extremes model of the Gumbel type, produced by modelling func-
tions such as vglm and vgam, and with a family function that is either gumbel or
gumbelff.
show.plot Logical. Plot it? If FALSE no plot will be done.
y.arg Logical. Add the raw data on to the plot?
spline.fit Logical. Use a spline fit through the fitted percentiles? This can be useful if
there are large gaps between some values along the covariate.
label Logical. Label the percentiles?
R See gumbel.
percentiles See gumbel.
add.arg Logical. Add the plot to an existing plot?
mpv See gumbel.
xlab Caption for the x-axis. See par.
ylab Caption for the y-axis. See par.
main Title of the plot. See title.
pch Plotting character. See par.
pcol.arg Color of the points. See the col argument of par.
llty.arg Line type. Line type. See the lty argument of par.
lcol.arg Color of the lines. See the col argument of par.
llwd.arg Line width. See the lwd argument of par.
tcol.arg Color of the text (if label is TRUE). See the col argument of par.
tadj Text justification. See the adj argument of par.
... Arguments passed into the plot function when setting up the entire plot. Useful
arguments here include sub and las.

Details
There should be a single covariate such as time. The quantiles specified by percentiles are plotted.
642 qtplot.lmscreg

Value
The object with a list called qtplot in the post slot of object. (If show.plot = FALSE then just
the list is returned.) The list contains components

fitted.values The percentiles of the response, possibly including the MPV.

percentiles The percentiles (small vector of values between 0 and 100.

Note
Unlike gumbel, one cannot have percentiles = NULL.

Author(s)
Thomas W. Yee

qtplot(fit1, perc = 97, mpv = FALSE, lcol = 3, tcol = 3,

lwd = 2, tadj = 0.4, add = TRUE) -> saved
head(saved@post$qtplot$fitted)

## End(Not run)

qtplot.lmscreg Quantile Plot for LMS Quantile Regression

Description
Plots quantiles associated with a LMS quantile regression.

Usage
qtplot.lmscreg(object, newdata = NULL,
percentiles = object@misc$percentiles,
show.plot = TRUE, ...)
qtplot.lmscreg 643

Arguments
object A VGAM quantile regression model, i.e., an object produced by modelling func-
tions such as vglm and vgam with a family function beginning with "lms.", e.g.,
lms.yjn.
newdata Optional data frame for computing the quantiles. If missing, the original data is
used.
percentiles Numerical vector with values between 0 and 100 that specify the percentiles
(quantiles). The default are the percentiles used when the model was fitted.
show.plot Logical. Plot it? If FALSE no plot will be done.
... Graphical parameter that are passed into plotqtplot.lmscreg.

Details
The primary variable is defined as the main covariate upon which the regression or smoothing is
performed. For example, in medical studies, it is often the age. In VGAM, it is possible to handle
more than one covariate, however, the primary variable must be the first term after the intercept.

Value
A list with the following components.
fitted.values A vector of fitted percentile values.
percentiles The percentiles used.

Note
plotqtplot.lmscreg does the actual plotting.

Author(s)
Thomas W. Yee

References
Yee, T. W. (2004) Quantile regression via vector generalized additive models. Statistics in Medicine,
23, 22952315.

See Also
plotqtplot.lmscreg, deplot.lmscreg, lms.bcn, lms.bcg, lms.yjn.

Examples
## Not run:
fit <- vgam(BMI ~ s(age, df = c(4, 2)), lms.bcn(zero=1), data = bmi.nz)
qtplot(fit)
qtplot(fit, perc = c(25, 50, 75, 95), lcol = "blue", tcol = "blue", llwd = 2)

## End(Not run)
644 quasibinomialff

quasibinomialff Quasi-Binomial Family Function

Description
Family function for fitting generalized linear models to binomial responses, where the dispersion
parameters are unknown.

Usage
quasibinomialff(link = "logit", multiple.responses = FALSE,
onedpar = !multiple.responses, parallel = FALSE, zero = NULL)

Arguments
link Link function. See Links for more choices.
multiple.responses
Multiple responses? If TRUE, then the response is interpreted as M binary re-
sponses, where M is the number of columns of the response matrix. In this case,
the response matrix should have zero/one values only.
If FALSE and the response is a (2-column) matrix, then the number of successes
is given in the first column and the second column is the number of failures.
onedpar One dispersion parameter? If multiple.responses, then a separate dispersion
parameter will be computed for each response (column), by default. Setting
onedpar=TRUE will pool them so that there is only one dispersion parameter to
be estimated.
parallel A logical or formula. Used only if multiple.responses is TRUE. This argument
allows for the parallelism assumption whereby the regression coefficients for a
variable is constrained to be equal over the M linear/additive predictors.
zero Can be an integer-valued vector specifying which linear/additive predictors are
modelled as intercepts only. The values must be from the set {1,2,. . . ,M }, where
M is the number of columns of the matrix response. See CommonVGAMffArguments
for more information.

Details
The final model is not fully estimated by maximum likelihood since the dispersion parameter is
unknown (see pp.1248 of McCullagh and Nelder (1989) for more details).
A dispersion parameter that is less/greater than unity corresponds to under-/over-dispersion relative
to the binomial model. Over-dispersion is more common in practice.
Setting multiple.responses=TRUE is necessary when fitting a Quadratic RR-VGLM (see cqo)
because the response will be a matrix of M columns (e.g., one column per species). Then there will
be M dispersion parameters (one per column of the response).
quasibinomialff 645

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, vgam, rrvglm, cqo, and cao.

Warning
The log-likelihood pertaining to the ordinary family is used to test for convergence during estima-
tion, and is printed out in the summary.

Note
If multiple.responses is FALSE (the default), then the response can be of one of three formats:
a factor (first level taken as success), a vector of proportions of success, or a 2-column matrix
(first column = successes) of counts. The argument weights in the modelling function can also
be specified. In particular, for a general vector of proportions, you will need to specify weights
because the number of trials is needed.
If multiple.responses is TRUE, then the matrix response can only be of one format: a matrix of
1s and 0s (1=success).
This function is only a front-end to the VGAM family function binomialff(); indeed, quasibinomialff(...)
is equivalent to binomialff(..., dispersion=0). Here, the argument dispersion=0 signifies
that the dispersion parameter is to be estimated.
Regardless of whether the dispersion parameter is to be estimated or not, its value can be seen from
the output from the summary() of the object.

Author(s)
Thomas W. Yee

References
McCullagh, P. and Nelder, J. A. (1989) Generalized Linear Models, 2nd ed. London: Chapman &
Hall.

See Also
binomialff, rrvglm, cqo, cao, logit, probit, cloglog, cauchit, poissonff, quasipoissonff,
quasibinomial.

Examples
quasibinomialff()
quasibinomialff(link = "probit")

# Nonparametric logistic regression

hunua <- transform(hunua, a.5 = sqrt(altitude)) # Transformation of altitude
fit1 <- vglm(agaaus ~ poly(a.5, 2), quasibinomialff, hunua)
fit2 <- vgam(agaaus ~ s(a.5, df = 2), quasibinomialff, hunua)
## Not run:
plot(fit2, se = TRUE, llwd = 2, lcol = "orange", scol = "orange",
646 quasipoissonff

xlab = "sqrt(altitude)", ylim = c(-3, 1),

main = "GAM and quadratic GLM fitted to species data")
plotvgam(fit1, se = TRUE, lcol = "blue", scol = "blue", add = TRUE, llwd = 2)

## End(Not run)
fit1@misc$dispersion # dispersion parameter
logLik(fit1)

# Here, the dispersion parameter defaults to 1

fit0 <- vglm(agaaus ~ poly(a.5, 2), binomialff, hunua)
fit0@misc$dispersion # dispersion parameter

quasipoissonff Quasi-Poisson Family Function

Description
Fits a generalized linear model to a Poisson response, where the dispersion parameter is unknown.

Usage
quasipoissonff(link = "loge", onedpar = FALSE,
parallel = FALSE, zero = NULL)

Arguments
link Link function. See Links for more choices.
onedpar One dispersion parameter? If the response is a matrix, then a separate dispersion
parameter will be computed for each response (column), by default. Setting
onedpar=TRUE will pool them so that there is only one dispersion parameter to
be estimated.
parallel A logical or formula. Used only if the response is a matrix.
zero Can be an integer-valued vector specifying which linear/additive predictors are
modelled as intercepts only. The values must be from the set {1,2,. . . ,M }, where
M is the number of columns of the matrix response. See CommonVGAMffArguments
for more information.

Details
M defined above is the number of linear/additive predictors.
If the dispersion parameter is unknown, then the resulting estimate is not fully a maximum likeli-
hood estimate.
A dispersion parameter that is less/greater than unity corresponds to under-/over-dispersion relative
to the Poisson model. Over-dispersion is more common in practice.
When fitting a Quadratic RR-VGLM, the response is a matrix of M , say, columns (e.g., one column
per species). Then there will be M dispersion parameters (one per column of the response matrix).
quasipoissonff 647

Value

An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, vgam, rrvglm, cqo, and cao.

Warning

See the warning in quasibinomialff.

Note

This function will handle a matrix response automatically.

The call poissonff(dispersion = 0, ...) is equivalent to quasipoissonff(...). The latter
was written so that R users of quasipoisson() would only need to add a ff to the end of the
family function name.
Regardless of whether the dispersion parameter is to be estimated or not, its value can be seen from
the output from the summary() of the object.

Author(s)

Thomas W. Yee

References

McCullagh, P. and Nelder, J. A. (1989) Generalized Linear Models, 2nd ed. London: Chapman &
Hall.

See Also

poissonff, negbinomial, loge, rrvglm, cqo, cao, binomialff, quasibinomialff, quasipoisson.

Examples
quasipoissonff()

## Not run: n <- 200; p <- 5; S <- 5

mydata <- rcqo(n, p, S, fam = "poisson", eq.tol = FALSE)
myform <- attr(mydata, "formula")
p1 <- cqo(myform, fam = quasipoissonff, eq.tol = FALSE, data = mydata)
sort(deviance(p1, history = TRUE)) # A history of all the iterations
lvplot(p1, y = TRUE, lcol = 1:S, pch = 1:S, pcol = 1:S)
summary(p1) # The dispersion parameters are estimated

## End(Not run)
648 Qvar

Qvar Quasi-variances Preprocessing Function

Description
Takes a vglm fit or a variance-covariance matrix, and preprocesses it for rcim and uninormal so
that quasi-variances can be computed.

Usage
Qvar(object, factorname = NULL, which.linpred = 1,
coef.indices = NULL, labels = NULL,
dispersion = NULL, reference.name = "(reference)", estimates = NULL)

Arguments
object A "vglm" object or a variance-covariance matrix, e.g., vcov(vglm.object).
The former is preferred since it contains all the information needed. If a ma-
trix then factorname and/or coef.indices should be specified to identify the
factor.
which.linpred A single integer from the set 1:M. Specifies which linear predictor to use. Let
the value of which.linpred be called j. Then the factor should appear in that
linear predictor, hence the jth row of the constraint matrix corresponding to the
factor should have at least one nonzero value. Currently the jth row must have
exactly one nonzero value because programming it for more than one nonzero
value is difficult.
factorname Character. If the vglm object contains more than one factor as explanatory vari-
able then this argument should be the name of the factor of interest. If object
is a variance-covariance matrix then this argument should also be specified.
labels Character. Optional, for labelling the variance-covariance matrix.
dispersion Numeric. Optional, passed into vcov() with the same argument name.
reference.name Character. Label for for the reference level.
coef.indices Optional numeric vector of length at least 3 specifying the indices of the factor
from the variance-covariance matrix.
estimates an optional vector of estimated coefficients (redundant if object is a model).

Details
Suppose a factor with L levels is an explanatory variable in a regression model. By default, R treats
the first level as baseline so that its coefficient is set to zero. It estimates the other L 1 coeffi-
cients, and with its associated standard errors, this is the conventional output. From the complete
variance-covariance matrix one can compute L quasi-variances based on all pairwise difference of
the coefficients. They are based on an approximation, and can be treated as uncorrelated. In min-
imizing the relative (not absolute) errors it is not hard to see that the estimation involves a RCIM
(rcim) with an exponential link function (explink).
Qvar 649

If object is a model, then at least one of factorname or coef.indices must be non-NULL. The
value of coef.indices, if non-NULL, determines which rows and columns of the models variance-
covariance matrix to use. If coef.indices contains a zero, an extra row and column are included
at the indicated position, to represent the zero variances and covariances associated with a reference
level. If coef.indices is NULL, then factorname should be the name of a factor effect in the
model, and is used in order to extract the necessary variance-covariance estimates.
Quasi-variances were first implemented in R with qvcalc. This implementation draws heavily from
that.

Value
A L by L matrix whose i-j element is the logarithm of the variance of the ith coefficient minus the
jth coefficient, for all values of i and j. The diagonal elements are abitrary and are set to zero.
The matrix has an attribute that corresponds to the prior weight matrix; it is accessed by uninormal
and replaces the usual weights argument. of vglm. This weight matrix has ones on the off-diagonals
and some small positive number on the diagonals.

Warning
Negative quasi-variances may occur (one of them and only one), though they are rare in practice. If
so then numerical problems may occur. See qvcalc() for more information.

Note
This is an adaptation of qvcalc() in qvcalc. It should work for all vglm models with one linear
predictor, i.e., M = 1. For M > 1 the factor should appear only in one of the linear predictors.
It is important to set maxit to be larger than usual for rcim since convergence is slow. Upon
successful convergence the ith row effect and the ith column effect should be equal. A simple
computation involving the fitted and predicted values allows the quasi-variances to be extracted
(see example below).
A function to plot comparison intervals has not been written here.

Author(s)
T. W. Yee, based heavily on qvcalc() in qvcalc written by David Firth.

References
Firth, D. (2003) Overcoming the reference category problem in the presentation of statistical mod-
els. Sociological Methodology 33, 118.
Firth, D. and de Menezes, R. X. (2004) Quasi-variances. Biometrika 91, 6580.
Yee, T. W. and Hadi, A. F. (2014) Row-column interaction models, with an R implementation.
Computational Statistics, 29, 14271445.

See Also
rcim, vglm, qvar, uninormal, explink, qvcalc() in qvcalc, ships.
650 Qvar

Examples

# Example 1
data("ships", package = "MASS")

Shipmodel <- vglm(incidents ~ type + year + period,

quasipoissonff, offset = log(service),
# trace = TRUE, model = TRUE,
data = ships, subset = (service > 0))

# Easiest form of input

fit1 <- rcim(Qvar(Shipmodel, "type"), uninormal("explink"), maxit = 99)
qvar(fit1) # Easy method to get the quasi-variances
qvar(fit1, se = TRUE) # Easy method to get the quasi-standard errors

(quasiVar <- exp(diag(fitted(fit1))) / 2) # Version 1

(quasiVar <- diag(predict(fit1)[, c(TRUE, FALSE)]) / 2) # Version 2
(quasiSE <- sqrt(quasiVar))

# Another form of input

fit2 <- rcim(Qvar(Shipmodel, coef.ind = c(0, 2:5), reference.name = "typeA"),
uninormal("explink"), maxit = 99)
## Not run: qvplot(fit2, col = "green", lwd = 3, scol = "blue", slwd = 2, las = 1)

# The variance-covariance matrix is another form of input (not recommended)

fit3 <- rcim(Qvar(cbind(0, rbind(0, vcov(Shipmodel)[2:5, 2:5])),
labels = c("typeA", "typeB", "typeC", "typeD", "typeE"),
estimates = c(typeA = 0, coef(Shipmodel)[2:5])),
uninormal("explink"), maxit = 99)
(QuasiVar <- exp(diag(fitted(fit3))) / 2) # Version 1
(QuasiVar <- diag(predict(fit3)[, c(TRUE, FALSE)]) / 2) # Version 2
(QuasiSE <- sqrt(quasiVar))
## Not run: qvplot(fit3)

# Example 2: a model with M > 1 linear predictors

## Not run: require("VGAMdata")
xs.nz.f <- subset(xs.nz, sex == "F")
xs.nz.f <- subset(xs.nz.f, !is.na(babies) & !is.na(age) & !is.na(ethnicity))
xs.nz.f <- subset(xs.nz.f, ethnicity != "Other")

clist <- list("sm.bs(age, df = 4)" = rbind(1, 0),

"sm.bs(age, df = 3)" = rbind(0, 1),
"ethnicity" = diag(2),
"(Intercept)" = diag(2))
fit1 <- vglm(babies ~ sm.bs(age, df = 4) + sm.bs(age, df = 3) + ethnicity,
zipoissonff(zero = NULL), xs.nz.f,
constraints = clist, trace = TRUE)
Fit1 <- rcim(Qvar(fit1, "ethnicity", which.linpred = 1),
uninormal("explink", imethod = 1), maxit = 99, trace = TRUE)
Fit2 <- rcim(Qvar(fit1, "ethnicity", which.linpred = 2),
uninormal("explink", imethod = 1), maxit = 99, trace = TRUE)
qvar 651

## End(Not run)
## Not run: par(mfrow = c(1, 2))
qvplot(Fit1, scol = "blue", pch = 16, main = expression(eta[1]),
slwd = 1.5, las = 1, length.arrows = 0.07)
qvplot(Fit2, scol = "blue", pch = 16, main = expression(eta[2]),
slwd = 1.5, las = 1, length.arrows = 0.07)

## End(Not run)

qvar Quasi-variances Extraction Function

Description
Takes a rcim fit of the appropriate format and returns either the quasi-variances or quasi-standard
errors.

Usage
qvar(object, se = FALSE, ...)

Arguments
object A rcim object that has family function uninormal with the explink link. See
below for an example.
se Logical. If FALSE then the quasi-variances are returned, else the square root of
them, called quasi-standard errors.
... Currently unused.

Details
This simple function is ad hoc and simply is equivalent to computing the quasi-variances by diag(predict(fit1)[, c(TRUE,
This function is for convenience only. Serious users of quasi-variances ought to understand why
and how this function works.

Value
A vector of quasi-variances or quasi-standard errors.

Author(s)
T. W. Yee.

See Also
rcim, uninormal, explink, Qvar, ships.
652 Rayleigh

Examples
data("ships", package = "MASS")
Shipmodel <- vglm(incidents ~ type + year + period,
quasipoissonff, offset = log(service),
data = ships, subset = (service > 0))

# Easiest form of input

fit1 <- rcim(Qvar(Shipmodel, "type"), uninormal("explink"), maxit = 99)
qvar(fit1) # Quasi-variances
qvar(fit1, se = TRUE) # Quasi-standard errors

# Manually compute them:

(quasiVar <- exp(diag(fitted(fit1))) / 2) # Version 1
(quasiVar <- diag(predict(fit1)[, c(TRUE, FALSE)]) / 2) # Version 2
(quasiSE <- sqrt(quasiVar))

## Not run: qvplot(fit1, col = "green", lwd = 3, scol = "blue", slwd = 2, las = 1)

Rayleigh The Rayleigh Distribution

Description
Density, distribution function, quantile function and random generation for the Rayleigh distribution
with parameter a.

Usage
drayleigh(x, scale = 1, log = FALSE)
prayleigh(q, scale = 1, lower.tail = TRUE, log.p = FALSE)
qrayleigh(p, scale = 1, lower.tail = TRUE, log.p = FALSE)
rrayleigh(n, scale = 1)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. Fed into runif.
scale the scale parameter b.
log Logical. If log = TRUE then the logarithm of the density is returned.
lower.tail, log.p
Same meaning as in pnorm or qnorm.

Details
See rayleigh, the VGAM family function for estimating the scale parameter b by maximum like-
lihood estimation, for the formula of the probability density function and range restrictions on the
parameter b.
rayleigh 653

Value
drayleigh gives the density, prayleigh gives the distribution function, qrayleigh gives the quan-
tile function, and rrayleigh generates random deviates.

Note
The Rayleigh distribution is related to the Maxwell distribution.

Author(s)
T. W. Yee and Kai Huang

References
Forbes, C., Evans, M., Hastings, N. and Peacock, B. (2011) Statistical Distributions, Hoboken, NJ,
USA: John Wiley and Sons, Fourth edition.

rayleigh Rayleigh Distribution Family Function

Description
Estimating the parameter of the Rayleigh distribution by maximum likelihood estimation. Right-
censoring is allowed.

Usage
rayleigh(lscale = "loge", nrfs = 1/3 + 0.01,
oim.mean = TRUE, zero = NULL)
cens.rayleigh(lscale = "loge", oim = TRUE)
654 rayleigh

Arguments
lscale Parameter link function applied to the scale parameter b. See Links for more
choices. A log link is the default because b is positive.
nrfs Numeric, of length one, with value in [0, 1]. Weighting factor between Newton-
Raphson and Fisher scoring. The value 0 means pure Newton-Raphson, while
1 means pure Fisher scoring. The default value uses a mixture of the two algo-
rithms, and retaining positive-definite working weights.
oim.mean Logical, used only for intercept-only models. TRUE means the mean of the OIM
elements are used as working weights. If TRUE then this argument has top prior-
ity for working out the working weights. FALSE means use another algorithm.
oim Logical. For censored data only, TRUE means the Newton-Raphson algorithm,
and FALSE means Fisher scoring.
zero Details at CommonVGAMffArguments.

Details
The Rayleigh distribution, which is used in physics, has a probability density function that can be
written
f (y) = y exp(0.5(y/b)2 )/b2
p
for y > 0 and b > 0. The mean of Y is b /2 (returned as the fitted values) and its variance is
b2 (4 )/2.
The VGAM family function cens.rayleigh handles right-censored data (the true value is greater
than the observed value). To indicate which type of censoring, input extra = list(rightcensored = vec2)
where vec2 is a logical vector the same length as the response. If the component of this list is miss-
ing then the logical values are taken to be FALSE. The fitted object has this component stored in the
extra slot.
The VGAM family function rayleigh handles multiple responses.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.

Warning
The theory behind the argument oim is not fully complete.

Note
The poisson.points family function is more general so that if ostatistic = 1 and dimension = 2
then it coincides with rayleigh. Other related distributions are the Maxwell and Weibull distribu-
tions.

Author(s)
T. W. Yee
Rcim 655

References

Forbes, C., Evans, M., Hastings, N. and Peacock, B. (2011) Statistical Distributions, Hoboken, NJ,
USA: John Wiley and Sons, Fourth edition.

See Also

Rayleigh, genrayleigh, riceff, maxwell, weibullR, poisson.points, simulate.vlm.

Examples
nn <- 1000; Scale <- exp(2)
rdata <- data.frame(ystar = rrayleigh(nn, scale = Scale))
fit <- vglm(ystar ~ 1, rayleigh, data = rdata, trace = TRUE, crit = "coef")
head(fitted(fit))
with(rdata, mean(ystar))
coef(fit, matrix = TRUE)
Coef(fit)

# Censored data
rdata <- transform(rdata, U = runif(nn, 5, 15))
rdata <- transform(rdata, y = pmin(U, ystar))
## Not run: par(mfrow = c(1, 2))
hist(with(rdata, ystar)); hist(with(rdata, y))
## End(Not run)
extra <- with(rdata, list(rightcensored = ystar > U))
fit <- vglm(y ~ 1, cens.rayleigh, data = rdata, trace = TRUE,
extra = extra, crit = "coef")
table(fit@extra$rightcen)
coef(fit, matrix = TRUE)
head(fitted(fit))

Rcim Mark the Baseline of Row and Column on a Matrix data

Description

Rearrange the rows and columns of the input so that the first row and first column are baseline.
This function is for rank-zero row-column interaction models (RCIMs; i.e., general main effects
models).

Usage

Rcim(mat, rbaseline = 1, cbaseline = 1)

656 Rcim

Arguments

mat Matrix, of dimension r by c. It is best that it is labelled with row and column
names.
rbaseline, cbaseline
Numeric (row number of the matrix mat) or character (matching a row name
of mat) that the user wants as the row baseline or reference level. Similarly
cbaseline for the column.

Details

This is a data preprocessing function for rcim. For rank-zero row-column interaction models this
function establishes the baseline (or reference) levels of the matrix response with respect to the row
and columnsthese become the new first row and column.

Value

Matrix of the same dimension as the input, with rbaseline and cbaseline specifying the first
rows and columns. The default is no change in mat.

Note

This function is similar to moffset; see moffset for information about the differences. If numeric,
the arguments rbaseline and cbaseline differ from arguments roffset and coffset in moffset
by 1 (when elements of the matrix agree).

Author(s)

Alfian F. Hadi and T. W. Yee.

See Also

moffset, rcim, plotrcim0.

Examples

(alcoff.e <- moffset(alcoff, roffset = "6", postfix = "*"))

(aa <- Rcim(alcoff, rbaseline = "11", cbaseline = "Sun"))
(bb <- moffset(alcoff, "11", "Sun", postfix = "*"))
aa - bb # Note the difference!
rcqo 657

rcqo Constrained Quadratic Ordination

Description
Random generation for constrained quadratic ordination (CQO).

Usage
rcqo(n, p, S, Rank = 1,
family = c("poisson", "negbinomial", "binomial-poisson",
"Binomial-negbinomial", "ordinal-poisson",
"Ordinal-negbinomial", "gamma2"),
eq.maximums = FALSE, eq.tolerances = TRUE, es.optimums = FALSE,
lo.abundance = if (eq.maximums) hi.abundance else 10,
hi.abundance = 100, sd.latvar = head(1.5/2^(0:3), Rank),
sd.optimums = ifelse(es.optimums, 1.5/Rank, 1) *
ifelse(scale.latvar, sd.latvar, 1),
sd.tolerances = 0.25, Kvector = 1, Shape = 1,
sqrt.arg = FALSE, log.arg = FALSE, rhox = 0.5, breaks = 4,
seed = NULL, optimums1.arg = NULL, Crow1positive = TRUE,
xmat = NULL, scale.latvar = TRUE)

Arguments
n Number of sites. It is denoted by n below.
p Number of environmental variables, including an intercept term. It is denoted
by p below. Must be no less than 1 + R in value.
S Number of species. It is denoted by S below.
Rank The rank or the number of latent variables or true dimension of the data on the
reduced space. This must be either 1, 2, 3 or 4. It is denoted by R.
family What type of species data is to be returned. The first choice is the default. If
binomial then a 0 means absence and 1 means presence. If ordinal then the
breaks argument is passed into the breaks argument of cut. Note that either
the Poisson or negative binomial distributions are used to generate binomial and
ordinal data, and that an upper-case choice is used for the negative binomial
distribution (this makes it easier for the user). If "gamma2" then this is the 2-
parameter gamma distribution.
eq.maximums Logical. Does each species have the same maximum? See arguments lo.abundance
and hi.abundance.
eq.tolerances Logical. Does each species have the same tolerance? If TRUE then the common
value is 1 along every latent variable, i.e., all species tolerance matrices are the
order-R identity matrix.
658 rcqo

es.optimums Logical. Do the species have equally spaced optimums? If TRUE then the quan-
tity S 1/R must be an integer with value 2 or more. That is, there has to be an
appropriate number of species in total. This is so that a grid of optimum values
is possible in R-dimensional latent variable space in order to place the species
optimums. Also see the argument sd.tolerances.
lo.abundance, hi.abundance
Numeric. These are recycled to a vector of length S. The species have a maxi-
mum between lo.abundance and hi.abundance. That is, at their optimal en-
vironment, the mean abundance of each species is between the two componen-
twise values. If eq.maximums is TRUE then lo.abundance and hi.abundance
must have the same values. If eq.maximums is FALSE then the logarithm of the
maximums are uniformly distributed between log(lo.abundance) and log(hi.abundance).
sd.latvar Numeric, of length R (recycled if necessary). Site scores along each latent vari-
able have these standard deviation values. This must be a decreasing sequence
of values because the first ordination axis contains the greatest spread of the
species site scores, followed by the second axis, followed by the third axis, etc.
sd.optimums Numeric, of length R (recycled if necessary). If es.optimums = FALSE then,
for the rth latent variable axis, the optimums of the species are generated from
a normal distribution centered about 0. If es.optimums = TRUE then the S
optimums are equally spaced about 0 along every latent variable axis. Regard-
less of the value of es.optimums, the optimums are then scaled to give standard
deviation sd.optimums[r].
sd.tolerances Logical. If eq.tolerances = FALSE then, for the rth latent variable, the
species tolerances are chosen from a normal distribution with mean 1 and
standard deviation sd.tolerances[r]. However, the first species y1 has its
tolerance matrix set equal to the order-R identity matrix. All tolerance matri-
ces for all species are diagonal in this function. This argument is ignored if
eq.tolerances is TRUE, otherwise it is recycled to length R if necessary.
Kvector A vector of positive k values (recycled to length S if necessary) for the negative
binomial distribution (see negbinomial for details). Note that a natural default
value does not exist, however the default value here is probably a realistic one,
and that for large values of one has V ar(Y ) = 2 /k approximately.
Shape A vector of positive values (recycled to length S if necessary) for the 2-
parameter gamma distribution (see gamma2 for details). Note that a natural de-
fault value does not exist, however the default value here is probably a realistic
one, and that V ar(Y ) = 2 /.
sqrt.arg Logical. Take the square-root of the negative binomial counts? Assigning
sqrt.arg = TRUE when family="negbinomial" means that the resulting
species data can be considered very crudely to be approximately Poisson dis-
tributed. They will not integers in general but much easier (less numerical prob-
lems) to estimate using something like cqo(..., family="poissonff").
log.arg Logical. Take the logarithm of the gamma random variates? Assigning log.arg = TRUE
when family="gamma2" means that the resulting species data can be consid-
ered very crudely to be approximately Gaussian distributed about its (quadratic)
mean. The result is that it is much easier (less numerical problems) to estimate
using something like cqo(..., family="gaussianff").
rcqo 659

rhox Numeric, less than 1 in absolute value. The correlation between the environmen-
tal variables. The correlation matrix is a matrix of 1s along the diagonal and
rhox in the off-diagonals. Note that each environmental variable is normally
distributed with mean 0. The standard deviation of each environmental variable
is chosen so that the site scores have the determined standard deviation, as given
by argument sd.latvar.
breaks If family is assigned an ordinal value then this argument is used to define the
cutpoints. It is fed into the breaks argument of cut.
seed If given, it is passed into set.seed. This argument can be used to obtain re-
producible results. If set, the value is saved as the "seed" attribute of the re-
turned value. The default will not change the random generator state, and return
.Random.seed as "seed" attribute.
optimums1.arg If assigned and Rank = 1 then these are the explicity optimums. Recycled to
length S.
Crow1positive See qrrvglm.control for details.
xmat The n by p 1 environmental matrix can be inputted.
scale.latvar Logical. If FALSE the argument sd.latvar is ignored and no scaling of the
latent variable values is performed.

Details
This function generates data coming from a constrained quadratic ordination (CQO) model. In
particular, data coming from a species packing model can be generated with this function. The
species packing model states that species have equal tolerances, equal maximums, and optimums
which are uniformly distributed over the latent variable space. This can be achieved by assigning
the arguments es.optimums = TRUE, eq.maximums = TRUE, eq.tolerances = TRUE.
At present, the Poisson and negative binomial abundances are generated first using lo.abundance
and hi.abundance, and if family is binomial or ordinal then it is converted into these forms.
In CQO theory the n by p matrix X is partitioned into two parts X1 and X2 . The matrix X2 contains
the real environmental variables whereas the variables in X1 are just for adjustment purposes; they
contain the intercept terms and other variables that one wants to adjust for when (primarily) looking
at the variables in X2 . This function has X1 only being a matrix of ones, i.e., containing an intercept
only.

Value
A n by p 1 + S data frame with components and attributes. In the following the attributes are
labelled with double quotes.
x2, x3, x4, ..., xp
The environmental variables. This makes up the n by p 1 X2 matrix. Note
that x1 is not present; it is effectively a vector of ones since it corresponds to an
intercept term when cqo is applied to the data.
y1, y2, x3, ..., yS
The species data. This makes up the n by S matrix Y . This will be of the form
described by the argument family.
660 rcqo

"concoefficients"
The p 1 by R matrix of constrained coefficients (or canonical coefficients).
These are also known as weights or loadings.
"formula" The formula involving the species and environmental variable names. This can
be used directly in the formula argument of cqo.
"log.maximums" The S-vector of species maximums, on a log scale. These are uniformly dis-
tributed between log(lo.abundance) and log(hi.abundance).
"latvar" The n by R matrix of site scores. Each successive column (latent variable) has
sample standard deviation equal to successive values of sd.latvar.
"eta" The linear/additive predictor value.
"optimums" The S by R matrix of species optimums.
"tolerances" The S by R matrix of species tolerances. These are the square root of the
diagonal elements of the tolerance matrices (recall that all tolerance matrices
are restricted to being diagonal in this function).
Other attributes are "break", "family", "Rank", "lo.abundance", "hi.abundance", "eq.tolerances",
"eq.maximums", "seed" as used.

Note
This function is under development and is not finished yet. There may be a few bugs.
Yet to do: add an argument that allows absences to be equal to the first level if ordinal data is
requested.

Author(s)
T. W. Yee

References
Yee, T. W. (2004) A new technique for maximum-likelihood canonical Gaussian ordination. Eco-
logical Monographs, 74, 685701.
Yee, T. W. (2006) Constrained additive ordination. Ecology, 87, 203213.
ter Braak, C. J. F. and Prentice, I. C. (1988) A theory of gradient analysis. Advances in Ecological
Research, 18, 271317.

See Also
cqo, qrrvglm.control, cut, binomialff, poissonff, negbinomial, gamma2, gaussianff.

Examples
## Not run:
# Example 1: Species packing model:
n <- 100; p <- 5; S <- 5
mydata <- rcqo(n, p, S, es.opt = TRUE, eq.max = TRUE)
names(mydata)
(myform <- attr(mydata, "formula"))
rdiric 661

fit <- cqo(myform, poissonff, mydata, Bestof = 3) # eq.tol = TRUE

matplot(attr(mydata, "latvar"), mydata[,-(1:(p-1))], col = 1:S)
persp(fit, col = 1:S, add = TRUE)
lvplot(fit, lcol = 1:S, y = TRUE, pcol = 1:S) # The same plot as above

# Compare the fitted model with the 'truth'

concoef(fit) # The fitted model
attr(mydata, "concoefficients") # The 'truth'

c(apply(attr(mydata, "latvar"), 2, sd),

apply(latvar(fit), 2, sd)) # Both values should be approx equal

# Example 2: negative binomial data fitted using a Poisson model:

n <- 200; p <- 5; S <- 5
mydata <- rcqo(n, p, S, fam = "negbin", sqrt = TRUE)
myform <- attr(mydata, "formula")
fit <- cqo(myform, fam = poissonff, dat = mydata) # I.tol = TRUE,
lvplot(fit, lcol = 1:S, y = TRUE, pcol = 1:S)
# Compare the fitted model with the 'truth'
concoef(fit) # The fitted model
attr(mydata, "concoefficients") # The 'truth'

# Example 3: gamma2 data fitted using a Gaussian model:

n <- 200; p <- 5; S <- 3
mydata <- rcqo(n, p, S, fam = "gamma2", log.arg = TRUE)
fit <- cqo(attr(mydata, "formula"),
fam = gaussianff, data = mydata) # I.tol = TRUE,
matplot(attr(mydata, "latvar"),
exp(mydata[, -(1:(p-1))]), col = 1:S) # 'raw' data
# Fitted model to transformed data:
lvplot(fit, lcol = 1:S, y = TRUE, pcol = 1:S)
# Compare the fitted model with the 'truth'
concoef(fit) # The fitted model
attr(mydata, "concoefficients") # The 'truth'

## End(Not run)

rdiric The Dirichlet distribution

Description
Generates Dirichlet random variates.

Usage
rdiric(n, shape, dimension = NULL, is.matrix.shape = FALSE)
662 rdiric

Arguments

n number of observations. Note it has two meanings, see is.matrix.shape be-

low.
shape the shape parameters. These must be positive. If dimension is specifed, values
are recycled if necessary to length dimension.
dimension the dimension of the distribution. If dimension is not numeric then it is taken
to be length(shape) (or ncol(shape) if is.matrix.shape == TRUE).
is.matrix.shape
Logical. If TRUE then shape must be a matrix, and then n is no longer the number
of rows of the answer but the answer has n * nrow(shape) rows. If FALSE (the
default) then shape is a vector and each of the n rows of the answer have shape
as its shape parameters.

Details

This function is based on a relationship between the gamma and Dirichlet distribution. Random
gamma variates are generated, and then Dirichlet random variates are formed from these.

Value

A n by dimension matrix of Dirichlet random variates. Each element is positive, and each row will
sum to unity. If shape has names then these will become the column names of the answer.

Author(s)

Thomas W. Yee

References

Lange, K. (2002) Mathematical and Statistical Methods for Genetic Analysis, 2nd ed. New York:
Springer-Verlag.

See Also

dirichlet is a VGAM family function for fitting a Dirichlet distribution to data.

Examples
ddata <- data.frame(rdiric(n = 1000, shape = c(y1 = 3, y2 = 1, y3 = 4)))
fit <- vglm(cbind(y1, y2, y3) ~ 1, dirichlet, data = ddata, trace = TRUE)
Coef(fit)
coef(fit, matrix = TRUE)
rec.exp1 663

rec.exp1 Upper Record Values from a 1-parameter Exponential Distribution

Description
Maximum likelihood estimation of the rate parameter of a 1-parameter exponential distribution
when the observations are upper record values.

Usage
rec.exp1(lrate = "loge", irate = NULL, imethod = 1)

Arguments
lrate Link function applied to the rate parameter. See Links for more choices.
irate Numeric. Optional initial values for the rate. The default value NULL means they
are computed internally, with the help of imethod.
imethod Integer, either 1 or 2 or 3. Initial method, three algorithms are implemented.
Choose the another value if convergence fails, or use irate.

Details
The response must be a vector or one-column matrix with strictly increasing values.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Note
By default, this family function has the intercept-only MLE as the initial value, therefore conver-
gence may only take one iteration. Fisher scoring is used.

Author(s)
T. W. Yee

References
Arnold, B. C. and Balakrishnan, N. and Nagaraja, H. N. (1998) Records, New York: John Wiley &
Sons.

length(y) / y[length(y)] # MLE of rate

fit <- vglm(y ~ 1, rec.exp1, trace = TRUE)

coef(fit, matrix = TRUE)
Coef(fit)

rec.normal Upper Record Values from a Univariate Normal Distribution

Description
Maximum likelihood estimation of the two parameters of a univariate normal distribution when the
observations are upper record values.

Usage
rec.normal(lmean = "identitylink", lsd = "loge",
imean = NULL, isd = NULL, imethod = 1, zero = NULL)

Arguments
lmean, lsd Link functions applied to the mean and sd parameters. See Links for more
choices.
imean, isd Numeric. Optional initial values for the mean and sd. The default value NULL
means they are computed internally, with the help of imethod.
imethod Integer, either 1 or 2 or 3. Initial method, three algorithms are implemented.
Choose the another value if convergence fails, or use imean and/or isd.
zero Can be an integer vector, containing the value 1 or 2. If so, the mean or stan-
dard deviation respectively are modelled as an intercept only. Usually, setting
zero = 2 will be used, if used at all. The default value NULL means both lin-
ear/additive predictors are modelled as functions of the explanatory variables.
See CommonVGAMffArguments for more information.

Details
The response must be a vector or one-column matrix with strictly increasing values.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.
reciprocal 665

Note
This family function tries to solve a difficult problem, and the larger the data set the better. Conver-
gence failure can commonly occur, and convergence may be very slow, so set maxit = 200, trace = TRUE,
say. Inputting good initial values are advised.
This family function uses the BFGS quasi-Newton update formula for the working weight matrices.
Consequently the estimated variance-covariance matrix may be inaccurate or simply wrong! The
standard errors must be therefore treated with caution; these are computed in functions such as
vcov() and summary().

Author(s)
T. W. Yee

References
Arnold, B. C. and Balakrishnan, N. and Nagaraja, H. N. (1998) Records, New York: John Wiley &
Sons.

reciprocal Reciprocal Link Function

Description
Computes the reciprocal transformation, including its inverse and the first two derivatives.

Usage
reciprocal(theta, bvalue = NULL, inverse = FALSE, deriv = 0,
short = TRUE, tag = FALSE)
negreciprocal(theta, bvalue = NULL, inverse = FALSE, deriv = 0,
short = TRUE, tag = FALSE)
666 reciprocal

Arguments
theta Numeric or character. See below for further details.
bvalue See Links.
inverse, deriv, short, tag
Details at Links.

Details
The reciprocal link function is a special case of the power link function. Numerical values of
theta close to 0 result in Inf, -Inf, NA or NaN.
The negreciprocal link function computes the negative reciprocal, i.e., 1/.

Value
For reciprocal: for deriv = 0, the reciprocal of theta, i.e., 1/theta when inverse = FALSE,
and if inverse = TRUE then 1/theta. For deriv = 1, then the function returns d theta / d eta
as a function of theta if inverse = FALSE, else if inverse = TRUE then it returns the reciprocal.

Note
Numerical instability may occur when theta is close to 0.

Author(s)
Thomas W. Yee

References
McCullagh, P. and Nelder, J. A. (1989) Generalized Linear Models, 2nd ed. London: Chapman &
Hall.

rhobit Rhobit Link Function

Description
Computes the rhobit link transformation, including its inverse and the first two derivatives.

Usage
rhobit(theta, bminvalue = NULL, bmaxvalue = NULL,
inverse = FALSE, deriv = 0, short = TRUE, tag = FALSE)

Arguments
theta Numeric or character. See below for further details.
bminvalue, bmaxvalue
Optional boundary values, e.g., values of theta which are less than or equal
to -1 can be replaced by bminvalue before computing the link function value.
And values of theta which are greater than or equal to 1 can be replaced by
bmaxvalue before computing the link function value. See Links.
inverse, deriv, short, tag
Details at Links.

Details
The rhobit link function is commonly used for parameters that lie between 1 and 1. Numerical
values of theta close to 1 or 1 or out of range result in Inf, -Inf, NA or NaN.

Value
For deriv = 0, the rhobit of theta, i.e., log((1 + theta)/(1 - theta)) when inverse = FALSE,
and if inverse = TRUE then (exp(theta) - 1)/(exp(theta) + 1).
For deriv = 1, then the function returns d eta / d theta as a function of theta if inverse = FALSE,
else if inverse = TRUE then it returns the reciprocal.

Note
Numerical instability may occur when theta is close to 1 or 1. One way of overcoming this is to
use bminvalue, etc.
The correlation parameter of a standard bivariate normal distribution lies between 1 and 1, there-
fore this function can be used for modelling this parameter as a function of explanatory variables.
The link function rhobit is very similar to fisherz, e.g., just twice the value of fisherz.

Author(s)
Thomas W. Yee
668 Rice

x <- c(seq(-1.02, -0.98, by = 0.01), seq(0.97, 1.02, by = 0.01))

rhobit(x) # Has NAs
rhobit(x, bminvalue = -1 + .Machine$double.eps,
bmaxvalue = 1 - .Machine$double.eps) # Has no NAs

Rice The Rice Distribution

Description
Density, distribution function, quantile function and random generation for the Rician distribution.

Usage
drice(x, sigma, vee, log = FALSE)
price(q, sigma, vee, lower.tail = TRUE, log.p = FALSE, ...)
qrice(p, sigma, vee, lower.tail = TRUE, log.p = FALSE, ...)
rrice(n, sigma, vee)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. Same as in runif.
vee, sigma See riceff.
... Other arguments such as lower.tail.
lower.tail, log.p
Same meaning as in pnorm or qnorm.
log Logical. If log = TRUE then the logarithm of the density is returned.

Details
See riceff, the VGAM family function for estimating the two parameters, for the formula of the
probability density function and other details.
Formulas for price() and qrice() are based on the Marcum-Q function.
riceff 669

Value
drice gives the density, price gives the distribution function, qrice gives the quantile function,
and rrice generates random deviates.

Author(s)
T. W. Yee and Kai Huang

x <- seq(0, 4, by = 0.01); vee <- 1; sigma <- 1

probs <- seq(0.05, 0.95, by = 0.05)
plot(x, drice(x, vee = vee, sigma = sigma), type = "l", col = "blue",
main = "Blue is density, orange is cumulative distribution function",
ylim = c(0, 1), sub = "Purple are 5, 10, ..., 95 percentiles",
las = 1, ylab = "", cex.main = 0.9)
abline(h = 0:1, col = "black", lty = 2)
Q <- qrice(probs, sigma, vee = vee)
lines(Q, drice(qrice(probs, sigma, vee = vee),
sigma, vee = vee), col = "purple", lty = 3, type = "h")
lines(x, price(x, sigma, vee = vee), type = "l", col = "orange")
lines(Q, drice(Q, sigma, vee = vee), col = "purple", lty = 3, type = "h")
lines(Q, price(Q, sigma, vee = vee), col = "purple", lty = 3, type = "h")
abline(h = probs, col = "purple", lty = 3)
max(abs(price(Q, sigma, vee = vee) - probs)) # Should be 0

## End(Not run)

riceff Rice Distribution Family Function

Description
Estimates the two parameters of a Rice distribution by maximum likelihood estimation.
670 riceff

Usage
riceff(lsigma = "loge", lvee = "loge", isigma = NULL,
ivee = NULL, nsimEIM = 100, zero = NULL, nowarning = FALSE)

Arguments
nowarning Logical. Suppress a warning? Ignored for VGAM 0.9-7 and higher.
lvee, lsigma Link functions for the v and parameters. See Links for more choices and for
general information.
ivee, isigma Optional initial values for the parameters. If convergence failure occurs (this
VGAM family function seems to require good initial values) try using these
arguments. See CommonVGAMffArguments for more information.
nsimEIM, zero See CommonVGAMffArguments for information.

Details
The Rician distribution has density function
y
f (y; v, ) = exp((y 2 + v 2 )/(2 2 )) I0 (yv/ 2 )
2
where y > 0, v > 0, > 0 and I0 is the modified Bessel function of the first kind with
orderpzero. When v = 0 the Rice distribution reduces to a Rayleigh distribution. The mean
is /2 exp(z/2)((1 z)I0 (z/2) zI1 (z/2)) (returned as the fitted values) where z =
v 2 /(2 2 ). Simulated Fisher scoring is implemented.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Note
Convergence problems may occur for data where v = 0; if so, use rayleigh or possibly use an
identity link.
When v is large (greater than 3, say) then the mean is approximately v and the standard deviation
is approximately .

Author(s)
T. W. Yee

References
Rice, S. O. (1945) Mathematical Analysis of Random Noise. Bell System Technical Journal, 24,
46156.

See Also
drice, rayleigh, besselI, simulate.vlm.
rigff 671

Examples
## Not run: sigma <- exp(1); vee <- exp(2)
rdata <- data.frame(y = rrice(n <- 1000, sigma, vee = vee))
fit <- vglm(y ~ 1, riceff, data = rdata, trace = TRUE, crit = "coef")
c(with(rdata, mean(y)), fitted(fit)[1])
coef(fit, matrix = TRUE)
Coef(fit)
summary(fit)

## End(Not run)

rigff Reciprocal Inverse Gaussian distribution

Description
Estimation of the parameters of a reciprocal inverse Gaussian distribution.

Usage
rigff(lmu = "identitylink", llambda = "loge", imu = NULL, ilambda = 1)

Arguments
lmu, llambda Link functions for mu and lambda. See Links for more choices.
imu, ilambda Initial values for mu and lambda. A NULL means a value is computed internally.

Details
See Jorgensen (1997) for details.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Note
This distribution is potentially useful for dispersion modelling.

Author(s)
T. W. Yee

References
Jorgensen, B. (1997) The Theory of Dispersion Models. London: Chapman & Hall
672 rlplot.gevff

rlplot.gevff Return Level Plot for GEV Fits

Description
A return level plot is constructed for a GEV-type model.

Usage
rlplot.gevff(object, show.plot = TRUE,
probability = c((1:9)/100, (1:9)/10, 0.95, 0.99, 0.995, 0.999),
add.arg = FALSE, xlab = if(log.arg) "Return Period (log-scale)" else
"Return Period", ylab = "Return Level",
main = "Return Level Plot",
pch = par()$pch, pcol.arg = par()$col, pcex = par()$cex,
llty.arg = par()$lty, lcol.arg = par()$col, llwd.arg = par()$lwd,
slty.arg = par()$lty, scol.arg = par()$col, slwd.arg = par()$lwd,
ylim = NULL, log.arg = TRUE, CI = TRUE, epsilon = 1e-05, ...)

Arguments
object A VGAM extremes model of the GEV-type, produced by vglm with a family
function either "gev" or "gevff".
show.plot Logical. Plot it? If FALSE no plot will be done.
probability Numeric vector of probabilities used.
add.arg Logical. Add the plot to an existing plot?
xlab Caption for the x-axis. See par.
ylab Caption for the y-axis. See par.
main Title of the plot. See title.
pch Plotting character. See par.
pcol.arg Color of the points. See the col argument of par.
pcex Character expansion of the points. See the cex argument of par.
llty.arg Line type. Line type. See the lty argument of par.
lcol.arg Color of the lines. See the col argument of par.
rlplot.gevff 673

llwd.arg Line width. See the lwd argument of par.

slty.arg, scol.arg, slwd.arg
Correponding arguments for the lines used for the confidence intervals. Used
only if CI=TRUE.
ylim Limits for the y-axis. Numeric of length 2.
log.arg Logical. If TRUE then log="" otherwise log="x". This changes the labelling of
the x-axis only.
CI Logical. Add in a 95 percent confidence interval?
epsilon Numeric, close to zero. Used for the finite-difference approximation to the first
derivatives with respect to each parameter. If too small, numerical problems will
occur.
... Arguments passed into the plot function when setting up the entire plot. Useful
arguments here include sub and las.

Details
A return level plot plots zp versus log(yp ). It is linear if the shape parameter = 0. If < 0
then the plot is convex with asymptotic limit as p approaches zero at /. And if > 0
then the plot is concave and has no finite bound. Here, G(zp ) = 1 p where 0 < p < 1 (p
corresponds to the argument probability) and G is the cumulative distribution function of the
GEV distribution. The quantity zp is known as the return level associated with the return period
1/p. For many applications, this means zp is exceeded by the annual maximum in any particular
year with probability p.
The points in the plot are the actual data.

Value
In the post slot of the object is a list called rlplot with list components

yp -log(probability), which is used on the x-axis.

zp values which are used for the y-axis
lower, upper lower and upper confidence limits for the 95 percent confidence intervals evalu-
ated at the values of probability (if CI=TRUE).

Note
The confidence intervals are approximate, being based on finite-difference approximations to deriva-
tives.

Author(s)
T. W. Yee

References
Coles, S. (2001) An Introduction to Statistical Modeling of Extreme Values. London: Springer-
Verlag.
674 rrar

# Identity link for all parameters:

fit2 <- vglm(y ~ 1, gevff(lshape = identitylink, lscale = identitylink,
iscale = 10), data = gdata, trace = TRUE)
coef(fit2, matrix = TRUE)
## Not run:
par(mfrow = c(1, 2))
rlplot(fit) -> i1
rlplot(fit2, pcol = "darkorange", lcol = "blue", log.arg = FALSE,
scol = "darkgreen", slty = "dashed", las = 1) -> i2
range(i2@post$rlplot$upper - i1@post$rlplot$upper) # Should be near 0
range(i2@post$rlplot$lower - i1@post$rlplot$lower) # Should be near 0

## End(Not run)

rrar Nested reduced-rank autoregressive models for multiple time series

Description
Estimates the parameters of a nested reduced-rank autoregressive model for multiple time series.

Usage
rrar(Ranks = 1, coefstart = NULL)

Arguments
Ranks Vector of integers: the ranks of the model. Each value must be at least one and
no more than M, where M is the number of response variables in the time series.
The length of Ranks is the lag, which is often denoted by the symbol L in the
literature.
coefstart Optional numerical vector of initial values for the coefficients. By default, the
family function chooses these automatically.

Details
Full details are given in Ahn and Reinsel (1988). Convergence may be very slow, so setting
maxits = 50, say, may help. If convergence is not obtained, you might like to try inputting
different initial values.
Setting trace = TRUE in vglm is useful for monitoring the progress at each iteration.
rrar 675

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Note
This family function should be used within vglm and not with rrvglm because it does not fit into
the RR-VGLM framework exactly. Instead, the reduced-rank model is formulated as a VGLM!
A methods function Coef.rrar, say, has yet to be written. It would return the quantities Ak1, C, D,
omegahat, Phi, etc. as slots, and then show.Coef.rrar would also need to be written.

Author(s)
T. W. Yee

References
Ahn, S. and Reinsel, G. C. (1988) Nested reduced-rank autoregressive models for multiple time
series. Journal of the American Statistical Association, 83, 849856.

par(mar = c(4, 4, 2, 2) + 0.1, mfrow = c(4, 1))

for (ii in 1:4) {
plot(year, fit@misc$Z[, ii], main = paste("Z", ii, sep = ""),
type = "l", xlab = "", ylab = "", las = 1, col = "blue")
points(year, fit@misc$Z[, ii], pch = "*", col = "blue")
}
676 rrvglm

## End(Not run)

rrvglm Fitting Reduced-Rank Vector Generalized Linear Models (RR-

VGLMs)

Description
A reduced-rank vector generalized linear model (RR-VGLM) is fitted. RR-VGLMs are VGLMs
but some of the constraint matrices are estimated. In this documentation, M is the number of linear
predictors.

Usage
rrvglm(formula, family, data = list(), weights = NULL, subset = NULL,
na.action = na.fail, etastart = NULL, mustart = NULL,
coefstart = NULL, control = rrvglm.control(...), offset = NULL,
method = "rrvglm.fit", model = FALSE, x.arg = TRUE, y.arg = TRUE,
contrasts = NULL, constraints = NULL, extra = NULL,
qr.arg = FALSE, smart = TRUE, ...)

Arguments
formula, family, weights
See vglm.
data an optional data frame containing the variables in the model. By default the vari-
ables are taken from environment(formula), typically the environment from
which rrvglm is called.
subset, na.action
See vglm.
etastart, mustart, coefstart
See vglm.
control a list of parameters for controlling the fitting process. See rrvglm.control for
details.
offset, model, contrasts
See vglm.
method the method to be used in fitting the model. The default (and presently only)
method rrvglm.fit uses iteratively reweighted least squares (IRLS).
x.arg, y.arg logical values indicating whether the model matrix and response vector/matrix
used in the fitting process should be assigned in the x and y slots. Note the
model matrix is the LM model matrix; to get the VGLM model matrix type
model.matrix(vglmfit) where vglmfit is a vglm object.
constraints See vglm.
extra, smart, qr.arg
See vglm.
... further arguments passed into rrvglm.control.
rrvglm 677

Details

The central formula is given by

= B1T x1 + A

where x1 is a vector (usually just a 1 for an intercept), x2 is another vector of explanatory variables,
and = C T x2 is an R-vector of latent variables. Here, is a vector of linear predictors, e.g., the
mth element is m = log(E[Ym ]) for the mth Poisson response. The matrices B1 , A and C are
estimated from the data, i.e., contain the regression coefficients. For ecologists, the central formula
represents a constrained linear ordination (CLO) since it is linear in the latent variables. It means
that the response is a monotonically increasing or decreasing function of the latent variables.
For identifiability it is common to enforce corner constraints on A: by default, the top R by R
submatrix is fixed to be the order-R identity matrix and the remainder of A is estimated.
The underlying algorithm of RR-VGLMs is iteratively reweighted least squares (IRLS) with an
optimizing algorithm applied within each IRLS iteration (e.g., alternating algorithm).
In theory, any VGAM family function that works for vglm and vgam should work for rrvglm too.
The function that actually does the work is rrvglm.fit; it is vglm.fit with some extra code.

Value

An object of class "rrvglm", which has the the same slots as a "vglm" object. The only difference
is that the some of the constraint matrices are estimates rather than known. But VGAM stores the
models the same internally. The slots of "vglm" objects are described in vglm-class.

Note

The arguments of rrvglm are in general the same as those of vglm but with some extras in rrvglm.control.
The smart prediction (smartpred) library is packed with the VGAM library.
In an example below, a rank-1 stereotype model of Anderson (1984) is fitted to some car data.
The reduced-rank regression is performed, adjusting for two covariates. Setting a trivial constraint
matrix for the latent variable variables in x2 avoids a warning message when it is overwritten by a
(common) estimated constraint matrix. It shows that German cars tend to be more expensive than
American cars, given a car of fixed weight and width.
If fit <- rrvglm(..., data = mydata) then summary(fit) requires corner constraints and no
missing values in mydata. Often the estimated variance-covariance matrix of the parameters is not
positive-definite; if this occurs, try refitting the model with a different value for Index.corner.
For constrained quadratic ordination (CQO) see cqo for more details about QRR-VGLMs.
With multiple binary responses, one must use binomialff(multiple.responses = TRUE) to
indicate that the response is a matrix with one response per column. Otherwise, it is interpreted as
a single binary response variable.

Author(s)

Thomas W. Yee
678 rrvglm

References
Yee, T. W. and Hastie, T. J. (2003) Reduced-rank vector generalized linear models. Statistical
Modelling, 3, 1541.
Yee, T. W. (2004) A new technique for maximum-likelihood canonical Gaussian ordination. Eco-
logical Monographs, 74, 685701.
Anderson, J. A. (1984) Regression and ordered categorical variables. Journal of the Royal Statisti-
cal Society, Series B, Methodological, 46, 130.
Yee, T. W. (2014) Reduced-rank vector generalized linear models with two linear predictors. Com-
putational Statistics and Data Analysis, 71, 889902.

See Also
rrvglm.control, lvplot.rrvglm (same as biplot.rrvglm), rrvglm-class, grc, cqo, vglmff-class,
vglm, vglm-class, smartpred, rrvglm.fit. Special family functions include negbinomial zipoisson
and zinegbinomial. (see Yee (2014) and COZIGAM). Methods functions include Coef.rrvglm,
summary.rrvglm, etc. Data include crashi.

Examples
## Not run:
# Example 1: RR negative binomial with Var(Y) = mu + delta1 * mu^delta2
nn <- 1000 # Number of observations
delta1 <- 3.0 # Specify this
delta2 <- 1.5 # Specify this; should be greater than unity
a21 <- 2 - delta2
mydata <- data.frame(x2 = runif(nn), x3 = runif(nn))
mydata <- transform(mydata, mu = exp(2 + 3 * x2 + 0 * x3))
mydata <- transform(mydata,
y2 = rnbinom(nn, mu = mu, size = (1/delta1)*mu^a21))
plot(y2 ~ x2, data = mydata, pch = "+", col = 'blue', las = 1,
main = paste("Var(Y) = mu + ", delta1, " * mu^", delta2, sep = ""))
rrnb2 <- rrvglm(y2 ~ x2 + x3, negbinomial(zero = NULL),
data = mydata, trace = TRUE)

a21.hat <- (Coef(rrnb2)@A)["loge(size)", 1]

beta11.hat <- Coef(rrnb2)@B1["(Intercept)", "loge(mu)"]
beta21.hat <- Coef(rrnb2)@B1["(Intercept)", "loge(size)"]
(delta1.hat <- exp(a21.hat * beta11.hat - beta21.hat))
(delta2.hat <- 2 - a21.hat)
# exp(a21.hat * predict(rrnb2)[1,1] - predict(rrnb2)[1,2]) # delta1.hat
summary(rrnb2)

# Obtain a 95 percent confidence interval for delta2:

se.a21.hat <- sqrt(vcov(rrnb2)["I(latvar.mat)", "I(latvar.mat)"])
ci.a21 <- a21.hat + c(-1, 1) * 1.96 * se.a21.hat
(ci.delta2 <- 2 - rev(ci.a21)) # The 95 percent confidence interval

Confint.rrnb(rrnb2) # Quick way to get it

# Plot the abundances and fitted values against the latent variable
rrvglm-class 679

plot(y2 ~ latvar(rrnb2), data = mydata, col = "blue",

xlab = "Latent variable", las = 1)
ooo <- order(latvar(rrnb2))
lines(fitted(rrnb2)[ooo] ~ latvar(rrnb2)[ooo], col = "orange")

# Example 2: stereotype model (reduced-rank multinomial logit model)

data(car.all)
scar <- subset(car.all,
is.element(Country, c("Germany", "USA", "Japan", "Korea")))
fcols <- c(13,14,18:20,22:26,29:31,33,34,36) # These are factors
scar[, -fcols] <- scale(scar[, -fcols]) # Standardize all numerical vars
ones <- matrix(1, 3, 1)
clist <- list("(Intercept)" = diag(3), Width = ones, Weight = ones,
Disp. = diag(3), Tank = diag(3), Price = diag(3),
Frt.Leg.Room = diag(3))
set.seed(111)
fit <- rrvglm(Country ~ Width + Weight + Disp. + Tank +
Price + Frt.Leg.Room,
multinomial, data = scar, Rank = 2, trace = TRUE,
constraints = clist, noRRR = ~ 1 + Width + Weight,
Uncor = TRUE, Corner = FALSE, Bestof = 2)
fit@misc$deviance # A history of the fits
Coef(fit)
biplot(fit, chull = TRUE, scores = TRUE, clty = 2, Ccex = 2,
ccol = "blue", scol = "orange", Ccol = "darkgreen", Clwd = 2,
main = "1=Germany, 2=Japan, 3=Korea, 4=USA")

## End(Not run)

rrvglm-class Class rrvglm

Description
Reduced-rank vector generalized linear models.

Objects from the Class

Objects can be created by calls to rrvglm.

Slots
extra: Object of class "list"; the extra argument on entry to vglm. This contains any extra
information that might be needed by the family function.
family: Object of class "vglmff". The family function.
iter: Object of class "numeric". The number of IRLS iterations used.
predictors: Object of class "matrix" with M columns which holds the M linear predictors.
assign: Object of class "list", from class "vlm". This named list gives information matching
the columns and the (LM) model matrix terms.
680 rrvglm-class

call: Object of class "call", from class "vlm". The matched call.
coefficients: Object of class "numeric", from class "vlm". A named vector of coefficients.
constraints: Object of class "list", from class "vlm". A named list of constraint matrices
used in the fitting.
contrasts: Object of class "list", from class "vlm". The contrasts used (if any).
control: Object of class "list", from class "vlm". A list of parameters for controlling the fitting
process. See vglm.control for details.
criterion: Object of class "list", from class "vlm". List of convergence criterion evaluated at
the final IRLS iteration.
df.residual: Object of class "numeric", from class "vlm". The residual degrees of freedom.
df.total: Object of class "numeric", from class "vlm". The total degrees of freedom.
dispersion: Object of class "numeric", from class "vlm". The scaling parameter.
effects: Object of class "numeric", from class "vlm". The effects.
fitted.values: Object of class "matrix", from class "vlm". The fitted values. This is usually
the mean but may be quantiles, or the location parameter, e.g., in the Cauchy model.
misc: Object of class "list", from class "vlm". A named list to hold miscellaneous parameters.
model: Object of class "data.frame", from class "vlm". The model frame.
na.action: Object of class "list", from class "vlm". A list holding information about missing
values.
offset: Object of class "matrix", from class "vlm". If non-zero, a M -column matrix of offsets.
post: Object of class "list", from class "vlm" where post-analysis results may be put.
preplot: Object of class "list", from class "vlm" used by plotvgam; the plotting parameters
may be put here.
prior.weights: Object of class "matrix", from class "vlm" holding the initially supplied weights.
qr: Object of class "list", from class "vlm". QR decomposition at the final iteration.
R: Object of class "matrix", from class "vlm". The R matrix in the QR decomposition used in
the fitting.
rank: Object of class "integer", from class "vlm". Numerical rank of the fitted model.
residuals: Object of class "matrix", from class "vlm". The working residuals at the final IRLS
iteration.
ResSS: Object of class "numeric", from class "vlm". Residual sum of squares at the final IRLS
iteration with the adjusted dependent vectors and weight matrices.
smart.prediction: Object of class "list", from class "vlm". A list of data-dependent parame-
ters (if any) that are used by smart prediction.
terms: Object of class "list", from class "vlm". The terms object used.
weights: Object of class "matrix", from class "vlm". The weight matrices at the final IRLS
iteration. This is in matrix-band form.
x: Object of class "matrix", from class "vlm". The model matrix (LM, not VGLM).
xlevels: Object of class "list", from class "vlm". The levels of the factors, if any, used in
fitting.
rrvglm-class 681

y: Object of class "matrix", from class "vlm". The response, in matrix form.
Xm2: Object of class "matrix", from class "vlm". See vglm-class).
Ym2: Object of class "matrix", from class "vlm". See vglm-class).
callXm2: Object of class "call", from class "vlm". The matched call for argument form2.

Extends
Class "vglm", directly. Class "vlm", by class "vglm".

Methods
biplot signature(x = "rrvglm"): biplot.
Coef signature(object = "rrvglm"): more detailed coefficients giving A, B1 , C, etc.
biplot signature(object = "rrvglm"): biplot.
print signature(x = "rrvglm"): short summary of the object.
summary signature(object = "rrvglm"): a more detailed summary of the object.

Note
The slots of "rrvglm" objects are currently identical to "vglm" objects.

Author(s)
Thomas W. Yee

References
Yee, T. W. and Hastie, T. J. (2003) Reduced-rank vector generalized linear models. Statistical
Modelling, 3, 1541.
Yee, T. W. and Wild, C. J. (1996) Vector generalized additive models. Journal of the Royal Statisti-
cal Society, Series B, Methodological, 58, 481493.

See Also
rrvglm, lvplot.rrvglm, vglmff-class.

Examples
## Not run: # Rank-1 stereotype model of Anderson (1984)
pneumo <- transform(pneumo, let = log(exposure.time),
x3 = runif(nrow(pneumo))) # x3 is unrelated
fit <- rrvglm(cbind(normal, mild, severe) ~ let + x3,
multinomial, data = pneumo, Rank = 1)
Coef(fit)

## End(Not run)
682 rrvglm.control

rrvglm.control Control Function for rrvglm()

Description
Algorithmic constants and parameters for running rrvglm are set using this function.

Usage
rrvglm.control(Rank = 1, Algorithm = c("alternating", "derivative"),
Corner = TRUE, Uncorrelated.latvar = FALSE,
Wmat = NULL, Svd.arg = FALSE,
Index.corner = if (length(str0))
head((1:1000)[-str0], Rank) else 1:Rank,
Ainit = NULL, Alpha = 0.5, Bestof = 1, Cinit = NULL,
Etamat.colmax = 10,
sd.Ainit = 0.02, sd.Cinit = 0.02, str0 = NULL,
noRRR = ~1, Norrr = NA,
noWarning = FALSE,
trace = FALSE, Use.Init.Poisson.QO = FALSE,
checkwz = TRUE, Check.rank = TRUE, Check.cm.rank = TRUE,
wzepsilon = .Machine$double.eps^0.75, ...)

Arguments
Rank The numerical rank R of the model. Must be an element from the set {1,2,. . . ,min(M ,p2)}.
Here, the vector of explanatory variables x is partitioned into (x1,x2), which is
of dimension p1+p2. The variables making up x1 are given by the terms in
noRRR argument, and the rest of the terms comprise x2.
Algorithm Character string indicating what algorithm is to be used. The default is the first
one.
Corner Logical indicating whether corner constraints are to be used. This is one method
for ensuring a unique solution. If TRUE, Index.corner specifies the R rows of
the constraint matrices that are use as the corner constraints, i.e., they hold an
order-R identity matrix.
Uncorrelated.latvar
Logical indicating whether uncorrelated latent variables are to be used. This is
normalization forces the variance-covariance matrix of the latent variables to be
diag(Rank), i.e., unit variance and uncorrelated. This constraint does not lead
to a unique solution because it can be rotated.
Wmat Yet to be done.
Svd.arg Logical indicating whether a singular value decomposition of the outer product
is to computed. This is another normalization which ensures uniqueness. See
the argument Alpha below.
Index.corner Specifies the R rows of the constraint matrices that are used for the corner con-
straints, i.e., they hold an order-R identity matrix.
rrvglm.control 683

Alpha The exponent in the singular value decomposition that is used in the first part:
if the SVD is U DV T then the first and second parts are U D and D1 V T
respectively. A value of 0.5 is symmetrical. This argument is used only when
Svd.arg=TRUE.
Bestof Integer. The best of Bestof models fitted is returned. This argument helps guard
against local solutions by (hopefully) finding the global solution from many fits.
The argument works only when the function generates its own initial value for
C, i.e., when C is not passed in as initial values.
Ainit, Cinit Initial A and C matrices which may speed up convergence. They must be of the
correct dimension.
Etamat.colmax Positive integer, no smaller than Rank. Controls the amount of memory used by
.Init.Poisson.QO(). It is the maximum number of columns allowed for the
pseudo-response and its weights. In general, the larger the value, the better the
initial value. Used only if Use.Init.Poisson.QO=TRUE.
str0 Integer vector specifying which rows of the estimated constraint matrices (A)
are to be all zeros. These are called structural zeros. Must not have any com-
mon value with Index.corner, and be a subset of the vector 1:M. The default,
str0 = NULL, means no structural zero rows at all.
sd.Ainit, sd.Cinit
Standard deviation of the initial values for the elements of A and C. These are
normally distributed with mean zero. This argument is used only if Use.Init.Poisson.QO = FALSE.
noRRR Formula giving terms that are not to be included in the reduced-rank regres-
sion. That is, noRRR specifes which explanatory variables are in the x1 vector
of rrvglm, and the rest go into x2 . The x1 variables constitute the B1 matrix in
Yee and Hastie (2003). Those x2 variables which are subject to the reduced-rank
regression correspond to the B2 matrix. Set noRRR = NULL for the reduced-rank
regression to be applied to every explanatory variable including the intercept.
Norrr Defunct. Please use noRRR. Use of Norrr will become an error soon.
trace Logical indicating if output should be produced for each iteration.
Use.Init.Poisson.QO
Logical indicating whether the .Init.Poisson.QO() should be used to obtain
initial values for the C. The function uses a new method that can work well if the
data are Poisson counts coming from an equal-tolerances QRR-VGLM (CQO).
This option is less realistic for RR-VGLMs compared to QRR-VGLMs.
checkwz logical indicating whether the diagonal elements of the working weight matri-
ces should be checked whether they are sufficiently positive, i.e., greater than
wzepsilon. If not, any values less than wzepsilon are replaced with this value.
noWarning, Check.rank, Check.cm.rank
Same as vglm.control. Ignored for VGAM 0.9-7 and higher.
wzepsilon Small positive number used to test whether the diagonals of the working weight
matrices are sufficiently positive.
... Variables in . . . are passed into vglm.control. If the derivative algorithm is
used then . . . are also passed into rrvglm.optim.control; and if the alternating
algorithm is used then . . . are also passed into valt.control.
In the above, R is the Rank and M is the number of linear predictors.
684 rrvglm.control

Details

VGAM supports three normalizations to ensure a unique solution. Of these, only corner constraints
will work with summary of RR-VGLM objects.

Value

A list with components matching the input names. Some error checking is done, but not much.

Note

The arguments in this function begin with an upper case letter to help avoid interference with those
of vglm.control.
In the example below a rank-1 stereotype model (Anderson, 1984) is fitted.

Author(s)

Thomas W. Yee

References

Yee, T. W. and Hastie, T. J. (2003) Reduced-rank vector generalized linear models. Statistical
Modelling, 3, 1541.

See Also

rrvglm, rrvglm.optim.control, rrvglm-class, vglm, vglm.control, cqo.

Examples

## Not run:
set.seed(111)
pneumo <- transform(pneumo, let = log(exposure.time),
x3 = runif(nrow(pneumo))) # x3 is random noise
fit <- rrvglm(cbind(normal, mild, severe) ~ let + x3,
multinomial, data = pneumo, Rank = 1, Index.corner = 2)
constraints(fit)
vcov(fit)
summary(fit)

## End(Not run)
rrvglm.optim.control 685

rrvglm.optim.control Control Function for rrvglm() Calling optim()

Description
Algorithmic constants and parameters for running optim within rrvglm are set using this function.

Usage
rrvglm.optim.control(Fnscale = 1, Maxit = 100,
Switch.optimizer = 3, Abstol = -Inf,
Reltol = sqrt(.Machine$double.eps), ...)

Arguments
Fnscale Passed into optim as fnscale.
Maxit Passed into optim as maxit.
Switch.optimizer
Iteration number when the "Nelder-Mead" method of optim is switched to the
quasi-Newton "BFGS" method. Assigning Switch.optimizer a negative num-
ber means always BFGS, while assigning Switch.optimizer a value greater
than maxits means always use Nelder-Mead.
Abstol Passed into optim as abstol.
Reltol Passed into optim as reltol.
... Ignored.

Details
See optim for more details.

Value
A list with components equal to the arguments.

Note
The transition between optimization methods may be unstable, so users may have to vary the value
of Switch.optimizer.
Practical experience with Switch.optimizer shows that setting it to too large a value may lead to
a local solution, whereas setting it to a low value will obtain the global solution. It appears that, if
BFGS kicks in too late when the Nelder-Mead algorithm is starting to converge to a local solution,
then switching to BFGS will not be sufficient to bypass convergence to that local solution.

Author(s)
Thomas W. Yee
686 ruge

See Also

rrvglm.control, optim.

ruge Rutherford-Geiger Polonium Data

Description

Decay counts of polonium recorded by Rutherford and Geiger (1910).

Usage

data(ruge)

Format

This data frame contains the following columns:

counts a numeric vector, counts or frequencies

number a numeric vector, the number of decays

Details

These are the radioactive decay counts of polonium recorded by Rutherford and Geiger (1910)
representing the number of scintillations in 2608 1/8 minute intervals. For example, there were
57 frequencies of zero counts. The counts can be thought of as being approximately Poisson dis-
tributed.

Source

Rutherford, E. and Geiger, H. (1910) The Probability Variations in the Distribution of alpha Parti-
cles, Philosophical Magazine, 20, 698704.

Examples
lambdahat <- with(ruge, weighted.mean(number, w = counts))
(N <- with(ruge, sum(counts)))
with(ruge, cbind(number, counts,
fitted = round(N * dpois(number, lam = lambdahat))))
s 687

s Defining Smooths in VGAM Formulas

Description
s is used in the definition of (vector) smooth terms within vgam formulas. This corresponds to 1st-
generation VGAMs that use backfitting for their estimation. The effective degrees of freedom is
prespecified.

Usage
s(x, df = 4, spar = 0, ...)

Arguments
x covariate (abscissae) to be smoothed. Note that x must be a single variable
and not a function of a variable. For example, s(x) is fine but s(log(x)) will
fail. In this case, let logx <- log(x) (in the data frame), say, and then use
s(logx). At this stage bivariate smoothers (x would be a two-column matrix)
are not implemented.
df numerical vector of length r. Effective degrees of freedom: must lie between
1 (linear fit) and n (interpolation). Thus one could say that df-1 is the effec-
tive nonlinear degrees of freedom (ENDF) of the smooth. Recycling of values
will be used if df is not of length r. If spar is positive then this argument is
ignored. Thus s() means that the effective degrees of freedom is prespecified.
If it is known that the component function(s) are more wiggly than usual then
try increasing the value of this argument.
spar numerical vector of length r. Positive smoothing parameters (after scaling) .
Larger values mean more smoothing so that the solution approaches a linear fit
for that component function. A zero value means that df is used. Recycling of
values will be used if spar is not of length r.
... Ignored for now.

Details
In this help file M is the number of additive predictors and r is the number of component functions
to be estimated (so that r is an element from the set {1,2,. . . ,M }). Also, if n is the number of
distinct abscissae, then s will fail if n < 7.
s, which is symbolic and does not perform any smoothing itself, only handles a single covariate.
Note that s works in vgam only. It has no effect in vglm (actually, it is similar to the identity
function I so that s(x2) is the same as x2 in the LM model matrix). It differs from the s() of
the gam package and the s of the mgcv package; they should not be mixed together. Also, terms
involving s should be simple additive terms, and not involving interactions and nesting etc. For
example, myfactor:s(x2) is not a good idea.
688 s

Value
A vector with attributes that are (only) used by vgam.

Note
The vector cubic smoothing spline which s() represents is computationally demanding for large
M . The cost is approximately O(nM 3 ) where n is the number of unique abscissae.
Currently a bug relating to the use of s() is that only constraint matrices whose columns are or-
thogonal are handled correctly. If any s() term has a constraint matrix that does not satisfy this
condition then a warning is issued. See is.buggy for more information.
A more modern alternative to using s with vgam is to use sm.os or sm.ps. This does not require
backfitting and allows automatic smoothing parameter selection. However, this alternative should
only be used when the sample size is reasonably large (> 500, say). These are called Generation-2
VGAMs.
Another alternative to using s with vgam is bs and/or ns with vglm. The latter implements half-
stepping, which is helpful if convergence is difficult.

Author(s)
Thomas W. Yee

References
Yee, T. W. and Wild, C. J. (1996) Vector generalized additive models. Journal of the Royal Statisti-
cal Society, Series B, Methodological, 58, 481493.

See Also
vgam, is.buggy, sm.os, sm.ps, vsmooth.spline.

Examples
# Nonparametric logistic regression
fit1 <- vgam(agaaus ~ s(altitude, df = 2), binomialff, data = hunua)
## Not run: plot(fit1, se = TRUE)

# Bivariate logistic model with artificial data

nn <- 300
bdata <- data.frame(x1 = runif(nn), x2 = runif(nn))
bdata <- transform(bdata,
y1 = rbinom(nn, size = 1, prob = logit(sin(2 * x2), inverse = TRUE)),
y2 = rbinom(nn, size = 1, prob = logit(sin(2 * x2), inverse = TRUE)))
fit2 <- vgam(cbind(y1, y2) ~ x1 + s(x2, 3), trace = TRUE,
binom2.or(exchangeable = TRUE), data = bdata)
coef(fit2, matrix = TRUE) # Hard to interpret
## Not run: plot(fit2, se = TRUE, which.term = 2, scol = "blue")
sc.studentt2 689

sc.studentt2 Scaled Student t Distribution with 2 df Family Function

Description
Estimates the location and scale parameters of a scaled Student t distribution with 2 degrees of
freedom, by maximum likelihood estimation.

Usage
sc.studentt2(percentile = 50, llocation = "identitylink", lscale = "loge",
ilocation = NULL, iscale = NULL, imethod = 1, zero = "scale")

Arguments
percentile A numerical vector containing values between 0 and 100, which are the quantiles
and expectiles. They will be returned as fitted values.
llocation, lscale
See Links for more choices, and CommonVGAMffArguments.
ilocation, iscale, imethod, zero
See CommonVGAMffArguments for details.

Details
Koenker (1993) solved for the distribution whose quantiles are equal to its expectiles. Its canonical
form has mean and mode at 0, and has a heavy tail (in fact, its variance is infinite).
The standard (canonical) form of this distribution can be endowed with a location and scale
parameter. The standard form has a density that can be written as
f (z) = 2/(4 + z 2 )3/2
for real y. Then z = (y a)/b for location and scale parameters a and b > 0. The mean of Y is
a. By default, 1 = a) and 2 = log(b). The expectiles/quantiles corresponding to percentile
are returned as the fitted values; in particular, percentile = 50 corresponds to the mean (0.5
expectile) and median (0.5 quantile).

Note that if Y has a standard dsc.t2 then Y = 2T2 where T2 has a Student-t distribution with 2
degrees of freedom. The two parameters here can also be estimated using studentt2 by specifying
df = 2 and making an adjustment for the scale parameter, however, this VGAM family function is
more efficient since the EIM is known (Fisher scoring is implemented.)

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.

Author(s)
T. W. Yee
690 Select

References

Koenker, R. (1993) When are expectiles percentiles? (solution) Econometric Theory, 9, 526527.

See Also

dsc.t2, studentt2.

Examples
set.seed(123); nn <- 1000
kdata <- data.frame(x2 = sort(runif(nn)))
kdata <- transform(kdata, mylocat = 1 + 3 * x2,
myscale = 1)
kdata <- transform(kdata, y = rsc.t2(nn, loc = mylocat, scale = myscale))
fit <- vglm(y ~ x2, sc.studentt2(perc = c(1, 50, 99)), data = kdata)
fit2 <- vglm(y ~ x2, studentt2(df = 2), data = kdata) # 'same' as fit

coef(fit, matrix = TRUE)

head(fitted(fit))
head(predict(fit))

# Nice plot of the results

## Not run: plot(y ~ x2, data = kdata, col = "blue", las = 1,
sub = paste("n =", nn),
main = "Fitted quantiles/expectiles using the sc.studentt2() distribution")
matplot(with(kdata, x2), fitted(fit), add = TRUE, type = "l", lwd = 3)
legend("bottomright", lty = 1:3, lwd = 3, legend = colnames(fitted(fit)),
col = 1:3)
## End(Not run)

fit@extra$percentile # Sample quantiles

Select Select Variables for a Formula Response or the RHS of a Formula

Description

Select variables from a data frame whose names begin with a certain character string.

Usage

Select(data = list(), prefix = "y",

lhs = NULL, rhs = NULL, rhs2 = NULL, rhs3 = NULL,
as.character = FALSE, as.formula.arg = FALSE, tilde = TRUE,
exclude = NULL, sort.arg = TRUE)
Select 691

Arguments
data A data frame or a matrix.
prefix A vector of character strings, or a logical. If a character then the variables cho-
sen from data begin with the value of prefix. If a logical then only TRUE is
accepted and all the variables in data are chosen.
lhs A character string. The response of a formula.
rhs A character string. Included as part of the RHS a formula. Set rhs = "0" to
suppress the intercept.
rhs2, rhs3 Same as rhs but appended to its RHS, i.e., paste0(rhs, " + ", rhs2, " + ", rhs3).
If used, rhs should be used first, and then possibly rhs2 and then possibly rhs3.
as.character Logical. Return the answer as a character string?
as.formula.arg Logical. Is the answer a formula?
tilde Logical. If as.character and as.formula.arg are both TRUE then include the
tilde in the formula?
exclude Vector of character strings. Exclude these variables explicitly.
sort.arg Logical. Sort the variables?

Details
This is meant as a utility function to avoid manually: (i) making a cbind call to construct a big
matrix response, and (ii) constructing a formula involving a lot of terms. The savings can be made
because the variables of interest begin with some prefix, e.g., with the character "y".

Value
If as.character = FALSE and as.formula.arg = FALSE then a matrix such as cbind(y1, y2, y3).
If as.character = TRUE and as.formula.arg = FALSE then a character string such as "cbind(y1, y2, y3)".
If as.character = FALSE and as.formula.arg = TRUE then a formula such as lhs ~ y1 + y2 + y3.
If as.character = TRUE and as.formula.arg = TRUE then a character string such as "lhs ~ y1 + y2 + y3".
See the examples below. By default, if no variables beginning the the value of prefix is found then
a NULL is returned. Setting prefix = " " is a way of selecting no variables.

Note
This function is a bit experimental at this stage and may change in the short future. Some of its utility
may be better achieved using subset and its select argument, e.g., subset(pdata, TRUE, select = y01:y10).
For some models such as posbernoulli.t the order of the variables in the xij argument is crucial,
therefore care must be taken with the argument sort.arg. In some instances, it may be good to
rename variables y1 to y01, y2 to y02, etc. when there are variables such as y14.
Currently subsetcol() and Select() are identical. One of these functions might be withdrawn in
the future.

Author(s)
T. W. Yee.
692 seq2binomial

See Also
vglm, cbind, subset, formula, fill.

Examples
Pneumo <- pneumo
colnames(Pneumo) <- c("y1", "y2", "y3", "x2") # The "y" variables are response
Pneumo$x1 <- 1; Pneumo$x3 <- 3; Pneumo$x <- 0; Pneumo$x4 <- 4 # Add these

Select(data = Pneumo) # Same as with(Pneumo, cbind(y1, y2, y3))

Select(Pneumo, "x")
Select(Pneumo, "x", sort = FALSE, as.char = TRUE)
Select(Pneumo, "x", exclude = "x1")
Select(Pneumo, "x", exclude = "x1", as.char = TRUE)
Select(Pneumo, c("x", "y"))
Select(Pneumo, "z") # Now returns a NULL
Select(Pneumo, " ") # Now returns a NULL
Select(Pneumo, prefix = TRUE, as.formula = TRUE)
Select(Pneumo, "x", exclude = c("x3", "x1"), as.formula = TRUE,
lhs = "cbind(y1, y2, y3)", rhs = "0")
Select(Pneumo, "x", exclude = "x1", as.formula = TRUE, as.char = TRUE,
lhs = "cbind(y1, y2, y3)", rhs = "0")

# Now a 'real' example:

Huggins89table1 <- transform(Huggins89table1, x3.tij = t01)
tab1 <- subset(Huggins89table1,
rowSums(Select(Huggins89table1, "y")) > 0)
# Same as
# subset(Huggins89table1, y1 + y2 + y3 + y4 + y5 + y6 + y7 + y8 + y9 + y10 > 0)

# Long way to do it:

fit.th <-
vglm(cbind(y01, y02, y03, y04, y05, y06, y07, y08, y09, y10) ~ x2 + x3.tij,
xij = list(x3.tij ~ t01 + t02 + t03 + t04 + t05 + t06 + t07 + t08 +
t09 + t10 - 1),
posbernoulli.t(parallel.t = TRUE ~ x2 + x3.tij),
data = tab1, trace = TRUE,
form2 = ~ x2 + x3.tij + t01 + t02 + t03 + t04 + t05 + t06 + t07 + t08 +
t09 + t10)
# Short way to do it:
Fit.th <- vglm(Select(tab1, "y") ~ x2 + x3.tij,
xij = list(Select(tab1, "t", as.formula = TRUE,
sort = FALSE, lhs = "x3.tij", rhs = "0")),
posbernoulli.t(parallel.t = TRUE ~ x2 + x3.tij),
data = tab1, trace = TRUE,
form2 = Select(tab1, prefix = TRUE, as.formula = TRUE))

seq2binomial The Two-stage Sequential Binomial Distribution Family Function

seq2binomial 693

Description
Estimation of the probabilities of a two-stage binomial distribution.

Usage
seq2binomial(lprob1 = "logit", lprob2 = "logit",
iprob1 = NULL, iprob2 = NULL,
parallel = FALSE, zero = NULL)

Arguments
lprob1, lprob2 Parameter link functions applied to the two probabilities, called p and q below.
See Links for more choices.
iprob1, iprob2 Optional initial value for the first and second probabilities respectively. A NULL
means a value is obtained in the initialize slot.
parallel, zero Details at Links. If parallel = TRUE then the constraint also applies to the
intercept.

Details
This VGAM family function fits the model described by Crowder and Sweeting (1989) which is
described as follows. Each of m spores has a probability p of germinating. Of the y1 spores that
germinate, each has a probability q of bending in a particular direction. Let y2 be the number that
bend in the specified direction. The probability model for this data is P (y1 , y2 ) =

m y1 y1 y2
p (1 p)my1 q (1 q)y1 y2
y1 y2

for 0 < p < 1, 0 < q < 1, y1 = 1, . . . , m and y2 = 1, . . . , y1 . Here, p is prob1, q is prob2.

Although the Authors refer to this as the bivariate binomial model, I have named it the (two-stage)
sequential binomial model. Fisher scoring is used.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Note
The response must be a two-column matrix of sample proportions corresponding to y1 and y2 . The
m values should be inputted with the weights argument of vglm and vgam. The fitted value is a
two-column matrix of estimated probabilities p and q. A common form of error is when there are
no trials for y1 , e.g., if mvector below has some values which are zero.

Author(s)
Thomas W. Yee
694 setup.smart

References
Crowder, M. and Sweeting, T. (1989). Bayesian inference for a bivariate binomial distribution.
Biometrika, 76, 599603.

setup.smart Smart Prediction Setup

Description
Sets up smart prediction in one of two modes: "write" and "read".

Usage
setup.smart(mode.arg, smart.prediction = NULL, max.smart = 30)

Arguments
mode.arg mode.arg must be "write" or "read". If in "read" mode then smart.prediction
must be assigned the data structure .smart.prediction that was created while
fitting. This is stored in [email protected] or object$smart.prediction
where object is the name of the fitted object.
Simplex 695

smart.prediction
If in "read" mode then smart.prediction must be assigned the list of data de-
pendent parameters, which is stored on the fitted object. Otherwise, smart.prediction
is ignored.
max.smart max.smart is the initial length of the list .smart.prediction. It is not impor-
tant because .smart.prediction is made larger if needed.

Details
This function is only required by programmers writing a modelling function such as lm and glm, or a
prediction functions of such, e.g., predict.lm. The function setup.smart operates by mimicking
the operations of a first-in first-out stack (better known as a queue).

Value
Nothing is returned.

Side Effects
In "write" mode .smart.prediction in smartpredenv is assigned an empty list with max.smart
components. In "read" mode .smart.prediction in smartpredenv is assigned smart.prediction.
Then .smart.prediction.counter in smartpredenv is assigned the value 0, and .smart.prediction.mode
and .max.smart are written to smartpredenv too.

## Not run: # Put at the beginning of predict.lm

setup.smart("read", smart.prediction = object$smart.prediction)

## End(Not run)

Simplex Simplex Distribution

Description
Density function, and random generation for the simplex distribution.
696 Simplex

Usage

dsimplex(x, mu = 0.5, dispersion = 1, log = FALSE)

rsimplex(n, mu = 0.5, dispersion = 1)

Arguments

x Vector of quantiles. The support of the distribution is the interval (0, 1).
mu, dispersion Mean and dispersion parameters. The former lies in the interval (0, 1) and the
latter is positive.
n, log Same usage as runif.

Details

The VGAM family function simplex fits this model; see that online help for more information. For
rsimplex() the rejection method is used; it may be very slow if the density is highly peaked, and
will fail if the density asymptotes at the boundary.

Value

dsimplex(x) gives the density function, rsimplex(n) gives n random variates.

Author(s)

T. W. Yee

See Also

simplex.

Examples

sigma <- c(4, 2, 1) # Dispersion parameter

mymu <- c(0.1, 0.5, 0.7); xxx <- seq(0, 1, len = 501)
## Not run: par(mfrow = c(3, 3)) # Figure 2.1 of Song (2007)
for (iii in 1:3)
for (jjj in 1:3) {
plot(xxx, dsimplex(xxx, mymu[jjj], sigma[iii]),
type = "l", col = "blue", xlab = "", ylab = "", main =
paste("mu = ", mymu[jjj], ", sigma = ", sigma[iii], sep = "")) }
## End(Not run)
simplex 697

simplex Simplex Distribution Family Function

Description
The two parameters of the univariate standard simplex distribution are estimated by full maximum
likelihood estimation.

Usage
simplex(lmu = "logit", lsigma = "loge", imu = NULL, isigma = NULL,
imethod = 1, ishrinkage = 0.95, zero = "sigma")

Arguments
lmu, lsigma Link function for mu and sigma. See Links for more choices.
imu, isigma Optional initial values for mu and sigma. A NULL means a value is obtained
internally.
imethod, ishrinkage, zero
See CommonVGAMffArguments for information.

Details
The probability density function can be written

f (y; , ) = [2 2 (y(1 y))3 ]0.5 exp[0.5(y )2 /( 2 y(1 y)2 (1 )2 )]

for 0 < y < 1, 0 < < 1, and > 0. The mean of Y is (called mu, and returned as the fitted
values).
The second parameter, sigma, of this standard simplex distribution is known as the dispersion pa-
rameter. The unit variance function is V () = 3 (1 )3 . Fisher scoring is applied to both
parameters.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Note
This distribution is potentially useful for dispersion modelling. Numerical problems may occur
when mu is very close to 0 or 1.

Author(s)
T. W. Yee
698 simulate.vlm

References
Jorgensen, B. (1997) The Theory of Dispersion Models. London: Chapman & Hall
Song, P. X.-K. (2007) Correlated Data Analysis: Modeling, Analytics, and Applications. Springer.

See Also
dsimplex, dirichlet, rig, binomialff.

Examples
sdata <- data.frame(x2 = runif(nn <- 1000))
sdata <- transform(sdata, eta1 = 1 + 2 * x2,
eta2 = 1 - 2 * x2)
sdata <- transform(sdata, y = rsimplex(nn, mu = logit(eta1, inverse = TRUE),
dispersion = exp(eta2)))
(fit <- vglm(y ~ x2, simplex(zero = NULL), data = sdata, trace = TRUE))
coef(fit, matrix = TRUE)
summary(fit)

simulate.vlm Simulate Responses for VGLMs and VGAMs

Description
Simulate one or more responses from the distribution corresponding to a fitted model object.

Usage
## S3 method for class 'vlm'
simulate(object, nsim = 1, seed = NULL, ...)

Arguments
object an object representing a fitted model. Usually an object of class vglm-class or
vgam-class.
nsim, seed Same as simulate.
... additional optional arguments.

Details
This is a methods function for simulate and hopefully should behave in a very similar manner.
Only VGAM family functions with a simslot slot have been implemented for simulate.

Value
Similar to simulate. Note that many VGAM family functions can handle multiple responses. This
can result in a longer data frame with more rows (nsim multiplied by n rather than the ordinary
n). In the future an argument may be available so that there is always n rows no matter how many
responses were inputted.
simulate.vlm 699

Warning
With multiple response and/or multivariate responses, the order of the elements may differ. For
some VGAM families, the order is n N F , where n is the sample size, N is nsim and F is
ncol(fitted(vglmObject)). For other VGAM families, the order is n F N . An example of
each is given below.

See Also
Currently the VGAM family functions with a simslot slot are: alaplace1, alaplace2, betabinomial,
betabinomialff, betaR, betaff, biamhcop, bifrankcop, bilogistic, binomialff, binormal,
binormalcop, biclaytoncop, cauchy, cauchy1, chisq, dirichlet, dagum, erlang, exponential,
bifgmcop, fisk, gamma1, gamma2, gammaR, gengamma.stacy, geometric, gompertz, gumbelII,
hzeta, inv.lomax, inv.paralogistic, kumar, lgamma1, lgamma3, lindley, lino, logff, logistic1,
logistic, lognormal, lomax, makeham, negbinomial, negbinomial.size, paralogistic, perks,
poissonff, posnegbinomial, posnormal, pospoisson, polya, polyaR, posbinomial, rayleigh,
riceff, simplex, sinmad, slash, studentt, studentt2, studentt3, triangle, uninormal,
yulesimon, zageometric, zageometricff, zanegbinomial, zanegbinomialff, zapoisson, zapoissonff,
zigeometric, zigeometricff, zinegbinomial, zipf, zipoisson, zipoissonff.
See also RNG about random number generation in R, vglm, vgam for model fitting.

Examples
nn <- 10; mysize <- 20; set.seed(123)
bdata <- data.frame(x2 = rnorm(nn))
bdata <- transform(bdata,
y1 = rbinom(nn, size = mysize, p = logit(1+x2, inverse = TRUE)),
y2 = rbinom(nn, size = mysize, p = logit(1+x2, inverse = TRUE)),
f1 = factor(as.numeric(rbinom(nn, size = 1,
p = logit(1+x2, inverse = TRUE)))))
(fit1 <- vglm(cbind(y1, aaa = mysize - y1) ~ x2, # Matrix response (2-colns)
binomialff, data = bdata))
(fit2 <- vglm(f1 ~ x2, binomialff, model = TRUE, data = bdata)) # Factor response

set.seed(123); simulate(fit1, nsim = 8)

set.seed(123); c(simulate(fit2, nsim = 3)) # Use c() when model = TRUE

# An n x N x F example
set.seed(123); n <- 100
bdata <- data.frame(x2 = runif(n), x3 = runif(n))
bdata <- transform(bdata, y1 = rnorm(n, 1 + 2 * x2),
y2 = rnorm(n, 3 + 4 * x2))
fit1 <- vglm(cbind(y1, y2) ~ x2, binormal(eq.sd = TRUE), data = bdata)
nsim <- 1000 # Number of simulations for each observation
my.sims <- simulate(fit1, nsim = nsim)
dim(my.sims) # A data frame
aaa <- array(unlist(my.sims), c(n, nsim, ncol(fitted(fit1)))) # n by N by F
summary(rowMeans(aaa[, , 1]) - fitted(fit1)[, 1]) # Should be all 0s
summary(rowMeans(aaa[, , 2]) - fitted(fit1)[, 2]) # Should be all 0s

# An n x F x N example
700 Sinmad

n <- 100; set.seed(111); nsim <- 1000

zdata <- data.frame(x2 = runif(n))
zdata <- transform(zdata, lambda1 = loge(-0.5 + 2 * x2, inverse = TRUE),
lambda2 = loge( 0.5 + 2 * x2, inverse = TRUE),
pstr01 = logit( 0, inverse = TRUE),
pstr02 = logit(-1.0, inverse = TRUE))
zdata <- transform(zdata, y1 = rzipois(n, lambda = lambda1, pstr0 = pstr01),
y2 = rzipois(n, lambda = lambda2, pstr0 = pstr02))
zip.fit <- vglm(cbind(y1, y2) ~ x2, zipoissonff, data = zdata, crit = "coef")
my.sims <- simulate(zip.fit, nsim = nsim)
dim(my.sims) # A data frame
aaa <- array(unlist(my.sims), c(n, ncol(fitted(zip.fit)), nsim)) # n by F by N
summary(rowMeans(aaa[, 1, ]) - fitted(zip.fit)[, 1]) # Should be all 0s
summary(rowMeans(aaa[, 2, ]) - fitted(zip.fit)[, 2]) # Should be all 0s

Sinmad The Singh-Maddala Distribution

Description
Density, distribution function, quantile function and random generation for the Singh-Maddala dis-
tribution with shape parameters a and q, and scale parameter scale.

Usage
dsinmad(x, scale = 1, shape1.a, shape3.q, log = FALSE)
psinmad(q, scale = 1, shape1.a, shape3.q, lower.tail = TRUE, log.p = FALSE)
qsinmad(p, scale = 1, shape1.a, shape3.q, lower.tail = TRUE, log.p = FALSE)
rsinmad(n, scale = 1, shape1.a, shape3.q)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. If length(n) > 1, the length is taken to be the number
required.
shape1.a, shape3.q
shape parameters.
scale scale parameter.
log Logical. If log = TRUE then the logarithm of the density is returned.
lower.tail, log.p
Same meaning as in pnorm or qnorm.

Details
See sinmad, which is the VGAM family function for estimating the parameters by maximum like-
lihood estimation.
sinmad 701

Value
dsinmad gives the density, psinmad gives the distribution function, qsinmad gives the quantile
function, and rsinmad generates random deviates.

Note
The Singh-Maddala distribution is a special case of the 4-parameter generalized beta II distribution.

Author(s)
T. W. Yee and Kai Huang

References
Kleiber, C. and Kotz, S. (2003) Statistical Size Distributions in Economics and Actuarial Sciences,
Hoboken, NJ, USA: Wiley-Interscience.

sinmad Singh-Maddala Distribution Family Function

Description
Maximum likelihood estimation of the 3-parameter Singh-Maddala distribution.

Usage
sinmad(lscale = "loge", lshape1.a = "loge", lshape3.q = "loge",
iscale = NULL, ishape1.a = NULL, ishape3.q = NULL, imethod = 1,
lss = TRUE, gscale = exp(-5:5), gshape1.a = exp(-5:5),
gshape3.q = exp(-5:5), probs.y = c(0.25, 0.5, 0.75),
zero = "shape")
702 sinmad

Arguments
lss See CommonVGAMffArguments for important information.
lshape1.a, lscale, lshape3.q
Parameter link functions applied to the (positive) parameters a, scale, and q.
See Links for more choices.
iscale, ishape1.a, ishape3.q, imethod, zero
See CommonVGAMffArguments for information. For imethod = 2 a good initial
value for ishape3.q is needed to obtain good estimates for the other parameters.
gscale, gshape1.a, gshape3.q
See CommonVGAMffArguments for information.
probs.y See CommonVGAMffArguments for information.

Details
The 3-parameter Singh-Maddala distribution is the 4-parameter generalized beta II distribution with
shape parameter p = 1. It is known under various other names, such as the Burr XII (or just the
Burr distribution), Pareto IV, beta-P, and generalized log-logistic distribution. More details can be
found in Kleiber and Kotz (2003).
Some distributions which are special cases of the 3-parameter Singh-Maddala are the Lomax (a =
1), Fisk (q = 1), and paralogistic (a = q).
The Singh-Maddala distribution has density

f (y) = aqy a1 /[ba {1 + (y/b)a }1+q ]

for a > 0, b > 0, q > 0, y 0. Here, b is the scale parameter scale, and the others are shape
parameters. The cumulative distribution function is

F (y) = 1 [1 + (y/b)a ]q .

The mean is
E(Y ) = b (1 + 1/a) (q 1/a)/(q)
provided a < 1 < aq; these are returned as the fitted values. This family function handles
multiple responses.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Note
See the notes in genbetaII.

Author(s)
T. W. Yee
Skellam 703

References
Kleiber, C. and Kotz, S. (2003) Statistical Size Distributions in Economics and Actuarial Sciences,
Hoboken, NJ, USA: Wiley-Interscience.

See Also
Sinmad, genbetaII, betaII, dagum, fisk, inv.lomax, lomax, paralogistic, inv.paralogistic,
simulate.vlm.

Examples
sdata <- data.frame(y = rsinmad(n = 1000, shape1 = exp(1),
scale = exp(2), shape3 = exp(0)))
fit <- vglm(y ~ 1, sinmad(lss = FALSE), data = sdata, trace = TRUE)
fit <- vglm(y ~ 1, sinmad(lss = FALSE, ishape1.a = exp(1)),
data = sdata, trace = TRUE)
coef(fit, matrix = TRUE)
Coef(fit)
summary(fit)

# Harder problem (has the shape3.q parameter going to infinity)

set.seed(3)
sdata <- data.frame(y1 = rbeta(1000, 6, 6))
# hist(with(sdata, y1))
if (FALSE) {
# These struggle
fit1 <- vglm(y1 ~ 1, sinmad(lss = FALSE), data = sdata, trace = TRUE)
fit1 <- vglm(y1 ~ 1, sinmad(lss = FALSE), data = sdata, trace = TRUE,
crit = "coef")
Coef(fit1)
}
# Try this remedy:
fit2 <- vglm(y1 ~ 1, sinmad(lss = FALSE, ishape3.q = 3, lshape3.q = "loglog"),
data = sdata, trace = TRUE, stepsize = 0.05, maxit = 99)
coef(fit2, matrix = TRUE)
Coef(fit2)

Skellam The Skellam Distribution

Description
Density and random generation for the Skellam distribution.

Usage
dskellam(x, mu1, mu2, log = FALSE)
rskellam(n, mu1, mu2)
704 skellam

Arguments
x vector of quantiles.
n number of observations. Same as runif.
mu1, mu2 See skellam
.
log Logical; if TRUE, the logarithm is returned.

Details
See skellam, the VGAM family function for estimating the parameters, for the formula of the
probability density function and other details.

Value
dskellam gives the density, and rskellam generates random deviates.

Warning
Numerical problems may occur for data if 1 and/or 2 are large. The normal approximation for
this case has not been implemented yet.

skellam Skellam Distribution Family Function

Description
Estimates the two parameters of a Skellam distribution by maximum likelihood estimation.

Usage
skellam(lmu1 = "loge", lmu2 = "loge", imu1 = NULL, imu2 = NULL,
nsimEIM = 100, parallel = FALSE, zero = NULL)
skellam 705

Arguments

lmu1, lmu2 Link functions for the 1 and 2 parameters. See Links for more choices and
for general information.
imu1, imu2 Optional initial values for the parameters. See CommonVGAMffArguments for
more information. If convergence failure occurs (this VGAM family function
seems to require good initial values) try using these arguments.
nsimEIM, parallel, zero
See CommonVGAMffArguments for information. In particular, setting parallel=TRUE
will constrain the two means to be equal.

Details

The Skellam distribution models the difference between two independent Poisson distributions
(with means j , say). It has density function
y/2
1
f (y; 1 , 2 ) = exp(1 2 ) I|y| (2 1 2 )
2

where y is an integer, 1 > 0, 2 > 0. Here, Iv is the modified Bessel function of the first kind
with order v.
The mean is 1 2 (returned as the fitted values), and the variance is 1 + 2 . Simulated Fisher
scoring is implemented.

Value

An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Warning

This VGAM family function seems fragile and very sensitive to the initial values. Use very cau-
tiously!!

Note

Numerical problems may occur for data if 1 and/or 2 are large.

References

Skellam, J. G. (1946) The frequency distribution of the difference between two Poisson variates
belonging to different populations. Journal of the Royal Statistical Society, Series A, 109, 296.

See Also

dskellam, dpois, poissonff.

706 skewnorm

Examples
## Not run:
sdata <- data.frame(x2 = runif(nn <- 1000))
sdata <- transform(sdata, mu1 = exp(1 + x2), mu2 = exp(1 + x2))
sdata <- transform(sdata, y = rskellam(nn, mu1, mu2))
fit1 <- vglm(y ~ x2, skellam, data = sdata, trace = TRUE, crit = "coef")
fit2 <- vglm(y ~ x2, skellam(parallel = TRUE), data = sdata, trace = TRUE)
coef(fit1, matrix = TRUE)
coef(fit2, matrix = TRUE)
summary(fit1)
# Likelihood ratio test for equal means:
pchisq(2 * (logLik(fit1) - logLik(fit2)),
df = df.residual(fit2) - df.residual(fit1), lower.tail = FALSE)
lrtest(fit1, fit2) # Alternative

## End(Not run)

skewnorm Skew-Normal Distribution

Description
Density and random generation for the univariate skew-normal distribution.

Usage
dskewnorm(x, location = 0, scale = 1, shape = 0, log = FALSE)
rskewnorm(n, location = 0, scale = 1, shape = 0)

Arguments
x vector of quantiles.
n number of observations. Same as runif.
location The location parameter . A vector.
scale The scale parameter . A positive vector.
shape The shape parameter. It is called in skewnormal.
log Logical. If log=TRUE then the logarithm of the density is returned.

Details
See skewnormal, which currently only estimates the shape parameter. More generally here, Z =
+ Y where Y has a standard skew-normal distribution (see skewnormal), is the location
parameter and is the scale parameter.

Value
dskewnorm gives the density, rskewnorm generates random deviates.
skewnormal 707

Note

The default values of all three parameters corresponds to the skew-normal being the standard normal
distribution.

Author(s)

T. W. Yee

References

http://tango.stat.unipd.it/SN.

See Also

skewnormal.

Examples
## Not run: N <- 200 # Grid resolution
shape <- 7; x <- seq(-4, 4, len = N)
plot(x, dskewnorm(x, shape = shape), type = "l", col = "blue", las = 1,
ylab = "", lty = 1, lwd = 2)
abline(v = 0, h = 0, col = "grey")
lines(x, dnorm(x), col = "orange", lty = 2, lwd = 2)
legend("topleft", leg = c(paste("Blue = dskewnorm(x, ", shape,")", sep = ""),
"Orange = standard normal density"), lty = 1:2, lwd = 2,
col = c("blue", "orange"))
## End(Not run)

skewnormal Univariate Skew-Normal Distribution Family Function

Description

Maximum likelihood estimation of the shape parameter of a univariate skew-normal distribution.

Usage

skewnormal(lshape = "identitylink", ishape = NULL, nsimEIM = NULL)

Arguments
lshape, ishape, nsimEIM
See Links and CommonVGAMffArguments.
708 skewnormal

Details

The univariate skew-normal distribution has a density function that can be written

f (y) = 2 (y) (y)

where is the shape parameter. Here, is the standard normal density and its cumulative
distribution function. When = 0 the result is a standard normal distribution. When = 1 it
models the distribution of the maximum of two independent standard normal variates. When the
absolute value of the shape parameter increases the skewness of the distribution increases. The
limit as the shape parameter tends to positive infinity results in the folded normal distribution or
half-normal distribution. When the shape parameter changes its sign, the density is reflected about
y = 0.
p
The mean of the distribution is = 2/((1 + 2 )) and these are returned as the fitted values.
The variance of the distribution is 1 2 . The Newton-Raphson algorithm is used unless the
nsimEIM argument is used.

Value

An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Warning

It is well known that the EIM of Azzalinis skew-normal distribution is singular for skewness pa-
rameter tending to zero, and thus produces influential problems.

Note

It is a good idea to use several different initial values to ensure that the global solution is obtained.
This family function will be modified (hopefully soon) to handle a location and scale parameter too.

Author(s)

Thomas W. Yee

References

Azzalini, A. A. (1985) A class of distributions which include the normal. Scandinavian Journal of
Statistics, 12, 171178.
Azzalini, A. and Capitanio, A. (1999) Statistical applications of the multivariate skew-normal dis-
tribution. Journal of the Royal Statistical Society, Series B, Methodological, 61, 579602.

See Also

skewnorm, uninormal, foldnormal.

Slash 709

Examples
sdata <- data.frame(y1 = rskewnorm(nn <- 1000, shape = 5))
fit1 <- vglm(y1 ~ 1, skewnormal, data = sdata, trace = TRUE)
coef(fit1, matrix = TRUE)
head(fitted(fit1), 1)
with(sdata, mean(y1))
## Not run: with(sdata, hist(y1, prob = TRUE))
x <- with(sdata, seq(min(y1), max(y1), len = 200))
with(sdata, lines(x, dskewnorm(x, shape = Coef(fit1)), col = "blue"))
## End(Not run)

sdata <- data.frame(x2 = runif(nn))

sdata <- transform(sdata, y2 = rskewnorm(nn, shape = 1 + 2*x2))
fit2 <- vglm(y2 ~ x2, skewnormal, data = sdata, trace = TRUE, crit = "coef")
summary(fit2)

Slash Slash Distribution

Description

Density function, distribution function, and random generation for the slash distribution.

Usage

dslash(x, mu = 0, sigma = 1, log = FALSE, smallno = .Machine$double.eps*1000)

pslash(q, mu = 0, sigma = 1, very.negative = -10000,
lower.tail = TRUE, log.p = FALSE)
rslash(n, mu = 0, sigma = 1)

Arguments

x, q vector of quantiles.
n Same as runif.
mu, sigma the mean and standard deviation of the univariate normal distribution.
log Logical. If TRUE then the logarithm of the density is returned.
very.negative Numeric, of length 1. A large negative value. For (q-mu)/sigma values less
than this, the value 0 is returned because integrate tends to fail. A warning is
issued. Similarly, if (q-mu)/sigma is greater than abs(very.negative) then 1
is returned with a warning.
smallno See slash.
lower.tail, log.p
Same meaning as in pnorm or qnorm.
710 slash

Details
See slash, the VGAM family function for estimating the two parameters by maximum likelihood
estimation, for the formula of the probability density function and other details.
Function pslash uses a for () loop and integrate, meaning its very slow. It may also be
inaccurate for extreme values of q, and returns with 1 or 0 values when too extreme compared to
very.negative.

Value
dslash gives the density, and pslash gives the distribution function, rslash generates random
deviates.

Note
pslash is very slow.

Author(s)
Thomas W. Yee and C. S. Chee

curve(pslash, col = "blue", -5, 5, ylim = 0:1)

pslash(c(-Inf, -20000, 20000, Inf)) # Gives a warning

## End(Not run)

slash Slash Distribution Family Function

Description
Estimates the two parameters of the slash distribution by maximum likelihood estimation.
slash 711

Usage

slash(lmu = "identitylink", lsigma = "loge",

imu = NULL, isigma = NULL, gprobs.y = ppoints(8), nsimEIM = 250,
zero = NULL, smallno = .Machine$double.eps*1000)

Arguments

lmu, lsigma Parameter link functions applied to the and parameters, respectively. See
Links for more choices.
imu, isigma Initial values. A NULL means an initial value is chosen internally. See CommonVGAMffArguments
for more information.
gprobs.y Used to compute the initial values for mu. This argument is fed into the probs
argument of quantile to construct a grid, which is used to evaluate the log-
likelihood. This must have values between 0 and 1.
nsimEIM, zero See CommonVGAMffArguments for information.
smallno Small positive number, used to test for the singularity.

Details

The standard slash distribution is the distribution of the ratio of a standard normal variable to an
independent standard uniform(0,1) variable. It is mainly of use in simulation studies. One of its
properties is that it has heavy tails, similar to those of the Cauchy.
The general slash distribution can be obtained by replacing the univariate normal variable by a
general normal N (, ) random variable. It has a density that can be written as
p
1/(2 (2)) if y = ,
f (y) = 2
p 2
1 exp((((y )/) )/2))/( (2pi)((y )/) ) if y 6= .

where and are the mean and standard deviation of the univariate normal distribution respectively.

Value

An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Note

Fisher scoring using simulation is used. Convergence is often quite slow. Numerical problems may
occur.

Author(s)

T. W. Yee and C. S. Chee

712 sm.os

References
Johnson, N. L. and Kotz, S. and Balakrishnan, N. (1994) Continuous Univariate Distributions, 2nd
edition, Volume 1, New York: Wiley.
Kafadar, K. (1982) A Biweight Approach to the One-Sample Problem Journal of the American
Statistical Association, 77, 416424.

sm.os Defining OSullivan Spline Smooths in VGAM Formulas

Description
This function represents an O-spline smooth term in a vgam formula and confers automatic smooth-
ing parameter selection.

Usage
sm.os(x, ..., niknots = 6, spar = -1, o.order = 2,
alg.niknots = c("s", ".nknots.smspl")[1], all.knots = FALSE,
ridge.adj = 1e-5, spillover = 0.01, maxspar = 1e12,
outer.ok = FALSE, fixspar = FALSE)

Arguments
x covariate (abscissae) to be smoothed. Also called the regressor. If the xij facil-
ity is used then these covariates are inputted via the ... argument.
... Used to accommodate the other M 1 covariates when the xij facility is used.
See Section 3.4.4 of Yee (2015) for something very similar. This argument,
found in the second argument, means that the other argument names must be
fully specified if used, e.g., outer.ok and not outer. See the example below. In
the example below, the term in the main formula is sm.os(gcost.air, gcost.trn, gcost.bus)
and one might be tempted to use something like sm.os(gcost) to represent that
xij term. However, this is not recommended because sm.os(gcost) might not
sm.os 713

have the same number of columns as sm.os(gcost.air, gcost.trn, gcost.bus)

etc. That is, it is best to select one of the diagonal elements of the block matrix
to represent that term.
niknots numeric, the number of interior knots, called K below. The default is to use this
value. If you want alg.niknots to operate then assign NULL to this argument.
alg.niknots character. The algorithm used to determine the number of interior knots. Only
used when all.knots = FALSE and niknots = NULL. Note that ".nknots.smspl"
corresponds to the default of smooth.spline. The value "s" corresponds to the
same algorithm as s.
all.knots logical. If TRUE then all distinct points in x are used as the interior knots.
If FALSE (default) then a subset of x[] is used, specifically x[j] where the
niknots indices are quantiles that are evenly spaced with respect to the argu-
ment probssee quantile. If all.knots = FALSE and niknots = NULL then
the argument alg.niknots is used to compute niknots.
spar, maxspar spar is a vector of smoothing parameters. Negative values mean that magic
will choose initial values in order to do the optimization at each P-IRLS iter-
ation. Positive values mean that they are used as initial values for magic. If
fixspar = TRUE then spar should be assigned a vector of positive values (but
having values less than maxspar); then the smoothing parameters will be fixed
and magic will not be used.
o.order The order of the OSullivan penalzed spline. Any one value from 1:4 is accept-
able. The degree of the spline is 2 * o.order - 1, so that cubic splines are
the default. Setting o.order = 1 results in a linear spline which is a piecewise
linear function.
ridge.adj small positive number to stabilize linear dependencies among B-spline bases.
spillover small and positive proportion of the range used on the outside of the boundary
values. This defines the endpoints a and b that cover the data xi , i.e., we are
interested in the interval [a, b] which contains all the abscissae. The interior
knots are strictly inside (a, b).
outer.ok Fed into the argument (by the same name) of splineDesign.
fixspar logical. If TRUE then spar should be a vector with positive values and the
smoothing parameters are fixed at those values. If FALSE then spar contains
the initial values for the smoothing parameters, and magic is called to determine
(hopefully) some good values for the smoothing parameters.

Details
This function is currently used by vgam to allow automatic smoothing parameter selection based on
O-splines to minimize an UBRE quantity. In contrast, s operates by having a prespecified amount
of smoothing, e.g., its df argument. When the sample size is reasonably large this function is
recommended over s also because backfitting is not required. This function therefore allows 2nd-
generation VGAMs to be fitted (called G2-VGAMs, or Penalized-VGAMs).
This function should only be used with vgam. This function uses quantile to choose the knots,
whereas sm.ps chooses equally-spaced knots. As Wand and Ormerod (2008) write, in most sit-
uations the differences will be minor, but it is possible for problems to arise for either strategy
by constructing certain regression functions and predictor variable distributions. Any differences
714 sm.os

between O-splines and P-splines tend to be at the boundaries. O-splines have natural boundary
constraints so that the solution is linear beyond the boundary knots.
Some arguments in decreasing order of precedence are: all.knots, niknots, alg.niknots.
Unlike s, which is symbolic and does not perform any smoothing itself, this function does compute
the penalized spline when used by vgamit creates the appropriate columns of the model matrix.
When this function is used within vgam, automatic smoothing parameter selection is implemented
by calling magic after the necessary link-ups are done.
By default this function centres the component function. This function is also smart; it can be
used for smart prediction (Section 18.6 of Yee (2015)). Automatic smoothing parameter selection
is performed using performance-oriented iteration whereby an optimization problem is solved at
each IRLS iteration.
This function works better when the sample size is large, e.g., when in the hundreds, say.

Value

A matrix with attributes that are (only) used by vgam. The number of rows of the matrix is
length(x). The number of columns is a function of the number of interior knots K and the order
of the O-spline m: K + 2m 1. In code, this is niknots + 2 * o.order - 1.

Warning

Being introduced into VGAM for the first time, this function (and those associated with it) should
be used cautiously. Not all options are fully working or have been tested yet, and there are bound
to be some bugs lurking around.

Note

This function is currently under development and may change in the future.
One might try using this function with vglm so as to fit a regression spline, however, the default
value of niknots will probably be too high for most data sets.

Author(s)

T. W. Yee, with some of the essential R code coming from the appendix of Wand and Ormerod
(2008).

References

Wand, M. P. and Ormerod, J. T. (2008). On semiparametric regression with OSullivan penalized

splines. Australian and New Zealand Journal of Statistics, 50(2): 179198.

See Also

vgam, sm.ps, s, smartpred, is.smart, summarypvgam, smooth.spline, splineDesign, bs, magic.

sm.os 715

Examples

sm.os(runif(20))

## Not run:
data("TravelMode", package = "AER") # Need to install "AER" first
air.df <- subset(TravelMode, mode == "air") # Form 4 smaller data frames
bus.df <- subset(TravelMode, mode == "bus")
trn.df <- subset(TravelMode, mode == "train")
car.df <- subset(TravelMode, mode == "car")
TravelMode2 <- data.frame(income = air.df$income,
wait.air = air.df$wait - car.df$wait,
wait.trn = trn.df$wait - car.df$wait,
wait.bus = bus.df$wait - car.df$wait,
gcost.air = air.df$gcost - car.df$gcost,
gcost.trn = trn.df$gcost - car.df$gcost,
gcost.bus = bus.df$gcost - car.df$gcost,
wait = air.df$wait) # Value is unimportant
TravelMode2$mode <- subset(TravelMode, choice == "yes")$mode # The response
TravelMode2 <- transform(TravelMode2, incom.air = income, incom.trn = 0,
incom.bus = 0)
set.seed(1)
TravelMode2 <- transform(TravelMode2,
junkx2 = runif(nrow(TravelMode2)))

tfit2 <-
vgam(mode ~ sm.os(gcost.air, gcost.trn, gcost.bus) + ns(junkx2, 4) +
sm.os(incom.air, incom.trn, incom.bus) + wait ,
crit = "coef",
multinomial(parallel = FALSE ~ 1), data = TravelMode2,
xij = list(sm.os(gcost.air, gcost.trn, gcost.bus) ~
sm.os(gcost.air, gcost.trn, gcost.bus) +
sm.os(gcost.trn, gcost.bus, gcost.air) +
sm.os(gcost.bus, gcost.air, gcost.trn),
sm.os(incom.air, incom.trn, incom.bus) ~
sm.os(incom.air, incom.trn, incom.bus) +
sm.os(incom.trn, incom.bus, incom.air) +
sm.os(incom.bus, incom.air, incom.trn),
wait ~ wait.air + wait.trn + wait.bus),
form2 = ~ sm.os(gcost.air, gcost.trn, gcost.bus) +
sm.os(gcost.trn, gcost.bus, gcost.air) +
sm.os(gcost.bus, gcost.air, gcost.trn) +
wait +
sm.os(incom.air, incom.trn, incom.bus) +
sm.os(incom.trn, incom.bus, incom.air) +
sm.os(incom.bus, incom.air, incom.trn) +
junkx2 + ns(junkx2, 4) +
incom.air + incom.trn + incom.bus +
gcost.air + gcost.trn + gcost.bus +
wait.air + wait.trn + wait.bus)
par(mfrow = c(2, 2))
plot(tfit2, se = TRUE, lcol = "orange", scol = "blue", ylim = c(-4, 4))
summary(tfit2)
716 sm.ps

## End(Not run)

sm.ps Defining Penalized Spline Smooths in VGAM Formulas

Description
This function represents a P-spline smooth term in a vgam formula and confers automatic smoothing
parameter selection.

Usage
sm.ps(x, ..., ps.int = NULL, spar = -1, degree = 3, p.order = 2,
ridge.adj = 1e-5, spillover = 0.01, maxspar = 1e12,
outer.ok = FALSE, mux = NULL, fixspar = FALSE)

Arguments
x, ... See sm.os.
ps.int the number of equally-spaced B-spline intervals. Note that the number of knots
is equal to ps.int + 2*degree + 1. The default, signified by NULL, means that
the maximum of the value 7 and degree is chosen. This usually means 6 interior
knots for big data sets. However, if this is too high compared to the length of x,
then some adjustment is made. In the case where mux is assigned a numerical
value (suggestions: some value between 1 and 2) then ceiling(mux * log(length(unique(x.index)))
is used, where x.index is the combined data. No matter what, the above is not
guaranteed to work on every data set. This argument may change in the future.
See also argument mux.
spar, maxspar See sm.os.
mux numeric. If given, then this argument multiplies log(length(unique(x))) to
obtain ps.int. If ps.int is given then this argument is ignored.
degree degree of B-spline basis. Usually this will be 2 or 3; and the values 1 or 4 might
possibly be used.
p.order order of difference penalty (0 is the ridge penalty).
ridge.adj, spillover
See sm.os.
outer.ok, fixspar
See sm.os.

Details
This function can be used by vgam to allow automatic smoothing parameter selection based on
P-splines and minimizing an UBRE quantity.
This function should only be used with vgam and is an alternative to sm.os; see that function for
some details that also apply here.
sm.ps 717

Value
A matrix with attributes that are (only) used by vgam. The number of rows of the matrix is
length(x) and the number of columns is ps.int + degree - 1. The latter is because the
function is centred.

Warning
See sm.os.

Note
This function is currently under development and may change in the future. In particular, the default
for ps.int is subject to change.

Author(s)
B. D. Marx wrote the original function. Subsequent edits were made by T. W. Yee and C. Somchit.

References
Eilers, P. H. C. and Marx, B. D. (1996). Flexible smoothing with B-splines and penalties (with
comments and rejoinder). Statistical Science, 11(2): 89121.

See Also
sm.os, s, vgam, smartpred, is.smart, summarypvgam, splineDesign, bs, magic.

Examples
sm.ps(runif(20))
sm.ps(runif(20), ps.int = 5)

junkx2 = runif(nrow(TravelMode2)))

tfit2 <-
vgam(mode ~ sm.ps(gcost.air, gcost.trn, gcost.bus) + ns(junkx2, 4) +
sm.ps(incom.air, incom.trn, incom.bus) + wait ,
crit = "coef",
multinomial(parallel = FALSE ~ 1), data = TravelMode2,
xij = list(sm.ps(gcost.air, gcost.trn, gcost.bus) ~
sm.ps(gcost.air, gcost.trn, gcost.bus) +
sm.ps(gcost.trn, gcost.bus, gcost.air) +
sm.ps(gcost.bus, gcost.air, gcost.trn),
sm.ps(incom.air, incom.trn, incom.bus) ~
sm.ps(incom.air, incom.trn, incom.bus) +
sm.ps(incom.trn, incom.bus, incom.air) +
sm.ps(incom.bus, incom.air, incom.trn),
wait ~ wait.air + wait.trn + wait.bus),
form2 = ~ sm.ps(gcost.air, gcost.trn, gcost.bus) +
sm.ps(gcost.trn, gcost.bus, gcost.air) +
sm.ps(gcost.bus, gcost.air, gcost.trn) +
wait +
sm.ps(incom.air, incom.trn, incom.bus) +
sm.ps(incom.trn, incom.bus, incom.air) +
sm.ps(incom.bus, incom.air, incom.trn) +
junkx2 + ns(junkx2, 4) +
incom.air + incom.trn + incom.bus +
gcost.air + gcost.trn + gcost.bus +
wait.air + wait.trn + wait.bus)
par(mfrow = c(2, 2))
plot(tfit2, se = TRUE, lcol = "orange", scol = "blue", ylim = c(-4, 4))
summary(tfit2)

## End(Not run)

smart.expression S Expression for Smart Functions

Description

smart.expression is an S expression for a smart function to call itself. It is best if you go through
it line by line, but most users will not need to know anything about it. It requires the primary
argument of the smart function to be called "x".
The list component match.call must be assigned the value of match.call() in the smart function;
this is so that the smart function can call itself later.

See Also

match.call.
smart.mode.is 719

Examples
print(sm.min2)

smart.mode.is Determine What Mode the Smart Prediction is In

Description

Determine which of three modes the smart prediction is currently in.

Usage

smart.mode.is(mode.arg = NULL)

Arguments

mode.arg a character string, either "read", "write" or "neutral".

Details

Smart functions such as bs and poly need to know what mode smart prediction is in. If it is
in "write" mode then the parameters are saved to .smart.prediction using put.smart. If in
"read" mode then the parameters are read in using get.smart. If in "neutral" mode then the
smart function behaves like an ordinary function.

Value

If mode.arg is given, then either TRUE or FALSE is returned. If mode.arg is not given, then the mode
("neutral", "read" or "write") is returned. Usually, the mode is "neutral".

See Also

put.smart, bs, poly.

Examples
print(sm.min1)
smart.mode.is() # Returns "neutral"
smart.mode.is(smart.mode.is()) # Returns TRUE
720 smartpred

smartpred Smart Prediction

Description

Data-dependent parameters in formula terms can cause problems in when predicting. The smart-
pred package saves data-dependent parameters on the object so that the bug is fixed. The lm and
glm functions have been fixed properly. Note that the VGAM package by T. W. Yee automatically
comes with smart prediction.

Usage

sm.bs(x, df = NULL, knots = NULL, degree = 3, intercept = FALSE,

Boundary.knots = range(x))
sm.ns(x, df = NULL, knots = NULL, intercept = FALSE,
Boundary.knots = range(x))
sm.poly(x, ..., degree = 1, coefs = NULL, raw = FALSE)
sm.scale(x, center = TRUE, scale = TRUE)

Arguments

x The x argument is actually common to them all.

df, knots, intercept, Boundary.knots
See bs and/or ns.
degree, ..., coefs, raw
See poly.
center, scale See scale.

Details

R version 1.6.0 introduced a partial fix for the prediction problem because it does not work all
the time, e.g., for terms such as I(poly(x, 3)), poly(c(scale(x)), 3), bs(scale(x), 3),
scale(scale(x)). See the examples below. Smart prediction, however, will always work.
The basic idea is that the functions in the formula are now smart, and the modelling functions make
use of these smart functions. Smart prediction works in two ways: using smart.expression, or
using a combination of put.smart and get.smart.

Value

The usual value returned by bs, ns, poly and scale, When used with functions such as vglm the
data-dependent parameters are saved on one slot component called smart.prediction.
smartpred 721

Side Effects
The variables .max.smart, .smart.prediction and .smart.prediction.counter are created
while the model is being fitted. They are created in a new environment called smartpredenv.
These variables are deleted after the model has been fitted. However, if there is an error in the
model fitting function or the fitting model is killed (e.g., by typing control-C) then these variables
will be left in smartpredenv. At the beginning of model fitting, these variables are deleted if
present in smartpredenv.
During prediction, the variables .smart.prediction and .smart.prediction.counter are re-
constructed and read by the smart functions when the model frame is re-evaluated. After prediction,
these variables are deleted.
If the modelling function is used with argument smart = FALSE (e.g., vglm(..., smart = FALSE))
then smart prediction will not be used, and the results should match with the original R functions.

WARNING
The functions bs, ns, poly and scale are now left alone (from 2014-05 onwards) and no longer
smart. They work via safe prediction. The smart versions of these functions have been renamed and
they begin with "sm.".
The functions predict.bs and predict.ns are not smart. That is because they operate on objects
that contain attributes only and do not have list components or slots. The function predict.poly
is not smart.

Author(s)
T. W. Yee and T. J. Hastie

See Also
get.smart.prediction, get.smart, put.smart, smart.expression, smart.mode.is, setup.smart,
wrapup.smart. For vgam in VGAM, sm.ps is important. Commonly used data-dependent func-
tions include scale, poly, bs, ns. In R, the functions bs and ns are in the splines package, and this
library is automatically loaded in because it contains compiled code that bs and ns call.
The functions vglm, vgam, rrvglm and cqo in T. W. Yees VGAM package are examples of mod-
elling functions that employ smart prediction.

Examples
# Create some data first
n <- 20
set.seed(86) # For reproducibility of the random numbers
ldata <- data.frame(x2 = sort(runif(n)), y = sort(runif(n)))
library("splines") # To get ns() in R

# This will work for R 1.6.0 and later

fit <- lm(y ~ ns(x2, df = 5), data = ldata)
## Not run:
plot(y ~ x2, data = ldata)
lines(fitted(fit) ~ x2, data = ldata)
722 sratio

new.ldata <- data.frame(x2 = seq(0, 1, len = n))

points(predict(fit, new.ldata) ~ x2, new.ldata, type = "b", col = 2, err = -1)

## End(Not run)

# The following fails for R 1.6.x and later. It can be

# made to work with smart prediction provided
# ns is changed to sm.ns and scale is changed to sm.scale:
fit1 <- lm(y ~ ns(scale(x2), df = 5), data = ldata)
## Not run:
plot(y ~ x2, data = ldata, main = "Safe prediction fails")
lines(fitted(fit1) ~ x2, data = ldata)
points(predict(fit1, new.ldata) ~ x2, new.ldata, type = "b", col = 2, err = -1)

## End(Not run)

# Fit the above using smart prediction

## Not run:
library("VGAM") # The following requires the VGAM package to be loaded
fit2 <- vglm(y ~ sm.ns(sm.scale(x2), df = 5), uninormal, data = ldata)
[email protected]
plot(y ~ x2, data = ldata, main = "Smart prediction")
lines(fitted(fit2) ~ x2, data = ldata)
points(predict(fit2, new.ldata, type = "response") ~ x2, data = new.ldata,
type = "b", col = 2, err = -1)

## End(Not run)

sratio Ordinal Regression with Stopping Ratios

Description
Fits a stopping ratio logit/probit/cloglog/cauchit/... regression model to an ordered (preferably)
factor response.

Usage
sratio(link = "logit", parallel = FALSE, reverse = FALSE,
zero = NULL, whitespace = FALSE)

Arguments
link Link function applied to the M stopping ratio probabilities. See Links for more
choices.
parallel A logical, or formula specifying which terms have equal/unequal coefficients.
reverse Logical. By default, the stopping ratios used are j = logit(P [Y = j|Y j])
for j = 1, . . . , M . If reverse is TRUE, then j = logit(P [Y = j+1|Y j+1])
will be used.
sratio 723

zero Can be an integer-valued vector specifying which linear/additive predictors are

modelled as intercepts only. The values must be from the set {1,2,. . . ,M }. The
default value means none are modelled as intercept-only terms.
whitespace See CommonVGAMffArguments for information.

Details
In this help file the response Y is assumed to be a factor with ordered values 1, 2, . . . , M + 1, so
that M is the number of linear/additive predictors j .
There are a number of definitions for the continuation ratio in the literature. To make life easier,
in the VGAM package, we use continuation ratios (see cratio) and stopping ratios. Continuation
ratios deal with quantities such as logit(P[Y>j|Y>=j]).

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.

Warning
No check is made to verify that the response is ordinal if the response is a matrix; see ordered.

Author(s)
Thomas W. Yee

References
Agresti, A. (2013) Categorical Data Analysis, 3rd ed. Hoboken, NJ, USA: Wiley.
Simonoff, J. S. (2003) Analyzing Categorical Data, New York, USA: Springer-Verlag.
McCullagh, P. and Nelder, J. A. (1989) Generalized Linear Models, 2nd ed. London: Chapman &
Hall.
Yee, T. W. (2010) The VGAM package for categorical data analysis. Journal of Statistical Software,
32, 134. http://www.jstatsoft.org/v32/i10/.
724 studentt

See Also

cratio, acat, cumulative, multinomial, margeff, pneumo, logit, probit, cloglog, cauchit.

Examples
pneumo <- transform(pneumo, let = log(exposure.time))
(fit <- vglm(cbind(normal, mild, severe) ~ let,
sratio(parallel = TRUE), data = pneumo))
coef(fit, matrix = TRUE)
constraints(fit)
predict(fit)
predict(fit, untransform = TRUE)

studentt Student t Distribution

Description

Estimating the parameters of a Student t distribution.

Usage

studentt (ldf = "loglog", idf = NULL, tol1 = 0.1, imethod = 1)

studentt2(df = Inf, llocation = "identitylink", lscale = "loge",
ilocation = NULL, iscale = NULL, imethod = 1, zero = "scale")
studentt3(llocation = "identitylink", lscale = "loge", ldf = "loglog",
ilocation = NULL, iscale = NULL, idf = NULL,
imethod = 1, zero = c("scale", "df"))

Arguments
llocation, lscale, ldf
Parameter link functions for each parameter, e.g., for degrees of freedom . See
Links for more choices. The defaults ensures the parameters are in range. A
loglog link keeps the degrees of freedom greater than unity; see below.
ilocation, iscale, idf
Optional initial values. If given, the values must be in range. The default is to
compute an initial value internally.
tol1 A positive value, the tolerance for testing whether an initial value is 1. Best to
leave this argument alone.
df Numeric, user-specified degrees of freedom. It may be of length equal to the
number of columns of a response matrix.
imethod, zero See CommonVGAMffArguments.
studentt 725

Details
The Student t density function is
(+1)/2
y2

(( + 1)/2)
f (y; ) = 1+
(/2)

for all real y. Then E(Y ) = 0 if > 1 (returned as the fitted values), and V ar(Y ) = /( 2)
for > 2. When = 1 then the Student t-distribution corresponds to the standard Cauchy distri-
bution, cauchy1. When = 2 with a scale parameter of sqrt(2) then the Student t-distribution
corresponds to the standard (Koenker) distribution, sc.studentt2. The degrees of freedom can be
treated as a parameter to be estimated, and as a real and not an integer. The Student t distribution is
used for a variety of reasons in statistics, including robust regression.
Let Y = (T )/ where and are the location and scale parameters respectively. Then
studentt3 estimates the location, scale and degrees of freedom parameters. And studentt2 es-
timates the location, scale parameters for a user-specified degrees of freedom, df. And studentt
estimates the degrees of freedom parameter only. The fitted values are the location parameters. By
default the linear/additive predictors are (, log(), log log())T or subsets thereof.
In general convergence can be slow, especially when there are covariates.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Note
studentt3() and studentt2() can handle multiple responses.
Practical experience has shown reasonably good initial values are required. If convergence failure
occurs try using arguments such as idf. Local solutions are also possible, especially when the
degrees of freedom is close to unity or the scale parameter is close to zero.
A standard normal distribution corresponds to a t distribution with infinite degrees of freedom. Con-
sequently, if the data is close to normal, there may be convergence problems; best to use uninormal
instead.

Author(s)
T. W. Yee

References
Student (1908) The probable error of a mean. Biometrika, 6, 125.
Zhu, D. and Galbraith, J. W. (2010) A generalized asymmetric Student-t distribution with applica-
tion to financial econometrics. Journal of Econometrics, 157, 297305.

See Also
uninormal, cauchy1, logistic, huber2, sc.studentt2, TDist, simulate.vlm.
726 summarypvgam

Examples
tdata <- data.frame(x2 = runif(nn <- 1000))
tdata <- transform(tdata, y1 = rt(nn, df = exp(exp(0.5 - x2))),
y2 = rt(nn, df = exp(exp(0.5 - x2))))
fit1 <- vglm(y1 ~ x2, studentt, data = tdata, trace = TRUE)
coef(fit1, matrix = TRUE)

fit2 <- vglm(y1 ~ x2, studentt2(df = exp(exp(0.5))), data = tdata)

coef(fit2, matrix = TRUE) # df inputted into studentt2() not quite right

fit3 <- vglm(cbind(y1, y2) ~ x2, studentt3, data = tdata, trace = TRUE)
coef(fit3, matrix = TRUE)

summarypvgam Summarizing Penalized Vector Generalized Additive Model Fits

Description
These functions are all methods for class "pvgam" or summary.pvgam objects.

Usage
summarypvgam(object, dispersion = NULL, digits = options()$digits - 2,
presid = TRUE)
## S3 method for class 'summary.pvgam'
show(x, quote = TRUE, prefix = "", digits = options()$digits -
2, signif.stars = getOption("show.signif.stars"))

Arguments
object an object of class "pvgam", which is the result of a call to vgam with at least one
sm.os or sm.ps term.
x an object of class "summary.pvgam", which is the result of a call to summarypvgam().
dispersion, digits, presid
See summaryvglm.
quote, prefix, signif.stars
See summaryvglm.

Details
This methods function reports a summary more similar to summary.gam from mgcv than summary.gam()
from gam. It applies to G2-VGAMs using sm.os and O-splines, else sm.ps and P-splines. In partic-
ular, the hypothesis test for whether each sm.os or sm.ps term can be deleted follows quite closely
to summary.gam. The p-values from this type of test tend to be biased downwards (too small) and
corresponds to p.type = 5. It is hoped in the short future that improved p-values be implemented,
somewhat like the default of summary.gam. This methods function was adapted from summary.gam.
summaryvgam 727

Value

summarypvgam returns an object of class "summary.pvgam"; see summary.pvgam-class.

Warning

See sm.os.

See Also

vgam, summaryvgam, summary.pvgam-class, sm.os, sm.ps, summary.glm, summary.lm, summary.gam

from mgcv, summaryvgam for G1-VGAMs.

Examples
hfit2 <- vgam(agaaus ~ sm.os(altitude), binomialff, data = hunua)
coef(hfit2, matrix = TRUE)
summary(hfit2)

summaryvgam Summarizing Vector Generalized Additive Model Fits

Description

These functions are all methods for class vgam or summary.vgam objects.

Usage

summaryvgam(object, dispersion = NULL, digits = options()$digits - 2,

presid = TRUE, nopredictors = FALSE)
## S3 method for class 'summary.vgam'
show(x, quote = TRUE, prefix = "",
digits = options()$digits-2, nopredictors = NULL)

Arguments

object an object of class "vgam", which is the result of a call to vgam with at least one
s term.
x an object of class "summary.vgam", which is the result of a call to summaryvgam().
dispersion, digits, presid
See summaryvglm.
quote, prefix, nopredictors
See summaryvglm.
728 summaryvglm

Details
This methods function reports a summary more similar to summary.gam() from gam than summary.gam
from mgcv. It applies to G1-VGAMs using s and vector backfitting. In particular, an approximate
score test for linearity is conducted for each s termsee Section 4.3.4 of Yee (2015) for details.
The p-values from this type of test tend to be biased upwards (too large).

Value
summaryvgam returns an object of class "summary.vgam"; see summary.vgam-class.

See Also
vgam, summary.glm, summary.lm, summary.gam from mgcv, summarypvgam for P-VGAMs.

Examples
hfit <- vgam(agaaus ~ s(altitude, df = 2), binomialff, data = hunua)
summary(hfit)
summary(hfit)@anova # Table for (approximate) testing of linearity

summaryvglm Summarizing Vector Generalized Linear Model Fits

Description
These functions are all methods for class vglm or summary.vglm objects.

Usage
summaryvglm(object, correlation = FALSE,
dispersion = NULL, digits = NULL, presid = TRUE,
signif.stars = getOption("show.signif.stars"),
nopredictors = FALSE, ...)
## S3 method for class 'summary.vglm'
show(x, digits = max(3L, getOption("digits") - 3L),
quote = TRUE, prefix = "", presid = TRUE,
signif.stars = NULL, nopredictors = NULL,
top.half.only = FALSE, ...)

Arguments
object an object of class "vglm", usually, a result of a call to vglm.
x an object of class "summary.vglm", usually, a result of a call to summaryvglm().
dispersion used mainly for GLMs. See summary.glm.
correlation logical; if TRUE, the correlation matrix of the estimated parameters is returned
and printed.
summaryvglm 729

digits the number of significant digits to use when printing.

signif.stars logical; if TRUE, significance stars are printed for each coefficient.
presid Pearson residuals; print out some summary statistics of these?
quote Fed into print().
nopredictors logical; if TRUE the names of the linear predictors are not printed out. The default
is that they are.
top.half.only logical; if TRUE then only print out the top half of the usual output. Used for
P-VGAMs.
prefix Not used.
... Not used.

Details
show.summary.vglm() tries to be smart about formatting the coefficients, standard errors, etc. and
additionally gives significance stars if signif.stars is TRUE. The coefficients component of
the result gives the estimated coefficients and their estimated standard errors, together with their
ratio. This third column is labelled z value regardless of whether the dispersion is estimated or
known (or fixed by the family). A fourth column gives the two-tailed p-value corresponding to the
z ratio based on a Normal reference distribution.
In general, the t distribution is not used, but the normal distribution is used.
Correlations are printed to two decimal places (or symbolically): to see the actual correlations print
summary(object)@correlation directly.
It is possible for programmers to write a methods function to print out extra quantities when
summary(vglmObject) is called. The generic function is summaryvglmS4VGAM(), and one can use
the S4 function setMethod to compute the quantities needed. Also needed is the generic function
is showsummaryvglmS4VGAM() to actually print the quantities out.

Value
summaryvglm returns an object of class "summary.vglm"; see summary.vglm-class.

See Also
vglm, confintvglm, vcovvlm, summary.glm, summary.lm, summary.

Examples
## For examples see example(glm)
pneumo <- transform(pneumo, let = log(exposure.time))
(fit <- vglm(cbind(normal, mild, severe) ~ let, acat, data = pneumo))
coef(fit, matrix = TRUE)
summary(fit)
coef(summary(fit))
730 SURff

SURff Seemingly Unrelated Regressions Family Function

Description
Fits a system of seemingly unrelated regressions.

Usage
SURff(mle.normal = FALSE,
divisor = c("n", "n-max(pj,pk)", "sqrt((n-pj)*(n-pk))"),
parallel = FALSE, Varcov = NULL, matrix.arg = FALSE)

Arguments
mle.normal Logical. If TRUE then the MLE, assuming multivariate normal errors, is com-
puted; the effect is just to add a loglikelihood slot to the returned object.
Then it results in the maximum likelihood estimator.
divisor Character, partial matching allowed and the first choice is the default. The di-
visor for the estimate of the covariances. If "n" then the estimate will be bi-
ased. If the others then the estimate will be unbiased for some elements. If
mle.normal = TRUE and this argument is not "n" then a warning or an error
will result.
parallel See CommonVGAMffArguments. If parallel = TRUE then the constraint applies
to the intercept too.
Varcov Numeric. This may be assigned a variance-covariance of the errors. If matrix.arg
then this is a M M matrix. If !matrix.arg then this is a M M matrix in
matrix-band format (a vector with at least M and at most M*(M+1)/2 elements).
matrix.arg Logical. Of single length.

Details
Proposed by Zellner (1962), the basic seemingly unrelated regressions (SUR) model is a set of LMs
(M > 1 of them) tied together at the error term level. Each LMs model matrix may potentially
have its own set of predictor variables.
Zellners efficient (ZEF) estimator (also known as Zellners two-stage Aitken estimator) can be
obtained by setting maxit = 1 (and possibly divisor = "sqrt" or divisor = "n-max").
The default value of maxit (in vglm.control) probably means iterative GLS (IGLS) estimator is
computed because IRLS will probably iterate to convergence. IGLS means, at each iteration, the
residuals are used to estimate the error variance-covariance matrix, and then the matrix is used in
the GLS. The IGLS estimator is also known as Zellners iterative Aitken estimator, or IZEF.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.
SURff 731

Warning
The default convergence criterion may be a little loose. Try setting epsilon = 1e-11, especially
with mle.normal = TRUE.

Note
The fitted object has slot @extra$ncols.X.lm which is a M vector with the number of parameters
for each LM. Also, @misc$values.divisor is the M -vector of divisor values.
Constraint matrices are needed in order to specify which response variables that each term on the
RHS of the formula is a regressor for. See the constraints argument of vglm for more information.

Author(s)
T. W. Yee.

References
Zellner, A. (1962) An Efficient Method of Estimating Seemingly Unrelated Regressions and Tests
for Aggregation Bias. J. Amer. Statist. Assoc., 57(298), 348368.
Kmenta, J. and Gilbert, R. F. (1968) Small Sample Properties of Alternative Estimators of Seem-
ingly Unrelated Regressions. J. Amer. Statist. Assoc., 63(324), 11801200.

round(coef(zef1, matrix = TRUE), digits = 4) # ZEF

zef1@extra$ncols.X.lm
zef1@misc$divisor
zef1@misc$values.divisor
round(sqrt(diag(vcov(zef1))), digits = 4) # SEs
nobs(zef1, type = "lm")
df.residual(zef1, type = "lm")

mle1 <- vglm(cbind(invest.g, invest.w) ~

capital.g + value.g + capital.w + value.w,
732 SurvS4

SURff(mle.normal = TRUE),
epsilon = 1e-11,
data = gew, trace = TRUE, constraints = clist)
round(coef(mle1, matrix = TRUE), digits = 4) # MLE
round(sqrt(diag(vcov(mle1))), digits = 4) # SEs

SurvS4 Create a Survival Object

Description
Create a survival object, usually used as a response variable in a model formula.

Usage
SurvS4(time, time2, event, type =, origin = 0)
is.SurvS4(x)

Arguments
time for right censored data, this is the follow up time. For interval data, the first
argument is the starting time for the interval.
x any R object.
event The status indicator, normally 0=alive, 1=dead. Other choices are TRUE/FALSE
(TRUE = death) or 1/2 (2=death). For interval censored data, the status indica-
tor is 0=right censored, 1=event at time, 2=left censored, 3=interval censored.
Although unusual, the event indicator can be omitted, in which case all subjects
are assumed to have an event.
time2 ending time of the interval for interval censored or counting process data only.
Intervals are assumed to be open on the left and closed on the right, (start, end].
For counting process data, event indicates whether an event occurred at the end
of the interval.
type character string specifying the type of censoring. Possible values are "right",
"left", "counting", "interval", or "interval2". The default is "right"
or "counting" depending on whether the time2 argument is absent or present,
respectively.
origin for counting process data, the hazard function origin. This is most often used in
conjunction with a model containing time dependent strata in order to align the
subjects properly when they cross over from one strata to another.

Details
Typical usages are

SurvS4(time, event)
SurvS4(time, time2, event, type=, origin=0)
SurvS4 733

In theory it is possible to represent interval censored data without a third column containing the
explicit status. Exact, right censored, left censored and interval censored observation would be
represented as intervals of (a,a), (a, infinity), (-infinity,b), and (a,b) respectively; each specifying
the interval within which the event is known to have occurred.
If type = "interval2" then the representation given above is assumed, with NA taking the place
of infinity. If type="interval" event must be given. If event is 0, 1, or 2, the relevant information
is assumed to be contained in time, the value in time2 is ignored, and the second column of the
result will contain a placeholder.
Presently, the only methods allowing interval censored data are the parametric models computed
by survreg, so the distinction between open and closed intervals is unimportant. The distinction is
important for counting process data and the Cox model.
The function tries to distinguish between the use of 0/1 and 1/2 coding for left and right censored
data using if (max(status)==2). If 1/2 coding is used and all the subjects are censored, it will
guess wrong. Use 0/1 coding in this case.

Value
An object of class SurvS4 (formerly Surv). There are methods for print, is.na, and subscripting
survival objects. SurvS4 objects are implemented as a matrix of 2 or 3 columns.
In the case of is.SurvS4, a logical value TRUE if x inherits from class "SurvS4", otherwise a FALSE.

Note
The purpose of having SurvS4 in VGAM is so that the same input can be fed into vglm as functions
in survival such as survreg. The class name has been changed from "Surv" to "SurvS4"; see
SurvS4-class.
The format J+ is interpreted in VGAM as J. If type="interval" then these should not be used
in VGAM: (L,U-] or (L,U+].

Author(s)
The code and documentation comes from survival. Slight modifications have been made for conver-
sion to S4 by T. W. Yee. Also, for "interval" data, as.character.SurvS4() has been modified
to print intervals of the form (start, end] and not [start, end] as previously. (This makes a
difference for discrete data, such as for cens.poisson). All VGAM family functions beginning
with "cen" require the packaging function Surv to format the input.

See Also
SurvS4-class, cens.poisson, survreg, leukemia.

Examples
with(leukemia, SurvS4(time, status))
class(with(leukemia, SurvS4(time, status)))
734 SurvS4-class

SurvS4-class Class "SurvS4"

Description
S4 version of the Surv class.

Objects from the Class

A virtual Class: No objects may be created from it.

Extends
Class "Surv", directly. Class "matrix", directly. Class "oldClass", by class "Surv", distance
2. Class "structure", by class "matrix", distance 2. Class "array", by class "matrix", distance
2. Class "vector", by class "matrix", distance 3, with explicit coerce. Class "vector", by class
"matrix", distance 4, with explicit coerce.

Methods
show signature(object = "SurvS4"): ...

Warning
This code has not been thoroughly tested.

Note
The purpose of having SurvS4 in VGAM is so that the same input can be fed into vglm as functions
in survival such as survreg.

Author(s)
T. W. Yee.

References
See survival.

Tikuv A Short-tailed Symmetric Distribution

Description
Density, cumulative distribution function, quantile function and random generation for the short-
tailed symmetric distribution of Tiku and Vaughan (1999).

Usage
dtikuv(x, d, mean = 0, sigma = 1, log = FALSE)
ptikuv(q, d, mean = 0, sigma = 1, lower.tail = TRUE, log.p = FALSE)
qtikuv(p, d, mean = 0, sigma = 1, lower.tail = TRUE, log.p = FALSE, ...)
rtikuv(n, d, mean = 0, sigma = 1, Smallno = 1.0e-6)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. Same as in runif.
d, mean, sigma
arguments for the parameters of the distribution. See tikuv for more details.
For rtikuv, arguments mean and sigma must be of length 1.
Smallno Numeric, a small value used by the rejection method for determining the lower
and upper limits of the distribution. That is, ptikuv(L) < Smallno and ptikuv(U) > 1-Smallno
where L and U are the lower and upper limits respectively.
... Arguments that can be passed into uniroot.
log Logical. If log = TRUE then the logarithm of the density is returned.
lower.tail, log.p
Same meaning as in pnorm or qnorm.

Details
See tikuv for more details.

Value
dtikuv gives the density, ptikuv gives the cumulative distribution function, qtikuv gives the quan-
tile function, and rtikuv generates random deviates.

Author(s)
T. W. Yee and Kai Huang

plot(x, pnorm(x), type = "l", col = "black", ylab = "", las = 1,

main = "Black is standard normal, others are ptikuv(x, d)")
lines(x, ptikuv(x, d = -10), col = "orange")
lines(x, ptikuv(x, d = -1 ), col = "blue")
lines(x, ptikuv(x, d = 1 ), col = "green")
legend("topleft", col = c("orange","blue","green"), lty = rep(1, len = 3),
legend = paste("d =", c(-10, -1, 1)))
## End(Not run)

probs <- seq(0.1, 0.9, by = 0.1)

ptikuv(qtikuv(p = probs, d = 1), d = 1) - probs # Should be all 0

tikuv Short-tailed Symmetric Distribution Family Function

Description
Fits the short-tailed symmetric distribution of Tiku and Vaughan (1999).

Usage
tikuv(d, lmean = "identitylink", lsigma = "loge", isigma = NULL,
zero = "sigma")

Arguments
d The d parameter. It must be a single numeric value less than 2. Then h =
2 d > 0 is another parameter.
lmean, lsigma Link functions for the mean and standard deviation parameters of the usual uni-
variate normal distribution (see Details below). They are and respectively.
See Links for more choices.
isigma Optional initial value for . A NULL means a value is computed internally.
zero A vector specifying which linear/additive predictors are modelled as intercept-
only. The values can be from the set {1,2}, corresponding respectively to ,
. If zero = NULL then all linear/additive predictors are modelled as a linear
combination of the explanatory variables. For many data sets having zero = 2
is a good idea. See CommonVGAMffArguments for information.
tikuv 737

Details
The short-tailed symmetric distribution of Tiku and Vaughan (1999) has a probability density func-
tion that can be written
" 2 #2
K 1 y 1 2 2
f (y) = 1+ exp (y ) /
2 2h 2

where h = 2 d > 0, K is a function of h, < y < , > 0. The mean of Y is E(Y ) =

and this is returned as the fitted values.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Warning
Under- or over-flow may occur if the data is ill-conditioned, e.g., when d is very close to 2 or
approaches -Inf.

Note
The density function is the product of a univariate normal density and a polynomial in the response
y. The distribution is bimodal if d > 0, else is unimodal. A normal distribution arises as the limit as
d approaches , i.e., as h approaches . Fisher scoring is implemented. After fitting the value
of d is stored in @misc with component name d.

Author(s)
Thomas W. Yee

References
Akkaya, A. D. and Tiku, M. L. (2008) Short-tailed distributions and inliers. Test, 17, 282296.
Tiku, M. L. and Vaughan, D. C. (1999) A family of short-tailed symmetric distributions. Technical
report, McMaster University, Canada.

## Not run: with(tdata, hist(y, prob = TRUE))

lines(dtikuv(sy, d = 1, m = Cfit[1], s = Cfit[2]) ~ sy, data = tdata, col = "orange")
## End(Not run)

Tobit The Tobit Distribution

Description
Density, distribution function, quantile function and random generation for the Tobit model.

Usage
dtobit(x, mean = 0, sd = 1, Lower = 0, Upper = Inf, log = FALSE)
ptobit(q, mean = 0, sd = 1, Lower = 0, Upper = Inf,
lower.tail = TRUE, log.p = FALSE)
qtobit(p, mean = 0, sd = 1, Lower = 0, Upper = Inf,
lower.tail = TRUE, log.p = FALSE)
rtobit(n, mean = 0, sd = 1, Lower = 0, Upper = Inf)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. If length(n) > 1 then the length is taken to be the
number required.
Lower, Upper vector of lower and upper thresholds.
mean, sd, lower.tail, log, log.p
see rnorm.

Details
See tobit, the VGAM family function for estimating the parameters, for details. Note that the den-
sity at Lower and Upper is the the area to the left and right of those points. Thus there are two spikes
(but less in value); see the example below. Consequently, dtobit(Lower) + dtobit(Upper) + the
area in between equals unity.

Value
dtobit gives the density, ptobit gives the distribution function, qtobit gives the quantile function,
and rtobit generates random deviates.

Author(s)
T. W. Yee
tobit 739

integrate(dtobit, lower = Lower, upper = Upper,

mean = mu, Lower = Lower, Upper = Upper)$value +
dtobit(Lower, mean = mu, Lower = Lower, Upper = Upper) +
dtobit(Upper, mean = mu, Lower = Lower, Upper = Upper) # Adds to unity

## Not run:
plot(x, ptobit(x, m = mu, Lower = Lower, Upper = Upper),
type = "l", ylim = 0:1, las = 1, col = "orange",
ylab = paste("ptobit(m = ", mu, ", sd = 1, Lower =", Lower,
", Upper =", Upper, ")"),
main = "Orange is cumulative distribution function; blue is density",
sub = "Purple lines are the 10,20,...,90 percentiles")
abline(h = 0)
lines(x, dtobit(x, m = mu, Lower = Lower, Upper = Upper), col = "blue")

probs <- seq(0.1, 0.9, by = 0.1)

Q <- qtobit(probs, m = mu, Lower = Lower, Upper = Upper)
lines(Q, ptobit(Q, m = mu, Lower = Lower, Upper = Upper),
col = "purple", lty = "dashed", type = "h")
lines(Q, dtobit(Q, m = mu, Lower = Lower, Upper = Upper),
col = "darkgreen", lty = "dashed", type = "h")
abline(h = probs, col = "purple", lty = "dashed")
max(abs(ptobit(Q, m = mu, Lower = Lower, Upper = Upper) - probs)) # Should be 0

endpts <- c(Lower, Upper) # Endpoints have a spike (not quite, actually)
lines(endpts, dtobit(endpts, m = mu, Lower = Lower, Upper = Upper),
col = "blue", lwd = 3, type = "h")

## End(Not run)

tobit Tobit Model

Description
Fits a Tobit model.

Usage
tobit(Lower = 0, Upper = Inf, lmu = "identitylink", lsd = "loge",
imu = NULL, isd = NULL,
type.fitted = c("uncensored", "censored", "mean.obs"),
byrow.arg = FALSE, imethod = 1, zero = "sd")
740 tobit

Arguments
Lower Numeric. It is the value L described below. Any value of the linear model xTi
that is less than this lowerbound is assigned this value. Hence this should be the
smallest possible value in the response variable. May be a vector (see below for
more information).
Upper Numeric. It is the value U described below. Any value of the linear model xTi
that is greater than this upperbound is assigned this value. Hence this should be
the largest possible value in the response variable. May be a vector (see below
for more information).
lmu, lsd Parameter link functions for the mean and standard deviation parameters. See
Links for more choices. The standard deviation is a positive quantity, therefore
a log link is its default.
imu, isd, byrow.arg
See CommonVGAMffArguments for information.
type.fitted Type of fitted value returned. The first choice is default and is the ordinary un-
censored or unbounded linear model. If "censored" then the fitted values in the
interval [L, U ]. If "mean.obs" then the mean of the observations is returned;
this is a doubly truncated normal distribution augmented by point masses at the
truncation points (see dtobit). See CommonVGAMffArguments for more infor-
mation.
imethod Initialization method. Either 1 or 2 or 3, this specifies some methods for obtain-
ing initial values for the parameters. See CommonVGAMffArguments for informa-
tion.
zero A vector, e.g., containing the value 1 or 2. If so, the mean or standard deviation
respectively are modelled as an intercept-only. Setting zero = NULL means both
linear/additive predictors are modelled as functions of the explanatory variables.
See CommonVGAMffArguments for more information.

Details
The Tobit model can be written
yi = xTi + i
where the ei N (0, 2 ) independently and i = 1, . . . , n. However, we measure yi = yi only if
yi > L and yi < U for some cutpoints L and U . Otherwise we let yi = L or yi = U , whatever
is closer. The Tobit model is thus a multiple linear regression but with censored responses if it is
below or above certain cutpoints.
The defaults for Lower and Upper and lmu correspond to the standard Tobit model. Fisher scoring is
used for the standard and nonstandard models. By default, the mean xTi is the first linear/additive
predictor, and the log of the standard deviation is the second linear/additive predictor. The Fisher
information matrix for uncensored data is diagonal. The fitted values are the estimates of xTi .

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.
tobit 741

Warning
If values of the response and Lower and/or Upper are not integers then there is the danger that
the value is wrongly interpreted as uncensored. For example, if the first 10 values of the response
were runif(10) and Lower was assigned these value then testing y[1:10] == Lower[1:10] is
numerically fraught. Currently, if any y < Lower or y > Upper then a warning is issued.

Note
The response can be a matrix. If so, then Lower and Upper are recycled into a matrix with
the number of columns equal to the number of responses, and the recycling is done row-wise if
byrow.arg = TRUE. The default order is as matrix, which is byrow.arg = FALSE. For example,
these are returned in fit4@misc$Lower and fit4@misc$Upper below.
If there is no censoring then uninormal is recommended instead. Any value of the response less
than Lower or greater than Upper will be assigned the value Lower and Upper respectively, and a
warning will be issued. The fitted object has components censoredL and censoredU in the extra
slot which specifies whether observations are censored in that direction. The function cens.normal
is an alternative to tobit().
When obtaining initial values, if the algorithm would otherwise want to fit an underdetermined
system of equations, then it uses the entire data set instead. This might result in rather poor quality
initial values, and consequently, monitoring convergence is advised.

Author(s)
Thomas W. Yee

References
Tobin, J. (1958) Estimation of relationships for limited dependent variables. Econometrica 26,
2436.

See Also
rtobit, cens.normal, uninormal, double.cens.normal, posnormal, CommonVGAMffArguments,
rnorm.

Examples
# Here, fit1 is a standard Tobit model and fit2 is a nonstandard Tobit model
tdata <- data.frame(x2 = seq(-1, 1, length = (nn <- 100)))
set.seed(1)
Lower <- 1; Upper <- 4 # For the nonstandard Tobit model
tdata <- transform(tdata,
Lower.vec = rnorm(nn, Lower, 0.5),
Upper.vec = rnorm(nn, Upper, 0.5))
meanfun1 <- function(x) 0 + 2*x
meanfun2 <- function(x) 2 + 2*x
meanfun3 <- function(x) 2 + 2*x
meanfun4 <- function(x) 3 + 2*x
tdata <- transform(tdata,
742 tobit

y1 = rtobit(nn, mean = meanfun1(x2)), # Standard Tobit model

y2 = rtobit(nn, mean = meanfun2(x2), Lower = Lower, Upper = Upper),
y3 = rtobit(nn, mean = meanfun3(x2), Lower = Lower.vec, Upper = Upper.vec),
y4 = rtobit(nn, mean = meanfun3(x2), Lower = Lower.vec, Upper = Upper.vec))
with(tdata, table(y1 == 0)) # How many censored values?
with(tdata, table(y2 == Lower | y2 == Upper)) # How many censored values?
with(tdata, table(attr(y2, "cenL")))
with(tdata, table(attr(y2, "cenU")))

fit1 <- vglm(y1 ~ x2, tobit, data = tdata, trace = TRUE)

coef(fit1, matrix = TRUE)
summary(fit1)

fit2 <- vglm(y2 ~ x2, tobit(Lower = Lower, Upper = Upper, type.f = "cens"),
data = tdata, trace = TRUE)
table(fit2@extra$censoredL)
table(fit2@extra$censoredU)
coef(fit2, matrix = TRUE)

fit3 <- vglm(y3 ~ x2, tobit(Lower = with(tdata, Lower.vec),

Upper = with(tdata, Upper.vec), type.f = "cens"),
data = tdata, trace = TRUE)
table(fit3@extra$censoredL)
table(fit3@extra$censoredU)
coef(fit3, matrix = TRUE)

# fit4 is fit3 but with type.fitted = "uncen".

fit4 <- vglm(cbind(y3, y4) ~ x2,
tobit(Lower = rep(with(tdata, Lower.vec), each = 2),
Upper = rep(with(tdata, Upper.vec), each = 2),
byrow.arg = TRUE),
data = tdata, crit = "coeff", trace = TRUE)
head(fit4@extra$censoredL) # A matrix
head(fit4@extra$censoredU) # A matrix
head(fit4@misc$Lower) # A matrix
head(fit4@misc$Upper) # A matrix
coef(fit4, matrix = TRUE)

## Not run: # Plot fit1--fit4

par(mfrow = c(2, 2))

plot(y1 ~ x2, tdata, las = 1, main = "Standard Tobit model",

col = as.numeric(attr(y1, "cenL")) + 3,
pch = as.numeric(attr(y1, "cenL")) + 1)
legend(x = "topleft", leg = c("censored", "uncensored"),
pch = c(2, 1), col = c("blue", "green"))
legend(-1.0, 2.5, c("Truth", "Estimate", "Naive"),
col = c("purple", "orange", "black"), lwd = 2, lty = c(1, 2, 2))
lines(meanfun1(x2) ~ x2, tdata, col = "purple", lwd = 2)
lines(fitted(fit1) ~ x2, tdata, col = "orange", lwd = 2, lty = 2)
lines(fitted(lm(y1 ~ x2, tdata)) ~ x2, tdata, col = "black",
lty = 2, lwd = 2) # This is simplest but wrong!
Tol 743

plot(y2 ~ x2, data = tdata, las = 1, main = "Tobit model",

col = as.numeric(attr(y2, "cenL")) + 3 +
as.numeric(attr(y2, "cenU")),
pch = as.numeric(attr(y2, "cenL")) + 1 +
as.numeric(attr(y2, "cenU")))
legend(x = "topleft", leg = c("censored", "uncensored"),
pch = c(2, 1), col = c("blue", "green"))
legend(-1.0, 3.5, c("Truth", "Estimate", "Naive"),
col = c("purple", "orange", "black"), lwd = 2, lty = c(1, 2, 2))
lines(meanfun2(x2) ~ x2, tdata, col = "purple", lwd = 2)
lines(fitted(fit2) ~ x2, tdata, col = "orange", lwd = 2, lty = 2)
lines(fitted(lm(y2 ~ x2, tdata)) ~ x2, tdata, col = "black",
lty = 2, lwd = 2) # This is simplest but wrong!

plot(y3 ~ x2, data = tdata, las = 1,

main = "Tobit model with nonconstant censor levels",
col = as.numeric(attr(y3, "cenL")) + 2 +
as.numeric(attr(y3, "cenU") * 2),
pch = as.numeric(attr(y3, "cenL")) + 1 +
as.numeric(attr(y3, "cenU") * 2))
legend(x = "topleft", leg = c("censored", "uncensored"),
pch = c(2, 1), col = c("blue", "green"))
legend(-1.0, 3.5, c("Truth", "Estimate", "Naive"),
col = c("purple", "orange", "black"), lwd = 2, lty = c(1, 2, 2))
lines(meanfun3(x2) ~ x2, tdata, col = "purple", lwd = 2)
lines(fitted(fit3) ~ x2, tdata, col = "orange", lwd = 2, lty = 2)
lines(fitted(lm(y3 ~ x2, tdata)) ~ x2, tdata, col = "black",
lty = 2, lwd = 2) # This is simplest but wrong!

plot(y3 ~ x2, data = tdata, las = 1,

main = "Tobit model with nonconstant censor levels",
col = as.numeric(attr(y3, "cenL")) + 2 +
as.numeric(attr(y3, "cenU") * 2),
pch = as.numeric(attr(y3, "cenL")) + 1 +
as.numeric(attr(y3, "cenU") * 2))
legend(x = "topleft", leg = c("censored", "uncensored"),
pch = c(2, 1), col = c("blue", "green"))
legend(-1.0, 3.5, c("Truth", "Estimate", "Naive"),
col = c("purple", "orange", "black"), lwd = 2, lty = c(1, 2, 2))
lines(meanfun3(x2) ~ x2, data = tdata, col = "purple", lwd = 2)
lines(fitted(fit4)[, 1] ~ x2, tdata, col = "orange", lwd = 2, lty = 2)
lines(fitted(lm(y3 ~ x2, tdata)) ~ x2, data = tdata, col = "black",
lty = 2, lwd = 2) # This is simplest but wrong!

## End(Not run)

Tol Tolerances

Description
Generic function for the tolerances of a model.
744 Tol

Usage
Tol(object, ...)

Arguments
object An object for which the computation or extraction of a tolerance or tolerances is
meaningful.
... Other arguments fed into the specific methods function of the model. Sometimes
they are fed into the methods function for Coef.

Details
Different models can define an optimum in different ways. Many models have no such notion or
definition.
Tolerances occur in quadratic ordination, i.e., CQO and UQO. They have ecological meaning be-
cause a high tolerance for a species means the species can survive over a large environmental
range (stenoecous species), whereas a small tolerance means the species niche is small (eurycous
species). Mathematically, the tolerance is like the variance of a normal distribution.

Value
The value returned depends specifically on the methods function invoked.

Warning
There is a direct inverse relationship between the scaling of the latent variables (site scores) and
the tolerances. One normalization is for the latent variables to have unit variance. Another nor-
malization is for all the tolerances to be unit. These two normalization cannot simultaneously hold
in general. For rank-R>1 models it becomes more complicated because the latent variables are
also uncorrelated. An important argument when fitting quadratic ordination models is whether
eq.tolerances is TRUE or FALSE. See Yee (2004) for details.

Note
Tolerances are undefined for linear and additive ordination models. They are well-defined for
quadratic ordination models.

Author(s)
Thomas W. Yee

See Also
Tol.qrrvglm. Max, Opt, cqo, rcim for UQO.

Examples
## Not run:
set.seed(111) # This leads to the global solution
hspider[,1:6] <- scale(hspider[, 1:6]) # Standardized environmental vars
p1 <- cqo(cbind(Alopacce, Alopcune, Alopfabr, Arctlute, Arctperi,
Auloalbi, Pardlugu, Pardmont, Pardnigr, Pardpull,
Trocterr, Zoraspin) ~
WaterCon + BareSand + FallTwig + CoveMoss + CoveHerb + ReflLux,
Bestof = 2, quasipoissonff, data = hspider, Crow1positive = FALSE)

Tol(p1)

## End(Not run)

Topple The Topp-Leone Distribution

Description
Density, distribution function, quantile function and random generation for the Topp-Leone distri-
bution.

Usage
dtopple(x, shape, log = FALSE)
ptopple(q, shape, lower.tail = TRUE, log.p = FALSE)
qtopple(p, shape)
rtopple(n, shape)

Arguments
x, q, p, n Same as Uniform.
shape the (shape) parameter, which lies in (0, 1).
log Logical. If log = TRUE then the logarithm of the density is returned.
lower.tail, log.p
Same meaning as in pnorm or qnorm.

Details
See topple, the VGAM family function for estimating the (shape) parameter s by maximum like-
lihood estimation, for the formula of the probability density function.
746 topple

Value
dtopple gives the density, ptopple gives the distribution function, qtopple gives the quantile
function, and rtopple generates random deviates.

Note
The Topp-Leone distribution is related to the triangle distribution.

Author(s)
T. W. Yee

References
Topp, C. W. and F. C. Leone (1955) A family of J-shaped frequency functions. Journal of the
American Statistical Association, 50, 209219.

topple Topp-Leone Distribution Family Function

Description
Estimating the parameter of the Topp-Leone distribution by maximum likelihood estimation.

Usage
topple(lshape = "logit", zero = NULL, gshape = ppoints(8))
topple 747

Arguments
lshape, zero, gshape
More information is at CommonVGAMffArguments.

Details

The Topple distribution has a probability density function that can be written

f (y; s) = 2s(1 y)[y(2 y)]s1

for 0 < y < 1 and shape parameter 0 < s < 1. The mean of Y is 1 4s [(1 + s)]2 /(2 + 2s)
(returned as the fitted values).

Value

An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Note

Fisher-scoring and Newton-Raphson are the same here. A related distribution is the triangle distri-
bution. This VGAM family function handles multiple responses.

Author(s)

T. W. Yee

References

Topp, C. W. and F. C. Leone (1955) A family of J-shaped frequency functions. Journal of the
American Statistical Association, 50, 209219.

See Also

Topple, Triangle.

Examples

tdata <- data.frame(y = rtopple(1000, shape = logit(1, inverse = TRUE)))

tfit <- vglm(y ~ 1, topple, data = tdata, trace = TRUE, crit = "coef")
coef(tfit, matrix = TRUE)
Coef(tfit)
748 toxop

toxop Toxoplasmosis Data

Description
Toxoplasmosis data in 34 cities in El Salvador.

Usage
data(toxop)

Format
A data frame with 34 observations on the following 4 variables.

rainfall a numeric vector; the amount of rainfall in each city.

ssize a numeric vector; sample size.
cityNo a numeric vector; the city number.
positive a numeric vector; the number of subjects testing positive for the disease.

Details
See the references for details.

Source
See the references for details.

References
Efron, B. (1978) Regression and ANOVA With zero-one data: measures of residual variation. Jour-
nal of the American Statistical Association, 73, 113121.
Efron, B. (1986) Double exponential families and their use in generalized linear regression. Journal
of the American Statistical Association, 81, 709721.

Triangle The Triangle Distribution

Description
Density, distribution function, quantile function and random generation for the Triangle distribution
with parameter theta.

Usage
dtriangle(x, theta, lower = 0, upper = 1, log = FALSE)
ptriangle(q, theta, lower = 0, upper = 1, lower.tail = TRUE, log.p = FALSE)
qtriangle(p, theta, lower = 0, upper = 1, lower.tail = TRUE, log.p = FALSE)
rtriangle(n, theta, lower = 0, upper = 1)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. Same as runif.
theta the theta parameter which lies between lower and upper.
lower, upper lower and upper limits of the distribution. Must be finite.
log Logical. If log = TRUE then the logarithm of the density is returned.
lower.tail, log.p
Same meaning as in pnorm or qnorm.

Details
See triangle, the VGAM family function for estimating the parameter by maximum likelihood
estimation.

Value
dtriangle gives the density, ptriangle gives the distribution function, qtriangle gives the quan-
tile function, and rtriangle generates random deviates.

Author(s)
T. W. Yee and Kai Huang

triangle Triangle Distribution Family Function

Description
Estimating the parameter of the triangle distribution by maximum likelihood estimation.

Usage
triangle(lower = 0, upper = 1,
link = extlogit(min = 0, max = 1), itheta = NULL)

Arguments
lower, upper lower and upper limits of the distribution. Must be finite. Called A and B
respectively below.
link Parameter link function applied to the parameter , which lies in (A, B). See
Links for more choices. The default constrains the estimate to lie in the interval.
itheta Optional initial value for the parameter. The default is to compute the value
internally.

Details
The triangle distribution has a probability density function that consists of two lines joined at ,
which is the location of the mode. The lines intersect the y = 0 axis at A and B. Here, Fisher
scoring is used.
On fitting, the extra slot has components called lower and upper which contains the values of the
above arguments (recycled to the right length). The fitted values are the mean of the distribution,
which is (A + B + )/3.
triangle 751

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Warning
The MLE regularity conditions do not hold for this distribution (e.g., the first derivative evaluated
at the mode does not exist because it is not continuous) so that misleading inferences may result,
e.g., in the summary and vcov of the object. Additionally, convergence to the MLE often appears to
fail.

Note
The response must contain values in (A, B). For most data sets (especially small ones) it is very
common for half-stepping to occur.
Arguments lower and upper and link must match. For example, setting lower = 0.2 and
upper = 4 and link = extlogit(min = 0.2, max = 4.1) will result in an error. Ideally
link = extlogit(min = lower, max = upper) ought to work but it does not (yet)! Minimal
error checking is done for this deficiency.

Author(s)
T. W. Yee

References
Kotz, S. and van Dorp, J. R. (2004) Beyond Beta: Other Continuous Families of Distributions with
Bounded Support and Applications. Chapter 1. World Scientific: Singapore.
Nguyen, H. D. and McLachlan, G. J. (2016) Maximum likelihood estimation of triangular and
polygon distributions. Computational Statistics and Data Analysis, 102, 2336.

See Also
Triangle, Topple, simulate.vlm.

Examples
# Example 1
tdata <- data.frame(y = rtriangle(n <- 3000, theta = 3/4))
fit <- vglm(y ~ 1, triangle(link = "identitylink"), data = tdata, trace = TRUE)
coef(fit, matrix = TRUE)
Coef(fit)
head(fit@extra$lower)
head(fitted(fit))
with(tdata, mean(y))

# Example 2; Kotz and van Dorp (2004), p.14

rdata <- data.frame(y = c(0.1, 0.25, 0.3, 0.4, 0.45, 0.6, 0.75, 0.8))
fit <- vglm(y ~ 1, triangle(link = "identitylink"), data = rdata, trace = TRUE,
crit = "coef", maxit = 1000)
752 trplot

Coef(fit) # The MLE is the 3rd order statistic, which is 0.3.

fit <- vglm(y ~ 1, triangle(link = "identitylink"), data = rdata, trace = TRUE,
crit = "coef", maxit = 1001)
Coef(fit) # The MLE is the 3rd order statistic, which is 0.3.

trplot Trajectory Plot

Description
Generic function for a trajectory plot.

Usage
trplot(object, ...)

Arguments
object An object for which a trajectory plot is meaningful.
... Other arguments fed into the specific methods function of the model. They
usually are graphical parameters, and sometimes they are fed into the methods
function for Coef.

Details
Trajectory plots can be defined in different ways for different models. Many models have no such
notion or definition.
For quadratic and additive ordination models they plot the fitted values of two species against each
other (more than two is theoretically possible, but not implemented in this software yet).

Value
The value returned depends specifically on the methods function invoked.

Author(s)
Thomas W. Yee

References
Yee, T. W. (2012) On constrained and unconstrained quadratic ordination. Manuscript in prepara-
tion.

See Also
trplot.qrrvglm, perspqrrvglm, lvplot.
trplot.qrrvglm 753

Examples
## Not run: set.seed(123)
hspider[, 1:6] <- scale(hspider[, 1:6]) # Standardized environmental vars
p1cqo <- cqo(cbind(Alopacce, Alopcune, Alopfabr, Arctlute, Arctperi,
Auloalbi, Pardlugu, Pardmont, Pardnigr, Pardpull,
Trocterr, Zoraspin) ~
WaterCon + BareSand + FallTwig + CoveMoss + CoveHerb + ReflLux,
quasipoissonff, data = hspider, Crow1positive = FALSE)

nos <- ncol(depvar(p1cqo))

clr <- 1:nos # OR (1:(nos+1))[-7] to omit yellow

trplot(p1cqo, which.species = 1:3, log = "xy",

col = c("blue", "orange", "green"), lwd = 2, label = TRUE) -> ii
legend(0.00005, 0.3, paste(ii$species[, 1], ii$species[, 2], sep = " and "),
lwd = 2, lty = 1, col = c("blue", "orange", "green"))
abline(a = 0, b = 1, lty = "dashed", col = "grey")
## End(Not run)

trplot.qrrvglm Trajectory plot for QRR-VGLMs

Description
Produces a trajectory plot for quadratic reduced-rank vector generalized linear models (QRR-
VGLMs). It is only applicable for rank-1 models with argument noRRR = ~ 1.

Usage
trplot.qrrvglm(object, which.species = NULL, add = FALSE, show.plot = TRUE,
label.sites = FALSE, sitenames = rownames(object@y),
axes.equal = TRUE, cex = par()$cex,
col = 1:(nos * (nos - 1)/2), log = "",
lty = rep_len(par()$lty, nos * (nos - 1)/2),
lwd = rep_len(par()$lwd, nos * (nos - 1)/2),
tcol = rep_len(par()$col, nos * (nos - 1)/2),
xlab = NULL, ylab = NULL,
main = "", type = "b", check.ok = TRUE, ...)

Arguments
object Object of class "qrrvglm", i.e., a CQO object.
which.species Integer or character vector specifying the species to be plotted. If integer, these
are the columns of the response matrix. If character, these must match exactly
with the species names. The default is to use all species.
add Logical. Add to an existing plot? If FALSE (default), a new plot is made.
show.plot Logical. Plot it?
754 trplot.qrrvglm

label.sites Logical. If TRUE, the points on the curves/trajectories are labelled with the
sitenames.
sitenames Character vector. The names of the sites.
axes.equal Logical. If TRUE, the x- and y-axes will be on the same scale.
cex Character expansion of the labelling of the site names. Used only if label.sites
is TRUE. See the cex argument in par.
col Color of the lines. See the col argument in par. Here, nos is the number of
species.
log Character, specifying which (if any) of the x- and y-axes are to be on a logarith-
mic scale. See the log argument in par.
lty Line type. See the lty argument of par.
lwd Line width. See the lwd argument of par.
tcol Color of the text for the site names. See the col argument in par. Used only if
label.sites is TRUE.
xlab Character caption for the x-axis. By default, a suitable caption is found. See the
xlab argument in plot or title.
ylab Character caption for the y-axis. By default, a suitable caption is found. See the
xlab argument in plot or title.
main Character, giving the title of the plot. See the main argument in plot or title.
type Character, giving the type of plot. A common option is to use type="l" for
lines only. See the type argument of plot.
check.ok Logical. Whether a check is performed to see that noRRR = ~ 1 was used. It
doesnt make sense to have a trace plot unless this is so.
... Arguments passed into the plot function when setting up the entire plot. Useful
arguments here include xlim and ylim.

Details
A trajectory plot plots the fitted values of a second species against a first species. The argument
which.species must therefore contain at least two species. By default, all of the species that were
fitted in object are plotted. With more than a few species the resulting plot will be very congested,
and so it is recommended that only a few species be selected for plotting.
In the above, M is the number of species selected for plotting, so there will be M (M 1)/2
curves/trajectories in total.
A trajectory plot will be fitted only if noRRR = ~ 1 because otherwise the trajectory will not be a
smooth function of the latent variables.

Value
A list with the following components.
species.names A matrix of characters giving the first and second species. The number of
different combinations of species is given by the number of rows. This is useful
for creating a legend.
sitenames A character vector of site names, sorted by the latent variable (from low to high).
Truncpareto 755

Note

Plotting the axes on a log scale is often a good idea. The use of xlim and ylim to control the axis
limits is also a good idea, so as to limit the extent of the curves at low abundances or probabilities.
Setting label.sites = TRUE is a good idea only if the number of sites is small, otherwise there is
too much clutter.

Author(s)

Thomas W. Yee

References

Yee, T. W. (2012) On constrained and unconstrained quadratic ordination. Manuscript in prepara-

tion.

See Also

cqo, par, title.

Examples
## Not run: set.seed(111) # This leads to the global solution
# hspider[,1:6] <- scale(hspider[,1:6]) # Standardize the environmental variables
p1 <- cqo(cbind(Alopacce, Alopcune, Alopfabr, Arctlute, Arctperi, Auloalbi,
Pardlugu, Pardmont, Pardnigr, Pardpull, Trocterr, Zoraspin) ~
WaterCon + BareSand + FallTwig + CoveMoss + CoveHerb + ReflLux,
poissonff, data = hspider, trace = FALSE)

trplot(p1, which.species = 1:3, log = "xy", type = "b", lty = 1,

main = "Trajectory plot of three hunting spiders species",
col = c("blue","red","green"), lwd = 2, label = TRUE) -> ii
legend(0.00005, 0.3, lwd = 2, lty = 1, col = c("blue", "red", "green"),
with(ii, paste(species.names[,1], species.names[,2], sep = " and ")))
abline(a = 0, b = 1, lty = "dashed", col = "grey") # Useful reference line

## End(Not run)

Truncpareto The Truncated Pareto Distribution

Description

Density, distribution function, quantile function and random generation for the upper truncated
Pareto(I) distribution with parameters lower, upper and shape.
756 Truncpareto

Usage
dtruncpareto(x, lower, upper, shape, log = FALSE)
ptruncpareto(q, lower, upper, shape, lower.tail = TRUE, log.p = FALSE)
qtruncpareto(p, lower, upper, shape)
rtruncpareto(n, lower, upper, shape)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n, log Same meaning as runif.
lower, upper, shape
the lower, upper and shape (k) parameters. If necessary, values are recycled.
lower.tail, log.p
Same meaning as in pnorm or qnorm.

Details
See truncpareto, the VGAM family function for estimating the parameter k by maximum like-
lihood estimation, for the formula of the probability density function and the range restrictions
imposed on the parameters.

Value
dtruncpareto gives the density, ptruncpareto gives the distribution function, qtruncpareto
gives the quantile function, and rtruncpareto generates random deviates.

Author(s)
T. W. Yee and Kai Huang

References
Aban, I. B., Meerschaert, M. M. and Panorska, A. K. (2006) Parameter estimation for the truncated
Pareto distribution, Journal of the American Statistical Association, 101(473), 270277.

ptruncpareto(qq, lower = lower, upper = upper, shape = kay)

qtruncpareto(ptruncpareto(qq, lower = lower, upper = upper, shape = kay),
lower = lower, upper = upper, shape = kay) - qq # Should be all 0

truncweibull Truncated Weibull Distribution Family Function

Description
Maximum likelihood estimation of the 2-parameter Weibull distribution with lower truncation. No
observations should be censored.

Usage
truncweibull(lower.limit = 1e-5,
lAlpha = "loge", lBetaa = "loge",
iAlpha = NULL, iBetaa = NULL,
nrfs = 1, probs.y = c(0.2, 0.5, 0.8),
imethod = 1, zero = "Betaa")

Arguments
lower.limit Positive lower truncation limits. Recycled to the same dimension as the re-
sponse, going across rows first. The default, being close to 0, should mean
effectively the same results as weibullR if there are no response values that are
smaller.
lAlpha, lBetaa Parameter link functions applied to the (positive) parameters Alpha (called
below) and (positive) Betaa (called below). See Links for more choices.
iAlpha, iBetaa See CommonVGAMffArguments.
imethod, nrfs, zero, probs.y
Details at weibullR and CommonVGAMffArguments.

Details
MLE of the two parameters of the Weibull distribution are computed, subject to lower truncation.
That is, all response values are greater than lower.limit, element-wise. For a particular observa-
tion this is any known positive value. This function is currently based directly on Wingo (1989) and
his parameterization is used (it differs from weibullR.) In particular, = a and = (1/b)a where
a and b are as in weibullR and dweibull.
758 truncweibull

Upon fitting the extra slot has a component called lower.limit which is of the same dimension
as the response. The fitted values are the mean, which are computed using pgamma.deriv and
pgamma.deriv.unscaled.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Warning
This function may be converted to the same parameterization as weibullR at any time. Yet to do:
one element of the EIM may be wrong (due to two interpretations of a formula; but it seems to
work). Convergence is slower than usual and this may imply something is wrong; use argument
maxit. In fact, its probably because pgamma.deriv.unscaled is inaccurate at q = 1 and q = 2.
Also, convergence should be monitored, especially if the truncation means that a large proportion
of the data is lost compared to an ordinary Weibull distribution.

Note
More improvements need to be made, e.g., initial values are currently based on no truncation. This
VGAM family function handles multiple responses.

Author(s)
T. W. Yee

References
Wingo, D. R. (1989) The left-truncated Weibull distribution: theory and computation. Statistical
Papers, 30(1), 3948.

See Also
weibullR, dweibull, pgamma.deriv, pgamma.deriv.unscaled.

Examples
nn <- 5000; prop.lost <- 0.40 # Proportion lost to truncation
wdata <- data.frame(x2 = runif(nn)) # Complete Weibull data
wdata <- transform(wdata,
Betaa = exp(1)) # > 2 is okay (satisfies regularity conds)
wdata <- transform(wdata, Alpha = exp(0.5 - 1 * x2))
wdata <- transform(wdata, Shape = Betaa,
# aaa = Betaa,
# bbb = 1 / Alpha^(1 / Betaa),
Scale = 1 / Alpha^(1 / Betaa))
wdata <- transform(wdata, y2 = rweibull(nn, shape = Shape, scale = Scale))
summary(wdata)

lower.limit2 <- with(wdata, quantile(y2, prob = prop.lost)) # Proportion lost

ucberk 759

wdata <- subset(wdata, y2 > lower.limit2) # Smaller due to truncation

fit1 <- vglm(y2 ~ x2, maxit = 100, trace = TRUE,

truncweibull(lower.limit = lower.limit2), data = wdata)
coef(fit1, matrix = TRUE)
summary(fit1)
vcov(fit1)
head(fit1@extra$lower.limit)

ucberk University California Berkeley Graduate Admissions

Description
University California Berkeley Graduate Admissions: counts cross-classified by acceptance/rejection
and gender, for the six largest departments.

Usage
data(ucberk)

Format
A data frame with 6 departmental groups with the following 5 columns.
m.deny Counts of men denied admission.
m.admit Counts of men admitted.
w.deny Counts of women denied admission.
w.admit Counts of women admitted.
dept Department (the six largest), called A, codeB, . . . , codeF.

Details
From Bickel et al. (1975), the data consists of applications for admission to graduate study at
the University of California, Berkeley, for the fall 1973 quarter. In the admissions cycle for that
quarter, the Graduate Division at Berkeley received approximately 15,000 applications, some of
which were later withdrawn or transferred to a different proposed entry quarter by the applicants.
Of the applications finally remaining for the fall 1973 cycle 12,763 were sufficiently complete
to permit a decision. There were about 101 graduate department and interdepartmental graduate
majors. There were 8442 male applicants and 4321 female applicants. About 44 percent of the
males and about 35 percent of the females were admitted. The data are well-known for illustrating
Simpsons paradox.

References
Bickel, P. J., Hammel, E. A. and OConnell, J. W. (1975) Sex bias in graduate admissions: data
from Berkeley. Science, 187(4175): 398404.
Freedman, D., Pisani, R. and Purves, R. (1998) Chapter 2 of Statistics, 3rd. ed., W. W. Norton &
Company.
760 uninormal

Examples
summary(ucberk)

uninormal Univariate Normal Distribution

Description
Maximum likelihood estimation of the two parameters of a univariate normal distribution.

Usage
uninormal(lmean = "identitylink", lsd = "loge", lvar = "loge",
var.arg = FALSE, imethod = 1, isd = NULL, parallel = FALSE,
smallno = 1e-05, zero = "sd")

Arguments
lmean, lsd, lvar
Link functions applied to the mean and standard deviation/variance. See Links
for more choices. Being positive quantities, a log link is the default for the
standard deviation and variance (see var.arg).
var.arg Logical. If TRUE then the second parameter is the variance and lsd and esd are
ignored, else the standard deviation is used and lvar and evar are ignored.
smallno Numeric, positive but close to 0. Used specifically for quasi-variances; if the
link for the mean is explink then any non-positive value of eta is replaced by
this quantity (hopefully, temporarily and only during early iterations).
imethod, parallel, isd, zero
See CommonVGAMffArguments for more information. If lmean = loge then try
imethod = 2. If parallel = TRUE then the parallelism constraint is not applied
to the intercept.

Details
This fits a linear model (LM) as the first linear/additive predictor. So, by default, this is just the
mean. By default, the log of the standard deviation is the second linear/additive predictor. The
Fisher information matrix is diagonal. This VGAM family function can handle multiple responses.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Warning
uninormal() is the new name; normal1() is old and will be decommissioned soon.
UtilitiesVGAM 761

Note
Yet to do: allow an argument such as eq.sd that enables the standard devations to be the same.

Author(s)
T. W. Yee

References
Forbes, C., Evans, M., Hastings, N. and Peacock, B. (2011) Statistical Distributions, Hoboken, NJ,
USA: John Wiley and Sons, Fourth edition.

See Also
gaussianff, posnormal, mix2normal, normal.vcm, Qvar, tobit, cens.normal, foldnormal,
skewnormal, double.cens.normal, SURff, AR1, huber2, studentt, binormal, dnorm, simulate.vlm.

Examples
udata <- data.frame(x2 = rnorm(nn <- 200))
udata <- transform(udata,
y1 = rnorm(nn, m = 1 - 3*x2, sd = exp(1 + 0.2*x2)),
y2a = rnorm(nn, m = 1 + 2*x2, sd = exp(1 + 2.0*x2)^0.5),
y2b = rnorm(nn, m = 1 + 2*x2, sd = exp(1 + 2.0*x2)^0.5))
fit1 <- vglm(y1 ~ x2, uninormal(zero = NULL), data = udata, trace = TRUE)
coef(fit1, matrix = TRUE)
fit2 <- vglm(cbind(y2a, y2b) ~ x2, data = udata, trace = TRUE,
uninormal(var = TRUE, parallel = TRUE ~ x2,
zero = NULL))
coef(fit2, matrix = TRUE)

# Generate data from N(mu = theta = 10, sigma = theta) and estimate theta.
theta <- 10
udata <- data.frame(y3 = rnorm(100, m = theta, sd = theta))
fit3a <- vglm(y3 ~ 1, uninormal(lsd = "identitylink"), data = udata,
constraints = list("(Intercept)" = rbind(1, 1)))
fit3b <- vglm(y3 ~ 1, uninormal(lsd = "identitylink", parallel = TRUE ~ 1,
zero = NULL), data = udata)
coef(fit3a, matrix = TRUE)
coef(fit3b, matrix = TRUE) # Same as fit3a

UtilitiesVGAM Utility Functions for the VGAM Package

Description
A set of common utility functions used by VGAM family functions.
762 UtilitiesVGAM

Usage
param.names(string, S)
dimm(M, hbw = M)
interleave.VGAM(.M, M1, inverse = FALSE)

Arguments
string Character. Name of the parameter.
M, .M Numeric. The total number of linear/additive predictors, called M . By total, it
is meant summed over the number of responses. Often, M is the total number of
parameters to be estimated (but this is not the same as the number of regression
coefficients, unless the RHS of the formula is an intercept-only). The use of
.M is unfortunate, but it is a compromise solution to what is presented in Yee
(2015). Ideally, .M should be just M.
M1 Numeric. The number of linear/additive predictors for one response, called M1 .
This argument used to be called M, but is now renamed properly.
inverse Logical. Useful for the inverse function of interleave.VGAM().
S Numeric. The number of responses.
hbw Numeric. The half-bandwidth, which measures the number of bands emanating
from the central diagonal band.

Details
See Yee (2015) for some details about some of these functions.

Value
For param.names(), this function returns the parameter names for S responses, i.e., string is
returned unchanged if S = 1, else paste(string, 1:S, sep = "").
For dimm(), this function returns the number of elements to be stored for each of the working
weight matrices. They are represented as columns in the matrix wz in e.g., vglm.fit(). See the
matrix-band format described in Section 18.3.5 of Yee (2015).
For interleave.VGAM(), this function returns a reordering of the linear/additive predictors depend-
ing on the number of responses. The arguments presented in Table 18.5 may not be valid in your
version of Yee (2015).

Author(s)
T. W. Yee. Victor Miranda added the inverse argument to interleave.VGAM().

References
Yee, T. W. (2015) Vector Generalized Linear and Additive Models: With an Implementation in R.
New York, USA: Springer.

See Also
CommonVGAMffArguments, VGAM-package.
V1 763

Examples
param.names("shape", 1) # "shape"
param.names("shape", 3) # c("shape1", "shape2", "shape3")

dimm(3, hbw = 1) # Diagonal matrix; the 3 elements need storage.

dimm(3) # A general 3 x 3 symmetrix matrix has 6 unique elements.
dimm(3, hbw = 2) # Tridiagonal matrix; the 3-3 element is 0 and unneeded.

M1 <- 2; ncoly <- 3; M <- ncoly * M1

mynames1 <- param.names("location", ncoly)
mynames2 <- param.names("scale", ncoly)
(parameters.names <- c(mynames1, mynames2)[interleave.VGAM(M, M1 = M1)])
# The following is/was in Yee (2015) and has a poor/deceptive style:
(parameters.names <- c(mynames1, mynames2)[interleave.VGAM(M, M = M1)])
parameters.names[interleave.VGAM(M, M1 = M1, inverse = TRUE)]

V1 V1 Flying-Bombs Hits in London

Description
A small count data set. During WWII V1 flying-bombs were fired from sites in France (Pas-de-
Calais) and Dutch coasts towards London. The number of hits per square grid around London were
recorded.

Usage
data(V1)

Format
A data frame with the following variables.
hits Values between 0 and 4, and 7. Actually, the 7 is really imputed from the paper (it was recorded
as "5 and over").
ofreq Observed frequency, i.e., the number of grids with that many hits.

Details
The data concerns 576 square grids each of 0.25 square kms about south London. The area was
selected comprising 144 square kms over which the basic probability function of the distribution
was very nearly constant. V1s, which were one type of flying-bomb, were a Vergeltungswaffen
or vengeance weapon fired during the summer of 1944 at London. The V1s were informally called
Buzz Bombs or Doodlebugs, and they were pulse-jet-powered with a warhead of 850 kg of explo-
sives. Over 9500 were launched at London, and many were shot down by artillery and the RAF.
Over the period considered the total number of bombs within the area was 537.
It was asserted that the bombs tended to be grouped in clusters. However, a basic Poisson analysis
shows this is not the case. Their guidance system being rather primitive, the data is consistent with
a Poisson distribution (random).
764 vcovvlm

Source
Clarke, R. D. (1946). An application of the Poisson distribution. Journal of the Institute of Actuar-
ies, 72(3), 481.

References
Feller, W. (1970). An Introduction to Probability Theory and Its Applications, Vol. 1, Third Edition.
John Wiley and Sons: New York, USA.

vcovvlm Calculate Variance-Covariance Matrix for a Fitted VLM or RR-

VGLM Object

Description
Returns the variance-covariance matrix of the parameters of a fitted vlm-class object or a fitted
rrvglm-class object.

Usage
vcov(object, ...)
vcovvlm(object, dispersion = NULL, untransform = FALSE)

Arguments
object A fitted model object, having class vlm-class or rrvglm-class or a superclass
of such. The former includes a vglm object.
dispersion Numerical. A value may be specified, else it is estimated for quasi-GLMs (e.g.,
method of moments). For almost all other types of VGLMs it is usually unity.
The value is multiplied by the raw variance-covariance matrix.
venice 765

untransform logical. For intercept-only models with trivial constraints; if set TRUE then the
parameter link function is inverted to give the answer for the untransformed/raw
parameter.
... Same as vcov.

Details
This methods function is based on the QR decomposition of the (large) VLM model matrix and
working weight matrices. Currently vcovvlm operates on the fundamental vlm-class objects be-
cause pretty well all modelling functions in VGAM inherit from this. Currently vcovrrvglm is not
entirely reliable because the elements of the AC part of the matrix sometimes cannot be computed
very accurately, so that the entire matrix is not positive-definite.

Value
Same as vcov.

Author(s)
Thomas W. Yee

See Also
confintvglm, summaryvglm, vcov.

Examples
ndata <- data.frame(x2 = runif(nn <- 300))
ndata <- transform(ndata, y1 = rnbinom(nn, mu = exp(3+x2), size = exp(1)),
y2 = rnbinom(nn, mu = exp(2-x2), size = exp(0)))
fit1 <- vglm(cbind(y1, y2) ~ x2, negbinomial, data = ndata, trace = TRUE)
fit2 <- rrvglm(y1 ~ x2, negbinomial(zero = NULL), data = ndata)
coef(fit1, matrix = TRUE)
vcov(fit1)
vcov(fit2)

venice Venice Maximum Sea Levels Data

Description
Some sea levels data sets recorded at Venice, Italy.

Usage
data(venice)
data(venice90)
766 venice

Format
venice is a data frame with 51 observations on the following 11 variables. It concerns the maximum
heights of sea levels between 1931 and 1981.

year a numeric vector.

r1,r2,r3,r4,r5,r6,r7,r8,r9,r10 numeric vectors; r1 is the highest recorded value, r2 is the second
highest recorded value, etc.

venice90 is a data frame with 455 observations on the following 7 variables.

year, month, day, hour numeric vectors; actual time of the recording.
sealevel numeric; sea level.
ohour numeric; number of hours since the midnight of 31 Dec 1939 and 1 Jan 1940.
Year numeric vector; approximate year as a real number. The formula is start.year + ohour / (365.26 * 24)
where start.year is 1940. One can treat Year as continuous whereas year can be treated as
both continuous and discrete.

Details
Sea levels are in cm. For venice90, the value 0 corresponds to a fixed reference point (e.g., the
mean sea level in 1897 at an old palace of Venice). Clearly since the relative (perceived) mean sea
level has been increasing in trend over time (more than an overall 0.4 m increase by 2010), therefore
the value 0 is (now) a very low and unusual measurement.
For venice, in 1935 only the top six values were recorded.
For venice90, this is a subset of a data set provided by Paolo Pirazzoli consisting of hourly sea
levels from 1940 to 2009. Values greater than 90 cm were extracted, and then declustered (each
cluster provides no more than one value, and each value is at least 24 hours apart). Thus the values
are more likely to be independent. Of the original (2009-1940+1)*365.26*24 values about 7
percent of these comprise venice90.
Yet to do: check for consistency between the data sets. Some external data sets elsewhere have
some extremes recorded at times not exactly on the hour.

Source
Pirazzoli, P. (1982) Maree estreme a Venezia (periodo 18721981). Acqua Aria, 10, 10231039.
Thanks to Paolo Pirazzoli and Alberto Tomasin for the venice90 data.

References
Smith, R. L. (1986) Extreme value theory based on the r largest annual events. Journal of Hydrol-
ogy, 86, 2743.
Battistin, D. and Canestrelli, P. (2006). La serie storica delle maree a Venezia, 18722004 (in
Italian), Comune di Venezia. Istituzione Centro Previsione e Segnalazioni Maree.

ymat <- as.matrix(venice[, paste("r", 1:10, sep = "")])

fit1 <- vgam(ymat ~ s(year, df = 3), gumbel(R = 365, mpv = TRUE),
data = venice, trace = TRUE, na.action = na.pass)
head(fitted(fit1))

par(mfrow = c(2, 1), xpd = TRUE)

plot(fit1, se = TRUE, lcol = "blue", llwd = 2, slty = "dashed")

par(mfrow = c(1,1), bty = "l", xpd = TRUE, las = 1)

qtplot(fit1, mpv = TRUE, lcol = c(1, 2, 5), tcol = c(1, 2, 5),
llwd = 2, pcol = "blue", tadj = 0.1)

plot(sealevel ~ Year, data = venice90, type = "h", col = "blue")

summary(venice90)
dim(venice90)
round(100 * nrow(venice90) / ((2009 - 1940 + 1) * 365.26 * 24), digits = 3)

## End(Not run)

vgam Fitting Vector Generalized Additive Models

Description
Fit a vector generalized additive model (VGAM). Both 1st-generation VGAMs (based on backfit-
ting) and 2nd-generation VGAMs (based on P-splines, with automatic smoothing parameter selec-
tion) are implemented. This is a large class of models that includes generalized additive models
(GAMs) and vector generalized linear models (VGLMs) as special cases.

Usage
vgam(formula, family, data = list(), weights = NULL, subset = NULL,
na.action = na.fail, etastart = NULL, mustart = NULL,
coefstart = NULL, control = vgam.control(...), offset = NULL,
method = "vgam.fit", model = FALSE, x.arg = TRUE, y.arg = TRUE,
contrasts = NULL, constraints = NULL,
extra = list(), form2 = NULL, qr.arg = FALSE, smart = TRUE, ...)

Arguments
formula a symbolic description of the model to be fit. The RHS of the formula is applied
to each linear/additive predictor, and should include at least one sm.os term or
sm.ps term or s term. Mixing both together is not allowed. Different variables
in each linear/additive predictor can be chosen by specifying constraint matrices.
768 vgam

family Same as for vglm.

data an optional data frame containing the variables in the model. By default the vari-
ables are taken from environment(formula), typically the environment from
which vgam is called.
weights, subset, na.action
Same as for vglm. Note that subset may be unreliable and to get around this
problem it is best to use subset to create a new smaller data frame and feed
in the smaller data frame. See below for an example. This is a bug that needs
fixing.
etastart, mustart, coefstart
Same as for vglm.
control a list of parameters for controlling the fitting process. See vgam.control for
details.
method the method to be used in fitting the model. The default (and presently only)
method vgam.fit uses iteratively reweighted least squares (IRLS).
constraints, model, offset
Same as for vglm.
x.arg, y.arg logical values indicating whether the model matrix and response vector/matrix
used in the fitting process should be assigned in the x and y slots. Note the
model matrix is the LM model matrix; to get the VGAM model matrix type
model.matrix(vgamfit) where vgamfit is a vgam object.
contrasts, extra, form2, qr.arg, smart
Same as for vglm.
... further arguments passed into vgam.control.

Details
A vector generalized additive model (VGAM) is loosely defined as a statistical model that is a
function of M additive predictors. The central formula is given by
p
X
j = f(j)k (xk )
k=1

where xk is the kth explanatory variable (almost always x1 = 1 for the intercept term), and f(j)k
are smooth functions of xk that are estimated by smoothers. The first term in the summation is
just the intercept. Currently two types of smoothers are implemented: s represents the older and
more traditional one, called a vector (cubic smoothing spline) smoother and is based on Yee and
Wild (1996); it is more similar to the R package gam. The newer one is represented by sm.os and
sm.ps, and these are based on O-splines and P-splinesthey allow automatic smoothing parameter
selection; it is more similar to the R package mgcv.
In the above, j = 1, . . . , M where M is finite. If all the functions are constrained to be linear then
the resulting model is a vector generalized linear model (VGLM). VGLMs are best fitted with vglm.
Vector (cubic smoothing spline) smoothers are represented by s() (see s). Local regression via
lo() is not supported. The results of vgam will differ from the gam() (in the gam) because vgam()
uses a different knot selection algorithm. In general, fewer knots are chosen because the computa-
tion becomes expensive when the number of additive predictors M is large.
vgam 769

Second-generation VGAMs are based on the O-splines and P-splines. The latter is due to Eilers and
Marx (1996). Backfitting is not required, and estimation is performed using IRLS. The function
sm.os represents a smart implementation of O-splines. The function sm.ps represents a smart im-
plementation of P-splines. Written G2-VGAMs or P-VGAMs, this methodology should not be used
unless the sample size is reasonably large. Usually an UBRE predictive criterion is optimized (at
each IRLS iteration) because the scale parameter for VGAMs is usually assumed to be known. This
search for optimal smoothing parameters does not always converge, and neither is it totally reliable.
G2-VGAMs implicitly set criterion = "coefficients" so that convergence occurs when the
change in the regression coefficients between 2 IRLS iterations is sufficiently small. Otherwise the
search for the optimal smoothing parameters might cause the log-likelihood to decrease between
2 IRLS iterations. Currently outer iteration is implemented, by default, rather than performance
iteration because the latter is more easy to converge to a local solution; see Wood (2004) for details.
One can use performance iteration by setting Maxit.outer = 1 in vgam.control.
The underlying algorithm of VGAMs is IRLS. First-generation VGAMs (called G1-VGAMs) are
estimated by modified vector backfitting using vector splines. O-splines are used as the basis func-
tions for the vector (smoothing) splines, which are a lower dimensional version of natural B-splines.
The function vgam.fit() actually does the work. The smoothing code is based on F. OSullivans
BART code.
A closely related methodology based on VGAMs called constrained additive ordination (CAO)
first forms a linear combination of the explanatory variables (called latent variables) and then fits
a GAM to these. This is implemented in the function cao for a very limited choice of family
functions.

Value
For G1-VGAMs and G2-VGAMs, an object of class "vgam" or "pvgam" respectively (see vgam-class
and pvgam-class for further information).

WARNING
For G1-VGAMs, currently vgam can only handle constraint matrices cmat, say, such that crossprod(cmat)
is diagonal. It can be detected by is.buggy. VGAMs with constraint matrices that have non-
orthogonal columns should be fitted with sm.os or sm.ps terms instead of s.
See warnings in vglm.control.

The theory behind the scaling parameter is currently being made more rigorous, but it it should give
the same value as the scale parameter for GLMs.

Author(s)
Thomas W. Yee

References
Wood, S. N. (2004). Stable and efficient multiple smoothing parameter estimation for generalized
additive models. J. Amer. Statist. Assoc., 99(467): 673686.
Yee, T. W. and Wild, C. J. (1996) Vector generalized additive models. Journal of the Royal Statisti-
cal Society, Series B, Methodological, 58, 481493.
Yee, T. W. (2008) The VGAM Package. R News, 8, 2839.
Yee, T. W. (2015) Vector Generalized Linear and Additive Models: With an Implementation in R.
New York, USA: Springer.
Yee, T. W. (2016). Comments on Smoothing parameter and model selection for general smooth
models by Wood, S. N. and Pya, N. and Safken, N., J. Amer. Statist. Assoc., 110(516).

See Also
is.buggy, vgam.control, vgam-class, vglmff-class, plotvgam, summaryvgam, summarypvgam,
sm.os, sm.ps, s, magic. vglm, vsmooth.spline, cao.

Examples
# Nonparametric proportional odds model
pneumo <- transform(pneumo, let = log(exposure.time))
vgam(cbind(normal, mild, severe) ~ s(let),
cumulative(parallel = TRUE), data = pneumo, trace = TRUE)

# Nonparametric logistic regression

hfit <- vgam(agaaus ~ s(altitude, df = 2), binomialff, data = hunua)
## Not run: plot(hfit, se = TRUE)
phfit <- predict(hfit, type = "terms", raw = TRUE, se = TRUE)
names(phfit)
head(phfit$fitted)
head(phfit$se.fit)
phfit$df
phfit$sigma

# Fit two species simultaneously

hfit2 <- vgam(cbind(agaaus, kniexc) ~ s(altitude, df = c(2, 3)),
binomialff(multiple.responses = TRUE), data = hunua)
coef(hfit2, matrix = TRUE) # Not really interpretable
## Not run:
plot(hfit2, se = TRUE, overlay = TRUE, lcol = 3:4, scol = 3:4)
ooo <- with(hunua, order(altitude))
with(hunua, matplot(altitude[ooo], fitted(hfit2)[ooo,], ylim = c(0, 0.8),
xlab = "Altitude (m)", ylab = "Probability of presence", las = 1,
vgam-class 771

main = "Two plant species' response curves", type = "l", lwd = 2))
with(hunua, rug(altitude))

## End(Not run)

# The 'subset' argument does not work here. Use subset() instead.
set.seed(1)
zdata <- data.frame(x2 = runif(nn <- 500))
zdata <- transform(zdata, y = rbinom(nn, 1, 0.5))
zdata <- transform(zdata, subS = runif(nn) < 0.7)
sub.zdata <- subset(zdata, subS) # Use this instead
if (FALSE)
fit4a <- vgam(cbind(y, y) ~ s(x2, df = 2),
binomialff(multiple.responses = TRUE),
data = zdata, subset = subS) # This fails!!!
fit4b <- vgam(cbind(y, y) ~ s(x2, df = 2),
binomialff(multiple.responses = TRUE),
data = sub.zdata) # This succeeds!!!
fit4c <- vgam(cbind(y, y) ~ sm.os(x2),
binomialff(multiple.responses = TRUE),
data = sub.zdata) # This succeeds!!!
## Not run: par(mfrow = c(2, 2))
plot(fit4b, se = TRUE, shade = TRUE, shcol = "pink")
plot(fit4c, se = TRUE, shade = TRUE, shcol = "pink")

## End(Not run)

vgam-class Class vgam

Description
Vector generalized additive models.

Objects from the Class

Objects can be created by calls of the form vgam(...).

Slots
nl.chisq: Object of class "numeric". Nonlinear chi-squared values.
nl.df: Object of class "numeric". Nonlinear chi-squared degrees of freedom values.
spar: Object of class "numeric" containing the (scaled) smoothing parameters.
s.xargument: Object of class "character" holding the variable name of any s() terms.
var: Object of class "matrix" holding approximate pointwise standard error information.
Bspline: Object of class "list" holding the scaled (internal and boundary) knots, and the fitted
B-spline coefficients. These are used for prediction.
772 vgam-class

extra: Object of class "list"; the extra argument on entry to vglm. This contains any extra
information that might be needed by the family function.
family: Object of class "vglmff". The family function.
iter: Object of class "numeric". The number of IRLS iterations used.
predictors: Object of class "matrix" with M columns which holds the M linear predictors.
assign: Object of class "list", from class "vlm". This named list gives information matching
the columns and the (LM) model matrix terms.
call: Object of class "call", from class "vlm". The matched call.
coefficients: Object of class "numeric", from class "vlm". A named vector of coefficients.
constraints: Object of class "list", from class "vlm". A named list of constraint matrices
used in the fitting.
contrasts: Object of class "list", from class "vlm". The contrasts used (if any).
control: Object of class "list", from class "vlm". A list of parameters for controlling the fitting
process. See vglm.control for details.
criterion: Object of class "list", from class "vlm". List of convergence criterion evaluated at
the final IRLS iteration.
df.residual: Object of class "numeric", from class "vlm". The residual degrees of freedom.
df.total: Object of class "numeric", from class "vlm". The total degrees of freedom.
dispersion: Object of class "numeric", from class "vlm". The scaling parameter.
effects: Object of class "numeric", from class "vlm". The effects.
fitted.values: Object of class "matrix", from class "vlm". The fitted values. This is usually
the mean but may be quantiles, or the location parameter, e.g., in the Cauchy model.
misc: Object of class "list", from class "vlm". A named list to hold miscellaneous parameters.
model: Object of class "data.frame", from class "vlm". The model frame.
na.action: Object of class "list", from class "vlm". A list holding information about missing
values.
offset: Object of class "matrix", from class "vlm". If non-zero, a M -column matrix of offsets.
post: Object of class "list", from class "vlm" where post-analysis results may be put.
preplot: Object of class "list", from class "vlm" used by plotvgam; the plotting parameters
may be put here.
prior.weights: Object of class "matrix", from class "vlm" holding the initially supplied weights.
qr: Object of class "list", from class "vlm". QR decomposition at the final iteration.
R: Object of class "matrix", from class "vlm". The R matrix in the QR decomposition used in
the fitting.
rank: Object of class "integer", from class "vlm". Numerical rank of the fitted model.
residuals: Object of class "matrix", from class "vlm". The working residuals at the final IRLS
iteration.
ResSS: Object of class "numeric", from class "vlm". Residual sum of squares at the final IRLS
iteration with the adjusted dependent vectors and weight matrices.
vgam-class 773

smart.prediction: Object of class "list", from class "vlm". A list of data-dependent parame-
ters (if any) that are used by smart prediction.
terms: Object of class "list", from class "vlm". The terms object used.
weights: Object of class "matrix", from class "vlm". The weight matrices at the final IRLS
iteration. This is in matrix-band form.
x: Object of class "matrix", from class "vlm". The model matrix (LM, not VGLM).
xlevels: Object of class "list", from class "vlm". The levels of the factors, if any, used in
fitting.
y: Object of class "matrix", from class "vlm". The response, in matrix form.
Xm2: Object of class "matrix", from class "vlm". See vglm-class).
Ym2: Object of class "matrix", from class "vlm". See vglm-class).
callXm2: Object of class "call", from class "vlm". The matched call for argument form2.

Extends
Class "vglm", directly. Class "vlm", by class "vglm".

Methods
cdf signature(object = "vglm"): cumulative distribution function. Useful for quantile regres-
sion and extreme value data models.
deplot signature(object = "vglm"): density plot. Useful for quantile regression models.
deviance signature(object = "vglm"): deviance of the model (where applicable).
plot signature(x = "vglm"): diagnostic plots.
predict signature(object = "vglm"): extract the additive predictors or predict the additive
predictors at a new data frame.
print signature(x = "vglm"): short summary of the object.
qtplot signature(object = "vglm"): quantile plot (only applicable to some models).
resid signature(object = "vglm"): residuals. There are various types of these.
residuals signature(object = "vglm"): residuals. Shorthand for resid.
rlplot signature(object = "vglm"): return level plot. Useful for extreme value data models.
summary signature(object = "vglm"): a more detailed summary of the object.

Note
VGAMs have all the slots that vglm objects have (vglm-class), plus the first few slots described in
the section above.

Author(s)
Thomas W. Yee
774 vgam.control

References
Yee, T. W. and Wild, C. J. (1996) Vector generalized additive models. Journal of the Royal Statisti-
cal Society, Series B, Methodological, 58, 481493.

See Also
vgam.control, vglm, s, vglm-class, vglmff-class.

Examples
# Fit a nonparametric proportional odds model
pneumo <- transform(pneumo, let = log(exposure.time))
vgam(cbind(normal, mild, severe) ~ s(let),
cumulative(parallel = TRUE), data = pneumo)

vgam.control Control Function for vgam()

Description
Algorithmic constants and parameters for running vgam are set using this function.

Usage
vgam.control(all.knots = FALSE, bf.epsilon = 1e-07, bf.maxit = 30,
checkwz=TRUE, Check.rank = TRUE, Check.cm.rank = TRUE,
criterion = names(.min.criterion.VGAM),
epsilon = 1e-07, maxit = 30, Maxit.outer = 20,
noWarning = FALSE,
na.action = na.fail,
nk = NULL, save.weights = FALSE, se.fit = TRUE,
trace = FALSE, wzepsilon = .Machine$double.eps^0.75,
xij = NULL, gamma.arg = 1, ...)

Arguments
all.knots logical indicating if all distinct points of the smoothing variables are to be used
as knots. By default, all.knots=TRUE for n 40, and for n > 40, the number
of knots is approximately 40 + (n 40)0.25 . This increases very slowly with n
so that the number of knots is approximately between 50 and 60 for large n.
bf.epsilon tolerance used by the modified vector backfitting algorithm for testing conver-
gence. Must be a positive number.
bf.maxit maximum number of iterations allowed in the modified vector backfitting algo-
rithm. Must be a positive integer.
checkwz logical indicating whether the diagonal elements of the working weight matri-
ces should be checked whether they are sufficiently positive, i.e., greater than
wzepsilon. If not, any values less than wzepsilon are replaced with this value.
vgam.control 775

Check.rank, Check.cm.rank
See vglm.control.
criterion character variable describing what criterion is to be used to test for convergence.
The possibilities are listed in .min.criterion.VGAM, but most family functions
only implement a few of these.
epsilon positive convergence tolerance epsilon. Roughly speaking, the Newton-Raphson/Fisher-
scoring/local-scoring iterations are assumed to have converged when two suc-
cessive criterion values are within epsilon of each other.
maxit maximum number of Newton-Raphson/Fisher-scoring/local-scoring iterations
allowed.
Maxit.outer maximum number of outer iterations allowed when there are sm.os or sm.ps
terms. See vgam for a little information about the default outer iteration. Note
that one can use performance iteration by setting Maxit.outer = 1; then the
smoothing parameters will be automatically chosen at each IRLS iteration (some
specific programming allows this).
na.action how to handle missing values. Unlike the SPLUS gam function, vgam cannot
handle NAs when smoothing.
nk vector of length d containing positive integers. where d be the number of s terms
in the formula. Recycling is used if necessary. The ith value is the number of
B-spline coefficients to be estimated for each component function of the ith s()
term. nk differs from the number of knots by some constant. If specified, nk
overrides the automatic knot selection procedure.
save.weights logical indicating whether the weights slot of a "vglm" object will be saved on
the object. If not, it will be reconstructed when needed, e.g., summary.
se.fit logical indicating whether approximate pointwise standard errors are to be saved
on the object. If TRUE, then these can be plotted with plot(..., se = TRUE).
trace logical indicating if output should be produced for each iteration.
wzepsilon Small positive number used to test whether the diagonals of the working weight
matrices are sufficiently positive.
noWarning Same as vglm.control.
xij Same as vglm.control.
gamma.arg Numeric; same as gamma in magic. Inflation factor for optimizing the UBRE/GCV
criterion. If given, a suggested value is 1.4 to help avoid overfitting, based on
the work of Gu and co-workers (values between 1.2 and 1.4 appeared reason-
able, based on simulations). A warning may be given if the value is deemed
out-of-range.
... other parameters that may be picked up from control functions that are specific
to the VGAM family function.

Details
Most of the control parameters are used within vgam.fit and you will have to look at that to un-
derstand the full details. Many of the control parameters are used in a similar manner by vglm.fit
(vglm) because the algorithm (IRLS) is very similar.
776 vglm

Setting save.weights=FALSE is useful for some models because the weights slot of the object
is often the largest and so less memory is used to store the object. However, for some VGAM
family function, it is necessary to set save.weights=TRUE because the weights slot cannot be
reconstructed later.

Value
A list with components matching the input names. A little error checking is done, but not much.
The list is assigned to the control slot of vgam objects.

Warning
See vglm.control.

Note
vgam does not implement half-stepsizing, therefore parametric models should be fitted with vglm.
Also, vgam is slower than vglm too.

Author(s)
Thomas W. Yee

References
Yee, T. W. and Wild, C. J. (1996) Vector generalized additive models. Journal of the Royal Statisti-
cal Society, Series B, Methodological, 58, 481493.

See Also
vgam, vglm.control, vsmooth.spline, vglm.

Examples
pneumo <- transform(pneumo, let = log(exposure.time))
vgam(cbind(normal, mild, severe) ~ s(let, df = 2), multinomial,
data = pneumo, trace = TRUE, eps = 1e-4, maxit = 10)

vglm Fitting Vector Generalized Linear Models

Description
vglm is used to fit vector generalized linear models (VGLMs). This is a very large class of models
that includes generalized linear models (GLMs) as a special case.
vglm 777

Usage
vglm(formula, family, data = list(), weights = NULL, subset = NULL,
na.action = na.fail, etastart = NULL, mustart = NULL,
coefstart = NULL, control = vglm.control(...), offset = NULL,
method = "vglm.fit", model = FALSE, x.arg = TRUE, y.arg = TRUE,
contrasts = NULL, constraints = NULL, extra = list(),
form2 = NULL, qr.arg = TRUE, smart = TRUE, ...)

Arguments
formula a symbolic description of the model to be fit. The RHS of the formula is ap-
plied to each linear predictor. Different variables in each linear predictor can be
chosen by specifying constraint matrices.
family a function of class "vglmff" (see vglmff-class) describing what statistical
model is to be fitted. This is called a VGAM family function. See CommonVGAMffArguments
for general information about many types of arguments found in this type of
function. The argument name "family" is used loosely and for the ease of ex-
isting glm users; there is no concept of a formal error distribution for VGLMs.
Possibly the argument name should be better "model" but unfortunately that
name has already been taken.
data an optional data frame containing the variables in the model. By default the vari-
ables are taken from environment(formula), typically the environment from
which vglm is called.
weights an optional vector or matrix of (prior fixed and known) weights to be used in
the fitting process. If the VGAM family function handles multiple responses
(Q > 1 of them, say) then weights can be a matrix with Q columns. Each
column matches the respective response. If it is a vector (the usually case) then
it is recycled into a matrix with Q columns. The values of weights must be
positive; try setting a very small value such as 1.0e-8 to effectively delete an
observation.
Currently the weights argument does not support sampling weights from com-
plex sampling designs. And currently sandwich estimators are not computed in
any shape or form. The present weights are multiplied by the corresponding
log-likelihood contributions.
subset an optional logical vector specifying a subset of observations to be used in the
fitting process.
na.action a function which indicates what should happen when the data contain NAs. The
default is set by the na.action setting of options, and is na.fail if that is
unset. The factory-fresh default is na.omit.
etastart starting values for the linear predictors. It is a M -column matrix with the same
number of rows as the response. If M = 1 then it may be a vector. Note that
etastart and the output of predict(fit) should be comparable. Here, fit is
the fitted object.
mustart starting values for the fitted values. It can be a vector or a matrix; if a matrix,
then it has the same number of rows as the response. Usually mustart and the
output of fitted(fit) should be comparable. Some family functions do not
make use of this argument.
778 vglm

coefstart starting values for the coefficient vector. The length and order must match that
of coef(fit).
control a list of parameters for controlling the fitting process. See vglm.control for
details.
offset a vector or M -column matrix of offset values. These are a priori known and are
added to the linear/additive predictors during fitting.
method the method to be used in fitting the model. The default (and presently only)
method vglm.fit() uses iteratively reweighted least squares (IRLS).
model a logical value indicating whether the model frame should be assigned in the
model slot.
x.arg, y.arg logical values indicating whether the model matrix and response vector/matrix
used in the fitting process should be assigned in the x and y slots. Note the
model matrix is the LM model matrix; to get the VGLM model matrix type
model.matrix(vglmfit) where vglmfit is a vglm object.
contrasts an optional list. See the contrasts.arg of model.matrix.default.
constraints an optional list of constraint matrices. The components of the list must be named
with the term it corresponds to (and it must match in character format exactly).
There are two types of input: "lm"-type and "vlm"-type. The former is a subset
of the latter. The former has a matrix for each term of the LM matrix. The latter
has a matrix for each column of the VLM matrix. After fitting, the constraints
extractor function may be applied; it returns the "vlm"-type list of constraint
matrices by default. If "lm"-type are returned by constraints then these can
be fed into this argument and it should give the same model as before.
Each constraint matrix must have M rows, and be of full-column rank. By de-
fault, constraint matrices are the M by M identity matrix unless arguments in
the family function itself override these values, e.g., parallel (see CommonVGAMffArguments).
If constraints is used it must contain all the terms; an incomplete list is not
accepted.
extra an optional list with any extra information that might be needed by the VGAM
family function.
form2 The second (optional) formula. If argument xij is used (see vglm.control)
then form2 needs to have all terms in the model. Also, some VGAM family
functions such as micmen use this argument to input the regressor variable. If
given, the slots @Xm2 and @Ym2 may be assigned. Note that smart prediction
applies to terms in form2 too.
qr.arg logical value indicating whether the slot qr, which returns the QR decomposi-
tion of the VLM model matrix, is returned on the object.
smart logical value indicating whether smart prediction (smartpred) will be used.
... further arguments passed into vglm.control.

Details
A vector generalized linear model (VGLM) is loosely defined as a statistical model that is a function
of M linear predictors. The central formula is given by

j = jT x
vglm 779

where x is a vector of explanatory variables (sometimes just a 1 for an intercept), and j is a vector
of regression coefficients to be estimated. Here, j = 1, . . . , M , where M is finite. Then one can
write = (1 , . . . , M )T as a vector of linear predictors.
Most users will find vglm similar in flavour to glm. The function vglm.fit actually does the work.

Value
An object of class "vglm", which has the following slots. Some of these may not be assigned to
save space, and will be recreated if necessary later.

extra the list extra at the end of fitting.

family the family function (of class "vglmff").
iter the number of IRLS iterations used.
predictors a M -column matrix of linear predictors.
assign a named list which matches the columns and the (LM) model matrix terms.
call the matched call.
coefficients a named vector of coefficients.
constraints a named list of constraint matrices used in the fitting.
contrasts the contrasts used (if any).
control list of control parameter used in the fitting.
criterion list of convergence criterion evaluated at the final IRLS iteration.
df.residual the residual degrees of freedom.
df.total the total degrees of freedom.
dispersion the scaling parameter.
effects the effects.
fitted.values the fitted values, as a matrix. This is often the mean but may be quantiles, or the
location parameter, e.g., in the Cauchy model.
misc a list to hold miscellaneous parameters.
model the model frame.
na.action a list holding information about missing values.
offset if non-zero, a M -column matrix of offsets.
post a list where post-analysis results may be put.
preplot used by plotvgam, the plotting parameters may be put here.
prior.weights initially supplied weights (the weights argument). Also see weightsvglm.
qr the QR decomposition used in the fitting.
R the R matrix in the QR decomposition used in the fitting.
rank numerical rank of the fitted model.
residuals the working residuals at the final IRLS iteration.
ResSS residual sum of squares at the final IRLS iteration with the adjusted dependent
vectors and weight matrices.
780 vglm

smart.prediction
a list of data-dependent parameters (if any) that are used by smart prediction.
terms the terms object used.
weights the working weight matrices at the final IRLS iteration. This is in matrix-band
form.
x the model matrix (linear model LM, not VGLM).
xlevels the levels of the factors, if any, used in fitting.
y the response, in matrix form.
This slot information is repeated at vglm-class.

WARNING
See warnings in vglm.control. Also, see warnings under weights above regarding sampling
weights from complex sampling designs.

Note
This function can fit a wide variety of statistical models. Some of these are harder to fit than
others because of inherent numerical difficulties associated with some of them. Successful model
fitting benefits from cumulative experience. Varying the values of arguments in the VGAM family
function itself is a good first step if difficulties arise, especially if initial values can be inputted. A
second, more general step, is to vary the values of arguments in vglm.control. A third step is to
make use of arguments such as etastart, coefstart and mustart.
Some VGAM family functions end in "ff" to avoid interference with other functions, e.g., binomialff,
poissonff, gaussianff, gammaff. This is because VGAM family functions are incompatible with
glm (and also gam in the gam library and gam in the mgcv library).
The smart prediction (smartpred) library is incorporated within the VGAM library.
The theory behind the scaling parameter is currently being made more rigorous, but it it should give
the same value as the scale parameter for GLMs.
In Example 5 below, the xij argument to illustrate covariates that are specific to a linear predictor.
Here, lop/rop are the ocular pressures of the left/right eye (artificial data). Variables leye and
reye might be the presence/absence of a particular disease on the LHS/RHS eye respectively. See
vglm.control and fill for more details and examples.

Author(s)
Thomas W. Yee

Yee, T. W. (2014) Reduced-rank vector generalized linear models with two linear predictors. Com-
putational Statistics and Data Analysis, 71, 889902.
Yee, T. W. (2008) The VGAM Package. R News, 8, 2839.

See Also
vglm.control, vglm-class, vglmff-class, smartpred, vglm.fit, fill, rrvglm, vgam. Meth-
ods functions include coefvlm, constraints.vlm, hatvaluesvlm, linkfun.vglm, plotvglm,
predictvglm, summary.vglm, AIC.vglm, lrtest_vglm, etc.

Examples
# Example 1. See help(glm)
print(d.AD <- data.frame(treatment = gl(3, 3),
outcome = gl(3, 1, 9),
counts = c(18,17,15,20,10,20,25,13,12)))
vglm.D93 <- vglm(counts ~ outcome + treatment, family = poissonff,
data = d.AD, trace = TRUE)
summary(vglm.D93)

# Example 2. Multinomial logit model

pneumo <- transform(pneumo, let = log(exposure.time))
vglm(cbind(normal, mild, severe) ~ let, multinomial, data = pneumo)

# Example 3. Proportional odds model

fit3 <- vglm(cbind(normal, mild, severe) ~ let, propodds, data = pneumo)
coef(fit3, matrix = TRUE)
constraints(fit3)
model.matrix(fit3, type = "lm") # LM model matrix
model.matrix(fit3) # Larger VGLM (or VLM) model matrix

# Example 4. Bivariate logistic model

fit4 <- vglm(cbind(nBnW, nBW, BnW, BW) ~ age, binom2.or, coalminers)
coef(fit4, matrix = TRUE)
depvar(fit4) # Response are proportions
weights(fit4, type = "prior")

# Example 5. The use of the xij argument (simple case).

# The constraint matrix for 'op' has one column.
nn <- 1000
eyesdat <- round(data.frame(lop = runif(nn),
rop = runif(nn),
op = runif(nn)), digits = 2)
eyesdat <- transform(eyesdat, eta1 = -1 + 2 * lop,
eta2 = -1 + 2 * lop)
eyesdat <- transform(eyesdat,
leye = rbinom(nn, size = 1, prob = logit(eta1, inverse = TRUE)),
reye = rbinom(nn, size = 1, prob = logit(eta2, inverse = TRUE)))
782 vglm-class

head(eyesdat)
fit5 <- vglm(cbind(leye, reye) ~ op,
binom2.or(exchangeable = TRUE, zero = 3),
data = eyesdat, trace = TRUE,
xij = list(op ~ lop + rop + fill(lop)),
form2 = ~ op + lop + rop + fill(lop))
coef(fit5)
coef(fit5, matrix = TRUE)
constraints(fit5)

vglm-class Class vglm

Description
Vector generalized linear models.

Objects from the Class

Objects can be created by calls of the form vglm(...).

Slots
In the following, M is the number of linear predictors.

effects: Object of class "numeric", from class "vlm". The effects.

fitted.values: Object of class "matrix", from class "vlm". The fitted values.
misc: Object of class "list", from class "vlm". A named list to hold miscellaneous parameters.
model: Object of class "data.frame", from class "vlm". The model frame.
na.action: Object of class "list", from class "vlm". A list holding information about missing
values.
offset: Object of class "matrix", from class "vlm". If non-zero, a M -column matrix of offsets.
post: Object of class "list", from class "vlm" where post-analysis results may be put.
preplot: Object of class "list", from class "vlm" used by plotvgam; the plotting parameters
may be put here.
prior.weights: Object of class "matrix", from class "vlm" holding the initially supplied weights.
qr: Object of class "list", from class "vlm". QR decomposition at the final iteration.
R: Object of class "matrix", from class "vlm". The R matrix in the QR decomposition used in
the fitting.
rank: Object of class "integer", from class "vlm". Numerical rank of the fitted model.
residuals: Object of class "matrix", from class "vlm". The working residuals at the final IRLS
iteration.
ResSS: Object of class "numeric", from class "vlm". Residual sum of squares at the final IRLS
iteration with the adjusted dependent vectors and weight matrices.
smart.prediction: Object of class "list", from class "vlm". A list of data-dependent parame-
ters (if any) that are used by smart prediction.
terms: Object of class "list", from class "vlm". The terms object used.
weights: Object of class "matrix", from class "vlm". The weight matrices at the final IRLS
iteration. This is in matrix-band form.
x: Object of class "matrix", from class "vlm". The model matrix (LM, not VGLM).
xlevels: Object of class "list", from class "vlm". The levels of the factors, if any, used in
fitting.
y: Object of class "matrix", from class "vlm". The response, in matrix form.
Xm2: Object of class "matrix", from class "vlm". See vglm-class).
Ym2: Object of class "matrix", from class "vlm". See vglm-class).
callXm2: Object of class "call", from class "vlm". The matched call for argument form2.

Extends
Class "vlm", directly.

Methods
cdf signature(object = "vglm"): cumulative distribution function. Applicable to, e.g., quantile
regression and extreme value data models.
deplot signature(object = "vglm"): Applicable to, e.g., quantile regression.
784 vglm.control

deviance signature(object = "vglm"): deviance of the model (where applicable).

plot signature(x = "vglm"): diagnostic plots.
predict signature(object = "vglm"): extract the linear predictors or predict the linear predic-
tors at a new data frame.
print signature(x = "vglm"): short summary of the object.
qtplot signature(object = "vglm"): quantile plot (only applicable to some models).
resid signature(object = "vglm"): residuals. There are various types of these.
residuals signature(object = "vglm"): residuals. Shorthand for resid.
rlplot signature(object = "vglm"): return level plot. Useful for extreme value data models.
summary signature(object = "vglm"): a more detailed summary of the object.

Author(s)
Thomas W. Yee

See Also
vglm, vglmff-class, vgam-class.

Examples
# Multinomial logit model
pneumo <- transform(pneumo, let = log(exposure.time))
vglm(cbind(normal, mild, severe) ~ let, multinomial, data = pneumo)

vglm.control Control Function for vglm()

Description
Algorithmic constants and parameters for running vglm are set using this function.

Usage
vglm.control(checkwz = TRUE, Check.rank = TRUE, Check.cm.rank = TRUE,
criterion = names(.min.criterion.VGAM),
epsilon = 1e-07, half.stepsizing = TRUE,
maxit = 30, noWarning = FALSE,
stepsize = 1, save.weights = FALSE,
trace = FALSE, wzepsilon = .Machine$double.eps^0.75,
xij = NULL, ...)
vglm.control 785

Arguments
checkwz logical indicating whether the diagonal elements of the working weight matri-
ces should be checked whether they are sufficiently positive, i.e., greater than
wzepsilon. If not, any values less than wzepsilon are replaced with this value.
Check.rank logical indicating whether the rank of the VLM matrix should be checked. If
this is not of full column rank then the results are not to be trusted. The default
is to give an error message if the VLM matrix is not of full column rank.
Check.cm.rank logical indicating whether the rank of each constraint matrix should be checked.
If this is not of full column rank then an error will occur. Under no circumstances
should any constraint matrix have a rank less than the number of columns.
criterion character variable describing what criterion is to be used to test for convergence.
The possibilities are listed in .min.criterion.VGAM, but most family functions
only implement a few of these.
epsilon positive convergence tolerance epsilon. Roughly speaking, the Newton-Raphson/Fisher-
scoring iterations are assumed to have converged when two successive criterion
values are within epsilon of each other.
half.stepsizing
logical indicating if half-stepsizing is allowed. For example, in maximizing
a log-likelihood, if the next iteration has a log-likelihood that is less than the
current value of the log-likelihood, then a half step will be taken. If the log-
likelihood is still less than at the current position, a quarter-step will be taken etc.
Eventually a step will be taken so that an improvement is made to the conver-
gence criterion. half.stepsizing is ignored if criterion == "coefficients".
maxit maximum number of (usually Fisher-scoring) iterations allowed. Sometimes
Newton-Raphson is used.
noWarning logical indicating whether to suppress a warning if convergence is not obtained
within maxit iterations. This is ignored if maxit = 1 is set.
stepsize usual step size to be taken between each Newton-Raphson/Fisher-scoring itera-
tion. It should be a value between 0 and 1, where a value of unity corresponds
to an ordinary step. A value of 0.5 means half-steps are taken. Setting a value
near zero will cause convergence to be generally slow but may help increase the
chances of successful convergence for some family functions.
save.weights logical indicating whether the weights slot of a "vglm" object will be saved on
the object. If not, it will be reconstructed when needed, e.g., summary. Some
family functions have save.weights = TRUE and others have save.weights = FALSE
in their control functions.
trace logical indicating if output should be produced for each iteration. Setting trace = TRUE
is recommended in general because VGAM fits a very broad variety of models
and distributions, and for some of them, convergence is intrinsically more diffi-
cult. Monitoring convergence can help check that the solution is reasonable or
that a problem has occurred. It may suggest better initial values are needed, the
making of invalid assumptions, or that the model is inappropriate for the data,
etc.
wzepsilon small positive number used to test whether the diagonals of the working weight
matrices are sufficiently positive.
786 vglm.control

xij A formula or a list of formulas. Each formula has a RHS giving M terms making
up a covariate-dependent term (whose name is the response). That is, it creates a
variable that takes on different values for each linear/additive predictor, e.g., the
ocular pressure of each eye. The M terms must be unique; use fill1, fill2,
fill3, etc. if necessary. Each formula should have a response which is taken as
the name of that variable, and the M terms are enumerated in sequential order.
Each of the M terms multiply each successive row of the constraint matrix.
When xij is used, the use of form2 is also required to give every term used by
the model.
The function Select can be used to select variables beginning with the same
character string.
... other parameters that may be picked up from control functions that are specific
to the VGAM family function.

Details
Most of the control parameters are used within vglm.fit and you will have to look at that to
understand the full details.
Setting save.weights = FALSE is useful for some models because the weights slot of the object is
the largest and so less memory is used to store the object. However, for some VGAM family func-
tion, it is necessary to set save.weights = TRUE because the weights slot cannot be reconstructed
later.

Value
A list with components matching the input names. A little error checking is done, but not much.
The list is assigned to the control slot of vglm objects.

Warning
For some applications the default convergence criterion should be tightened. Setting something like
criterion = "coef", epsilon = 1e-09 is one way to achieve this, and also add trace = TRUE to
monitor the convergence. Setting maxit to some higher number is usually not needed, and needing
to do so suggests something is wrong, e.g., an ill-conditioned model, over-fitting or under-fitting.

Note
Reiterating from above, setting trace = TRUE is recommended in general.
In Example 2 below there are two covariates that have linear/additive predictor specific values.
These are handled using the xij argument.

Author(s)
Thomas W. Yee

References
Yee, T. W. and Hastie, T. J. (2003) Reduced-rank vector generalized linear models. Statistical
Modelling, 3, 1541.
vglm.control 787

See Also
vglm, fill1. The authors homepage has further documentation about the xij argument; see also
Select.

Examples
# Example 1.
pneumo <- transform(pneumo, let = log(exposure.time))
vglm(cbind(normal, mild, severe) ~ let, multinomial, data = pneumo,
crit = "coef", step = 0.5, trace = TRUE, epsil = 1e-8, maxit = 40)

# Example 2. The use of the xij argument (simple case).

ymat <- rdiric(n <- 1000, shape = rep(exp(2), len = 4))
mydat <- data.frame(x1 = runif(n), x2 = runif(n), x3 = runif(n),
x4 = runif(n),
z1 = runif(n), z2 = runif(n), z3 = runif(n),
z4 = runif(n))
mydat <- transform(mydat, X = x1, Z = z1)
mydat <- round(mydat, digits = 2)
fit2 <- vglm(ymat ~ X + Z,
dirichlet(parallel = TRUE), data = mydat, trace = TRUE,
xij = list(Z ~ z1 + z2 + z3 + z4,
X ~ x1 + x2 + x3 + x4),
form2 = ~ Z + z1 + z2 + z3 + z4 +
X + x1 + x2 + x3 + x4)
head(model.matrix(fit2, type = "lm")) # LM model matrix
head(model.matrix(fit2, type = "vlm")) # Big VLM model matrix
coef(fit2)
coef(fit2, matrix = TRUE)
max(abs(predict(fit2)-predict(fit2, new = mydat))) # Predicts correctly
summary(fit2)
## Not run:
# plotvgam(fit2, se = TRUE, xlab = "x1", which.term = 1) # Bug!
# plotvgam(fit2, se = TRUE, xlab = "z1", which.term = 2) # Bug!
plotvgam(fit2, xlab = "x1") # Correct
plotvgam(fit2, xlab = "z1") # Correct

## End(Not run)

# Example 3. The use of the xij argument (complex case).

set.seed(123)
coalminers <- transform(coalminers,
Age = (age - 42) / 5,
dum1 = round(runif(nrow(coalminers)), digits = 2),
dum2 = round(runif(nrow(coalminers)), digits = 2),
dum3 = round(runif(nrow(coalminers)), digits = 2),
dumm = round(runif(nrow(coalminers)), digits = 2))
BS <- function(x, ..., df = 3)
sm.bs(c(x,...), df = df)[1:length(x),,drop = FALSE]
NS <- function(x, ..., df = 3)
788 vglmff-class

sm.ns(c(x,...), df = df)[1:length(x),,drop = FALSE]

# Equivalently...
BS <- function(x, ..., df = 3)
head(sm.bs(c(x,...), df = df), length(x), drop = FALSE)
NS <- function(x, ..., df = 3)
head(sm.ns(c(x,...), df = df), length(x), drop = FALSE)

fit3 <- vglm(cbind(nBnW,nBW,BnW,BW) ~ Age + NS(dum1, dum2),

fam = binom2.or(exchangeable = TRUE, zero = 3),
xij = list(NS(dum1, dum2) ~ NS(dum1, dum2) +
NS(dum2, dum1) +
fill(NS( dum1))),
form2 = ~ NS(dum1, dum2) + NS(dum2, dum1) + fill(NS(dum1)) +
dum1 + dum2 + dum3 + Age + age + dumm,
data = coalminers, trace = TRUE)
head(model.matrix(fit3, type = "lm")) # LM model matrix
head(model.matrix(fit3, type = "vlm")) # Big VLM model matrix
coef(fit3)
coef(fit3, matrix = TRUE)
## Not run:
plotvgam(fit3, se = TRUE, lcol = "red", scol = "blue", xlab = "dum1")

## End(Not run)

vglmff-class Class vglmff

Description
Family functions for the VGAM package

Objects from the Class

Objects can be created by calls of the form new("vglmff", ...).

Slots
In the following, M is the number of linear/additive predictors.

blurb: Object of class "character" giving a small description of the model. Important arguments
such as parameter link functions can be expressed here.
constraints: Object of class "expression" which sets up any constraint matrices defined by
arguments in the family function. A zero argument is always fed into cm.zero.vgam, whereas
other constraints are fed into cm.vgam.
deviance: Object of class "function" returning the deviance of the model. This slot is optional. If
present, the function must have arguments function(mu, y, w, residuals = FALSE, eta, extra = NULL).
Deviance residuals are returned if residuals = TRUE.
vglmff-class 789

fini: Object of class "expression" to insert code at a special position in vglm.fit or vgam.fit.
This code is evaluated immediately after the fitting.
first: Object of class "expression" to insert code at a special position in vglm or vgam.
infos: Object of class "function" which returns a list with components such as M1. At present
only a very few VGAM family functions have this feature implemented. Those that do do not
require specifying the M1 argument when used with rcim.
initialize: Object of class "expression" used to perform error checking (especially for the
variable y) and obtain starting values for the model. In general, etastart or mustart are
assigned values based on the variables y, x and w.
linkinv: Object of class "function" which returns the fitted values, given the linear/additive
predictors. The function must have arguments function(eta, extra = NULL).
last: Object of class "expression" to insert code at a special position (at the very end) of
vglm.fit() or vgam.fit(). This code is evaluated after the fitting. The list misc is often
assigned components in this slot, which becomes the misc slot on the fitted object.
linkfun: Object of class "function" which, given the fitted values, returns the linear/additive
predictors. If present, the function must have arguments function(mu, extra = NULL).
Most VGAM family functions do not have a linkfun function. They largely are for classical
exponential families, i.e., GLMs.
loglikelihood: Object of class "function" returning the log-likelihood of the model. This slot is
optional. If present, the function must have arguments function(mu, y, w, residuals = FALSE, eta, extra = NUL
The argument residuals can be ignored because log-likelihood residuals arent defined.
middle: Object of class "expression" to insert code at a special position in vglm.fit or vgam.fit.
middle2: Object of class "expression" to insert code at a special position in vglm.fit or vgam.fit.
simslot: Object of class "function" to allow simulate to work.
summary.dispersion: Object of class "logical" indicating whether the general VGLM formula
(based on a residual sum of squares) can be used for computing the scaling/dispersion param-
eter. It is TRUE for most models except for nonlinear regression models.
vfamily: Object of class "character" giving class information about the family function. Al-
though not developed at this stage, more flexible classes are planned in the future. For exam-
ple, family functions sratio, cratio, cumulative, and acat all operate on categorical data,
therefore will have a special class called "VGAMcat", say. Then if fit was a vglm object, then
coef(fit) would print out the vglm coefficients plus "VGAMcat" information as well.
deriv: Object of class "expression" which returns a M -column matrix of first derivatives of the
log-likelihood function with respect to the linear/additive predictors, i.e., the score vector. In
Yee and Wild (1996) this is the di vector. Thus each row of the matrix returned by this slot is
such a vector.
weight: Object of class "expression" which returns the second derivatives of the log-likelihood
function with respect to the linear/additive predictors. This can be either the observed or
expected information matrix, i.e., Newton-Raphson or Fisher-scoring respectively. In Yee and
Wild (1996) this is the Wi matrix. Thus each row of the matrix returned by this slot is such a
matrix. Like the weights slot of vglm/vgam, it is stored in matrix-band form, whereby the first
M columns of the matrix are the diagonals, followed by the upper-diagonal band, followed
by the band above that, etc. In this case, there can be up to M (M + 1) columns, with the last
column corresponding to the (1,M ) elements of the weight matrices.
validfitted, validparams: Functions that test that the fitted values and all parameters are within
range. These functions can issue a warning if violations are detected.
790 vglmff-class

Methods

print signature(x = "vglmff"): short summary of the family function.

Warning

VGAM family functions are not compatible with glm, nor gam (from either gam or mgcv packages).

Note

With link functions etc., one must use substitute to embed the options into the code. There are
two different forms: eval(substitute(expression({...}), list(...))) for expressions, and
eval(substitute( function(...) { ... }, list(...) )) for functions.
The extra argument in linkinv, linkfun, deviance, loglikelihood, etc. matches with the
argument extra in vglm, vgam and rrvglm. This allows input to be fed into all slots of a VGAM
family function.
The expression derivative is evaluated immediately prior to weight, so there is provision for re-
use of variables etc. Programmers must be careful to choose variable names that do not interfere
with vglm.fit, vgam.fit() etc.
Programmers of VGAM family functions are encouraged to keep to previous conventions regarding
the naming of arguments, e.g., link is the argument for parameter link functions, zero for allowing
some of the linear/additive predictors to be an intercept term only, etc.
In general, Fisher-scoring is recommended over Newton-Raphson where tractable. Although usu-
ally slightly slower in convergence, the weight matrices from using the expected information are
positive-definite over a larger parameter space.

Author(s)

Thomas W. Yee

References

Yee, T. W. and Wild, C. J. (1996) Vector generalized additive models. Journal of the Royal Statisti-
cal Society, Series B, Methodological, 58, 481493.

See Also

vglm, vgam, rrvglm, rcim.

Examples
cratio()
cratio(link = "cloglog")
cratio(link = "cloglog", reverse = TRUE)
vonmises 791

vonmises von Mises Distribution Family Function

Description
Estimates the location and scale parameters of the von Mises distribution by maximum likelihood
estimation.

Usage
vonmises(llocation = extlogit(min = 0, max = 2 * pi), lscale = "loge",
ilocation = NULL, iscale = NULL, imethod = 1, zero = NULL)

Arguments
llocation, lscale
Parameter link functions applied to the location a parameter and scale parameter
k, respectively. See Links for more choices. For k, a log link is the default
because the parameter is positive.
ilocation Initial value for the location a parameter. By default, an initial value is cho-
sen internally using imethod. Assigning a value will override the argument
imethod.
iscale Initial value for the scale k parameter. By default, an initial value is chosen in-
ternally using imethod. Assigning a value will override the argument imethod.
imethod An integer with value 1 or 2 which specifies the initialization method. If failure
to converge occurs try the other value, or else specify a value for ilocation and
iscale.
zero An integer-valued vector specifying which linear/additive predictors are mod-
elled as intercepts only. The default is none of them. If used, one can choose
one value from the set {1,2}. See CommonVGAMffArguments for more informa-
tion.

Details
The (two-parameter) von Mises is the most commonly used distribution in practice for circular data.
It has a density that can be written as

exp[k cos(y a)]

f (y; a, k) =
2I0 (k)

where 0 y < 2, k > 0 is the scale parameter, a is the location parameter, and I0 (k) is the
modified Bessel function of order 0 evaluated at k. The mean of Y (which is the fitted value) is a
and the circular variance is 1 I1 (k)/I0 (k) where I1 (k) is the modified Bessel function of order 1.
By default, 1 = log(a/(2 a)) and 2 = log(k) for this family function.
792 vonmises

Value

An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.

Warning

Numerically, the von Mises can be difficult to fit because of a log-likelihood having multiple maxi-
mums. The user is therefore encouraged to try different starting values, i.e., make use of ilocation
and iscale.

Note

The response and the fitted values are scaled so that 0 y < 2. The linear/additive predictors are
left alone. Fisher scoring is used.

Author(s)

T. W. Yee

References

Forbes, C., Evans, M., Hastings, N. and Peacock, B. (2011) Statistical Distributions, Hoboken, NJ,
USA: John Wiley and Sons, Fourth edition.

See Also

Bessel, cardioid.
CircStats and circular currently have a lot more R functions for circular data than the VGAM
package.

Examples

vdata <- data.frame(x2 = runif(nn <- 1000))

vdata <- transform(vdata, y = rnorm(nn, m = 2+x2, sd = exp(0.2))) # Bad data!!
fit <- vglm(y ~ x2, vonmises(zero = 2), data = vdata, trace = TRUE)
coef(fit, matrix = TRUE)
Coef(fit)
with(vdata, range(y)) # Original data
range(depvar(fit)) # Processed data is in [0,2*pi)
vsmooth.spline 793

vsmooth.spline Vector cubic smoothing spline

Description
Fits a vector cubic smoothing spline.

Usage
vsmooth.spline(x, y, w = NULL, df = rep(5, M), spar = NULL,
i.constraint = diag(M),
x.constraint = diag(M),
constraints = list("(Intercepts)" = i.constraint,
x = x.constraint),
all.knots = FALSE, var.arg = FALSE, scale.w = TRUE,
nk = NULL, control.spar = list())

Arguments
x A vector, matrix or a list. If a list, the x component is used. If a matrix, the first
column is used. x may also be a complex vector, in which case the real part is
used, and the imaginary part is used for the response. In this help file, n is the
number of unique values of x.
y A vector, matrix or a list. If a list, the y component is used. If a matrix, all but
the first column is used. In this help file, M is the number of columns of y if there
are no constraints on the functions.
w The weight matrices or the number of observations. If the weight matrices, then
this must be a n-row matrix with the elements in matrix-band form (see iam). If
a vector, then these are the number of observations. By default, w is the M by M
identity matrix, denoted by matrix(1, n, M).
df Numerical vector containing the degrees of freedom for each component func-
tion (smooth). If necessary, the vector is recycled to have length equal to the
number of component functions to be estimated (M if there are no constraints),
which equals the number of columns of the x-constraint matrix. A value of 2
means a linear fit, and each element of df should lie between 2 and n. The larger
the values of df the more wiggly the smooths.
spar Numerical vector containing the non-negative smoothing parameters for each
component function (smooth). If necessary, the vector is recycled to have length
equal to the number of component functions to be estimated (M if there are no
constraints), which equals the number of columns of the x-constraint matrix. A
value of zero means the smooth goes through the data and hence is wiggly. A
value of Inf may be assigned, meaning the smooth will be linear. By default,
the NULL value of spar means df is used to determine the smoothing parameters.
all.knots Logical. If TRUE then each distinct value of x will be a knot. By default, only
a subset of the unique values of x are used; typically, the number of knots is
O(n^0.25) for n large, but if n <= 40 then all the unique values of x are used.
794 vsmooth.spline

i.constraint A M-row constraint matrix for the intercepts. It must be of full column rank.
By default, the constraint matrix for the intercepts is the M by M identity matrix,
meaning no constraints.
x.constraint A M-row constraint matrix for x. It must be of full column rank. By default,
the constraint matrix for the intercepts is the M by M identity matrix, meaning no
constraints.
constraints An alternative to specifying i.constraint and x.constraint, this is a list with
two components corresponding to the intercept and x respectively. They must
both be a M-row constraint matrix with full column rank.
var.arg Logical: return the pointwise variances of the fit? Currently, this corresponds
only to the nonlinear part of the fit, and may be wrong.
scale.w Logical. By default, the weights w are scaled so that the diagonal elements have
mean 1.
nk Number of knots. If used, this argument overrides all.knots, and must lie
between 6 and n+2 inclusive.
control.spar See smooth.spline.

Details
The algorithm implemented is detailed in Yee (2000). It involves decomposing the component
functions into a linear and nonlinear part, and using B-splines. The cost of the computation is
O(n M^3).
The argument spar contains scaled smoothing parameters.

Value
An object of class "vsmooth.spline" (see vsmooth.spline-class).

WARNING
See vgam for information about an important bug.

Note
This function is quite similar to smooth.spline but offers less functionality. For example, cross
validation is not implemented here. For M = 1, the results will be generally different, mainly due to
the different way the knots are selected.
The vector cubic smoothing spline which s() represents is computationally demanding for large
M . The cost is approximately O(nM 3 ) where n is the number of unique abscissae.
Yet to be done: return the unscaled smoothing parameters.

Author(s)
Thomas W. Yee
waitakere 795

References
Yee, T. W. (2000) Vector Splines and Other Vector Smoothers. Pages 529534. In: Bethlehem,
J. G. and van der Heijde, P. G. M. Proceedings in Computational Statistics COMPSTAT 2000.
Heidelberg: Physica-Verlag.

See Also
vsmooth.spline-class, plot.vsmooth.spline, predict.vsmooth.spline, iam, sm.os, s, smooth.spline.

Examples
nn <- 20; x <- 2 + 5*(nn:1)/nn
x[2:4] <- x[5:7] # Allow duplication
y1 <- sin(x) + rnorm(nn, sd = 0.13)
y2 <- cos(x) + rnorm(nn, sd = 0.13)
y3 <- 1 + sin(x) + rnorm(nn, sd = 0.13) # Run this for constraints
y <- cbind(y1, y2, y3)
ww <- cbind(rep(3, nn), 4, (1:nn)/nn)

(fit <- vsmooth.spline(x, y, w = ww, df = 5))

## Not run:
plot(fit) # The 1st and 3rd functions do not differ by a constant

## End(Not run)

mat <- matrix(c(1,0,1, 0,1,0), 3, 2)

(fit2 <- vsmooth.spline(x, y, w = ww, df = 5, i.constr = mat,
x.constr = mat))
# The 1st and 3rd functions do differ by a constant:
mycols <- c("orange", "blue", "orange")
## Not run: plot(fit2, lcol = mycols, pcol = mycols, las = 1)

p <- predict(fit, x = model.matrix(fit, type = "lm"), deriv = 0)

max(abs(depvar(fit) - with(p, y))) # Should be 0; and fit@y is not good

par(mfrow = c(3, 1))

ux <- seq(1, 8, len = 100)
for (dd in 1:3) {
pp <- predict(fit, x = ux, deriv = dd)
## Not run: with(pp, matplot(x, y, type = "l", main = paste("deriv =", dd),
lwd = 2, ylab = "", cex.axis = 1.5,
cex.lab = 1.5, cex.main = 1.5))
## End(Not run)
}

waitakere Waitakere Ranges Data

796 waitakere

Description
The waitakere data frame has 579 rows and 18 columns. Altitude is explanatory, and there are
binary responses (presence/absence = 1/0 respectively) for 17 plant species.

Usage
data(waitakere)

Format
This data frame contains the following columns:

agaaus Agathis australis, or Kauri

beitaw Beilschmiedia tawa, or Tawa
corlae Corynocarpus laevigatus
cyadea Cyathea dealbata
cyamed Cyathea medullaris
daccup Dacrydium cupressinum
dacdac Dacrycarpus dacrydioides
eladen Elaecarpus dentatus
hedarb Hedycarya arborea
hohpop Species name unknown
kniexc Knightia excelsa, or Rewarewa
kuneri Kunzea ericoides
lepsco Leptospermum scoparium
metrob Metrosideros robusta
neslan Nestegis lanceolata
rhosap Rhopalostylis sapida
vitluc Vitex lucens, or Puriri
altitude meters above sea level

Details
These were collected from the Waitakere Ranges, a small forest in northern Auckland, New Zealand.
At 579 sites in the forest, the presence/absence of 17 plant species was recorded, as well as the alti-
tude. Each site was of area size 200m2 .

Source
Dr Neil Mitchell, University of Auckland.

waldff Wald Distribution Family Function

Description
Estimates the parameter of the standard Wald distribution by maximum likelihood estimation.

Usage
waldff(llambda = "loge", ilambda = NULL)

Arguments
llambda,ilambda
See CommonVGAMffArguments for information.

Details
The standard Wald distribution is a special case of the inverse Gaussian distribution with = 1. It
has a density that can be written as
p
f (y; ) = /(2y 3 ) exp (y 1)2 /(2y)

where y > 0 and > 0. The mean of Y is 1 (returned as the fitted values) and its variance is 1/.
By default, = log().

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Note
The VGAM family function inv.gaussianff estimates the location parameter too.

Author(s)
T. W. Yee

References
Johnson, N. L. and Kotz, S. and Balakrishnan, N. (1994) Continuous Univariate Distributions, 2nd
edition, Volume 1, New York: Wiley.
798 weibull.mean

weibull.mean Weibull Distribution Family Function, Parameterized by the Mean

Description
Maximum likelihood estimation of the 2-parameter Weibull distribution. The mean is one of the
parameters. No observations should be censored.

Usage
weibull.mean(lmean = "loge", lshape = "loge", imean = NULL,
ishape = NULL, probs.y = c(0.2, 0.5, 0.8),
imethod = 1, zero = "shape")

Arguments
lmean, lshape Parameter link functions applied to the (positive) mean parameter (called mu
below) and (positive) shape parameter (called a below). See Links for more
choices.
imean, ishape Optional initial values for the mean and shape parameters.
imethod, zero, probs.y
Details at CommonVGAMffArguments.

Details
See weibullR for most of the details for this family function too. The mean of Y is b (1 + 1/a)
(returned as the fitted values), and this is the first parameter (a loge link is the default because it is
positive). The other parameter is the positive shape paramter a, also having a default loge link.
This VGAM family function currently does not handle censored data. Fisher scoring is used to
estimate the two parameters. Although the expected information matrices used here are valid in
all regions of the parameter space, the regularity conditions for maximum likelihood estimation are
satisfied only if a > 2 (according to Kleiber and Kotz (2003)). If this is violated then a warning
message is issued. One can enforce a > 2 by choosing lshape = logoff(offset = -2).
Common values of the shape parameter lie between 0.5 and 3.5.
weibullR 799

Value

An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Note

See weibullR for more details. This VGAM family function handles multiple responses.

Author(s)

T. W. Yee

See Also

weibullR, dweibull, truncweibull, gev, lognormal, expexpff, maxwell, rayleigh, gumbelII.

Examples
wdata <- data.frame(x2 = runif(nn <- 1000)) # Complete data
wdata <- transform(wdata, mu = exp(-1 + 1 * x2),
x3 = rnorm(nn),
shape1 = exp(1),
shape2 = exp(2))
wdata <- transform(wdata,
y1 = rweibull(nn, shape = shape1, scale = mu / gamma(1 + 1/shape1)),
y2 = rweibull(nn, shape = shape2, scale = mu / gamma(1 + 1/shape2)))
fit <- vglm(cbind(y1, y2) ~ x2 + x3, weibull.mean, data = wdata, trace = TRUE)
coef(fit, matrix = TRUE)
sqrt(diag(vcov(fit))) # SEs
summary(fit, presid = FALSE)

weibullR Weibull Distribution Family Function

Description

Maximum likelihood estimation of the 2-parameter Weibull distribution. No observations should

be censored.

Usage

weibullR(lscale = "loge", lshape = "loge",

iscale = NULL, ishape = NULL, lss = TRUE, nrfs = 1,
probs.y = c(0.2, 0.5, 0.8), imethod = 1, zero = "shape")
800 weibullR

Arguments
lshape, lscale Parameter link functions applied to the (positive) shape parameter (called a
below) and (positive) scale parameter (called b below). See Links for more
choices.
ishape, iscale Optional initial values for the shape and scale parameters.
nrfs Currently this argument is ignored. Numeric, of length one, with value in [0, 1].
Weighting factor between Newton-Raphson and Fisher scoring. The value 0
means pure Newton-Raphson, while 1 means pure Fisher scoring. The default
value uses a mixture of the two algorithms, and retaining positive-definite work-
ing weights.
imethod Initialization method used if there are censored observations. Currently only the
values 1 and 2 are allowed.
zero, probs.y, lss
Details at CommonVGAMffArguments.

Details
The Weibull density for a response Y is

f (y; a, b) = ay a1 exp[(y/b)a ]/(ba )

for a > 0, b > 0, y > 0. The cumulative distribution function is

F (y; a, b) = 1 exp[(y/b)a ].

The mean of Y is b (1 + 1/a) (returned as the fitted values), and the mode is at b (1 1/a)1/a
when a > 1. The density is unbounded for a < 1. The kth moment about the origin is E(Y k ) =
bk (1 + k/a). The hazard function is ata1 /ba .
This VGAM family function currently does not handle censored data. Fisher scoring is used to
estimate the two parameters. Although the expected information matrices used here are valid in
all regions of the parameter space, the regularity conditions for maximum likelihood estimation are
satisfied only if a > 2 (according to Kleiber and Kotz (2003)). If this is violated then a warning
message is issued. One can enforce a > 2 by choosing lshape = logoff(offset = -2).
Common values of the shape parameter lie between 0.5 and 3.5.
Summarized in Harper et al. (2011), for inference, there are 4 cases to consider. If a 1 then the
MLEs are not consistent (and the smallest observation becomes a hyperefficient solution for the lo-
cation parameter in the 3-parameter case). If 1 < a < 2 then MLEs exist but are not asymptotically
normal. If a = 2 then the MLEs exist and are normal and asymptotically efficient but with a slower
convergence rate than when a > 2. If a > 2 then MLEs have classical asymptotic properties.
The 3-parameter (location is the third parameter) Weibull can be estimated by maximizing a profile
log-likelihood (see, e.g., Harper et al. (2011) and Lawless (2003)), else try gev which is a better
parameterization.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.
weibullR 801

Warning

This function is under development to handle other censoring situations. The version of this function
which will handle censored data will be called cenweibull(). It is currently being written and will
use SurvS4 as input. It should be released in later versions of VGAM.
If the shape parameter is less than two then misleading inference may result, e.g., in the summary
and vcov of the object.

Note

Successful convergence depends on having reasonably good initial values. If the initial values
chosen by this function are not good, make use the two initial value arguments.
This VGAM family function handles multiple responses.
The Weibull distribution is often an alternative to the lognormal distribution. The inverse Weibull
distribution, which is that of 1/Y where Y has a Weibull(a, b) distribution, is known as the log-
Gompertz distribution.
There are problems implementing the three-parameter Weibull distribution. These are because the
classical regularity conditions for the asymptotic properties of the MLEs are not satisfied because
the support of the distribution depends on one of the parameters.
Other related distributions are the Maxwell and Rayleigh distributions.

Author(s)

T. W. Yee

References

Kleiber, C. and Kotz, S. (2003) Statistical Size Distributions in Economics and Actuarial Sciences,
Hoboken, NJ, USA: Wiley-Interscience.
Johnson, N. L. and Kotz, S. and Balakrishnan, N. (1994) Continuous Univariate Distributions, 2nd
edition, Volume 1, New York: Wiley.
Lawless, J. F. (2003) Statistical Models and Methods for Lifetime Data, 2nd ed. Hoboken, NJ,
USA: John Wiley & Sons.
Rinne, Horst. (2009) The Weibull Distribution: A Handbook. Boca Raton, FL, USA: CRC Press.
Gupta, R. D. and Kundu, D. (2006) On the comparison of Fisher information of the Weibull and
GE distributions, Journal of Statistical Planning and Inference, 136, 31303144.
Harper, W. V. and Eschenbach, T. G. and James, T. R. (2011) Concerns about Maximum Likelihood
Estimation for the Three-Parameter Weibull Distribution: Case Study of Statistical Software, The
American Statistician, 65(1), 4454.
Smith, R. L. (1985) Maximum likelihood estimation in a class of nonregular cases. Biometrika, 72,
6790.
Smith, R. L. and Naylor, J. C. (1987) A comparison of maximum likelihood and Bayesian estimators
for the three-parameter Weibull distribution. Applied Statistics, 36, 358369.
802 weightsvglm

See Also

weibull.mean, dweibull, truncweibull, gev, lognormal, expexpff, maxwell, rayleigh, gumbelII.

Examples
wdata <- data.frame(x2 = runif(nn <- 1000)) # Complete data
wdata <- transform(wdata,
y1 = rweibull(nn, shape = exp(1), scale = exp(-2 + x2)),
y2 = rweibull(nn, shape = exp(2), scale = exp( 1 - x2)))
fit <- vglm(cbind(y1, y2) ~ x2, weibullR, data = wdata, trace = TRUE)
coef(fit, matrix = TRUE)
vcov(fit)
summary(fit)

weightsvglm Prior and Working Weights of a VGLM fit

Description

Returns either the prior weights or working weights of a VGLM object.

Usage

weightsvglm(object, type = c("prior", "working"),

matrix.arg = TRUE, ignore.slot = FALSE,
deriv.arg = FALSE, ...)

Arguments

object a model object from the VGAM R package that inherits from a vector general-
ized linear model (VGLM), e.g., a model of class "vglm".
type Character, which type of weight is to be returned? The default is the first one.
matrix.arg Logical, whether the answer is returned as a matrix. If not, it will be a vector.
ignore.slot Logical. If TRUE then object@weights is ignored even if it has been assigned,
and the long calculation for object@weights is repeated. This may give a
slightly different answer because of the final IRLS step at convergence may
or may not assign the latest value of quantities such as the mean and weights.
deriv.arg Logical. If TRUE then a list with components deriv and weights is returned.
See below for more details.
... Currently ignored.
weightsvglm 803

Details
Prior weights are usually inputted with the weights argument in functions such as vglm and vgam.
It may refer to frequencies of the individual data or be weight matrices specified beforehand.
Working weights are used by the IRLS algorithm. They correspond to the second derivatives of the
log-likelihood function with respect to the linear predictors. The working weights correspond to
positive-definite weight matrices and are returned in matrix-band form, e.g., the first M columns
correspond to the diagonals, etc.

Value
If type = "working" and deriv = TRUE then a list is returned with the two components described
below. Otherwise the prior or working weights are returned depending on the value of type.

deriv Typically the first derivative of the log-likelihood with respect to the linear pre-
dictors. For example, this is the variable deriv.mu in vglm.fit(), or equiva-
lently, the matrix returned in the "deriv" slot of a VGAM family function.
weights The working weights.

Note
This function is intended to be similar to weights.glm (see glm).

Author(s)
Thomas W. Yee

References
Yee, T. W. and Hastie, T. J. (2003) Reduced-rank vector generalized linear models. Statistical
Modelling, 3, 1541.
Chambers, J. M. and T. J. Hastie (eds) (1992) Statistical Models in S. Wadsworth & Brooks/Cole.

# Look at the working residuals

nn <- nrow(model.matrix(fit, type = "lm"))
M <- ncol(predict(fit))

wwt <- weights(fit, type = "working", deriv = TRUE) # In matrix-band format

804 wine

wz <- m2a(wwt$weights, M = M) # In array format

wzinv <- array(apply(wz, 3, solve), c(M, M, nn))
wresid <- matrix(NA, nn, M) # Working residuals
for (ii in 1:nn)
wresid[ii, ] <- wzinv[, , ii, drop = TRUE] %*% wwt$deriv[ii, ]
max(abs(c(resid(fit, type = "work")) - c(wresid))) # Should be 0

(zedd <- predict(fit) + wresid) # Adjusted dependent vector

wine Bitterness in Wine Data

Description
This oenological data frame concerns the amount of bitterness in 78 bottles of white wine.

Usage
data(wine)

Format
A data frame with 4 rows on the following 7 variables.
temp temperature, with levels cold and warm.
contact whether contact of the juice with the skin was allowed or avoided, for a specified period.
Two levels: no or yes.
bitter1, bitter2, bitter3, bitter4, bitter5 numeric vectors, the counts. The order is none to most
intense.

Details
The data set comes from Randall (1989) and concerns a factorial experiment for investigating factors
that affect the bitterness of white wines. There are two factors in the experiment: temperature at
the time of crushing the grapes and contact of the juice with the skin. Two bottles of wine were
fermented for each of the treatment combinations. A panel of 9 judges were selected and trained
for the ability to detect bitterness. Thus there were 72 bottles in total. Originally, the bitterness of
the wine were taken on a continuous scale in the interval from 0 (none) to 100 (intense) but later
they were grouped using equal lengths into five ordered categories 1, 2, 3, 4 and 5.

Source
Christensen, R. H. B. (2013) Analysis of ordinal data with cumulative link modelsestimation with
the R-package ordinal. R Package Version 2013.9-30. https://CRAN.R-project.org/package=
ordinal.
Randall, J. H. (1989) The analysis of sensory data by generalized linear model. Biometrical Journal
31(7), 781793.
Kosmidis, I. (2014) Improved estimation in cumulative link models. Journal of the Royal Statistical
Society, Series B, Methodological, 76, in press.
wrapup.smart 805

Examples
wine
summary(wine)

wrapup.smart Cleans Up After Smart Prediction

Description
wrapup.smart deletes any variables used by smart prediction. Needed by both the modelling func-
tion and the prediction function.

Usage
wrapup.smart()

Details
The variables to be deleted are .smart.prediction, .smart.prediction.counter, and .smart.prediction.mode.
The function wrapup.smart is useful in R because these variables are held in smartpredenv.

yeo.johnson Yeo-Johnson Transformation

Description
Computes the Yeo-Johnson transformation, which is a normalizing transformation.

Usage
yeo.johnson(y, lambda, derivative = 0,
epsilon = sqrt(.Machine$double.eps), inverse = FALSE)
806 yeo.johnson

Arguments
y Numeric, a vector or matrix.
lambda Numeric. It is recycled to the same length as y if necessary.
derivative Non-negative integer. The default is the ordinary function evaluation, otherwise
the derivative with respect to lambda.
epsilon Numeric and positive value. The tolerance given to values of lambda when
comparing it to 0 or 2.
inverse Logical. Return the inverse transformation?

Details
The Yeo-Johnson transformation can be thought of as an extension of the Box-Cox transformation.
It handles both positive and negative values, whereas the Box-Cox transformation only handles
positive values. Both can be used to transform the data so as to improve normality. They can be
used to perform LMS quantile regression.

Value
The Yeo-Johnson transformation or its inverse, or its derivatives with respect to lambda, of y.

Note
If inverse = TRUE then the argument derivative = 0 is required.

Author(s)
Thomas W. Yee

yip88 Zero-Inflated Poisson Distribution (Yip (1988) algorithm)

Description
Fits a zero-inflated Poisson distribution based on Yip (1988).

Usage
yip88(link = "loge", n.arg = NULL, imethod = 1)

Arguments
link Link function for the usual parameter. See Links for more choices.
n.arg The total number of observations in the data set. Needed when the response
variable has all the zeros deleted from it, so that the number of zeros can be
determined.
imethod Details at CommonVGAMffArguments.

Details
The method implemented here, Yip (1988), maximizes a conditional likelihood. Consequently, the
methodology used here deletes the zeros from the data set, and is thus related to the positive Poisson
distribution (where P (Y = 0) = 0).
The probability function of Y is 0 with probability , and Poisson() with probability 1 . Thus

P (Y = 0) = + (1 )P (W = 0)

where W is Poisson(). The mean, (1 ), can be obtained by the extractor function fitted
applied to the object.
This family function treats as a scalar. If you want to model both and as a function of
covariates, try zipoisson.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.
808 yip88

Warning
Under- or over-flow may occur if the data is ill-conditioned. Yip (1988) only considered being a
scalar and not modelled as a function of covariates. To get around this limitation, try zipoisson.
Inference obtained from summary.vglm and summary.vgam may or may not be correct. In particu-
lar, the p-values, standard errors and degrees of freedom may need adjustment. Use simulation on
artificial data to check that these are reasonable.

Note
The data may be inputted in two ways. The first is when the response is a vector of positive values,
with the argument n in yip88 specifying the total number of observations. The second is simply
include all the data in the response. In this case, the zeros are trimmed off during the computation,
and the x and y slots of the object, if assigned, will reflect this.
The estimate of is placed in the misc slot as @misc$pstr0. However, this estimate is computed
only for intercept models, i.e., the formula is of the form y ~ 1.

Author(s)
Thomas W. Yee

References
Yip, P. (1988) Inference about the mean of a Poisson distribution in the presence of a nuisance
parameter. The Australian Journal of Statistics, 30, 299306.
Angers, J-F. and Biswas, A. (2003) A Bayesian analysis of zero-inflated generalized Poisson model.
Computational Statistics & Data Analysis, 42, 3746.

See Also
zipoisson, Zipois, zapoisson, pospoisson, poissonff, dzipois.

Examples
phi <- 0.35; lambda <- 2 # Generate some artificial data
y <- rzipois(n <- 1000, lambda, phi)
table(y)

# Two equivalent ways of fitting the same model

fit1 <- vglm(y ~ 1, yip88(n = length(y)), subset = y > 0)
fit2 <- vglm(y ~ 1, yip88, trace = TRUE, crit = "coef")
(true.mean <- (1-phi) * lambda)
mean(y)
head(fitted(fit1))
fit1@misc$pstr0 # The estimate of phi

# Compare the ZIP with the positive Poisson distribution

pp <- vglm(y ~ 1, pospoisson, subset = y > 0, crit = "c")
coef(pp)
Coef(pp)
Yules 809

coef(fit1) - coef(pp) # Same

head(fitted(fit1) - fitted(pp)) # Different

# Another example (Angers and Biswas, 2003) ---------------------

abdata <- data.frame(y = 0:7, w = c(182, 41, 12, 2, 2, 0, 0, 1))
abdata <- subset(abdata, w > 0)

yy <- with(abdata, rep(y, w))

fit3 <- vglm(yy ~ 1, yip88(n = length(yy)), subset = yy > 0)
fit3@misc$pstr0 # Estimate of phi (they get 0.5154 with SE 0.0707)
coef(fit3, matrix = TRUE)
Coef(fit3) # Estimate of lambda (they get 0.6997 with SE 0.1520)
head(fitted(fit3))
mean(yy) # Compare this with fitted(fit3)

Yules Yule-Simon Distribution

Description
Density, distribution function, quantile function and random generation for the Yule-Simon distri-
bution.

Usage
dyules(x, shape, log = FALSE)
pyules(q, shape, lower.tail = TRUE, log.p = FALSE)
qyules(p, shape)
ryules(n, shape)

Arguments
x, q, p, n Same meaning as in Normal.
shape See yulesimon.
log, lower.tail, log.p
Same meaning as in pnorm or qnorm.

Details
See yulesimon, the VGAM family function for estimating the parameter, for the formula of the
probability density function and other details.

Value
dyules gives the density, pyules gives the distribution function, qyules gives the quantile function,
and ryules generates random deviates.
810 yulesimon

Note

Numerical problems may occur with qyules() when p is very close to 1.

Author(s)

T. W. Yee

See Also

yulesimon.

Examples
dyules(1:20, 2.1)
ryules(20, 2.1)

round(1000 * dyules(1:8, 2))

table(ryules(1000, 2))

## Not run: x <- 0:6

plot(x, dyules(x, shape = 2.2), type = "h", las = 1, col = "blue")
## End(Not run)

yulesimon Yule-Simon Family Function

Description

Estimating the shape parameter of the Yule-Simon distribution.

Usage

yulesimon(lshape = "loge", ishape = NULL, nsimEIM = 200, zero = NULL)

Arguments

lshape Link function for the shape parameter, called below. See Links for more
choices and for general information.
ishape Optional initial value for the (positive) parameter. See CommonVGAMffArguments
for more information. The default is to obtain an initial value internally. Use this
argument if the default fails.
nsimEIM, zero See CommonVGAMffArguments for more information.
Zabinom 811

Details
The probability function is
f (y; ) = beta(y, + 1),
where the parameter > 0, beta is the beta function, and y = 1, 2, . . .. The function dyules
computes this probability function. The mean of Y , which is returned as fitted values, is /( 1)
provided > 1. The variance of Y is 2 /(( 1)2 ( 2)) provided > 2.
The distribution was named after Udny Yule and Herbert A. Simon. Simon originally called it the
Yule distribution. This family function can handle multiple responses.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Author(s)
T. W. Yee

References
Simon, H. A. (1955) On a class of skew distribution functions. Biometrika, 42, 425440.

Zabinom Zero-Altered Binomial Distribution

Description
Density, distribution function, quantile function and random generation for the zero-altered bino-
mial distribution with parameter pobs0.

Usage
dzabinom(x, size, prob, pobs0 = 0, log = FALSE)
pzabinom(q, size, prob, pobs0 = 0)
qzabinom(p, size, prob, pobs0 = 0)
rzabinom(n, size, prob, pobs0 = 0)
812 Zabinom

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. If length(n) > 1 then the length is taken to be the
number required.
size, prob, log
Parameters from the ordinary binomial distribution (see dbinom).
pobs0 Probability of (an observed) zero, called pobs0. The default value of pobs0 = 0
corresponds to the response having a positive binomial distribution.

Details
The probability function of Y is 0 with probability pobs0, else a positive binomial(size, prob)
distribution.

Value
dzabinom gives the density and pzabinom gives the distribution function, qzabinom gives the quan-
tile function, and rzabinom generates random deviates.

Note
The argument pobs0 is recycled to the required length, and must have values which lie in the interval
[0, 1].

Author(s)
T. W. Yee

## Not run: x <- 0:10

barplot(rbind(dzabinom(x, size = size, prob = prob, pobs0 = pobs0),
dbinom(x, size = size, prob = prob)),
beside = TRUE, col = c("blue", "orange"), cex.main = 0.7, las = 1,
ylab = "Probability", names.arg = as.character(x),
main = paste("ZAB(size = ", size, ", prob = ", prob, ", pobs0 = ", pobs0,
") [blue] vs", " Binom(size = ", size, ", prob = ", prob,
") [orange] densities", sep = ""))
## End(Not run)
zabinomial 813

zabinomial Zero-Altered Binomial Distribution

Description
Fits a zero-altered binomial distribution based on a conditional model involving a Bernoulli distri-
bution and a positive-binomial distribution.

Usage
zabinomial(lpobs0 = "logit", lprob = "logit",
type.fitted = c("mean", "prob", "pobs0"),
ipobs0 = NULL, iprob = NULL, imethod = 1, zero = NULL)
zabinomialff(lprob = "logit", lonempobs0 = "logit",
type.fitted = c("mean", "prob", "pobs0", "onempobs0"),
iprob = NULL, ionempobs0 = NULL, imethod = 1, zero = "onempobs0")

Arguments
lprob Parameter link function applied to the probability parameter of the binomial
distribution. See Links for more choices.
lpobs0 Link function for the parameter p0 , called pobs0 here. See Links for more
choices.
type.fitted See CommonVGAMffArguments and fittedvlm for information.
iprob, ipobs0 See CommonVGAMffArguments.
lonempobs0, ionempobs0
Corresponding argument for the other parameterization. See details below.
imethod, zero See CommonVGAMffArguments.

Details
The response Y is zero with probability p0 , else Y has a positive-binomial distribution with prob-
ability 1 p0 . Thus 0 < p0 < 1, which may be modelled as a function of the covariates. The
zero-altered binomial distribution differs from the zero-inflated binomial distribution in that the
former has zeros coming from one source, whereas the latter has zeros coming from the binomial
distribution too. The zero-inflated binomial distribution is implemented in zibinomial. Some
people call the zero-altered binomial a hurdle model.
The input is currently a vector or one-column matrix. By default, the two linear/additive predictors
for zabinomial() are (logit(p0 ), log(p))T .
The VGAM family function zabinomialff() has a few changes compared to zabinomial().
These are: (i) the order of the linear/additive predictors is switched so the binomial probability
comes first; (ii) argument onempobs0 is now 1 minus the probability of an observed 0, i.e., the
probability of the positive binomial distribution, i.e., onempobs0 is 1-pobs0; (iii) argument zero
has a new default so that the onempobs0 is intercept-only by default. Now zabinomialff() is
generally recommended over zabinomial(). Both functions implement Fisher scoring and neither
can handle multiple responses.
814 zabinomial

Value

An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.
The fitted.values slot of the fitted object, which should be extracted by the generic function
fitted, returns the mean (default) which is given by

= (1 p0 )b /[1 (1 b )N ]

where b is the usual binomial mean. If type.fitted = "pobs0" then p0 is returned.

Note

The response should be a two-column matrix of counts, with first column giving the number of
successes.
Note this family function allows p0 to be modelled as functions of the covariates by having zero = NULL.
It is a conditional model, not a mixture model.
These family functions effectively combine posbinomial and binomialff into one family func-
tion.

Author(s)

T. W. Yee

See Also

dzabinom, zibinomial, posbinomial, binomialff, dbinom, CommonVGAMffArguments.

Examples
zdata <- data.frame(x2 = runif(nn <- 1000))
zdata <- transform(zdata, size = 10,
prob = logit(-2 + 3*x2, inverse = TRUE),
pobs0 = logit(-1 + 2*x2, inverse = TRUE))
zdata <- transform(zdata,
y1 = rzabinom(nn, size = size, prob = prob, pobs0 = pobs0))
with(zdata, table(y1))

zfit <- vglm(cbind(y1, size - y1) ~ x2, zabinomial(zero = NULL),

data = zdata, trace = TRUE)
coef(zfit, matrix = TRUE)
head(fitted(zfit))
head(predict(zfit))
summary(zfit)
Zageom 815

Zageom Zero-Altered Geometric Distribution

Description
Density, distribution function, quantile function and random generation for the zero-altered geo-
metric distribution with parameter pobs0.

Usage
dzageom(x, prob, pobs0 = 0, log = FALSE)
pzageom(q, prob, pobs0 = 0)
qzageom(p, prob, pobs0 = 0)
rzageom(n, prob, pobs0 = 0)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. If length(n) > 1 then the length is taken to be the
number required.
prob, log Parameters from the ordinary geometric distribution (see dgeom).
pobs0 Probability of (an observed) zero, called pobs0. The default value of pobs0 = 0
corresponds to the response having a positive geometric distribution.

Details
The probability function of Y is 0 with probability pobs0, else a positive geometric(prob) distribu-
tion.

Value
dzageom gives the density and pzageom gives the distribution function, qzageom gives the quantile
function, and rzageom generates random deviates.

Note
The argument pobs0 is recycled to the required length, and must have values which lie in the interval
[0, 1].

Author(s)
T. W. Yee

See Also
zageometric, zigeometric, rposgeom.
816 zageometric

Examples
prob <- 0.35; pobs0 <- 0.05; x <- (-1):7
dzageom(x, prob = prob, pobs0 = pobs0)
table(rzageom(100, prob = prob, pobs0 = pobs0))

## Not run: x <- 0:10

barplot(rbind(dzageom(x, prob = prob, pobs0 = pobs0),
dgeom(x, prob = prob)),
beside = TRUE, col = c("blue", "orange"), cex.main = 0.7, las = 1,
ylab = "Probability", names.arg = as.character(x),
main = paste("ZAG(prob = ", prob, ", pobs0 = ", pobs0,
") [blue] vs", " Geometric(prob = ", prob,
") [orange] densities", sep = ""))
## End(Not run)

zageometric Zero-Altered Geometric Distribution

Description
Fits a zero-altered geometric distribution based on a conditional model involving a Bernoulli distri-
bution and a positive-geometric distribution.

Usage
zageometric(lpobs0 = "logit", lprob = "logit",
type.fitted = c("mean", "prob", "pobs0", "onempobs0"),
imethod = 1, ipobs0 = NULL, iprob = NULL, zero = NULL)
zageometricff(lprob = "logit", lonempobs0 = "logit",
type.fitted = c("mean", "prob", "pobs0", "onempobs0"),
imethod = 1, iprob = NULL, ionempobs0 = NULL, zero = "onempobs0")

Arguments
lpobs0 Link function for the parameter p0 or , called pobs0 or phi here. See Links
for more choices.
lprob Parameter link function applied to the probability of success, called prob or p.
See Links for more choices.
type.fitted See CommonVGAMffArguments and fittedvlm for information.
ipobs0, iprob Optional initial values for the parameters. If given, they must be in range. For
multi-column responses, these are recycled sideways.
lonempobs0, ionempobs0
Corresponding argument for the other parameterization. See details below.
zero, imethod See CommonVGAMffArguments.
zageometric 817

Details
The response Y is zero with probability p0 , or Y has a positive-geometric distribution with proba-
bility 1 p0 . Thus 0 < p0 < 1, which is modelled as a function of the covariates. The zero-altered
geometric distribution differs from the zero-inflated geometric distribution in that the former has
zeros coming from one source, whereas the latter has zeros coming from the geometric distribution
too. The zero-inflated geometric distribution is implemented in the VGAM package. Some people
call the zero-altered geometric a hurdle model.
The input can be a matrix (multiple responses). By default, the two linear/additive predictors of
zageometric are (logit(), logit(p))T .
The VGAM family function zageometricff() has a few changes compared to zageometric().
These are: (i) the order of the linear/additive predictors is switched so the geometric probability
comes first; (ii) argument onempobs0 is now 1 minus the probability of an observed 0, i.e., the
probability of the positive geometric distribution, i.e., onempobs0 is 1-pobs0; (iii) argument zero
has a new default so that the pobs0 is intercept-only by default. Now zageometricff() is gener-
ally recommended over zageometric(). Both functions implement Fisher scoring and can handle
multiple responses.

= (1 )/p.

If type.fitted = "pobs0" then p0 is returned.

Warning
Convergence for this VGAM family function seems to depend quite strongly on providing good
initial values.
Inference obtained from summary.vglm and summary.vgam may or may not be correct. In particu-
lar, the p-values, standard errors and degrees of freedom may need adjustment. Use simulation on
artificial data to check that these are reasonable.

Note
Note this family function allows p0 to be modelled as functions of the covariates. It is a conditional
model, not a mixture model.
This family function effectively combines binomialff and posgeometric() and geometric into
one family function. However, posgeometric() is not written because it is trivially related to
geometric.

Author(s)
T. W. Yee
818 Zanegbin

See Also
dzageom, geometric, zigeometric, dgeom, CommonVGAMffArguments, simulate.vlm.

Examples
zdata <- data.frame(x2 = runif(nn <- 1000))
zdata <- transform(zdata, pobs0 = logit(-1 + 2*x2, inverse = TRUE),
prob = logit(-2 + 3*x2, inverse = TRUE))
zdata <- transform(zdata, y1 = rzageom(nn, prob = prob, pobs0 = pobs0),
y2 = rzageom(nn, prob = prob, pobs0 = pobs0))
with(zdata, table(y1))

fit <- vglm(cbind(y1, y2) ~ x2, zageometric, data = zdata, trace = TRUE)
coef(fit, matrix = TRUE)
head(fitted(fit))
head(predict(fit))
summary(fit)

Zanegbin Zero-Altered Negative Binomial Distribution

Description
Density, distribution function, quantile function and random generation for the zero-altered negative
binomial distribution with parameter pobs0.

Usage
dzanegbin(x, size, prob = NULL, munb = NULL, pobs0 = 0, log = FALSE)
pzanegbin(q, size, prob = NULL, munb = NULL, pobs0 = 0)
qzanegbin(p, size, prob = NULL, munb = NULL, pobs0 = 0)
rzanegbin(n, size, prob = NULL, munb = NULL, pobs0 = 0)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. If length(n) > 1 then the length is taken to be the
number required.
size, prob, munb, log
Parameters from the ordinary negative binomial distribution (see dnbinom). Some
arguments have been renamed slightly.
pobs0 Probability of zero, called pobs0. The default value of pobs0 = 0 corresponds
to the response having a positive negative binomial distribution.
zanegbinomial 819

Details
The probability function of Y is 0 with probability pobs0, else a positive negative binomial(nb ,
size) distribution.

Value
dzanegbin gives the density and pzanegbin gives the distribution function, qzanegbin gives the
quantile function, and rzanegbin generates random deviates.

Note
The argument pobs0 is recycled to the required length, and must have values which lie in the interval
[0, 1].

Author(s)
T. W. Yee

## Not run: x <- 0:10

barplot(rbind(dzanegbin(x, munb = munb, size = size, pobs0 = pobs0),
dnbinom(x, mu = munb, size = size)),
beside = TRUE, col = c("blue", "green"), cex.main = 0.7, las = 1,
ylab = "Probability", names.arg = as.character(x),
main = paste("ZANB(munb = ", munb, ", size = ", size,",
pobs0 = ", pobs0,
") [blue] vs", " NB(mu = ", munb, ", size = ", size,
") [green] densities", sep = ""))
## End(Not run)

zanegbinomial Zero-Altered Negative Binomial Distribution

Description
Fits a zero-altered negative binomial distribution based on a conditional model involving a binomial
distribution and a positive-negative binomial distribution.
820 zanegbinomial

Usage
zanegbinomial(zero = "size", type.fitted = c("mean", "munb", "pobs0"),
mds.min = 1e-3, nsimEIM = 500, cutoff.prob = 0.999,
eps.trig = 1e-7, max.support = 4000, max.chunk.MB = 30,
lpobs0 = "logit", lmunb = "loge", lsize = "loge",
imethod = 1, ipobs0 = NULL,
imunb = NULL, iprobs.y = NULL, gprobs.y = (0:9)/10,
isize = NULL, gsize.mux = exp(c(-30, -20, -15, -10, -6:3)))
zanegbinomialff(lmunb = "loge", lsize = "loge", lonempobs0 = "logit",
type.fitted = c("mean", "munb", "pobs0", "onempobs0"),
isize = NULL, ionempobs0 = NULL, zero = c("size",
"onempobs0"), mds.min = 1e-3, iprobs.y = NULL, gprobs.y = (0:9)/10,
cutoff.prob = 0.999, eps.trig = 1e-7, max.support = 4000,
max.chunk.MB = 30, gsize.mux = exp(c(-30, -20, -15, -10, -6:3)),
imethod = 1, imunb = NULL,
nsimEIM = 500)

Arguments
lpobs0 Link function for the parameter p0 , called pobs0 here. See Links for more
choices.
lmunb Link function applied to the munb parameter, which is the mean nb of an ordi-
nary negative binomial distribution. See Links for more choices.
lsize Parameter link function applied to the reciprocal of the dispersion parameter,
called k. That is, as k increases, the variance of the response decreases. See
Links for more choices.
type.fitted See CommonVGAMffArguments and fittedvlm for information.
lonempobs0, ionempobs0
Corresponding argument for the other parameterization. See details below.
ipobs0, imunb, isize
Optional initial values for p0 and munb and k. If given then it is okay to give
one value for each response/species by inputting a vector whose length is the
number of columns of the response matrix.
zero Specifies which of the three linear predictors are modelled as intercept-only.
All parameters can be modelled as a function of the explanatory variables by
setting zero = NULL (not recommended). A negative value means that the value
is recycled, e.g., setting 3 means all k are intercept-only for zanegbinomial.
See CommonVGAMffArguments for more information.
nsimEIM, imethod
See CommonVGAMffArguments.
iprobs.y, gsize.mux, gprobs.y
See negbinomial.
cutoff.prob, eps.trig
See negbinomial.
mds.min, max.support, max.chunk.MB
See negbinomial.
zanegbinomial 821

Details
The response Y is zero with probability p0 , or Y has a positive-negative binomial distribution with
probability 1 p0 . Thus 0 < p0 < 1, which is modelled as a function of the covariates. The zero-
altered negative binomial distribution differs from the zero-inflated negative binomial distribution
in that the former has zeros coming from one source, whereas the latter has zeros coming from the
negative binomial distribution too. The zero-inflated negative binomial distribution is implemented
in the VGAM package. Some people call the zero-altered negative binomial a hurdle model.
For one response/species, by default, the three linear/additive predictors for zanegbinomial() are
(logit(p0 ), log(nb ), log(k))T . This vector is recycled for multiple species.
The VGAM family function zanegbinomialff() has a few changes compared to zanegbinomial().
These are: (i) the order of the linear/additive predictors is switched so the negative binomial mean
comes first; (ii) argument onempobs0 is now 1 minus the probability of an observed 0, i.e., the
probability of the positive negative binomial distribution, i.e., onempobs0 is 1-pobs0; (iii) argument
zero has a new default so that the pobs0 is intercept-only by default. Now zanegbinomialff() is
generally recommended over zanegbinomial(). Both functions implement Fisher scoring and can
handle multiple responses.

= (1 p0 )nb /[1 (k/(k + nb ))k ].

If type.fitted = "pobs0" then p0 is returned.

Warning
This family function is fragile; it inherits the same difficulties as posnegbinomial. Convergence
for this VGAM family function seems to depend quite strongly on providing good initial values.
This VGAM family function is computationally expensive and usually runs slowly; setting trace = TRUE
is useful for monitoring convergence.
Inference obtained from summary.vglm and summary.vgam may or may not be correct. In particu-
lar, the p-values, standard errors and degrees of freedom may need adjustment. Use simulation on
artificial data to check that these are reasonable.

Note
Note this family function allows p0 to be modelled as functions of the covariates provided zero
is set correctly. It is a conditional model, not a mixture model. Simulated Fisher scoring is the
algorithm.
This family function effectively combines posnegbinomial and binomialff into one family func-
tion.
This family function can handle multiple responses, e.g., more than one species.
822 Zapois

Author(s)
T. W. Yee

References
Welsh, A. H., Cunningham, R. B., Donnelly, C. F. and Lindenmayer, D. B. (1996) Modelling the
abundances of rare species: statistical models for counts with extra zeros. Ecological Modelling,
88, 297308.
Yee, T. W. (2014) Reduced-rank vector generalized linear models with two linear predictors. Com-
putational Statistics and Data Analysis, 71, 889902.

See Also
dzanegbin, posnegbinomial, negbinomial, binomialff, rposnegbin, zinegbinomial, zipoisson,
dnbinom, CommonVGAMffArguments, simulate.vlm.

Examples
## Not run:
zdata <- data.frame(x2 = runif(nn <- 2000))
zdata <- transform(zdata, pobs0 = logit(-1 + 2*x2, inverse = TRUE))
zdata <- transform(zdata,
y1 = rzanegbin(nn, munb = exp(0+2*x2), size = exp(1), pobs0 = pobs0),
y2 = rzanegbin(nn, munb = exp(1+2*x2), size = exp(1), pobs0 = pobs0))
with(zdata, table(y1))
with(zdata, table(y2))

fit <- vglm(cbind(y1, y2) ~ x2, zanegbinomial, data = zdata, trace = TRUE)
coef(fit, matrix = TRUE)
head(fitted(fit))
head(predict(fit))

## End(Not run)

Zapois Zero-Altered Poisson Distribution

Description
Density, distribution function, quantile function and random generation for the zero-altered Poisson
distribution with parameter pobs0.

Usage
dzapois(x, lambda, pobs0 = 0, log = FALSE)
pzapois(q, lambda, pobs0 = 0)
qzapois(p, lambda, pobs0 = 0)
rzapois(n, lambda, pobs0 = 0)
Zapois 823

Arguments
x, q vector of quantiles.
p vector of probabilities.
n number of observations. If length(n) > 1 then the length is taken to be the
number required.
lambda Vector of positive means.
pobs0 Probability of zero, called pobs0. The default value of pobs0 = 0 corresponds
to the response having a positive Poisson distribution.
log Logical. Return the logarithm of the answer?

Details
The probability function of Y is 0 with probability pobs0, else a positive P oisson().

Value
dzapois gives the density, pzapois gives the distribution function, qzapois gives the quantile
function, and rzapois generates random deviates.

Note
The argument pobs0 is recycled to the required length, and must have values which lie in the interval
[0, 1].

Author(s)
T. W. Yee

## Not run: x <- 0:10

barplot(rbind(dzapois(x, lambda, pobs0), dpois(x, lambda)),
beside = TRUE, col = c("blue", "green"), las = 1,
main = paste("ZAP(", lambda, ", pobs0 = ", pobs0, ") [blue] vs",
" Poisson(", lambda, ") [green] densities", sep = ""),
names.arg = as.character(x), ylab = "Probability")
## End(Not run)
824 zapoisson

zapoisson Zero-Altered Poisson Distribution

Description
Fits a zero-altered Poisson distribution based on a conditional model involving a Bernoulli distribu-
tion and a positive-Poisson distribution.

Usage
zapoisson(lpobs0 = "logit", llambda = "loge", type.fitted =
c("mean", "lambda", "pobs0", "onempobs0"), imethod = 1,
ipobs0 = NULL, ilambda = NULL, ishrinkage = 0.95, probs.y = 0.35,
zero = NULL)
zapoissonff(llambda = "loge", lonempobs0 = "logit", type.fitted =
c("mean", "lambda", "pobs0", "onempobs0"), imethod = 1,
ilambda = NULL, ionempobs0 = NULL, ishrinkage = 0.95,
probs.y = 0.35, zero = "onempobs0")

Arguments
lpobs0 Link function for the parameter p0 , called pobs0 here. See Links for more
choices.
llambda Link function for the usual parameter. See Links for more choices.
type.fitted See CommonVGAMffArguments and fittedvlm for information.
lonempobs0 Corresponding argument for the other parameterization. See details below.
imethod, ipobs0, ionempobs0, ilambda, ishrinkage
See CommonVGAMffArguments for information.
probs.y, zero See CommonVGAMffArguments for information.

Details
The response Y is zero with probability p0 , else Y has a positive-Poisson() distribution with
probability 1 p0 . Thus 0 < p0 < 1, which is modelled as a function of the covariates. The zero-
altered Poisson distribution differs from the zero-inflated Poisson distribution in that the former has
zeros coming from one source, whereas the latter has zeros coming from the Poisson distribution
too. Some people call the zero-altered Poisson a hurdle model.
For one response/species, by default, the two linear/additive predictors for zapoisson() are (logit(p0 ), log())T .
The VGAM family function zapoissonff() has a few changes compared to zapoisson(). These
are: (i) the order of the linear/additive predictors is switched so the Poisson mean comes first; (ii)
argument onempobs0 is now 1 minus the probability of an observed 0, i.e., the probability of the
positive Poisson distribution, i.e., onempobs0 is 1-pobs0; (iii) argument zero has a new default so
that the onempobs0 is intercept-only by default. Now zapoissonff() is generally recommended
over zapoisson(). Both functions implement Fisher scoring and can handle multiple responses.
zapoisson 825

= (1 p0 )/[1 exp()].

If type.fitted = "pobs0" then p0 is returned.

Note
There are subtle differences between this family function and zipoisson and yip88. In particular,
zipoisson is a mixture model whereas zapoisson() and yip88 are conditional models.
Note this family function allows p0 to be modelled as functions of the covariates.
This family function effectively combines pospoisson and binomialff into one family function.
This family function can handle multiple responses, e.g., more than one species.

Author(s)
T. W. Yee

References
Welsh, A. H., Cunningham, R. B., Donnelly, C. F. and Lindenmayer, D. B. (1996) Modelling the
abundances of rare species: statistical models for counts with extra zeros. Ecological Modelling,
88, 297308.
Angers, J-F. and Biswas, A. (2003) A Bayesian analysis of zero-inflated generalized Poisson model.
Computational Statistics & Data Analysis, 42, 3746.
Yee, T. W. (2014) Reduced-rank vector generalized linear models with two linear predictors. Com-
putational Statistics and Data Analysis, 71, 889902.

See Also
rzapois, zipoisson, pospoisson, posnegbinomial, binomialff, rpospois, CommonVGAMffArguments,
simulate.vlm.

Examples
zdata <- data.frame(x2 = runif(nn <- 1000))
zdata <- transform(zdata, pobs0 = logit( -1 + 1*x2, inverse = TRUE),
lambda = loge(-0.5 + 2*x2, inverse = TRUE))
zdata <- transform(zdata, y = rzapois(nn, lambda, pobs0 = pobs0))

with(zdata, table(y))
fit <- vglm(y ~ x2, zapoisson, data = zdata, trace = TRUE)
fit <- vglm(y ~ x2, zapoisson, data = zdata, trace = TRUE, crit = "coef")
head(fitted(fit))
head(predict(fit))
826 zero

head(predict(fit, untransform = TRUE))

coef(fit, matrix = TRUE)
summary(fit)

# Another example ------------------------------

# Data from Angers and Biswas (2003)
abdata <- data.frame(y = 0:7, w = c(182, 41, 12, 2, 2, 0, 0, 1))
abdata <- subset(abdata, w > 0)
Abdata <- data.frame(yy = with(abdata, rep(y, w)))
fit3 <- vglm(yy ~ 1, zapoisson, data = Abdata, trace = TRUE, crit = "coef")
coef(fit3, matrix = TRUE)
Coef(fit3) # Estimate lambda (they get 0.6997 with SE 0.1520)
head(fitted(fit3), 1)
with(Abdata, mean(yy)) # Compare this with fitted(fit3)

zero The zero Argument in VGAM Family Functions

Description
The zero argument allows users to conveniently model certain linear/additive predictors as intercept-
only.

Details
Often a certain parameter needs to be modelled simply while other parameters in the model may
be more complex, for example, the parameter in LMS-Box-Cox quantile regression should be
modelled more simply compared to its parameter. Another example is the parameter in a GEV
distribution which is should be modelled simpler than its parameter. Using the zero argument
allows this to be fitted conveniently without having to input all the constraint matrices explicitly.
The zero argument can be assigned an integer vector from the set {1:M} where M is the number of
linear/additive predictors. Full details about constraint matrices can be found in the references. See
CommonVGAMffArguments for more information.

Value
Nothing is returned. It is simply a convenient argument for constraining certain linear/additive
predictors to be an intercept only.

Warning
The use of other arguments may conflict with the zero argument. For example, using constraints
to input constraint matrices may conflict with the zero argument. Another example is the argument
parallel. In general users should not assume any particular order of precedence when there is
potential conflict of definition. Currently no checking for consistency is made.
The argument zero may be renamed in the future to something better.
Zeta 827

Side Effects

The argument creates the appropriate constraint matrices internally.

Note

In all VGAM family functions zero = NULL means none of the linear/additive predictors are
modelled as intercepts-only. Almost all VGAM family function have zero = NULL as the default,
but there are some exceptions, e.g., binom2.or.
Typing something like coef(fit, matrix = TRUE) is a useful way to ensure that the zero argu-
ment has worked as expected.

Author(s)

T. W. Yee

References

Yee, T. W. and Wild, C. J. (1996) Vector generalized additive models. Journal of the Royal Statisti-
cal Society, Series B, Methodological, 58, 481493.
Yee, T. W. and Hastie, T. J. (2003) Reduced-rank vector generalized linear models. Statistical
Modelling, 3, 1541.

See Also

CommonVGAMffArguments, constraints.

Examples
args(multinomial)
args(binom2.or)
args(gpd)

#LMS quantile regression example

fit <- vglm(BMI ~ sm.bs(age, df = 4), lms.bcg(zero = c(1, 3)),
data = bmi.nz, trace = TRUE)
coef(fit, matrix = TRUE)

Zeta The Zeta Distribution

Description

Density, distribution function, quantile function and random generation for the zeta distribution.
828 Zeta

Usage

dzeta(x, shape, log = FALSE)

pzeta(q, shape, lower.tail = TRUE)
qzeta(p, shape)
rzeta(n, shape)

Arguments

x, q, p, n Same as Poisson.
shape The positive shape parameter p.
lower.tail, log
Same meaning as in Normal.

Details

The density function of the zeta distribution is given by

y s1 /(s + 1)

where s > 0, y = 1, 2, . . ., and is Riemanns zeta function.

Value

dzeta gives the density, pzeta gives the distribution function, qzeta gives the quantile function,
and rzeta generates random deviates.

Note

qzeta() runs slower and slower as shape approaches 0 and p approaches 1. The VGAM family
function zetaff estimates the shape parameter s.

Author(s)

T. W. Yee

References

Johnson N. L., Kotz S., and Balakrishnan N. (1993) Univariate Discrete Distributions, 2nd ed. New
York: Wiley.

See Also

zeta, zetaff, Oazeta, Oizeta, Otzeta.

zeta 829

Examples
dzeta(1:20, shape = 2)
myshape <- 0.5
max(abs(pzeta(1:200, myshape) -
cumsum(1/(1:200)^(1+myshape)) / zeta(myshape+1))) # Should be 0

## Not run: plot(1:6, dzeta(1:6, 2), type = "h", las = 1,

col = "orange", ylab = "Probability",
main = "zeta probability function; orange: shape = 2; blue: shape = 1")
points(0.10 + 1:6, dzeta(1:6, 1), type = "h", col = "blue")
## End(Not run)

zeta Riemanns Zeta Function

Description
Computes Riemanns zeta function and its first two derivatives. Also can computes Hurwitzs zeta
function.

Usage
zeta(x, deriv = 0, shift = 1)

Arguments
x A complex-valued vector/matrix whose real values must be 1. Otherwise, if
x may be real. If deriv is 1 or 2 then x must be real and positive.
deriv An integer equalling 0 or 1 or 2, which is the order of the derivative. The default
means it is computed ordinarily.
shift Positive and numeric, called A below. Allows for the Hurwitz zeta to be re-
turned. The default corresponds to the Riemann formula.

Details
The (Riemann) formula for real s is

X
1/ns .
n=1

While the usual definition involves an infinite series that converges when the real part of the argu-
ment is > 1, more efficient methods have been devised to compute the value. In particular, this
function uses Euler-Maclaurin summation. Theoretically, the zeta function can be computed over
the whole complex plane because of analytic continuation.
The (Riemann) formula used here for analytic continuation is

(s) = 2s s1 sin(s/2)(1 s)(1 s).

830 zeta

This is actually one of several formulas, but this one was discovered by Riemann himself and is
called the functional equation.
The Hurwitz zeta function for real 0 < s is

X
1/(A + n)s .
n=0

where 0 < A is known here as the shift. Since A = 1 by default, this function will therefore
return Riemanns zeta function by default. Currently derivatives are unavailable.

Value
The default is a vector/matrix of computed values of Riemanns zeta function. If shift contains
values not equal to 1, then this is Hurwitzs zeta function.

Warning
This function has not been fully tested, especially the derivatives. In particular, analytic continuation
does not work here for complex x with Re(x)<1 because currently the gamma function does not
handle complex arguments.

Note
Estimation of the parameter of the zeta distribution can be achieved with zetaff.

Author(s)
T. W. Yee, with the help of Garry J. Tee.

References
Riemann, B. (1859) Ueber die Anzahl der Primzahlen unter einer gegebenen Grosse. Monats-
berichte der Berliner Akademie, November 1859.
Edwards, H. M. (1974) Riemanns Zeta Function. Academic Press: New York.
Markman, B. (1965) The Riemann zeta function. BIT, 5, 138141.
Abramowitz, M. and Stegun, I. A. (1972) Handbook of Mathematical Functions with Formulas,
Graphs, and Mathematical Tables, New York: Dover Publications Inc.

See Also
zetaff, oazeta, oizeta, otzeta, lerch, gamma.

Examples
zeta(2:10)

## Not run:
curve(zeta, -13, 0.8, xlim = c(-12, 10), ylim = c(-1, 4), col = "orange",
las = 1, main = expression({zeta}(x)))
zetaff 831

curve(zeta, 1.2, 12, add = TRUE, col = "orange")

abline(v = 0, h = c(0, 1), lty = "dashed", col = "gray")

curve(zeta, -14, -0.4, col = "orange", main = expression({zeta}(x)))

abline(v = 0, h = 0, lty = "dashed", col = "gray") # Close up plot

x <- seq(0.04, 0.8, len = 100) # Plot of the first derivative

plot(x, zeta(x, deriv = 1), type = "l", las = 1, col = "blue",
xlim = c(0.04, 3), ylim = c(-6, 0), main = "zeta'(x)")
x <- seq(1.2, 3, len = 100)
lines(x, zeta(x, deriv = 1), col = "blue")
abline(v = 0, h = 0, lty = "dashed", col = "gray")
## End(Not run)

zeta(2) - pi^2 / 6 # Should be 0

zeta(4) - pi^4 / 90 # Should be 0
zeta(6) - pi^6 / 945 # Should be 0
zeta(8) - pi^8 / 9450 # Should be 0
# zeta(0, deriv = 1) + 0.5 * log(2*pi) # Should be 0

zetaff Zeta Distribution Family Function

Description
Estimates the parameter of the zeta distribution.

Usage
zetaff(lshape = "loge", ishape = NULL, gshape = exp(-3:4)/4, zero = NULL)

Arguments
lshape, ishape, zero
These arguments apply to the (positive) parameter p. See Links for more choices.
Choosing loglog constrains p > 1, but may fail if the maximum likelihood es-
timate is less than one. See CommonVGAMffArguments for more information.
gshape See CommonVGAMffArguments for more information.

Details
In this long tailed distribution the response must be a positive integer. The probability function for
a response Y is
P (Y = y) = 1/[y p+1 (p + 1)], p > 0, y = 1, 2, ...
where is Riemanns zeta function. The parameter p is positive, therefore a log link is the default.
The mean of Y is = (p)/(p + 1) (provided p > 1) and these are the fitted values. The variance
of Y is (p 1)/(p + 1) 2 provided p > 2.
832 zetaff

It appears that good initial values are needed for successful convergence. If convergence is not
obtained, try several values ranging from values near 0 to values about 10 or more.
Multiple responses are handled.

Value

An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Note

The zeta function may be used to compute values of the zeta function.

Author(s)

T. W. Yee

References

pp.527 of Chapter 11 of Johnson N. L., Kemp, A. W. and Kotz S. (2005) Univariate Discrete
Distributions, 3rd edition, Hoboken, New Jersey: Wiley.
Knight, K. (2000) Mathematical Statistics. Boca Raton: Chapman & Hall/CRC Press.

See Also

zeta, oazeta, oizeta, otzeta, diffzeta, dzeta, hzeta, zipf.

Examples

zdata <- data.frame(y = 1:5, w = c(63, 14, 5, 1, 2)) # Knight, p.304

fit <- vglm(y ~ 1, zetaff, data = zdata, trace = TRUE, weight = w, crit = "c")
(phat <- Coef(fit)) # 1.682557
with(zdata, cbind(round(dzeta(y, phat) * sum(w), 1), w))

with(zdata, weighted.mean(y, w))

fitted(fit, matrix = FALSE)
predict(fit)

# The following should be zero at the MLE:

with(zdata, mean(log(rep(y, w))) + zeta(1+phat, deriv = 1) / zeta(1+phat))
Zibinom 833

Zibinom Zero-Inflated Binomial Distribution

Description
Density, distribution function, quantile function and random generation for the zero-inflated bino-
mial distribution with parameter pstr0.

Usage
dzibinom(x, size, prob, pstr0 = 0, log = FALSE)
pzibinom(q, size, prob, pstr0 = 0)
qzibinom(p, size, prob, pstr0 = 0)
rzibinom(n, size, prob, pstr0 = 0)

Arguments
x, q vector of quantiles.
p vector of probabilities.
size number of trials. It is the N symbol in the formula given in zibinomial.
prob probability of success on each trial.
n Same as in runif.
log Same as pbinom.
pstr0 Probability of a structural zero (i.e., ignoring the binomial distribution), called
. The default value of = 0 corresponds to the response having an ordinary
binomial distribution.

Details
The probability function of Y is 0 with probability , and Binomial(size, prob) with probability
1 . Thus
P (Y = 0) = + (1 )P (W = 0)
where W is distributed Binomial(size, prob).

Value
dzibinom gives the density, pzibinom gives the distribution function, qzibinom gives the quantile
function, and rzibinom generates random deviates.

Note
The argument pstr0 is recycled to the required length, and must have values which lie in the interval
[0, 1].
These functions actually allow for zero-deflation. That is, the resulting probability of a zero count
is less than the nominal value of the parent distribution. See Zipois for more information.
834 zibinomial

Author(s)

T. W. Yee

See Also

zibinomial, dbinom.

Examples

prob <- 0.2; size <- 10; pstr0 <- 0.5

(ii <- dzibinom(0:size, size, prob, pstr0 = pstr0))
max(abs(cumsum(ii) - pzibinom(0:size, size, prob, pstr0 = pstr0))) # Should be 0
table(rzibinom(100, size, prob, pstr0 = pstr0))

table(qzibinom(runif(100), size, prob, pstr0 = pstr0))

round(dzibinom(0:10, size, prob, pstr0 = pstr0) * 100) # Should be similar

## Not run: x <- 0:size

barplot(rbind(dzibinom(x, size, prob, pstr0 = pstr0),
dbinom(x, size, prob)),
beside = TRUE, col = c("blue", "green"), ylab = "Probability",
main = paste("ZIB(", size, ", ", prob, ", pstr0 = ", pstr0, ") (blue) vs",
" Binomial(", size, ", ", prob, ") (green)", sep=""),
names.arg = as.character(x), las = 1, lwd = 2)
## End(Not run)

zibinomial Zero-Inflated Binomial Distribution Family Function

Description

Fits a zero-inflated binomial distribution by maximum likelihood estimation.

Usage

zibinomial(lpstr0 = "logit", lprob = "logit",

type.fitted = c("mean", "prob", "pobs0", "pstr0", "onempstr0"),
ipstr0 = NULL, zero = NULL, multiple.responses = FALSE,
imethod = 1)
zibinomialff(lprob = "logit", lonempstr0 = "logit",
type.fitted = c("mean", "prob", "pobs0", "pstr0", "onempstr0"),
ionempstr0 = NULL, zero = "onempstr0",
multiple.responses = FALSE, imethod = 1)
zibinomial 835

Arguments
lpstr0, lprob Link functions for the parameter and the usual binomial probability param-
eter. See Links for more choices. For the zero-deflated model see below.
type.fitted See CommonVGAMffArguments and fittedvlm.
ipstr0 Optional initial values for , whose values must lie between 0 and 1. The default
is to compute an initial value internally. If a vector then recyling is used.
lonempstr0, ionempstr0
Corresponding arguments for the other parameterization. See details below.
multiple.responses
Logical. Currently it must be FALSE to mean the function does not handle
multiple responses. This is to remain compatible with the same argument in
binomialff.
zero, imethod See CommonVGAMffArguments for information. Argument zero changed its de-
fault value for version 0.9-2.

Details
These functions are based on

P (Y = 0) = + (1 )(1 )N ,

for y = 0, and
N
P (Y = y) = (1 ) N y (1 )N (1y) .
Ny
for y = 1/N, 2/N, . . . , 1. That is, the response is a sample proportion out of N trials, and the
argument size in rzibinom is N here. The parameter is the probability of a structural zero,
and it satisfies 0 < < 1. The mean of Y is E(Y ) = (1 ) and these are returned as
the fitted values by default. By default, the two linear/additive predictors for zibinomial() are
(logit(), logit())T .
The VGAM family function zibinomialff() has a few changes compared to zibinomial().
These are: (i) the order of the linear/additive predictors is switched so the binomial probability
comes first; (ii) argument onempstr0 is now 1 minus the probability of a structural zero, i.e., the
probability of the parent (binomial) component, i.e., onempstr0 is 1-pstr0; (iii) argument zero
has a new default so that the onempstr0 is intercept-only by default. Now zibinomialff() is
generally recommended over zibinomial(). Both functions implement Fisher scoring.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Warning
Numerical problems can occur. Half-stepping is not uncommon. If failure to converge occurs, make
use of the argument ipstr0 or ionempstr0, or imethod.
836 zibinomial

Note

The response variable must have one of the formats described by binomialff, e.g., a factor or two
column matrix or a vector of sample proportions with the weights argument specifying the values
of N .
To work well, one needs large values of N and > 0, i.e., the larger N and are, the better. If
N = 1 then the model is unidentifiable since the number of parameters is excessive.
Setting stepsize = 0.5, say, may aid convergence.
Estimated probabilities of a structural zero and an observed zero are returned, as in zipoisson.
The zero-deflated binomial distribution might be fitted by setting lpstr0 = identitylink, albeit,
not entirely reliably. See zipoisson for information that can be applied here. Else try the zero-
altered binomial distribution (see zabinomial).

Author(s)

T. W. Yee

References

Welsh, A. H., Lindenmayer, D. B. and Donnelly, C. F. (2013) Fitting and interpreting occupancy
models. PLOS One, 8, 121.

See Also

rzibinom, binomialff, posbinomial, rbinom.

Examples
size <- 10 # Number of trials; N in the notation above
nn <- 200
zdata <- data.frame(pstr0 = logit( 0, inverse = TRUE), # 0.50
mubin = logit(-1, inverse = TRUE), # Mean of usual binomial
sv = rep(size, length = nn))
zdata <- transform(zdata,
y = rzibinom(nn, size = sv, prob = mubin, pstr0 = pstr0))
with(zdata, table(y))
fit <- vglm(cbind(y, sv - y) ~ 1, zibinomialff, data = zdata, trace = TRUE)
fit <- vglm(cbind(y, sv - y) ~ 1, zibinomialff, data = zdata, trace = TRUE,
stepsize = 0.5)

coef(fit, matrix = TRUE)

Coef(fit) # Useful for intercept-only models
head(fitted(fit, type = "pobs0")) # Estimate of P(Y = 0)
head(fitted(fit))
with(zdata, mean(y)) # Compare this with fitted(fit)
summary(fit)
Zigeom 837

Zigeom Zero-Inflated Geometric Distribution

Description
Density, and random generation for the zero-inflated geometric distribution with parameter pstr0.

Usage
dzigeom(x, prob, pstr0 = 0, log = FALSE)
pzigeom(q, prob, pstr0 = 0)
qzigeom(p, prob, pstr0 = 0)
rzigeom(n, prob, pstr0 = 0)

Arguments
x, q vector of quantiles.
p vector of probabilities.
prob see dgeom.
n Same as in runif.
pstr0 Probability of structural zero (ignoring the geometric distribution), called . The
default value corresponds to the response having an ordinary geometric distri-
bution.
log Logical. Return the logarithm of the answer?

Details
The probability function of Y is 0 with probability , and geometric(prob) with probability 1 .
Thus
P (Y = 0) = + (1 )P (W = 0)
where W is distributed geometric(prob).

Value
dzigeom gives the density, pzigeom gives the distribution function, qzigeom gives the quantile
function, and rzigeom generates random deviates.

Author(s)
T. W. Yee

## Not run: x <- 0:10

barplot(rbind(dzigeom(x, prob, pstr0), dgeom(x, prob)),
beside = TRUE, col = c("blue","orange"),
ylab = "P[Y = y]", xlab = "y", las = 1,
main = paste("zigeometric(", prob, ", pstr0 = ", pstr0,
") (blue) vs",
" geometric(", prob, ") (orange)", sep = ""),
names.arg = as.character(x))
## End(Not run)

zigeometric Zero-Inflated Geometric Distribution Family Function

Description
Fits a zero-inflated geometric distribution by maximum likelihood estimation.

Usage
zigeometric(lpstr0 = "logit", lprob = "logit",
type.fitted = c("mean", "prob", "pobs0", "pstr0", "onempstr0"),
ipstr0 = NULL, iprob = NULL,
imethod = 1, bias.red = 0.5, zero = NULL)
zigeometricff(lprob = "logit", lonempstr0 = "logit",
type.fitted = c("mean", "prob", "pobs0", "pstr0", "onempstr0"),
iprob = NULL, ionempstr0 = NULL,
imethod = 1, bias.red = 0.5, zero = "onempstr0")

Arguments
lpstr0, lprob Link functions for the parameters and p (prob). The usual geometric proba-
bility parameter is the latter. The probability of a structural zero is the former.
See Links for more choices. For the zero-deflated model see below.
zigeometric 839

lonempstr0, ionempstr0
Corresponding arguments for the other parameterization. See details below.
bias.red A constant used in the initialization process of pstr0. It should lie between 0
and 1, with 1 having no effect.
type.fitted See CommonVGAMffArguments and fittedvlm for information.
ipstr0, iprob See CommonVGAMffArguments for information.
zero, imethod See CommonVGAMffArguments for information.

Details
Function zigeometric() is based on

P (Y = 0) = + (1 )p,

for y = 0, and
P (Y = y) = (1 )p(1 p)y .
for y = 1, 2, . . .. The parameter satisfies 0 < < 1. The mean of Y is E(Y ) = (1 )p/(1 p)
and these are returned as the fitted values by default. By default, the two linear/additive predictors
are (logit(), logit(p))T . Multiple responses are handled.
Estimated probabilities of a structural zero and an observed zero can be returned, as in zipoisson;
see fittedvlm for information.
The VGAM family function zigeometricff() has a few changes compared to zigeometric().
These are: (i) the order of the linear/additive predictors is switched so the geometric probability
comes first; (ii) argument onempstr0 is now 1 minus the probability of a structural zero, i.e., the
probability of the parent (geometric) component, i.e., onempstr0 is 1-pstr0; (iii) argument zero
has a new default so that the onempstr0 is intercept-only by default. Now zigeometricff() is
generally recommended over zigeometric(). Both functions implement Fisher scoring and can
handle multiple responses.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Note
The zero-deflated geometric distribution might be fitted by setting lpstr0 = identitylink, albeit,
not entirely reliably. See zipoisson for information that can be applied here. Else try the zero-
altered geometric distribution (see zageometric).

Author(s)
T. W. Yee

See Also
rzigeom, geometric, zageometric, rgeom, simulate.vlm.
840 Zinegbin

Examples
gdata <- data.frame(x2 = runif(nn <- 1000) - 0.5)
gdata <- transform(gdata, x3 = runif(nn) - 0.5,
x4 = runif(nn) - 0.5)
gdata <- transform(gdata, eta1 = 1.0 - 1.0 * x2 + 2.0 * x3,
eta2 = -1.0,
eta3 = 0.5)
gdata <- transform(gdata, prob1 = logit(eta1, inverse = TRUE),
prob2 = logit(eta2, inverse = TRUE),
prob3 = logit(eta3, inverse = TRUE))
gdata <- transform(gdata, y1 = rzigeom(nn, prob1, pstr0 = prob3),
y2 = rzigeom(nn, prob2, pstr0 = prob3),
y3 = rzigeom(nn, prob2, pstr0 = prob3))
with(gdata, table(y1))
with(gdata, table(y2))
with(gdata, table(y3))
head(gdata)

fit1 <- vglm(y1 ~ x2 + x3 + x4, zigeometric(zero = 1), data = gdata, trace = TRUE)
coef(fit1, matrix = TRUE)
head(fitted(fit1, type = "pstr0"))

fit2 <- vglm(cbind(y2, y3) ~ 1, zigeometric(zero = 1), data = gdata, trace = TRUE)
coef(fit2, matrix = TRUE)
summary(fit2)

Zinegbin Zero-Inflated Negative Binomial Distribution

Description
Density, distribution function, quantile function and random generation for the zero-inflated nega-
tive binomial distribution with parameter pstr0.

Usage
dzinegbin(x, size, prob = NULL, munb = NULL, pstr0 = 0, log = FALSE)
pzinegbin(q, size, prob = NULL, munb = NULL, pstr0 = 0)
qzinegbin(p, size, prob = NULL, munb = NULL, pstr0 = 0)
rzinegbin(n, size, prob = NULL, munb = NULL, pstr0 = 0)

Arguments
x, q vector of quantiles.
p vector of probabilities.
n Same as in runif.
Zinegbin 841

size, prob, munb, log

Arguments matching dnbinom. The argument munb corresponds to mu in dnbinom
and has been renamed to emphasize the fact that it is the mean of the negative
binomial component.
pstr0 Probability of structural zero (i.e., ignoring the negative binomial distribution),
called .

Details
The probability function of Y is 0 with probability , and a negative binomial distribution with
probability 1 . Thus
P (Y = 0) = + (1 )P (W = 0)
where W is distributed as a negative binomial distribution (see rnbinom.) See negbinomial, a
VGAM family function, for the formula of the probability density function and other details of the
negative binomial distribution.

Value
dzinegbin gives the density, pzinegbin gives the distribution function, qzinegbin gives the quan-
tile function, and rzinegbin generates random deviates.

Author(s)
T. W. Yee

See Also
zinegbinomial, rnbinom, rzipois.

Examples
munb <- 3; pstr0 <- 0.2; size <- k <- 10; x <- 0:10
(ii <- dzinegbin(x, pstr0 = pstr0, mu = munb, size = k))
max(abs(cumsum(ii) - pzinegbin(x, pstr0 = pstr0, mu = munb, size = k))) # 0
table(rzinegbin(100, pstr0 = pstr0, mu = munb, size = k))

table(qzinegbin(runif(1000), pstr0 = pstr0, mu = munb, size = k))

round(dzinegbin(x, pstr0 = pstr0, mu = munb, size = k) * 1000) # Should be similar

## Not run: barplot(rbind(dzinegbin(x, pstr0 = pstr0, mu = munb, size = k),

dnbinom(x, mu = munb, size = k)), las = 1,
beside = TRUE, col = c("blue", "green"), ylab = "Probability",
main = paste("ZINB(mu = ", munb, ", k = ", k, ", pstr0 = ", pstr0,
842 zinegbinomial

") (blue) vs NB(mu = ", munb,

", size = ", k, ") (green)", sep = ""),
names.arg = as.character(x))
## End(Not run)

zinegbinomial Zero-Inflated Negative Binomial Distribution Family Function

Description
Fits a zero-inflated negative binomial distribution by full maximum likelihood estimation.

Usage
zinegbinomial(zero = "size",
type.fitted = c("mean", "munb", "pobs0", "pstr0",
"onempstr0"),
mds.min = 1e-3, nsimEIM = 500, cutoff.prob = 0.999,
eps.trig = 1e-7, max.support = 4000, max.chunk.MB = 30,
lpstr0 = "logit", lmunb = "loge", lsize = "loge",
imethod = 1, ipstr0 = NULL, imunb = NULL,
iprobs.y = NULL, isize = NULL,
gprobs.y = (0:9)/10,
gsize.mux = exp(c(-30, -20, -15, -10, -6:3)))
zinegbinomialff(lmunb = "loge", lsize = "loge", lonempstr0 = "logit",
type.fitted = c("mean", "munb", "pobs0", "pstr0",
"onempstr0"), imunb = NULL, isize = NULL, ionempstr0 =
NULL, zero = c("size", "onempstr0"), imethod = 1,
iprobs.y = NULL, cutoff.prob = 0.999,
eps.trig = 1e-7, max.support = 4000, max.chunk.MB = 30,
gprobs.y = (0:9)/10, gsize.mux = exp((-12:6)/2),
mds.min = 1e-3, nsimEIM = 500)

Arguments
lpstr0, lmunb, lsize
Link functions for the parameters , the mean and k; see negbinomial for de-
tails, and Links for more choices. For the zero-deflated model see below.
type.fitted See CommonVGAMffArguments and fittedvlm for more information.
ipstr0, isize, imunb
Optional initial values for and k and . The default is to compute an initial
value internally for both. If a vector then recycling is used.
lonempstr0, ionempstr0
Corresponding arguments for the other parameterization. See details below.
imethod An integer with value 1 or 2 or 3 which specifies the initialization method
for the mean parameter. If failure to converge occurs try another value. See
CommonVGAMffArguments for more information.
zinegbinomial 843

zero Specifies which linear/additive predictors are to be modelled as intercept-only.

They can be such that their absolute values are either 1 or 2 or 3. The default is
the and k parameters (both for each response). See CommonVGAMffArguments
for more information.
nsimEIM See CommonVGAMffArguments for information.
iprobs.y, cutoff.prob, max.support, max.chunk.MB
See negbinomial and/or posnegbinomial for details.
mds.min, eps.trig
See negbinomial for details.
gprobs.y, gsize.mux
These arguments relate to grid searching in the initialization process. See negbinomial
and/or posnegbinomial for details.

Details
These functions are based on

P (Y = 0) = + (1 )(k/(k + ))k ,

and for y = 1, 2, . . .,
P (Y = y) = (1 ) dnbinom(y, , k).
The parameter satisfies 0 < < 1. The mean of Y is (1 ) (returned as the fitted values). By
default, the three linear/additive predictors for zinegbinomial() are (logit(), log(), log(k))T .
See negbinomial, another VGAM family function, for the formula of the probability density func-
tion and other details of the negative binomial distribution.
Independent multiple responses are handled. If so then arguments ipstr0 and isize may be vectors
with length equal to the number of responses.
The VGAM family function zinegbinomialff() has a few changes compared to zinegbinomial().
These are: (i) the order of the linear/additive predictors is switched so the NB mean comes first;
(ii) onempstr0 is now 1 minus the probability of a structural 0, i.e., the probability of the parent
(NB) component, i.e., onempstr0 is 1-pstr0; (iii) argument zero has a new default so that the
onempstr0 is intercept-only by default. Now zinegbinomialff() is generally recommended over
zinegbinomial(). Both functions implement Fisher scoring and can handle multiple responses.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, and vgam.

Warning
This model can be difficult to fit to data, and this family function is fragile. The model is especially
difficult to fit reliably when the estimated k parameter is very large (so the model approaches a
zero-inflated Poisson distribution) or much less than 1 (and gets more difficult as it approaches
0). Numerical problems can also occur, e.g., when the probability of a zero is actually less than,
and not more than, the nominal probability of zero. Similarly, numerical problems can occur if
there is little or no 0-inflation, or when the sample size is small. Half-stepping is not uncommon.
Successful convergence is sensitive to the initial values, therefore if failure to converge occurs, try
844 zinegbinomial

using combinations of arguments stepsize (in vglm.control), imethod, imunb, ipstr0, isize,
and/or zero if there are explanatory variables. Else try fitting an ordinary negbinomial model or a
zipoisson model.
This VGAM family function can be computationally expensive and can run slowly; setting trace = TRUE
is useful for monitoring convergence.

Note
Estimated probabilities of a structural zero and an observed zero can be returned, as in zipoisson;
see fittedvlm for more information.
If k is large then the use of VGAM family function zipoisson is probably preferable. This follows
because the Poisson is the limiting distribution of a negative binomial as k tends to infinity.
The zero-deflated negative binomial distribution might be fitted by setting lpstr0 = identitylink,
albeit, not entirely reliably. See zipoisson for information that can be applied here. Else try the
zero-altered negative binomial distribution (see zanegbinomial).

Author(s)
T. W. Yee

See Also
Zinegbin, negbinomial, rpois, CommonVGAMffArguments.

Examples
## Not run:
# Example 1
ndata <- data.frame(x2 = runif(nn <- 1000))
ndata <- transform(ndata, pstr0 = logit(-0.5 + 1 * x2, inverse = TRUE),
munb = exp( 3 + 1 * x2),
size = exp( 0 + 2 * x2))
ndata <- transform(ndata,
y1 = rzinegbin(nn, mu = munb, size = size, pstr0 = pstr0))
with(ndata, table(y1)["0"] / sum(table(y1)))
nfit <- vglm(y1 ~ x2, zinegbinomial(zero = NULL), data = ndata)
coef(nfit, matrix = TRUE)
summary(nfit)
head(cbind(fitted(nfit), with(ndata, (1 - pstr0) * munb)))
round(vcov(nfit), 3)

# Example 2: RR-ZINB could also be called a COZIVGLM-ZINB-2

ndata <- data.frame(x2 = runif(nn <- 2000))
ndata <- transform(ndata, x3 = runif(nn))
ndata <- transform(ndata, eta1 = 3 + 1 * x2 + 2 * x3)
ndata <- transform(ndata, pstr0 = logit(-1.5 + 0.5 * eta1, inverse = TRUE),
munb = exp(eta1),
size = exp(4))
ndata <- transform(ndata,
zipebcom 845

y1 = rzinegbin(nn, pstr0 = pstr0, mu = munb, size = size))

with(ndata, table(y1)["0"] / sum(table(y1)))
rrzinb <- rrvglm(y1 ~ x2 + x3, zinegbinomial(zero = NULL), data = ndata,
Index.corner = 2, str0 = 3, trace = TRUE)
coef(rrzinb, matrix = TRUE)
Coef(rrzinb)

## End(Not run)

zipebcom Exchangeable Bivariate cloglog Odds-ratio Model From a Zero-

inflated Poisson Distribution

Description
Fits an exchangeable bivariate odds-ratio model to two binary responses with a complementary
log-log link. The data are assumed to come from a zero-inflated Poisson distribution that has been
converted to presence/absence.

Usage
zipebcom(lmu12 = "cloglog", lphi12 = "logit", loratio = "loge",
imu12 = NULL, iphi12 = NULL, ioratio = NULL,
zero = c("phi12", "oratio"), tol = 0.001, addRidge = 0.001)

Arguments
lmu12, imu12 Link function, extra argument and optional initial values for the first (and sec-
ond) marginal probabilities. Argument lmu12 should be left alone. Argument
imu12 may be of length 2 (one element for each response).
lphi12 Link function applied to the parameter of the zero-inflated Poisson distribution
(see zipoisson). See Links for more choices.
loratio Link function applied to the odds ratio. See Links for more choices.
iphi12, ioratio
Optional initial values for and the odds ratio. See CommonVGAMffArguments
for more details. In general, good initial values (especially for iphi12) are
often required, therefore use these arguments if convergence failure occurs. If
inputted, the value of iphi12 cannot be more than the sample proportions of
zeros in either response.
zero Which linear/additive predictor is modelled as an intercept only? A NULL means
none. The default has both and the odds ratio as not being modelled as a
function of the explanatory variables (apart from an intercept).
tol Tolerance for testing independence. Should be some small positive numerical
value.
addRidge Some small positive numerical value. The first two diagonal elements of the
working weight matrices are multiplied by 1+addRidge to make it diagonally
dominant, therefore positive-definite.
846 zipebcom

Details

This VGAM family function fits an exchangeable bivariate odds ratio model (binom2.or) with a
cloglog link. The data are assumed to come from a zero-inflated Poisson (ZIP) distribution that
has been converted to presence/absence. Explicitly, the default model is

cloglog[P (Yj = 1)/(1 )] = 1 , j = 1, 2

for the (exchangeable) marginals, and

logit[] = 2 ,

for the mixing parameter, and

log[P (Y00 = 1)P (Y11 = 1)/(P (Y01 = 1)P (Y10 = 1))] = 3 ,

specifies the dependency between the two responses. Here, the responses equal 1 for a success and
a 0 for a failure, and the odds ratio is often written = p00 p11 /(p10 p01 ). We have p10 = p01
because of the exchangeability.
The second linear/additive predictor models the parameter (see zipoisson). The third lin-
ear/additive predictor is the same as binom2.or, viz., the log odds ratio.
Suppose a dataset1 comes from a Poisson distribution that has been converted to presence/absence,
and that both marginal probabilities are the same (exchangeable). Then binom2.or("cloglog", exch=TRUE)
is appropriate. Now suppose a dataset2 comes from a zero-inflated Poisson distribution. The first
linear/additive predictor of zipebcom() applied to dataset2 is the same as that of binom2.or("cloglog", exch=TRUE)
applied to dataset1. That is, the has been taken care of by zipebcom() so that it is just like the
simpler binom2.or.
Note that, for 1 , mu12 = prob12 / (1-phi12) where prob12 is the probability of a 1 under the
ZIP model. Here, mu12 correspond to mu1 and mu2 in the binom2.or-Poisson model.
If = 0 then zipebcom() should be equivalent to binom2.or("cloglog", exch=TRUE). Full
details are given in Yee and Dirnbock (2009).
The leading 2 2 submatrix of the expected information matrix (EIM) is of rank-1, not 2! This
is due to the fact that the parameters corresponding to the first two linear/additive predictors are
unidentifiable. The quick fix around this problem is to use the addRidge adjustment. The model
is fitted by maximum likelihood estimation since the full likelihood is specified. Fisher scoring is
implemented.
The default models 2 and 3 as single parameters only, but this can be circumvented by setting
zero=NULL in order to model the and odds ratio as a function of all the explanatory variables.

Value

An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.
When fitted, the fitted.values slot of the object contains the four joint probabilities, labelled as
(Y1 , Y2 ) = (0,0), (0,1), (1,0), (1,1), respectively. These estimated probabilities should be extracted
with the fitted generic function.
zipebcom 847

Warning
The fact that the EIM is not of full rank may mean the model is naturally ill-conditioned. Not
sure whether there are any negative consequences wrt theory. For now it is certainly safer to fit
binom2.or to bivariate binary responses.

Note
The "12" in the argument names reinforce the user about the exchangeability assumption. The
name of this VGAM family function stands for zero-inflated Poisson exchangeable bivariate com-
plementary log-log odds-ratio model or ZIP-EBCOM.
See binom2.or for details that are pertinent to this VGAM family function too. Even better initial
values are usually needed here.
The xij (see vglm.control) argument enables environmental variables with different values at the
two time points to be entered into an exchangeable binom2.or model. See the authors webpage
for sample code.

References
Yee, T. W. and Dirnbock, T. (2009) Models for analysing species presence/absence data at two
time points. Journal of Theoretical Biology, 259(4), 684694.

See Also
binom2.or, zipoisson, cloglog, CommonVGAMffArguments.

Examples
zdata <- data.frame(x2 = seq(0, 1, len = (nsites <- 2000)))
zdata <- transform(zdata, eta1 = -3 + 5 * x2,
phi1 = logit(-1, inverse = TRUE),
oratio = exp(2))
zdata <- transform(zdata, mu12 = cloglog(eta1, inverse = TRUE) * (1-phi1))
tmat <- with(zdata, rbinom2.or(nsites, mu1 = mu12, oratio = oratio, exch = TRUE))
zdata <- transform(zdata, ybin1 = tmat[, 1], ybin2 = tmat[, 2])

with(zdata, table(ybin1, ybin2)) / nsites # For interest only

## Not run:
# Various plots of the data, for interest only
par(mfrow = c(2, 2))
plot(jitter(ybin1) ~ x2, data = zdata, col = "blue")

plot(jitter(ybin2) ~ jitter(ybin1), data = zdata, col = "blue")

plot(mu12 ~ x2, data = zdata, col = "blue", type = "l", ylim = 0:1,
ylab = "Probability", main = "Marginal probability and phi")
with(zdata, abline(h = phi1[1], col = "red", lty = "dashed"))

tmat2 <- with(zdata, dbinom2.or(mu1 = mu12, oratio = oratio, exch = TRUE))

with(zdata, matplot(x2, tmat2, col = 1:4, type = "l", ylim = 0:1,
ylab = "Probability", main = "Joint probabilities"))
848 Zipf

## End(Not run)

# Now fit the model to the data.

fit <- vglm(cbind(ybin1, ybin2) ~ x2, zipebcom, data = zdata, trace = TRUE)
coef(fit, matrix = TRUE)
summary(fit)
vcov(fit)

Zipf The Zipf Distribution

Description
Density, distribution function, quantile function and random generation for the Zipf distribution.

Usage
dzipf(x, N, shape, log = FALSE)
pzipf(q, N, shape, log.p = FALSE)
qzipf(p, N, shape)
rzipf(n, N, shape)

Arguments
x, q, p, n Same as Poisson.
N, shape the number of elements, and the exponent characterizing the distribution. See
zipf for more details.
log, log.p Same meaning as in Normal.

Details
This is a finite version of the zeta distribution. See zetaff for more details. In general, these
functions runs slower and slower as N increases.

Value
dzipf gives the density, pzipf gives the cumulative distribution function, qzipf gives the quantile
function, and rzipf generates random deviates.

Author(s)
T. W. Yee

zipf Zipf Distribution Family Function

Description
Estimates the parameter of the Zipf distribution.

Usage
zipf(N = NULL, lshape = "loge", ishape = NULL)

Arguments
N Number of elements, an integer satisfying 1 < N < Inf. The default is to use
the maximum value of the response. If given, N must be no less that the largest
response value. If N = Inf and s > 1 then this is the zeta distribution (use
zetaff instead).
lshape Parameter link function applied to the (positive) shape parameter s. See Links
for more choices.
ishape Optional initial value for the parameter s. The default is to choose an initial
value internally. If converge failure occurs use this argument to input a value.

Details
The probability function for a response Y is
N
X
P (Y = y) = y s / is , s > 0, y = 1, 2, . . . , N,
i=1

where s is the exponent characterizing the distribution.PThe mean of Y , which are returned as the
n m
fitted values, is = HN,s1 /HN,s where Hn,m = i=1 i is the nth generalized harmonic
number.
Zipfs law is an experimental law which is often applied to the study of the frequency of words
in a corpus of natural language utterances. It states that the frequency of any word is inversely
proportional to its rank in the frequency table. For example, "the" and "of" are first two most
common words, and Zipfs law states that "the" is twice as common as "of". Many other natural
phenomena conform to Zipfs law.
850 Zipois

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm and vgam.

Note
Upon convergence, the N is stored as @misc$N.

Author(s)
T. W. Yee

References
pp.526 of Chapter 11 of Johnson N. L., Kemp, A. W. and Kotz S. (2005) Univariate Discrete
Distributions, 3rd edition, Hoboken, New Jersey, USA: Wiley.

Zipois Zero-Inflated Poisson Distribution

Description
Density, distribution function, quantile function and random generation for the zero-inflated and
zero-deflated Poisson distribution with parameter pstr0.

Usage
dzipois(x, lambda, pstr0 = 0, log = FALSE)
pzipois(q, lambda, pstr0 = 0)
qzipois(p, lambda, pstr0 = 0)
rzipois(n, lambda, pstr0 = 0)
Zipois 851

Arguments

x, q vector of quantiles.
p vector of probabilities.
n number of observations. Must be a single positive integer.
lambda Vector of positive means.
pstr0 Probability of a structural zero (i.e., ignoring the Poisson distribution), called
. The default value of = 0 corresponds to the response having an ordinary
Poisson distribution. This argument may be negative to allow for 0-deflation,
hence its interpretation as a probability ceases.
log Logical. Return the logarithm of the answer?

Details

The probability function of Y is 0 with probability , and P oisson() with probability 1 . Thus

P (Y = 0) = + (1 )P (W = 0)

where W is distributed P oisson().

Value

dzipois gives the density, pzipois gives the distribution function, qzipois gives the quantile
function, and rzipois generates random deviates.

Note

The argument pstr0 is recycled to the required length, and must have values which lie in the interval
[0, 1].
These functions actually allow for the zero-deflated Poisson distribution. Here, pstr0 is also per-
mitted to lie in the interval [-1/expm1(lambda), 0]. The resulting probability of a zero count is
less than the nominal Poisson value, and the use of pstr0 to stand for the probability of a structural
zero loses its meaning. When pstr0 equals -1/expm1(lambda) this corresponds to the positive-
Poisson distribution (e.g., see dpospois).

Author(s)

T. W. Yee

See Also

zipoisson, dpois, rzinegbin.

852 zipoisson

Examples
lambda <- 3; pstr0 <- 0.2; x <- (-1):7
(ii <- dzipois(x, lambda, pstr0 = pstr0))
max(abs(cumsum(ii) - pzipois(x, lambda, pstr0 = pstr0))) # Should be 0
table(rzipois(100, lambda, pstr0 = pstr0))

table(qzipois(runif(100), lambda, pstr0))

round(dzipois(0:10, lambda, pstr0 = pstr0) * 100) # Should be similar

## Not run: x <- 0:10

par(mfrow = c(2, 1)) # Zero-inflated Poisson
barplot(rbind(dzipois(x, lambda, pstr0 = pstr0), dpois(x, lambda)),
beside = TRUE, col = c("blue", "orange"),
main = paste("ZIP(", lambda, ", pstr0 = ", pstr0, ") (blue) vs",
" Poisson(", lambda, ") (orange)", sep = ""),
names.arg = as.character(x))

deflat.limit <- -1 / expm1(lambda) # Zero-deflated Poisson

newpstr0 <- round(deflat.limit / 1.5, 3)
barplot(rbind(dzipois(x, lambda, pstr0 = newpstr0),
dpois(x, lambda)),
beside = TRUE, col = c("blue","orange"),
main = paste("ZDP(", lambda, ", pstr0 = ", newpstr0, ") (blue) vs",
" Poisson(", lambda, ") (orange)", sep = ""),
names.arg = as.character(x))
## End(Not run)

zipoisson Zero-Inflated Poisson Distribution Family Function

Description

Fits a zero-inflated or zero-deflated Poisson distribution by full maximum likelihood estimation.

Usage

zipoisson(lpstr0 = "logit", llambda = "loge", type.fitted =

c("mean", "lambda", "pobs0", "pstr0", "onempstr0"), ipstr0 =
NULL, ilambda = NULL, gpstr0 = NULL, imethod = 1,
ishrinkage = 0.95, probs.y = 0.35, zero = NULL)
zipoissonff(llambda = "loge", lonempstr0 = "logit", type.fitted =
c("mean", "lambda", "pobs0", "pstr0", "onempstr0"),
ilambda = NULL, ionempstr0 = NULL, gonempstr0 = NULL,
imethod = 1, ishrinkage = 0.95, probs.y = 0.35, zero =
"onempstr0")
zipoisson 853

Arguments
lpstr0, llambda
Link function for the parameter and the usual parameter. See Links for
more choices; see CommonVGAMffArguments for more information. For the zero-
deflated model see below.
ipstr0, ilambda
Optional initial values for , whose values must lie between 0 and 1. Optional
initial values for , whose values must be positive. The defaults are to compute
an initial value internally for each. If a vector then recycling is used.
lonempstr0, ionempstr0
Corresponding arguments for the other parameterization. See details below.
type.fitted Character. The type of fitted value to be returned. The first choice (the expected
value) is the default. The estimated probability of an observed 0 is an alterna-
tive, else the estimated probability of a structural 0, or one minus the estimated
probability of a structural 0. See CommonVGAMffArguments and fittedvlm for
more information.
imethod An integer with value 1 or 2 which specifies the initialization method for .
If failure to converge occurs try another value and/or else specify a value for
ishrinkage and/or else specify a value for ipstr0. See CommonVGAMffArguments
for more information.
ishrinkage How much shrinkage is used when initializing . The value must be between
0 and 1 inclusive, and a value of 0 means the individual response values are
used, and a value of 1 means the median or mean is used. This argument is used
in conjunction with imethod. See CommonVGAMffArguments for more informa-
tion.
zero Specifies which linear/additive predictors are to be modelled as intercept-only.
If given, the value can be either 1 or 2, and the default is none of them. Setting
zero = 1 makes a single parameter. See CommonVGAMffArguments for more
information.
gpstr0, gonempstr0, probs.y
Details at CommonVGAMffArguments.

Details
These models are a mixture of a Poisson distribution and the value 0; it has value 0 with probability
else is Poisson() distributed. Thus there are two sources for zero values, and is the probability
of a structural zero. The model for zipoisson() can be written

P (Y = 0) = + (1 ) exp(),

and for y = 1, 2, . . .,
P (Y = y) = (1 ) exp()y /y!.
Here, the parameter satisfies 0 < < 1. The mean of Y is (1 ) and these are returned as the
fitted values, by default. The variance of Y is (1 )(1 + ). By default, the two linear/additive
predictors of zipoisson() are (logit(), log())T .
The VGAM family function zipoissonff() has a few changes compared to zipoisson(). These
are: (i) the order of the linear/additive predictors is switched so the Poisson mean comes first;
854 zipoisson

(ii) onempstr0 is now 1 minus the probability of a structural 0, i.e., the probability of the parent
(Poisson) component, i.e., onempstr0 is 1-pstr0; (iii) argument zero has a new default so that
the onempstr0 is intercept-only by default. Now zipoissonff() is generally recommended over
zipoisson() (and definitely recommended over yip88). Both functions implement Fisher scoring
and can handle multiple responses.

Value
An object of class "vglmff" (see vglmff-class). The object is used by modelling functions such
as vglm, rrvglm and vgam.

Warning
Numerical problems can occur, e.g., when the probability of zero is actually less than, not more than,
the nominal probability of zero. For example, in the Angers and Biswas (2003) data below, replac-
ing 182 by 1 results in nonconvergence. Half-stepping is not uncommon. If failure to converge
occurs, try using combinations of imethod, ishrinkage, ipstr0, and/or zipoisson(zero = 1)
if there are explanatory variables. The default for zipoissonff() is to model the structural zero
probability as an intercept-only.

Note
This family function can be used to estimate the 0-deflated model, hence pstr0 is not to be inter-
preted as a probability. One should set, e.g., lpstr0 = "identitylink". Likewise, the functions
in Zipois can handle the zero-deflated Poisson distribution too. Although the iterations might fall
outside the parameter space, the validparams slot should keep them inside. A (somewhat) similar
alternative for zero-deflation is to try the zero-altered Poisson model (see zapoisson).
The use of this VGAM family function with rrvglm can result in a so-called COZIGAM or
COZIGLM. That is, a reduced-rank zero-inflated Poisson model (RR-ZIP) is a constrained zero-
inflated generalized linear model. See COZIGAM. A RR-ZINB model can also be fitted easily;
see zinegbinomial. Jargon-wise, a COZIGLM might be better described as a COZIVGLM-ZIP.

Author(s)
T. W. Yee

References
Thas, O. and Rayner, J. C. W. (2005) Smooth tests for the zero-inflated Poisson distribution. Bio-
metrics, 61, 808815.
Data: Angers, J-F. and Biswas, A. (2003) A Bayesian analysis of zero-inflated generalized Poisson
model. Computational Statistics & Data Analysis, 42, 3746.
Cameron, A. C. and Trivedi, P. K. (1998) Regression Analysis of Count Data. Cambridge University
Press: Cambridge.
Yee, T. W. (2014) Reduced-rank vector generalized linear models with two linear predictors. Com-
putational Statistics and Data Analysis, 71, 889902.
zipoisson 855

See Also
zapoisson, Zipois, yip88, rrvglm, zipebcom, rpois, simulate.vlm.

Examples
# Example 1: simulated ZIP data
zdata <- data.frame(x2 = runif(nn <- 1000))
zdata <- transform(zdata, pstr01 = logit(-0.5 + 1*x2, inverse = TRUE),
pstr02 = logit( 0.5 - 1*x2, inverse = TRUE),
Ps01 = logit(-0.5 , inverse = TRUE),
Ps02 = logit( 0.5 , inverse = TRUE),
lambda1 = loge(-0.5 + 2*x2, inverse = TRUE),
lambda2 = loge( 0.5 + 2*x2, inverse = TRUE))
zdata <- transform(zdata, y1 = rzipois(nn, lambda = lambda1, pstr0 = Ps01),
y2 = rzipois(nn, lambda = lambda2, pstr0 = Ps02))

with(zdata, table(y1)) # Eyeball the data

with(zdata, table(y2))
fit1 <- vglm(y1 ~ x2, zipoisson(zero = 1), data = zdata, crit = "coef")
fit2 <- vglm(y2 ~ x2, zipoisson(zero = 1), data = zdata, crit = "coef")
coef(fit1, matrix = TRUE) # These should agree with the above values
coef(fit2, matrix = TRUE) # These should agree with the above values

# Fit all two simultaneously, using a different parameterization:

fit12 <- vglm(cbind(y1, y2) ~ x2, zipoissonff, data = zdata, crit = "coef")
coef(fit12, matrix = TRUE) # These should agree with the above values

# For the first observation compute the probability that y1 is

# due to a structural zero.
(fitted(fit1, type = "pstr0") / fitted(fit1, type = "pobs0"))[1]

# Example 2: McKendrick (1926). Data from 223 Indian village households

cholera <- data.frame(ncases = 0:4, # Number of cholera cases,
wfreq = c(168, 32, 16, 6, 1)) # Frequencies
fit <- vglm(ncases ~ 1, zipoisson, wei = wfreq, cholera, trace = TRUE)
coef(fit, matrix = TRUE)
with(cholera, cbind(actual = wfreq,
fitted = round(dzipois(ncases, lambda = Coef(fit)[2],
pstr0 = Coef(fit)[1]) *
sum(wfreq), digits = 2)))

# Example 3: data from Angers and Biswas (2003)

abdata <- data.frame(y = 0:7, w = c(182, 41, 12, 2, 2, 0, 0, 1))
abdata <- subset(abdata, w > 0)
fit <- vglm(y ~ 1, zipoisson(lpstr0 = probit, ipstr0 = 0.8),
data = abdata, weight = w, trace = TRUE)
fitted(fit, type = "pobs0") # Estimate of P(Y = 0)
coef(fit, matrix = TRUE)
Coef(fit) # Estimate of pstr0 and lambda
fitted(fit)
with(abdata, weighted.mean(y, w)) # Compare this with fitted(fit)
856 Zoabeta

summary(fit)

# Example 4: zero-deflated model for intercept-only data

zdata <- transform(zdata, lambda3 = loge(0.0, inverse = TRUE))
zdata <- transform(zdata, deflat.limit = -1 / expm1(lambda3)) # Boundary
# The 'pstr0' parameter is negative and in parameter space:
zdata <- transform(zdata, usepstr0 = deflat.limit / 2) # Not too near the boundary
zdata <- transform(zdata, y3 = rzipois(nn, lambda3, pstr0 = usepstr0))
head(zdata)
with(zdata, table(y3)) # A lot of deflation
fit3 <- vglm(y3 ~ 1, zipoisson(zero = -1, lpstr0 = "identitylink"),
data = zdata, trace = TRUE, crit = "coef")
coef(fit3, matrix = TRUE)
# Check how accurate it was:
zdata[1, "usepstr0"] # Answer
coef(fit3)[1] # Estimate
Coef(fit3)
vcov(fit3) # Is positive-definite

# Example 5: This RR-ZIP is known as a COZIGAM or COZIVGLM-ZIP

set.seed(123)
rrzip <- rrvglm(Alopacce ~ sm.bs(WaterCon, df = 3), zipoisson(zero = NULL),
data = hspider, trace = TRUE, Index.corner = 2)
coef(rrzip, matrix = TRUE)
Coef(rrzip)
summary(rrzip)
## Not run: plotvgam(rrzip, lcol = "blue")

Zoabeta The Zero/One-Inflated Beta Distribution

Description
Density, distribution function, and random generation for the zero/one-inflated beta distribution.

Usage
dzoabeta(x, shape1, shape2, pobs0 = 0, pobs1 = 0, log = FALSE,
tol = .Machine$double.eps)
pzoabeta(q, shape1, shape2, pobs0 = 0, pobs1 = 0,
lower.tail = TRUE, log.p = FALSE, tol = .Machine$double.eps)
qzoabeta(p, shape1, shape2, pobs0 = 0, pobs1 = 0,
lower.tail = TRUE, log.p = FALSE, tol = .Machine$double.eps)
rzoabeta(n, shape1, shape2, pobs0 = 0, pobs1 = 0,
tol = .Machine$double.eps)

Arguments
x, q, p, n Same as Beta.
Zoabeta 857

pobs0, pobs1 vector of probabilities that 0 and 1 are observed (0 and 1 ).

shape1, shape2 Same as Beta. They are called a and b in beta respectively.
lower.tail, log, log.p
Same as Beta.
tol Numeric, tolerance for testing equality with 0 and 1.

Details
This distribution is a mixture of a discrete distribution with a continuous distribution. The cumula-
tive distribution function of Y is

F (y) = (1 0 1 )B(y) + 0 I[0 y] + 1 I[1 y]

where B(y) is the cumulative distribution function of the beta distribution with the same shape
parameters (pbeta), 0 is the inflated probability at 0 and 1 is the inflated probability at 1. The
default values of j mean that these functions behave like the ordinary Beta when only the essential
arguments are inputted.

Value
dzoabeta gives the density, pzoabeta gives the distribution function, qzoabeta gives the quantile,
and rzoabeta generates random deviates.

Author(s)
Xiangjie Xue and T. W. Yee

See Also
zoabetaR, beta, betaR, Betabinom.

Examples
## Not run:
N <- 1000; y <- rzoabeta(N, 2, 3, 0.2, 0.2)
hist(y, probability = TRUE, border = "blue", las = 1,
main = "Blue = 0- and 1-altered; orange = ordinary beta")
sum(y == 0) / N # Proportion of 0s
sum(y == 1) / N # Proportion of 1s
Ngrid <- 1000
lines(seq(0, 1, length = Ngrid),
dbeta(seq(0, 1, length = Ngrid), 2, 3), col = "orange")
lines(seq(0, 1, length = Ngrid), col = "blue",
dzoabeta(seq(0, 1, length = Ngrid), 2 , 3, 0.2, 0.2))

## End(Not run)
858 zoabetaR

zoabetaR Zero- and One-Inflated Beta Distribution Family Function

Description
Estimation of the shape parameters of the two-parameter beta distribution plus the probabilities of
a 0 and/or a 1.

Usage
zoabetaR(lshape1 = "loge", lshape2 = "loge", lpobs0 = "logit",
lpobs1 = "logit", ishape1 = NULL, ishape2 = NULL, trim = 0.05,
type.fitted = c("mean", "pobs0", "pobs1", "beta.mean"),
parallel.shape = FALSE, parallel.pobs = FALSE, zero = NULL)

Arguments
lshape1, lshape2, lpobs0, lpobs1
Details at CommonVGAMffArguments. See Links for more choices.
ishape1, ishape2
Details at CommonVGAMffArguments.
trim, zero Same as betaR.
parallel.shape, parallel.pobs
See CommonVGAMffArguments for more information.
type.fitted The choice "beta.mean" mean to return the mean of the beta distribution; the
0s and 1s are ignored. See CommonVGAMffArguments for more information.

Details
The standard 2-parameter beta distribution has a support on (0,1), however, many datasets have 0
and/or 1 values too. This family function handles 0s and 1s (at least one of them must be present) in
the data set by modelling the probability of a 0 by a logistic regression (default link is the logit), and
similarly for the probability of a 1. The remaining proportion, 1-pobs0-pobs1, of the data comes
from a standard beta distribution. This family function therefore extends betaR. One has M = 3 or
M = 4 per response. Multiple responses are allowed.

Value
Similar to betaR.

Author(s)
Thomas W. Yee and Xiangjie Xue.

See Also
Zoabeta, betaR, betaff, Beta, zipoisson.
zoabetaR 859

Examples
nn <- 1000; set.seed(1)
bdata <- data.frame(x2 = runif(nn))
bdata <- transform(bdata,
pobs0 = logit(-2 + x2, inverse = TRUE),
pobs1 = logit(-2 + x2, inverse = TRUE))
bdata <- transform(bdata,
y1 = rzoabeta(nn, shape1 = exp(1 + x2), shape2 = exp(2 - x2),
pobs0 = pobs0, pobs1 = pobs1))
summary(bdata)
fit1 <- vglm(y1 ~ x2, zoabetaR(parallel.pobs = TRUE),
data = bdata, trace = TRUE)
coef(fit1, matrix = TRUE)
summary(fit1)
Index

Topic classes lirat, 409

biplot-methods, 115 machinists, 460
calibrate-methods, 132 marital.nz, 466
Coef.qrrvglm-class, 170 melbmaxtemp, 472
Coef.rrvglm-class, 173 olympics, 539
concoef-methods, 186 oxtemp, 551
rrvglm-class, 679 pneumo, 583
SurvS4-class, 734 prats, 622
vgam-class, 771 prinia, 628
vglm-class, 782 ruge, 686
vglmff-class, 788 toxop, 748
Topic datagen ucberk, 759
posbernUC, 602 V1, 763
rcqo, 657 venice, 765
simulate.vlm, 698 waitakere, 795
Topic datasets wine, 804
auuc, 46 Topic distribution
backPain, 48 alaplaceUC, 28
beggs, 49 Benford, 50
bmi.nz, 122 Benini, 51
cfibrosis, 158 Betabinom, 54
chest.nz, 160 Betageom, 65
chinese.nz, 161 Betanorm, 69
coalminers, 166 Biamhcop, 74
corbet, 190 Biclaytoncop, 77
crashes, 198 Bifgmcop, 81
deermice, 210 bilogis, 90
ducklings, 231 Binom2.or, 94
enzyme, 232 Binom2.rho, 99
finney44, 266 Binorm, 106
flourbeetle, 273 Binormcop, 111
gew, 317 Biplackett, 112
grain.us, 328 Bisa, 116
hormone, 347 Bistudentt, 119
hspider, 349 Bort, 124
Huggins89.t1, 352 Card, 144
hunua, 354 Dagum, 205
lakeO, 383 dAR1, 208
leukemia, 393 dgenpois, 214

860
INDEX 861

dhuber, 216 Otzeta, 549

Diffzeta, 217 Paralogistic, 552
dlogF, 226 Pareto, 555
Expectiles-Exponential, 235 ParetoIV, 558
Expectiles-Normal, 237 Perks, 563
Expectiles-sc.t2, 238 PoissonPoints, 588
Expectiles-Uniform, 240 Polono, 591
expgeom, 246 posbernUC, 602
explog, 252 Posbinom, 604
exppois, 256 Posgeom, 608
Felix, 259 Posnegbin, 609
Fisk, 268 Posnorm, 614
Foldnorm, 274 Pospois, 617
Frank, 280 Rayleigh, 652
Frechet, 282 rcqo, 657
GenbetaII, 297 rdiric, 661
gengammaUC, 303 Rice, 668
genray, 306 Simplex, 695
gevUC, 315 Sinmad, 700
Gompertz, 320 Skellam, 703
gpdUC, 326 skewnorm, 706
Gumbel-II, 337 Slash, 709
gumbelUC, 340 Tikuv, 735
Hzeta, 359 Tobit, 738
Inv.gaussian, 366 Topple, 745
Inv.lomax, 369 Triangle, 749
Inv.paralogistic, 372 Truncpareto, 755
Kumar, 380 UtilitiesVGAM, 761
laplaceUC, 387 Yules, 809
lgammaUC, 397 Zabinom, 811
Lindley, 398 Zageom, 815
Lino, 406 Zanegbin, 818
Log, 418 Zapois, 822
loglapUC, 435 Zeta, 827
Lomax, 445 Zibinom, 833
Makeham, 461 Zigeom, 837
Maxwell, 468 Zinegbin, 840
Nakagami, 495 Zipf, 848
Oalog, 515 Zipois, 850
Oapospois, 518 Zoabeta, 856
Oazeta, 521 Topic dplot
Oilog, 523 plotdeplot.lmscreg, 571
Oiposbinom, 526 plotqrrvglm, 573
Oipospois, 530 plotvgam.control, 581
Oizeta, 533 Topic graphs
Oizipf, 536 deplot.lmscreg, 211
Otlog, 544 lvplot.qrrvglm, 453
Otpospois, 546 lvplot.rrvglm, 457
862 INDEX

perspqrrvglm, 566 Topic models

plotqtplot.lmscreg, 574 A1A2A3, 15
plotvgam, 578 AA.Aa.aa, 16
plotvglm, 582 AB.Ab.aB.ab, 18
prplot, 632 ABO, 19
qtplot.gumbel, 640 acat, 20
qtplot.lmscreg, 642 AICvlm, 22
rlplot.gevff, 672 alaplace, 24
trplot.qrrvglm, 753 amlbinomial, 30
Topic htest amlexponential, 32
lrtest, 450 amlnormal, 34
Topic manip amlpoisson, 37
iam, 361 AR1, 39
Topic math aux.posbernoulli.t, 47
cauchit, 147 benini1, 53
cloglog, 164 betabinomial, 57
erf, 233 betabinomialff, 60
expint, 249 betaff, 63
explink, 250 betageometric, 66
fisherz, 267 betaII, 68
foldsqrt, 277 betaprime, 71
golf, 318 betaR, 72
identitylink, 363 biamhcop, 75
kendall.tau, 379 biclaytoncop, 78
lambertW, 384 BICvlm, 80
lerch, 391 bifgmcop, 83
logc, 420 bifgmexp, 84
loge, 422 bifrankcop, 86
logit, 428 bigamma.mckay, 87
logitoffsetlink, 430 bigumbelIexp, 89
loglog, 441 bilogistic, 92
logoff, 444 binom2.or, 95
multilogit, 489 binom2.rho, 100
nbcanlink, 498 binomialff, 103
nbolf, 500 binormal, 108
ordpoisson, 542 binormalcop, 109
pgamma.deriv, 569 biplackettcop, 114
pgamma.deriv.unscaled, 570 biplot-methods, 115
polf, 589 bisa, 117
powerlink, 620 bistudentt, 120
probit, 629 borel.tanner, 123
reciprocal, 665 Brat, 125
rhobit, 667 brat, 127
zeta, 829 bratt, 129
Topic methods calibrate, 131
biplot-methods, 115 calibrate-methods, 132
calibrate-methods, 132 calibrate.qrrvglm, 132
concoef-methods, 186 calibrate.qrrvglm.control, 135
INDEX 863

cao, 136 fisherz, 267

cao.control, 140 fisk, 269
cardioid, 145 fittedvlm, 271
cauchit, 147 foldnormal, 275
cauchy, 149 foldsqrt, 277
cdf.lmscreg, 151 formulavlm, 279
cens.gumbel, 152 frechet, 283
cens.normal, 154 freund61, 284
cens.poisson, 156 gamma1, 287
cgo, 159 gamma2, 288
chisq, 162 gammahyperbola, 290
clo, 163 gammaR, 291
cloglog, 164 garma, 293
Coef, 167 gaussianff, 295
Coef.qrrvglm, 168 genbetaII, 298
Coef.rrvglm, 172 gengamma.stacy, 301
Coef.vlm, 174 genpoisson, 304
coefvgam, 175 genrayleigh, 307
coefvlm, 176 geometric, 309
CommonVGAMffArguments, 177 get.smart, 310
concoef, 184 get.smart.prediction, 311
concoef-methods, 186 gev, 312
confintvglm, 186 golf, 318
constraints, 188 gompertz, 322
cqo, 191 gpd, 323
cratio, 200 grc, 329
cumulative, 201 gumbel, 334
dagum, 206 gumbelII, 339
deplot.lmscreg, 211 guplot, 342
depvar, 212 has.interceptvlm, 344
df.residual, 213 huber2, 350
diffzeta, 219 hyperg, 356
dirichlet, 220 hypersecant, 357
dirmul.old, 221 hzeta, 360
dirmultinomial, 223 identitylink, 363
double.cens.normal, 227 inv.binomial, 364
double.expbinomial, 228 inv.gaussianff, 367
erlang, 234 inv.lomax, 370
expexpff, 242 inv.paralogistic, 373
expexpff1, 244 is.buggy, 375
expgeometric, 248 is.parallel, 376
explink, 250 is.smart, 377
explogff, 253 is.zero, 378
exponential, 254 kumar, 382
exppoisson, 257 laplace, 385
felix, 260 latvar, 389
fff, 261 leipnik, 390
fill, 262 levy, 394
864 INDEX

lgamma1, 395 normal.vcm, 510

lindley, 400 nparam.vlm, 514
linkfun, 401 oalog, 516
linkfun.vglm, 402 oapospoisson, 519
Links, 403 oazeta, 522
lino, 407 oilog, 525
lms.bcg, 410 oiposbinomial, 528
lms.bcn, 412 oipospoisson, 532
lms.yjn, 415 oizeta, 535
logc, 420 oizipf, 537
loge, 422 Opt, 540
logF, 423 ordpoisson, 542
logff, 424 otlog, 545
logistic, 426 otpospoisson, 548
logit, 428 otzeta, 550
logitoffsetlink, 430 paralogistic, 553
loglaplace, 432 paretoff, 556
logLik.vlm, 437 paretoIV, 560
loglinb2, 438 perks, 564
loglinb3, 440 perspqrrvglm, 566
loglog, 441 plotdeplot.lmscreg, 571
lognormal, 442 plotqrrvglm, 573
logoff, 444 plotqtplot.lmscreg, 574
lomax, 446 plotvgam, 578
lqnorm, 448 plotvgam.control, 581
lvplot, 452 plotvglm, 582
lvplot.qrrvglm, 453 poisson.points, 584
lvplot.rrvglm, 457 poissonff, 586
makeham, 462 polf, 589
margeff, 464 posbernoulli.b, 593
Max, 467 posbernoulli.t, 596
maxwell, 470 posbernoulli.tb, 599
mccullagh89, 471 posbinomial, 605
meplot, 473 posnegbinomial, 611
micmen, 475 posnormal, 615
mix2exp, 477 pospoisson, 619
mix2normal, 479 powerlink, 620
mix2poisson, 481 predictqrrvglm, 623
MNSs, 483 predictvglm, 624
model.framevlm, 485 prentice74, 627
model.matrixvlm, 486 probit, 629
multilogit, 489 propodds, 631
multinomial, 491 prplot, 632
nakagami, 496 put.smart, 634
nbcanlink, 498 qrrvglm.control, 635
nbolf, 500 qtplot.gumbel, 640
negbinomial, 502 qtplot.lmscreg, 642
negbinomial.size, 509 quasibinomialff, 644
INDEX 865

quasipoissonff, 646 vgam, 767

Qvar, 648 vgam-class, 771
qvar, 651 VGAM-package, 12
rayleigh, 653 vgam.control, 774
rec.exp1, 663 vglm, 776
rec.normal, 664 vglm.control, 784
reciprocal, 665 vonmises, 791
rhobit, 667 waldff, 797
riceff, 669 weibull.mean, 798
rigff, 671 weibullR, 799
rlplot.gevff, 672 weightsvglm, 802
rrar, 674 wrapup.smart, 805
rrvglm, 676 yeo.johnson, 805
rrvglm.control, 682 yip88, 807
rrvglm.optim.control, 685 yulesimon, 810
s, 687 zabinomial, 813
sc.studentt2, 689 zageometric, 816
Select, 690 zanegbinomial, 819
seq2binomial, 692 zapoisson, 824
setup.smart, 694 zero, 826
simplex, 697 zetaff, 831
simulate.vlm, 698 zibinomial, 834
sinmad, 701 zigeometric, 838
skellam, 704 zinegbinomial, 842
skewnormal, 707 zipebcom, 845
slash, 710 zipf, 849
sm.os, 712 zipoisson, 852
sm.ps, 716 Topic package
smart.expression, 718 VGAM-package, 12
smart.mode.is, 719 Topic programming
smartpred, 720 get.smart, 310
sratio, 722 get.smart.prediction, 311
studentt, 724 iam, 361
summarypvgam, 726 is.smart, 377
summaryvgam, 727 put.smart, 634
summaryvglm, 728 setup.smart, 694
SURff, 730 smart.expression, 718
tikuv, 736 smart.mode.is, 719
tobit, 739 smartpred, 720
Tol, 743 UtilitiesVGAM, 761
topple, 746 wrapup.smart, 805
triangle, 750 zero, 826
trplot, 752 Topic regression
trplot.qrrvglm, 753 A1A2A3, 15
truncweibull, 757 AA.Aa.aa, 16
uninormal, 760 AB.Ab.aB.ab, 18
UtilitiesVGAM, 761 ABO, 19
vcovvlm, 764 acat, 20
866 INDEX

AICvlm, 22 cens.gumbel, 152

alaplace, 24 cens.normal, 154
amlbinomial, 30 cens.poisson, 156
amlexponential, 32 cgo, 159
amlnormal, 34 chisq, 162
amlpoisson, 37 clo, 163
AR1, 39 cloglog, 164
aux.posbernoulli.t, 47 Coef, 167
benini1, 53 Coef.qrrvglm, 168
betabinomial, 57 Coef.rrvglm, 172
betabinomialff, 60 Coef.vlm, 174
betaff, 63 coefvgam, 175
betageometric, 66 coefvlm, 176
betaII, 68 concoef, 184
betaprime, 71 concoef-methods, 186
betaR, 72 confintvglm, 186
biamhcop, 75 constraints, 188
biclaytoncop, 78 cqo, 191
BICvlm, 80 cratio, 200
bifgmcop, 83 cumulative, 201
bifgmexp, 84 dagum, 206
bifrankcop, 86 deplot.lmscreg, 211
bigamma.mckay, 87 depvar, 212
bigumbelIexp, 89 df.residual, 213
bilogistic, 92 diffzeta, 219
binom2.or, 95 dirichlet, 220
binom2.rho, 100 dirmul.old, 221
binomialff, 103 dirmultinomial, 223
binormal, 108 double.cens.normal, 227
binormalcop, 109 double.expbinomial, 228
biplackettcop, 114 erlang, 234
biplot-methods, 115 expexpff, 242
bisa, 117 expexpff1, 244
bistudentt, 120 expgeometric, 248
borel.tanner, 123 explink, 250
Brat, 125 explogff, 253
brat, 127 exponential, 254
bratt, 129 exppoisson, 257
calibrate, 131 felix, 260
calibrate-methods, 132 fff, 261
calibrate.qrrvglm, 132 fill, 262
calibrate.qrrvglm.control, 135 fisherz, 267
cao, 136 fisk, 269
cao.control, 140 fittedvlm, 271
cardioid, 145 foldnormal, 275
cauchit, 147 foldsqrt, 277
cauchy, 149 formulavlm, 279
cdf.lmscreg, 151 frechet, 283
INDEX 867

freund61, 284 lms.bcn, 412

gamma1, 287 lms.yjn, 415
gamma2, 288 logc, 420
gammahyperbola, 290 loge, 422
gammaR, 291 logF, 423
garma, 293 logff, 424
gaussianff, 295 logistic, 426
genbetaII, 298 logit, 428
gengamma.stacy, 301 logitoffsetlink, 430
genpoisson, 304 loglaplace, 432
genrayleigh, 307 logLik.vlm, 437
geometric, 309 loglinb2, 438
get.smart, 310 loglinb3, 440
get.smart.prediction, 311 loglog, 441
gev, 312 lognormal, 442
golf, 318 logoff, 444
gompertz, 322 lomax, 446
gpd, 323 lqnorm, 448
grc, 329 lvplot, 452
gumbel, 334 lvplot.qrrvglm, 453
gumbelII, 339 lvplot.rrvglm, 457
guplot, 342 makeham, 462
has.interceptvlm, 344 margeff, 464
hatvalues, 345 Max, 467
huber2, 350 maxwell, 470
hyperg, 356 mccullagh89, 471
hypersecant, 357 meplot, 473
hzeta, 360 micmen, 475
identitylink, 363 mix2exp, 477
inv.binomial, 364 mix2normal, 479
inv.gaussianff, 367 mix2poisson, 481
inv.lomax, 370 MNSs, 483
inv.paralogistic, 373 multilogit, 489
is.buggy, 375 multinomial, 491
is.parallel, 376 nakagami, 496
is.smart, 377 nbcanlink, 498
is.zero, 378 nbolf, 500
kumar, 382 negbinomial, 502
laplace, 385 negbinomial.size, 509
latvar, 389 normal.vcm, 510
leipnik, 390 nparam.vlm, 514
levy, 394 oalog, 516
lgamma1, 395 oapospoisson, 519
lindley, 400 oazeta, 522
linkfun, 401 oilog, 525
linkfun.vglm, 402 oiposbinomial, 528
lino, 407 oipospoisson, 532
lms.bcg, 410 oizeta, 535
868 INDEX

oizipf, 537 rigff, 671

Opt, 540 rlplot.gevff, 672
ordpoisson, 542 rrar, 674
otlog, 545 rrvglm, 676
otpospoisson, 548 rrvglm.control, 682
otzeta, 550 rrvglm.optim.control, 685
paralogistic, 553 s, 687
paretoff, 556 sc.studentt2, 689
paretoIV, 560 Select, 690
perks, 564 seq2binomial, 692
perspqrrvglm, 566 setup.smart, 694
plotdeplot.lmscreg, 571 simplex, 697
plotqrrvglm, 573 sinmad, 701
plotqtplot.lmscreg, 574 skellam, 704
plotvgam, 578 skewnormal, 707
plotvgam.control, 581 slash, 710
plotvglm, 582 sm.os, 712
poisson.points, 584 sm.ps, 716
poissonff, 586 smart.expression, 718
polf, 589 smart.mode.is, 719
posbernoulli.b, 593 smartpred, 720
posbernoulli.t, 596 sratio, 722
posbernoulli.tb, 599 studentt, 724
posbinomial, 605 summarypvgam, 726
posnegbinomial, 611 summaryvgam, 727
posnormal, 615 summaryvglm, 728
pospoisson, 619 SURff, 730
powerlink, 620 tikuv, 736
predictqrrvglm, 623 tobit, 739
predictvglm, 624 Tol, 743
prentice74, 627 topple, 746
probit, 629 triangle, 750
propodds, 631 trplot, 752
prplot, 632 trplot.qrrvglm, 753
put.smart, 634 truncweibull, 757
qrrvglm.control, 635 uninormal, 760
qtplot.gumbel, 640 UtilitiesVGAM, 761
qtplot.lmscreg, 642 vcovvlm, 764
quasibinomialff, 644 vgam, 767
quasipoissonff, 646 vgam-class, 771
Qvar, 648 VGAM-package, 12
qvar, 651 vgam.control, 774
rayleigh, 653 vglm, 776
rec.exp1, 663 vglm.control, 784
rec.normal, 664 vonmises, 791
reciprocal, 665 vsmooth.spline, 793
rhobit, 667 waldff, 797
riceff, 669 weibull.mean, 798
INDEX 869

weibullR, 799 alaplace, 24

weightsvglm, 802 alaplace1, 31, 33, 36, 38, 414, 432, 433, 699
wrapup.smart, 805 alaplace1 (alaplace), 24
yeo.johnson, 805 alaplace2, 331, 387, 699
yip88, 807 alaplace2 (alaplace), 24
yulesimon, 810 alaplace3, 29, 436
zabinomial, 813 alaplace3 (alaplace), 24
zageometric, 816 alaplaceUC, 28
zanegbinomial, 819 alclevels (crashes), 198
zapoisson, 824 alcoff, 332, 488, 489
zero, 826 alcoff (crashes), 198
zetaff, 831 amlbinomial, 30, 33, 35, 36, 38, 105
zibinomial, 834 amlexponential, 31, 32, 33, 36, 255, 411
zigeometric, 838 amlnormal, 26, 31, 33, 34, 38, 238, 414, 417
zinegbinomial, 842 amlpoisson, 31, 33, 35, 36, 37, 588
zipebcom, 845 anova, 451
zipf, 849 AR1, 39, 4345, 209, 761
zipoisson, 852 AR1EIM, 40, 41, 42
zoabetaR, 858 arima.sim, 41
Topic smooth array, 734
plotvgam, 578 as.character.SurvS4 (SurvS4), 732
plotvgam.control, 581 as.data.frame.SurvS4 (SurvS4), 732
plotvglm, 582 atanh, 268
s, 687 auuc, 46, 332
sm.os, 712 aux.posbernoulli.t, 47, 595
sm.ps, 716
vgam, 767 backPain, 48
vgam-class, 771 beggs, 49
vgam.control, 774 Benford, 50
vsmooth.spline, 793 Benini, 51, 54
Topic survival benini1, 52, 53, 558
SurvS4, 732 Bessel, 792
Topic ts besselI, 670
rrar, 674 Beta, 64, 66, 72, 74, 856858
.Random.seed, 659 beta, 55, 58, 64, 65, 70, 73, 408, 423, 811, 857
[.SurvS4 (SurvS4), 732 Betabinom, 54, 55, 59, 62, 857
betabinomial, 55, 56, 57, 61, 62, 105, 224,
A1A2A3, 15, 17, 18, 20, 484 225, 622, 699
AA.Aa.aa, 16, 16, 18, 20, 484 betabinomialff, 55, 56, 59, 60, 64, 74, 223,
AB.Ab.aB.ab, 16, 17, 18, 20, 484 225, 622, 699
ABO, 1618, 19, 484 betaff, 59, 62, 63, 66, 67, 69, 7274, 300,
acat, 20, 179, 201, 203, 204, 376, 465, 466, 383, 699, 858
493, 724 Betageom, 65
AIC, 23, 81, 437 betageometric, 64, 66, 74, 310
AICqrrvglm (AICvlm), 22 betaII, 64, 68, 74, 208, 271, 300, 371, 374,
AICrrvgam (AICvlm), 22 448, 554, 703
AICrrvglm (AICvlm), 22 Betanorm, 69
AICvgam (AICvlm), 22 betaprime, 64, 71, 74
AICvlm, 22, 80, 81, 515, 598, 606, 607 betaR, 64, 72, 699, 857, 858
870 INDEX

Biamhcop, 74 borel.tanner, 123, 124, 125, 261

biamhcop, 74, 75, 75, 91, 699 Bort, 124
BIC, 81 boxcox, 806
Biclaytoncop, 77 Brat, 125, 128, 130
biclaytoncop, 77, 78, 699 brat, 125, 126, 127, 128130
BICvgam (BICvlm), 80 bratt, 125128, 129
BICvlm, 23, 80, 598, 606, 607 bs, 263, 314, 325, 375, 451, 465, 688, 714,
Bifgmcop, 81 717, 719721
bifgmcop, 76, 82, 83, 85, 87, 699
bifgmexp, 84, 84, 90 calibrate, 131, 134
bifrankcop, 84, 86, 113, 115, 281, 699 calibrate,Coef.qrrvglm-method
bigamma.mckay, 87, 289, 292 (calibrate-methods), 132
calibrate,rrvglm-method
bigumbelIexp, 76, 85, 89
(calibrate-methods), 132
bilogis, 90, 91
calibrate-methods, 132
bilogistic, 91, 92, 427, 699
calibrate.qrrvglm, 132, 132, 135, 136
Binom2.or, 94
calibrate.qrrvglm.control, 133, 134, 135
binom2.or, 94, 95, 95, 97, 102, 439, 827, 846,
cao, 12, 104, 105, 134, 136, 140, 143, 165,
847
193, 196, 289, 356, 405, 451, 506,
Binom2.rho, 99
507, 587, 588, 639, 645, 647, 769,
binom2.Rho (binom2.rho), 100
770
binom2.rho, 97100, 100, 439, 668 cao.control, 137139, 140
binomial, 104, 105, 182 Card, 144
binomialff, 30, 58, 59, 61, 62, 98, 102, 103, cardioid, 144, 145, 145, 792
127, 128, 130, 139, 165, 191, 193, cauchit, 96, 98, 147, 150, 165, 201, 202, 204,
196, 229, 230, 273, 278, 357, 517, 293, 404, 429, 512, 630, 645, 724
520, 523, 527529, 588, 594, 607, Cauchy, 150
640, 645, 647, 660, 694, 698, 699, cauchy, 148, 149, 699
769, 780, 814, 817, 821, 822, 825, cauchy1, 147, 148, 699, 725
835, 836 cauchy1 (cauchy), 149
Binorm, 106 cbind, 189, 691, 692
binormal, 77, 107, 108, 112, 121, 297, 699, cdf.lmscreg, 151, 411, 414, 417
761 cennormal (cens.normal), 154
binormalcop, 77, 109, 112, 380, 699 cens.gumbel, 152, 336
Binormcop, 111 cens.normal, 154, 228, 741, 761
Biplackett, 112 cens.poisson, 156, 588, 733
biplackettcop, 113, 114 cens.rayleigh (rayleigh), 653
biplot,qrrvglm-method (biplot-methods), cfibrosis, 158, 694
115 cgo, 159
biplot,rrvglm-method (biplot-methods), chest.nz, 160, 467
115 chinese.nz, 161
biplot-methods, 115 chisq, 162, 699
biplot.rrvglm, 459, 678 Chisquare, 163
biplot.rrvglm (lvplot.rrvglm), 457 choose, 222, 224
Bisa, 116 clo, 163
bisa, 116, 117, 368 cloglog, 96, 98, 104, 138, 148, 164, 165, 191,
Bistudentt, 119 201, 202, 204, 293, 404, 421, 429,
bistudentt, 109, 119, 120, 120 432, 512, 630, 645, 724, 846, 847
bmi.nz, 36, 122, 411, 417, 467 coalminers, 98, 102, 166
INDEX 871

Coef, 139, 167, 175, 389, 452, 467, 540, 744, (concoef-methods), 186
752 concoef,Coef.rrvglm-method
coef, 167, 168, 174177, 185, 187 (concoef-methods), 186
coef,vgam-method (coefvgam), 175 concoef,qrrvglm-method
Coef.qrrvglm, 135, 136, 168, 168, 169, 171, (concoef-methods), 186
196, 454456, 566, 568, 623, 640 concoef,rrvglm-method
Coef.qrrvglm-class, 170 (concoef-methods), 186
Coef.rrvglm, 168, 172, 174, 459, 678 concoef-method (concoef-methods), 186
Coef.rrvglm-class, 173 concoef-methods, 186
Coef.vlm, 168, 174 confint, 186, 187
coefficients,vgam-method (coefvgam), 175 confint.default, 187
confint.lm, 187
coefvgam, 175, 177
confintrrvglm (confintvglm), 186
coefvlm, 175, 176, 176, 781
confintvgam (confintvglm), 186
colMeans, 600
confintvglm, 186, 729, 765
colnames, 332 constraints, 188, 376, 377, 379, 778, 827
CommonVGAMffArguments, 14, 17, 19, 20, constraints.vlm, 781
2426, 30, 32, 35, 37, 39, 53, 57, 60, cor, 380
63, 68, 71, 73, 76, 78, 83, 86, 87, 92, corbet, 190, 613
96, 101, 103, 104, 108, 110, 114, cqo, 12, 14, 104, 105, 134, 137139, 159, 160,
118, 120, 123, 126, 137, 145, 149, 164, 165, 170, 171, 191, 288, 289,
155, 156, 162, 177, 190, 191, 200, 331, 332, 350, 356, 405, 451, 456,
202, 207, 220, 227, 229, 230, 234, 503, 504, 506, 507, 568, 574, 587,
244, 246, 248, 253, 255, 258, 260, 588, 624, 635, 640, 644, 645, 647,
261, 270, 271, 275, 276, 283, 285, 659, 660, 677, 678, 684, 721, 745,
287289, 292, 299301, 304, 307, 755
309, 313, 315, 322, 324, 335, 339, crashbc (crashes), 198
351, 360, 364, 367, 370, 373, 382, crashes, 198
386, 396, 400, 408, 413, 414, 416, crashf (crashes), 198
423, 424, 426, 427, 438, 440, 443, crashi, 332, 489, 678
447, 448, 463, 470, 471, 476, 478, crashi (crashes), 198
480, 482, 490492, 501, 503505, crashmc (crashes), 198
507, 511, 517, 519, 520, 522, 523, crashp (crashes), 198
525, 528, 532, 535, 538, 553, 556, crashtr (crashes), 198
564, 586, 587, 593, 596, 599, 603, cratio, 21, 179, 200, 203, 204, 376, 465, 466,
606, 611613, 616, 619, 627, 644, 493, 723, 724
646, 654, 664, 670, 689, 697, 702, cumsum, 474
705, 707, 711, 723, 724, 730, 736, cumulative, 21, 80, 179, 187, 201, 201, 278,
740, 741, 747, 757, 760, 762, 777, 318, 319, 346, 376, 427, 465, 466,
778, 791, 797, 798, 800, 807, 810, 492, 493, 500, 501, 584, 589, 590,
813, 814, 816, 818, 820, 822, 631633, 724
824827, 831, 835, 839, 842845, cut, 542, 657, 659, 660
847, 853, 858
concoef, 139, 184, 185 Dagum, 205, 208
concoef,cao-method (concoef-methods), dagum, 69, 205, 206, 206, 271, 300, 371, 374,
186 448, 554, 699, 703
concoef,Coef.cao-method dalap, 436
(concoef-methods), 186 dalap (alaplaceUC), 28
concoef,Coef.qrrvglm-method dAR1, 41, 208
872 INDEX

data.frame, 485 dfelix (Felix), 259

dbenf (Benford), 50 dfisk (Fisk), 268
dbenini (Benini), 51 dfoldnorm (Foldnorm), 274
dbetabinom (Betabinom), 54 dfrechet (Frechet), 282
dbetageom (Betageom), 65 dgenbetaII, 300
dbetanorm (Betanorm), 69 dgenbetaII (GenbetaII), 297
dbiamhcop (Biamhcop), 74 dgengamma.stacy (gengammaUC), 303
dbiclaytoncop, 79 dgenpois, 214, 305
dbiclaytoncop (Biclaytoncop), 77 dgenray, 308
dbifgmcop (Bifgmcop), 81 dgenray (genray), 306
dbifrankcop (Frank), 280 dgeom, 815, 818, 837, 838
dbilogis (bilogis), 90 dgev, 313
dbinom, 55, 527, 604, 812, 814, 834 dgev (gevUC), 315
dbinom2.or (Binom2.or), 94 dgompertz, 323, 342
dbinom2.rho (Binom2.rho), 99 dgompertz (Gompertz), 320
dbinorm (Binorm), 106 dgpd, 324, 325
dbinormcop (Binormcop), 111 dgpd (gpdUC), 326
dbiplackcop (Biplackett), 112 dgumbel, 321, 338
dbisa (Bisa), 116 dgumbel (gumbelUC), 340
dbistudentt, 121 dgumbelII, 336, 340
dbistudentt (Bistudentt), 119 dgumbelII (Gumbel-II), 337
dbort (Bort), 124 dhuber, 216
dcard (Card), 144 dhyper, 356, 357
ddagum (Dagum), 205 dhzeta, 361
ddiffzeta (Diffzeta), 217 dhzeta (Hzeta), 359
deermice, 47, 210, 595, 598, 601 Diffzeta, 217, 219
deexp, 33, 238, 241 diffzeta, 218, 219, 535, 550, 832
deexp (Expectiles-Exponential), 235 digamma, 627
denorm, 31, 36, 236, 241, 414 dimm (UtilitiesVGAM), 761
denorm (Expectiles-Normal), 237 dinv.gaussian (Inv.gaussian), 366
deplot.lmscreg, 152, 211, 411, 414, 417, dinv.lomax (Inv.lomax), 369
572, 643 dinv.paralogistic (Inv.paralogistic),
depvar, 168, 212 372
deunif, 236238 dirichlet, 220, 223, 225, 232, 362, 493, 662,
deunif (Expectiles-Uniform), 240 698, 699
deviance, 214 dirmul.old, 221, 225
dexp, 236 dirmultinomial, 59, 62, 221223, 223, 493
dexpgeom, 249 dkumar, 383
dexpgeom (expgeom), 246 dkumar (Kumar), 380
dexplog, 254 dlaplace (laplaceUC), 387
dexplog (explog), 252 dlgamma (lgammaUC), 397
dexppois, 258 dlind, 401
dexppois (exppois), 256 dlind (Lindley), 398
df.residual, 213 dlino (Lino), 406
df.residual_vlm (df.residual), 213 dlog (Log), 418
dfbeta (hatvalues), 345 dlogF, 226, 424
dfbetavlm (hatvalues), 345 dlogis, 426
dfelix, 261 dloglap, 433
INDEX 873

dloglap (loglapUC), 435 dsinmad (Sinmad), 700

dlomax (Lomax), 445 dskellam, 705
dmakeham, 321, 463, 464 dskellam (Skellam), 703
dmakeham (Makeham), 461 dskewnorm (skewnorm), 706
dmaxwell (Maxwell), 468 dslash (Slash), 709
dnaka (Nakagami), 495 dt, 119, 120, 239, 426
dnbinom, 609, 818, 822, 841 dtikuv, 737
dnorm, 238, 277, 616, 761 dtikuv (Tikuv), 735
doalog (Oalog), 515 dtobit, 740
doapospois (Oapospois), 518 dtobit (Tobit), 738
doazeta (Oazeta), 521 dtopple (Topple), 745
doilog (Oilog), 523 dtriangle (Triangle), 749
doiposbinom (Oiposbinom), 526 dtruncpareto (Truncpareto), 755
doipospois (Oipospois), 530 ducklings, 231
doizeta (Oizeta), 533 dunif, 241
doizipf (Oizipf), 536 dweibull, 757, 758, 799, 802
dotlog (Otlog), 544 dyules, 811
dotpospois (Otpospois), 546 dyules (Yules), 809
dotzeta (Otzeta), 549 dzabinom, 814
double.cens.normal, 155, 227, 665, 741, dzabinom (Zabinom), 811
761 dzageom, 818
double.expbinomial, 105, 228, 748 dzageom (Zageom), 815
dparalogistic (Paralogistic), 552 dzanegbin, 822
dpareto (Pareto), 555 dzanegbin (Zanegbin), 818
dzapois (Zapois), 822
dparetoI (ParetoIV), 558
dzeta, 550, 832
dparetoII (ParetoIV), 558
dzeta (Zeta), 827
dparetoIII (ParetoIV), 558
dzibinom (Zibinom), 833
dparetoIV (ParetoIV), 558
dzigeom (Zigeom), 837
dperks, 565
dzinegbin (Zinegbin), 840
dperks (Perks), 563
dzipf, 850
dpois, 215, 305, 482, 531, 589, 704, 705, 851
dzipf (Zipf), 848
dpois.points (PoissonPoints), 588
dzipois, 808, 823
dpolono (Polono), 591
dzipois (Zipois), 850
dposbern, 595, 598, 605
dzoabeta, 64
dposbern (posbernUC), 602 dzoabeta (Zoabeta), 856
dposbinom (Posbinom), 604 dzoibetabinom (Betabinom), 54
dposgeom (Posgeom), 608
dposnegbin (Posnegbin), 609 edhuber (dhuber), 216
dposnorm (Posnorm), 614 eexp (Expectiles-Exponential), 235
dpospois, 851 enorm (Expectiles-Normal), 237
dpospois (Pospois), 617 enzyme, 232, 477
drayleigh (Rayleigh), 652 erf, 233
drice, 670 erfc (erf), 233
drice (Rice), 668 erlang, 234, 699
dsc.t2, 241, 689, 690 eunif (Expectiles-Uniform), 240
dsc.t2 (Expectiles-sc.t2), 238 exp, 250, 385, 420
dsimplex, 698 Expectiles-Exponential, 235
dsimplex (Simplex), 695 Expectiles-Normal, 237
874 INDEX

Expectiles-sc.t2, 238 freund61, 255, 284

Expectiles-Uniform, 240
expexpff, 242, 245, 246, 289, 292, 799, 802 gam, 139, 769, 780
expexpff1, 243, 244, 244 gamma, 88, 234, 287, 289, 292, 830
expexpint (expint), 249 gamma1, 287, 289, 292, 302, 396, 699
expgeom, 246 gamma2, 88, 139, 191, 196, 288, 292, 302, 318,
expgeometric, 247, 248, 255, 310 319, 497, 637, 640, 658, 660, 699
expint, 249 GammaDist, 289
explink, 250, 423, 648, 649, 651, 760 gammahyperbola, 290
explog, 252 gammaR, 235, 244, 287289, 291, 401, 699
explogff, 252, 253, 255, 425 garma, 293
expm1, 419, 420 gaussian, 296
Exponential, 255, 328 gaussianff, 80, 109, 139, 191, 289, 295, 351,
exponential, 32, 33, 235, 247, 249, 252, 254, 449, 640, 660, 761, 769, 780
254, 258, 286, 291, 387, 478, 663, GenbetaII, 297
699 genbetaII, 64, 69, 74, 206, 208, 269271,
exppois, 256 298, 298, 370, 371, 373, 374, 409,
exppoisson, 257, 257 446448, 553, 554, 701703
extlogit, 63, 64, 146, 305, 315, 358, 404 gengamma.stacy, 301, 303, 304, 396, 628, 699
extlogit (logit), 428 gengammaUC, 303
genpoisson, 215, 304, 588
FDist, 262 genray, 306
Felix, 259 genrayleigh, 306, 307, 307, 655
felix, 124, 259, 260 Geometric, 310
fff, 261 geometric, 66, 67, 247, 249, 309, 699, 817,
fill, 262, 493, 692, 780, 781 818, 839
fill1, 210, 493, 601, 786, 787 get.smart, 310, 311, 312, 634, 719721
fill1 (fill), 262 get.smart.prediction, 311, 311, 721
finney44, 266 gev, 154, 284, 312, 316, 325, 335, 336, 340,
fisherz, 102, 267, 404, 512, 667, 668 342, 343, 766, 799, 800, 802
Fisk, 268, 271 gevff, 316, 336, 363, 674
fisk, 69, 208, 269, 269, 300, 371, 374, 448, gevff (gev), 312
554, 699, 703 gevUC, 315
fitted, 272 gew, 317, 731
fitted.values.vlm (fittedvlm), 271 glm, 177, 296, 311, 404, 507, 695, 720, 769,
fittedvlm, 180, 271, 517, 519, 522, 528, 625, 777, 779, 780, 790, 803
813, 816, 820, 824, 835, 839, 842, golf, 202, 204, 289, 318, 404, 501, 590
844, 853 Gompertz, 320
flourbeetle, 273 gompertz, 321, 322, 464, 699
Foldnorm, 274 gpd, 255, 315, 323, 327, 328, 474, 475, 558,
foldnormal, 274, 275, 275, 708, 761 562, 766
foldsqrt, 277, 404 gpdUC, 326
format.SurvS4 (SurvS4), 732 grain.us, 328, 675
formula, 280, 450, 691, 692 grc, 49, 199, 329, 539, 678
formula.vlm (formulavlm), 279 grep, 181
formulavlm, 279, 344 gumbel, 153, 154, 315, 334, 340343, 641, 642
Frank, 280 Gumbel-II, 337
Frechet, 282 gumbelff, 154, 315, 342, 343, 641
frechet, 282, 283, 283, 315 gumbelff (gumbel), 334
INDEX 875

gumbelII, 338, 339, 699, 799, 802 kendall.tau, 79, 110, 379
gumbelUC, 340 Kumar, 380
guplot, 154, 315, 336, 342, 766 kumar, 64, 74, 381, 382, 699

has.intercept (has.interceptvlm), 344 lakeO, 383

has.interceptvlm, 280, 344 lambertW, 384, 423, 461, 462
hatplot (hatvalues), 345 laplace, 26, 255, 351, 385, 388
hatvalues, 345 laplaceUC, 387
hatvaluesvlm, 781 latvar, 139, 389, 389, 453
hatvaluesvlm (hatvalues), 345 lbeta, 55
hist, 478, 482 leipnik, 390, 472
hormone, 347 lerch, 391, 830
leukemia, 393, 733
hspider, 196, 349
levy, 394
huber1 (huber2), 350
lgamma, 356, 396, 628
huber2, 216, 217, 297, 350, 725, 761
lgamma1, 288, 395, 397, 398, 699
Huggins89.t1, 352, 598, 601
lgamma3, 398, 627, 628, 699
Huggins89table1, 598, 601
lgamma3 (lgamma1), 395
Huggins89table1 (Huggins89.t1), 352
lgammaUC, 397
hunua, 354, 441, 796
Lindley, 398
hyperg, 356
lindley, 288, 399, 400, 699
hypersecant, 226, 357
linkfun, 401, 403, 405
hypersecant01 (hypersecant), 357
linkfun.vglm, 401, 402, 781
Hzeta, 359, 361
Links, 14, 15, 1720, 24, 39, 53, 57, 60, 66,
hzeta, 359, 360, 360, 550, 699, 832 68, 71, 73, 75, 78, 83, 84, 86, 87, 89,
92, 96, 101, 103, 105, 108, 110, 114,
I, 687 117, 120, 123, 145, 147, 149, 153,
iam, 361 154, 156, 157, 165, 178, 182, 183,
identity, 404, 512, 587, 666, 670 200, 202, 204, 207, 220, 222, 224,
identitylink, 363 227, 228, 234, 242, 245, 248, 251,
influence.measures, 346 255, 258, 260, 261, 267, 268, 270,
integrate, 592, 709, 710 275, 277, 278, 283, 285, 287, 288,
interleave.VGAM (UtilitiesVGAM), 761 290, 292, 299, 301, 304, 307, 309,
inv.binomial, 364, 507, 588 312, 318, 319, 322, 324, 335, 339,
Inv.gaussian, 366, 368 351, 356, 358, 360, 363, 364, 367,
inv.gaussianff, 118, 366, 367, 367, 797, 798 370, 373, 382, 386, 390, 394, 395,
Inv.lomax, 369 400, 403, 408, 413, 421424, 426,
inv.lomax, 69, 208, 271, 300, 369, 370, 370, 428432, 441445, 447, 448, 463,
371, 374, 448, 554, 699, 703 470, 471, 476, 477, 479, 482, 484,
Inv.paralogistic, 372, 374 490, 497501, 503, 517, 519, 522,
inv.paralogistic, 69, 208, 271, 300, 528, 553, 556, 561, 564, 585, 586,
371373, 373, 448, 554, 699, 703 588590, 611, 616, 619, 621, 627,
iris, 493 630, 644, 646, 654, 663, 664,
is.buggy, 375, 688, 769, 770 666668, 670, 671, 689, 693, 697,
is.na.SurvS4 (SurvS4), 732 702, 705, 707, 711, 722, 724, 736,
is.parallel, 190, 376 740, 750, 757, 760, 791, 798, 800,
is.smart, 377, 714, 717 807, 810, 813, 816, 820, 824, 831,
is.SurvS4 (SurvS4), 732 835, 838, 842, 845, 849, 853, 858
is.zero, 190, 378 Lino, 406, 409
876 INDEX

lino, 300, 406, 407, 407, 699 446, 554, 699, 703
lirat, 59, 62, 409 lqnorm, 297, 448
lm, 35, 177, 214, 311, 312, 512, 695, 720 lrtest, 450
lms.bcg, 33, 152, 212, 410, 414, 417, 643 lrtest_vglm, 781
lms.bcn, 26, 36, 152, 212, 238, 411, 412, 416, lrtest_vglm (lrtest), 450
417, 473, 643 lv, 389
lms.yjn, 152, 211, 212, 411, 414, 415, 575, lv (latvar), 389
643, 806 lvplot, 115, 139, 195, 390, 452, 456, 459, 752
lms.yjn2 (lms.yjn), 415 lvplot.qrrvglm, 169, 170, 196, 453, 453,
Log, 418 567, 568, 574, 639
log, 250, 385, 420, 422, 423, 425 lvplot.rrvglm, 457, 459, 678, 681
log10, 273
log1mexp, 419 machinists, 460
log1p, 419, 420 magic, 713, 714, 717, 770, 775
log1pexp (log1mexp), 419 Makeham, 461
logc, 404, 420, 423 makeham, 322, 323, 461, 462, 462, 699
loge, 148, 251, 293, 364, 404, 408, 421, 422, margeff, 21, 201, 204, 464, 493, 724
425, 429, 432, 442, 444, 445, 465, marital.nz, 466
499, 512, 587, 588, 621, 647, 798 match.call, 718
logF, 226, 423 Math.SurvS4 (SurvS4), 732
logff, 418, 419, 424, 424, 517, 524, 526, matrix, 94, 99, 182, 234, 509, 734, 741
544546, 613, 699 Max, 139, 467, 541, 745
logistic, 93, 426, 429, 699, 725 Maxwell, 468, 471, 589
logistic1, 204, 426, 429, 699 maxwell, 469, 470, 585, 653, 655, 799, 802
logistic1 (logistic), 426 mccullagh89, 391, 471
logit, 98, 104, 138, 148, 165, 191, 201, 204, median, 387
268, 293, 364, 404, 423, 427, 428, medpolish, 331, 332
431, 432, 490, 512, 630, 645, 724 melbmaxtemp, 472
logitlaplace1, 433 meplot, 325, 473
logitlaplace1 (loglaplace), 432 methods, 726728
logitoffsetlink, 165, 429, 430 micmen, 232, 475, 778
loglaplace, 432 mix2exp, 255, 477
loglaplace1, 436 mix2normal, 479, 483, 761
loglaplace1 (loglaplace), 432 mix2poisson, 478, 481, 481, 588
loglapUC, 435 MNSs, 1618, 20, 483
logLik, 451 model.frame, 485
logLik.vlm, 437 model.framevlm, 485, 486, 487
loglinb2, 98, 102, 438, 441 model.matrix, 213, 486, 487
loglinb3, 439, 440 model.matrix.default, 138, 778
loglog, 361, 404, 421, 423, 441, 503, 512, model.matrixvlm, 485, 486
724, 831 moffset, 332, 488, 488, 578, 656
logneg (loge), 422 multilogit, 178, 183, 401403, 429, 489,
Lognormal, 592 493, 511
lognormal, 442, 592, 699, 799, 802 multinom, 493
logoff, 312, 324, 404, 421, 423, 425, 442, Multinomial, 493
444, 472, 512 multinomial, 21, 127, 128, 201, 203, 204,
Lomax, 445, 448 221, 223, 225, 263, 332, 362, 376,
lomax, 69, 208, 271, 300, 371, 374, 445, 446, 458, 465, 466, 490, 491, 723, 724
INDEX 877

Nakagami, 495 optim, 133, 135, 137, 141, 142, 636, 637, 640,
nakagami, 495, 496, 496 685, 686
nbcanlink, 498, 507, 510 options, 137, 192, 777
nbolf, 202, 204, 319, 404, 500, 507, 590 order, 171
nef.hs (hypersecant), 357 ordered, 21, 200, 203, 543, 631, 723
negbinomial, 139, 181, 191, 196, 309, 310, ordpoisson, 542, 588, 590
331, 365, 461, 498501, 502, 509, Otlog, 419, 515, 516, 524, 544, 546
510, 592, 611613, 637, 640, 647, otlog, 425, 517, 545, 545
658, 660, 678, 699, 820, 822, Otpospois, 518, 519, 546, 548
841844 otpospoisson, 520, 531, 533, 547, 548, 588,
negbinomial.size, 498, 499, 507, 509, 699 620
negidentity, 404 Otzeta, 521, 534, 549, 549, 550, 828
negidentity (identitylink), 363 otzeta, 523, 535, 549, 550, 830, 832
negloge, 289, 292, 404, 503, 505, 561 oxtemp, 315, 551
negloge (loge), 422
negreciprocal, 404 palap (alaplaceUC), 28
negreciprocal (reciprocal), 665 par, 343, 345, 454456, 458, 459, 474,
Normal, 70, 481, 809, 828, 848 566568, 572, 573, 575, 577, 583,
normal.vcm, 178, 183, 490, 510, 761 633, 641, 672, 673, 754, 755
normal1 (uninormal), 760 Paralogistic, 552, 554
nparam (nparam.vlm), 514 paralogistic, 69, 208, 271, 300, 371, 374,
nparam.vlm, 514 448, 552, 553, 553, 699, 703
ns, 263, 314, 325, 375, 451, 688, 720, 721 param.names (UtilitiesVGAM), 761
Pareto, 555, 558, 560
Oalog, 515, 517 paretoff, 325, 555, 556, 556, 561, 562
oalog, 425, 516, 516, 546 ParetoI (ParetoIV), 558
Oapospois, 518, 520 ParetoII (ParetoIV), 558
oapospoisson, 519, 519, 531, 533 paretoII (paretoIV), 560
Oazeta, 521, 523, 828 ParetoIII (ParetoIV), 558
oazeta, 521, 522, 535, 830, 832 paretoIII (paretoIV), 560
Oilog, 419, 523, 526, 545 ParetoIV, 556, 558, 562
oilog, 425, 516, 517, 524, 525, 546 paretoIV, 557560, 560
Oiposbinom, 526 pbenf (Benford), 50
oiposbinomial, 528 pbenini (Benini), 51
Oipospois, 519, 530, 533, 547 pbeta, 857
oipospoisson, 520, 526, 531, 532, 548, 588 pbetabinom (Betabinom), 54
Oizeta, 218, 521, 526, 533, 535, 537, 538, pbetabinom.ab, 55
549, 828 pbetageom (Betageom), 65
oizeta, 523, 535, 536, 537, 550, 830, 832 pbetanorm (Betanorm), 69
Oizipf, 535, 536, 538 pbiamhcop (Biamhcop), 74
oizipf, 537 pbifgmcop (Bifgmcop), 81
oldClass, 734 pbifrankcop (Frank), 280
olym08, 332 pbilogis (bilogis), 90
olym08 (olympics), 539 pbinom, 833
olym12, 332 pbinorm, 102, 109, 110
olym12 (olympics), 539 pbinorm (Binorm), 106
olympics, 539 pbinormcop (Binormcop), 111
Ops.SurvS4 (SurvS4), 732 pbiplackcop (Biplackett), 112
Opt, 139, 468, 540, 745 pbisa, 118
878 INDEX

pbisa (Bisa), 116 plotqrrvglm, 573

pcard (Card), 144 plotqtplot.lmscreg, 574, 643
pdagum (Dagum), 205 plotrcim0, 332, 488, 489, 576, 656
pdiffzeta (Diffzeta), 217 plotvgam, 578, 582, 583, 680, 770, 772, 779,
peexp (Expectiles-Exponential), 235 783
penorm (Expectiles-Normal), 237 plotvgam.control, 580, 581, 583
Perks, 563 plotvglm, 580, 582, 781
perks, 563, 564, 699 pmakeham, 461
persp, 567, 568 pmakeham (Makeham), 461
perspqrrvglm, 196, 456, 566, 752 pmaxwell (Maxwell), 468
peunif (Expectiles-Uniform), 240 pnaka (Nakagami), 495
pexp, 236 pnbinom, 610
pexpgeom (expgeom), 246 pneumo, 21, 201, 203, 204, 492, 583, 724
pexplog (explog), 252 pnorm, 29, 50, 52, 107, 110, 116, 118, 144,
pexppois (exppois), 256 205, 216, 233, 234, 237, 256, 268,
pfisk (Fisk), 268 274, 276, 303, 306, 321, 338, 359,
pfoldnorm (Foldnorm), 274 369, 372, 381, 387, 397, 399, 406,
pfrechet (Frechet), 282 436, 445, 461, 469, 495, 552, 555,
pgamma, 569571 559, 563, 616, 652, 668, 700, 709,
pgamma.deriv, 569, 570, 571, 758 735, 745, 749, 756, 809
pgamma.deriv.unscaled, 570, 570, 758 pnorm2 (Binorm), 106
pgengamma.stacy (gengammaUC), 303 poalog (Oalog), 515
pgenray (genray), 306 poapospois (Oapospois), 518
pgev (gevUC), 315 poazeta (Oazeta), 521
pgompertz (Gompertz), 320 poilog (Oilog), 523
pgpd (gpdUC), 326 poiposbinom (Oiposbinom), 526
pgumbel (gumbelUC), 340 poipospois (Oipospois), 530
pgumbelII (Gumbel-II), 337 Poisson, 828, 848
phuber (dhuber), 216 poisson, 258, 588
phzeta (Hzeta), 359 poisson.points, 471, 584, 588, 589, 654, 655
pinv.gaussian (Inv.gaussian), 366 poissonff, 37, 124, 139, 156, 157, 191, 193,
pinv.lomax (Inv.lomax), 369 196, 255, 305, 332, 365, 461, 483,
pinv.paralogistic (Inv.paralogistic), 507, 510, 531, 533, 542, 543, 585,
372 586, 590, 592, 620, 640, 645, 647,
pkumar (Kumar), 380 660, 699, 705, 764, 769, 780, 808
plaplace (laplaceUC), 387 PoissonPoints, 588
plgamma (lgammaUC), 397 poizeta (Oizeta), 533
plind (Lindley), 398 poizipf (Oizipf), 536
plino (Lino), 406 polf, 202, 204, 319, 404, 501, 543, 588, 589
plog, 218, 545 Polono, 591
plog (Log), 418 poly, 465, 719721
plogis, 429 polya, 699
ploglap (loglapUC), 435 polya (negbinomial), 502
plomax (Lomax), 445 polyaR, 699
plot, 343, 474, 567, 577, 582, 754 polyaR (negbinomial), 502
plot.vgam (plotvgam), 578 posbernoulli.b, 210, 422, 423, 593, 597,
plot.window, 577 598, 601603, 607
plotdeplot.lmscreg, 211, 212, 571 posbernoulli.t, 47, 210, 594, 595, 596, 597,
INDEX 879

599603, 606, 607, 691 645, 724

posbernoulli.tb, 23, 594, 595, 597, 598, profile.glm, 187
599, 603, 606, 607 propodds, 203, 204, 376, 466, 631, 633
posbernUC, 602 prplot, 204, 632
Posbinom, 527, 604 psc.t2 (Expectiles-sc.t2), 238
posbinomial, 23, 105, 182, 527529, 594, psinmad (Sinmad), 700
595, 597, 598, 601, 604, 605, 605, pslash, 710
699, 814, 836 pslash (Slash), 709
Posgeom, 608 pt, 121, 239
Posnegbin, 609 ptikuv (Tikuv), 735
posnegbinomial, 507, 610, 611, 620, 699, ptobit (Tobit), 738
821, 822, 825, 843 ptopple (Topple), 745
Posnorm, 614 ptriangle (Triangle), 749
posnormal, 614, 615, 615, 699, 741, 761 ptruncpareto (Truncpareto), 755
Pospois, 530, 531, 547, 617, 620 punif, 240, 241, 282, 316, 327, 341
pospoisson, 519, 520, 531, 533, 547, 548, put.smart, 634, 719721
588, 613, 618, 619, 699, 808, 825 pyules (Yules), 809
potlog (Otlog), 544 pzabinom (Zabinom), 811
potpospois (Otpospois), 546 pzageom (Zageom), 815
potzeta (Otzeta), 549 pzanegbin (Zanegbin), 818
powerlink, 364, 404, 620, 666 pzapois (Zapois), 822
pparalogistic (Paralogistic), 552 pzeta (Zeta), 827
ppareto (Pareto), 555 pzibinom (Zibinom), 833
pparetoI (ParetoIV), 558 pzigeom (Zigeom), 837
pparetoII (ParetoIV), 558 pzinegbin (Zinegbin), 840
pparetoIII (ParetoIV), 558 pzipf (Zipf), 848
pzipois (Zipois), 850
pparetoIV (ParetoIV), 558
pzoabeta (Zoabeta), 856
pperks (Perks), 563
pzoibetabinom (Betabinom), 54
ppoints, 507
ppolono (Polono), 591 qalap (alaplaceUC), 28
pposbinom (Posbinom), 604 qbenf (Benford), 50
pposgeom (Posgeom), 608 qbenini (Benini), 51
pposnegbin (Posnegbin), 609 qbetanorm (Betanorm), 69
pposnorm (Posnorm), 614 qbisa (Bisa), 116
ppospois (Pospois), 617 qcard (Card), 144
prats, 622 qdagum (Dagum), 205
prayleigh (Rayleigh), 652 qdiffzeta (Diffzeta), 217
predict, 131, 132, 626 qeexp (Expectiles-Exponential), 235
predict.bs, 721 qenorm (Expectiles-Normal), 237
predict.lm, 695 qeunif (Expectiles-Uniform), 240
predict.poly, 721 qexp, 236
predictqrrvglm, 196, 623 qexpgeom (expgeom), 246
predictvglm, 272, 485, 487, 623, 624, 781 qexplog (explog), 252
prentice74, 302, 396, 398, 627 qexppois (exppois), 256
price (Rice), 668 qfisk (Fisk), 268
prinia, 595, 598, 601, 628 qfoldnorm (Foldnorm), 274
probit, 96, 98, 148, 165, 201, 202, 204, 273, qfrechet (Frechet), 282
293, 364, 404, 429, 432, 512, 629, qgengamma.stacy (gengammaUC), 303
880 INDEX

qgenray (genray), 306 qposgeom (Posgeom), 608

qgev (gevUC), 315 qposnegbin (Posnegbin), 609
qgompertz (Gompertz), 320 qposnorm (Posnorm), 614
qgpd (gpdUC), 326 qpospois (Pospois), 617
qgumbel (gumbelUC), 340 qrayleigh (Rayleigh), 652
qgumbelII (Gumbel-II), 337 qrice (Rice), 668
qhuber (dhuber), 216 qrrvglm.control, 141, 142, 192, 193, 195,
qhzeta (Hzeta), 359 196, 288, 503, 635, 659, 660
qinv.lomax (Inv.lomax), 369 qsc.t2 (Expectiles-sc.t2), 238
qinv.paralogistic (Inv.paralogistic), qsinmad (Sinmad), 700
372 qt, 239
qkumar (Kumar), 380 qtikuv (Tikuv), 735
qlaplace (laplaceUC), 387 qtobit (Tobit), 738
qlgamma (lgammaUC), 397 qtopple (Topple), 745
qlino (Lino), 406 qtplot.gumbel, 640
qlog (Log), 418 qtplot.gumbelff (qtplot.gumbel), 640
qloglap (loglapUC), 435 qtplot.lmscreg, 152, 212, 411, 414, 417,
qlomax (Lomax), 445 576, 642
qmakeham (Makeham), 461 qtriangle (Triangle), 749
qmaxwell (Maxwell), 468 qtruncpareto (Truncpareto), 755
qnaka (Nakagami), 495 quantile, 179, 180, 478, 479, 482, 505, 711,
qnbinom, 503, 507 713
quasibinomial, 182, 645
qnorm, 29, 50, 52, 116, 144, 205, 216, 237,
quasibinomialff, 105, 191, 193, 588, 644,
256, 268, 274, 303, 306, 321, 338,
647
359, 369, 372, 381, 387, 397, 399,
quasipoisson, 647
406, 436, 445, 461, 469, 495, 552,
quasipoissonff, 191, 193, 504, 506, 507,
555, 559, 563, 652, 668, 700, 709,
510, 588, 645, 646
735, 745, 749, 756, 809
qunif, 240, 241, 282, 316, 327, 341
qoalog (Oalog), 515
Qvar, 251, 332, 648, 651, 761
qoapospois (Oapospois), 518
qvar, 649, 651
qoazeta (Oazeta), 521
qyules (Yules), 809
qoilog (Oilog), 523
qzabinom (Zabinom), 811
qoiposbinom (Oiposbinom), 526
qzageom (Zageom), 815
qoipospois (Oipospois), 530
qzanegbin (Zanegbin), 818
qoizeta (Oizeta), 533
qzapois (Zapois), 822
qoizipf (Oizipf), 536 qzeta (Zeta), 827
qotlog (Otlog), 544 qzibinom (Zibinom), 833
qotpospois (Otpospois), 546 qzigeom (Zigeom), 837
qotzeta (Otzeta), 549 qzinegbin (Zinegbin), 840
qparalogistic (Paralogistic), 552 qzipf (Zipf), 848
qpareto (Pareto), 555 qzipois (Zipois), 850
qparetoI (ParetoIV), 558 qzoabeta (Zoabeta), 856
qparetoII (ParetoIV), 558
qparetoIII (ParetoIV), 558 ralap, 26
qparetoIV, 561 ralap (alaplaceUC), 28
qparetoIV (ParetoIV), 558 Rayleigh, 469, 652, 655
qperks (Perks), 563 rayleigh, 307, 308, 469, 471, 497, 585, 652,
qposbinom (Posbinom), 604 653, 653, 654, 670, 699, 799, 802
INDEX 881

rbenf (Benford), 50 rexplog (explog), 252

rbenini (Benini), 51 rexppois (exppois), 256
rbetabinom (Betabinom), 54 rfisk (Fisk), 268
rbetageom, 64, 67, 74, 310 rfoldnorm, 277
rbetageom (Betageom), 65 rfoldnorm (Foldnorm), 274
rbetanorm, 64, 74 rfrechet, 284
rbetanorm (Betanorm), 69 rfrechet (Frechet), 282
rbiamhcop, 76 rgamma, 235, 287, 289, 292
rbiamhcop (Biamhcop), 74 rgengamma.stacy, 302
rbiclaytoncop, 79 rgengamma.stacy (gengammaUC), 303
rbiclaytoncop (Biclaytoncop), 77 rgenray (genray), 306
rbifgmcop, 84 rgeom, 608, 839
rbifgmcop (Bifgmcop), 81 rgev, 315
rbifrankcop, 86, 87 rgev (gevUC), 315
rbifrankcop (Frank), 280 rgompertz (Gompertz), 320
rbilogis, 76, 93 rgpd, 325
rbilogis (bilogis), 90 rgpd (gpdUC), 326
rbinom, 529, 605, 836 rgumbel, 154, 336
rbinom2.or, 97, 98 rgumbel (gumbelUC), 340
rbinom2.or (Binom2.or), 94 rgumbelII (Gumbel-II), 337
rbinom2.rho, 102 rhobit, 102, 108, 267, 268, 304, 305, 404,
rbinom2.rho (Binom2.rho), 99 472, 512, 667
rbinorm (Binorm), 106 rhuber, 351
rbinormcop, 110 rhuber (dhuber), 216
rbinormcop (Binormcop), 111 rhzeta (Hzeta), 359
rbiplackcop, 114, 115 Rice, 668
rbiplackcop (Biplackett), 112 riceff, 655, 668, 669, 669, 699
rbisa (Bisa), 116 rig, 698
rbort, 124, 125 rigff, 671
rbort (Bort), 124 rinv.gaussian, 798
rcard, 146 rinv.gaussian (Inv.gaussian), 366
rcard (Card), 144 rinv.lomax (Inv.lomax), 369
Rcim, 332, 488, 489, 578, 655 rinv.paralogistic (Inv.paralogistic),
rcim, 12, 14, 49, 199, 251, 488, 489, 577, 578, 372
648, 649, 651, 656, 745, 789, 790 rkumar (Kumar), 380
rcim (grc), 329 rlaplace, 387
rcqo, 196, 640, 657 rlaplace (laplaceUC), 387
rdagum (Dagum), 205 rlgamma, 396
rdiffzeta (Diffzeta), 217 rlgamma (lgammaUC), 397
rdiric, 221, 661 rlind (Lindley), 398
rec.exp1, 663 rlino (Lino), 406
rec.normal, 664 rlnorm, 443
reciprocal, 293, 404, 503, 505, 665 rlog, 425, 524, 545
reexp (Expectiles-Exponential), 235 rlog (Log), 418
renorm (Expectiles-Normal), 237 rlogis, 91, 427
reunif (Expectiles-Uniform), 240 rloglap (loglapUC), 435
rexp, 236, 255, 478 rlomax (Lomax), 445
rexpgeom (expgeom), 246 rlplot.gev (rlplot.gevff), 672
882 INDEX

rlplot.gevff, 315, 672 rrvglm, 12, 14, 21, 49, 71, 73, 93, 104, 105,
rmakeham (Makeham), 461 138, 146, 163, 164, 172174, 196,
rmaxwell (Maxwell), 468 199, 200, 202, 203, 221, 222, 224,
rnaka, 497 296, 297, 330, 332, 347, 356, 368,
rnaka (Nakagami), 495 391, 405, 411, 414, 427, 439, 440,
rnbinom, 503, 505507, 509, 510, 610, 613, 451, 459, 470, 472, 492, 493, 506,
841 507, 509, 526, 532, 535, 538, 579,
RNG, 699 585, 587, 588, 612, 619, 645, 647,
rnorm, 77, 107, 111, 237, 274, 614, 738, 739, 654, 675, 676, 679, 681, 683, 684,
741 689, 721, 723, 781, 790, 792, 807,
roalog (Oalog), 515 854, 855
roapospois (Oapospois), 518 rrvglm-class, 679
roazeta (Oazeta), 521 rrvglm.control, 195, 330, 332, 459, 639,
roilog (Oilog), 523 676678, 682, 686
roiposbinom, 529 rrvglm.optim.control, 683, 684, 685
roiposbinom (Oiposbinom), 526 rsc.t2 (Expectiles-sc.t2), 238
roipospois (Oipospois), 530 rsimplex (Simplex), 695
roizeta (Oizeta), 533 rsinmad, 180
roizipf (Oizipf), 536 rsinmad (Sinmad), 700
rotlog (Otlog), 544 rskellam (Skellam), 703
rotpospois (Otpospois), 546 rskewnorm (skewnorm), 706
rotzeta (Otzeta), 549 rslash, 712
Round, 25, 30, 32, 35, 37 rslash (Slash), 709
rownames, 332 rtikuv (Tikuv), 735
rparalogistic (Paralogistic), 552 rtobit, 741
rpareto (Pareto), 555 rtobit (Tobit), 738
rparetoI (ParetoIV), 558 rtopple (Topple), 745
rparetoII (ParetoIV), 558 rtriangle (Triangle), 749
rparetoIII (ParetoIV), 558 rtruncpareto (Truncpareto), 755
rparetoIV (ParetoIV), 558 ruge, 588, 686
rperks (Perks), 563 runif, 52, 55, 65, 70, 74, 82, 94, 99, 113, 116,
rpois, 255, 483, 618, 844, 855 144, 218, 239241, 281, 282, 303,
rpois.points (PoissonPoints), 588 321, 338, 359, 387, 397, 399, 406,
rpolono (Polono), 591 418, 461, 495, 544, 549, 555, 559,
rposbern, 595, 598 563, 602, 604, 608, 609, 618, 652,
rposbern (posbernUC), 602 668, 696, 704, 706, 709, 735, 749,
rposbinom, 812 756, 833, 837, 840
rposbinom (Posbinom), 604 rweibull, 180
rposgeom, 815 ryules, 811
rposgeom (Posgeom), 608 ryules (Yules), 809
rposnegbin, 613, 819, 822 rzabinom (Zabinom), 811
rposnegbin (Posnegbin), 609 rzageom (Zageom), 815
rposnorm (Posnorm), 614 rzanegbin (Zanegbin), 818
rpospois, 825 rzapois, 825
rpospois (Pospois), 617 rzapois (Zapois), 822
rrar, 674 rzeta (Zeta), 827
rrayleigh (Rayleigh), 652 rzibinom, 835, 836
rrice (Rice), 668 rzibinom (Zibinom), 833
INDEX 883

rzigeom, 839 sm.ns (smartpred), 720

rzigeom (Zigeom), 837 sm.os, 12, 688, 712, 716, 717, 726, 727,
rzinegbin, 851 767770, 775, 795
rzinegbin (Zinegbin), 840 sm.poly, 311
rzipf (Zipf), 848 sm.poly (smartpred), 720
rzipois, 841 sm.ps, 12, 375, 688, 713, 714, 716, 721, 726,
rzipois (Zipois), 850 727, 767770, 775
rzoabeta (Zoabeta), 856 sm.scale (smartpred), 720
rzoibetabinom (Betabinom), 54 smart.expression, 718, 720, 721
smart.mode.is, 719, 721
s, 12, 176, 263, 314, 325, 375, 687, 687, 713, smartpred, 138, 192, 485, 487, 625, 626, 677,
714, 717, 727, 728, 767770, 774, 678, 714, 717, 720, 769, 778, 780,
775, 795 781
sc.studentt2, 26, 239, 689, 725 smooth.spline, 713, 714, 794, 795
scale, 138, 577, 639, 720, 721 splineDesign, 713, 714, 717
Select, 263, 332, 598, 601, 690, 786, 787 sratio, 21, 179, 200, 201, 203, 204, 376, 465,
seq2binomial, 105, 158, 692 466, 493, 722
set.seed, 139, 193, 196, 659 structure, 734
setMethod, 465, 729 studentt, 150, 426, 699, 724, 761
setup.smart, 694, 721, 805 studentt2, 689, 690, 699
ships, 649, 651 studentt2 (studentt), 724
show,SurvS4-method (SurvS4-class), 734 studentt3, 699
show.summary.pvgam (summarypvgam), 726 studentt3 (studentt), 724
show.summary.vgam (summaryvgam), 727 subset, 691, 692, 768
show.summary.vglm (summaryvglm), 728 subsetcol (Select), 690
show.SurvS4 (SurvS4), 732 sum, 437
Simplex, 695 summary, 139, 729
simplex, 105, 221, 672, 696, 697, 699 summary.gam, 726728
simulate, 698, 789 summary.glm, 727729
simulate.vlm, 26, 59, 62, 64, 74, 76, 84, 87, summary.lm, 727729
105, 150, 208, 235, 255, 271, 288, Summary.SurvS4 (SurvS4), 732
289, 292, 302, 310, 323, 360, 361, summarypvgam, 714, 717, 726, 728, 770
371, 374, 383, 401, 425, 427, 443, summaryvgam, 727, 727, 770
448, 464, 507, 510, 517, 520, 523, summaryvglm, 187, 726, 727, 728, 765
533, 546, 548, 565, 588, 607, 613, SURff, 297, 318, 730, 761
620, 655, 670, 698, 703, 712, 725, Surv, 156
751, 761, 811, 818, 822, 825, 839, survreg, 733, 734
850, 855 SurvS4, 156, 157, 732, 734, 801
Sinmad, 700, 703 SurvS4-class, 734
sinmad, 69, 180, 208, 271, 300, 371, 374, 448,
554, 699701, 701 TDist, 725
Skellam, 703 term.names (formulavlm), 279
skellam, 588, 704, 704 term.namesvlm (formulavlm), 279
skewnorm, 706, 708 terms, 680, 773, 780, 783
skewnormal, 277, 706, 707, 707, 761 Tikuv, 735
Slash, 709 tikuv, 735, 736
slash, 699, 709, 710, 710 title, 567, 568, 641, 672, 754, 755
sm.bs, 311 Tobit, 738
sm.bs (smartpred), 720 tobit, 155, 228, 617, 738, 739, 739, 761
884 INDEX

Tol, 468, 541, 743 150, 151, 153, 155, 156, 163, 175,
Topple, 745, 747, 751 176, 182, 189, 200, 202, 203, 207,
topple, 745, 746, 746, 749 211, 219, 221, 222, 224, 227, 235,
toxop, 230, 748 243, 245, 248, 254, 255, 258, 260,
trapO, 139, 196, 384 261, 263, 270, 276, 284, 285, 287,
Triangle, 746, 747, 749, 751 289, 291, 292, 296, 297, 299, 301,
triangle, 699, 749, 750 305, 308, 309, 313, 314, 319, 322,
trigamma, 504 325, 335, 339, 351, 356, 358, 361,
trplot, 453, 752 365, 368, 371, 374, 375, 382, 386,
trplot.qrrvglm, 196, 752, 753 389, 391, 394, 396, 400, 405, 408,
truncgeometric (geometric), 309 411, 413, 414, 417, 424, 425, 427,
Truncpareto, 558, 755 433, 439, 440, 443, 447, 449, 451,
truncpareto, 756 452, 463, 465, 470, 472, 476, 478,
truncpareto (paretoff), 556 480, 482, 484, 492, 497, 501, 506,
truncweibull, 571, 757, 799, 802 509, 512, 514, 517, 520, 523, 526,
TypicalVGAMfamilyFunction, 14, 405 529, 532, 535, 538, 543, 546, 548,
TypicalVGAMfamilyFunction 550, 554, 557, 562, 565, 575, 579,
(CommonVGAMffArguments), 177 580, 585, 587, 590, 594, 597, 600,
TypicalVGAMlink (Links), 403 606, 612, 616, 619, 627, 631, 641,
643, 645, 647, 654, 663, 664, 670,
ucberk, 759 671, 675, 677, 687689, 693, 697,
Unif, 515, 518, 521 699, 702, 705, 708, 711, 713, 714,
Uniform, 469, 524, 533, 536, 745 716, 717, 721, 723, 725728, 730,
uninormal, 39, 107, 109, 155, 163, 227, 228, 737, 740, 747, 751, 758, 760, 767,
251, 276, 277, 297, 331, 347, 351, 774776, 781, 789, 790, 792, 794,
443, 481, 511, 512, 617, 630, 648, 797, 799, 800, 803, 807, 811, 814,
649, 651, 665, 699, 708, 725, 731, 817, 821, 825, 832, 835, 839, 843,
737, 741, 760 846, 850, 854
uniroot, 35, 274, 275, 495, 735 vgam-class, 771
update, 451
VGAM-package, 12
uqo, 193, 196
uqo (grc), 329 vgam.control, 141, 768770, 774, 774
UtilitiesVGAM, 183, 761 vglm, 12, 14, 1619, 21, 26, 31, 33, 35, 37, 40,
43, 53, 58, 61, 64, 67, 68, 71, 73, 76,
V1, 588, 763 78, 80, 83, 85, 86, 88, 89, 93, 97,
valt.control, 683 102, 104, 108, 110, 114, 118, 121,
vcov, 765 123, 127, 129, 146, 150, 151, 153,
vcov (vcovvlm), 764 155, 156, 163, 168, 176, 177, 179,
vcovrrvglm, 765 182, 186, 187, 189, 195, 196, 200,
vcovrrvglm (vcovvlm), 764 202, 203, 207, 211, 213, 214, 219,
vcovvlm, 187, 729, 764, 765 221, 222, 224, 227, 229, 235, 243,
vector, 734 245, 248, 254, 255, 258, 260, 261,
venice, 154, 315, 336, 343, 765 263, 270, 276, 284, 285, 287, 289,
venice90 (venice), 765 291294, 296, 297, 299, 301, 305,
VGAM (VGAM-package), 12 308, 309, 313, 314, 319, 322, 325,
vgam, 12, 14, 1619, 21, 26, 31, 33, 35, 37, 40, 330, 332, 335, 339, 344346, 351,
53, 64, 67, 68, 71, 73, 76, 78, 83, 85, 356, 358, 361, 365, 368, 371,
86, 88, 89, 93, 97, 102, 104, 108, 374379, 382, 386, 389, 391, 394,
110, 114, 118, 121, 123, 138, 146, 396, 400, 401, 403, 405, 408, 411,
INDEX 885

414, 417, 424, 425, 427, 433, 439, zageometric, 310, 608, 699, 815, 816, 839
440, 443, 447, 449452, 463, 465, zageometricff, 699
466, 470, 472, 476, 478, 480, 482, zageometricff (zageometric), 816
484, 492, 497, 501, 506, 507, 509, Zanegbin, 818
512, 517, 520, 523, 526, 529, 532, zanegbinomial, 610, 613, 699, 819, 819, 844
535, 538, 543, 546, 548, 550, 554, zanegbinomialff, 699
557, 562, 565, 575, 579, 580, 583, zanegbinomialff (zanegbinomial), 819
585, 587, 590, 594, 597, 600, 606, Zapois, 822
612, 616, 619, 625627, 631, 641, zapoisson, 618, 620, 699, 808, 823, 824, 854,
643, 645, 647649, 654, 663, 664, 855
670672, 674678, 684, 687689, zapoissonff, 699
692, 693, 697, 699, 702, 705, 708, zapoissonff (zapoisson), 824
711, 714, 720, 721, 723, 725, zero, 190, 826
728731, 733, 734, 737, 740, 747, Zeta, 534, 827
751, 758, 760, 764, 768, 770, zeta, 219, 360, 361, 393, 521, 522, 535, 550,
773776, 776, 784, 787, 789, 790, 828, 829, 832
792, 797, 799, 800, 803, 807, 811, zetaff, 218, 219, 360, 361, 523, 534, 535,
814, 817, 821, 825, 832, 835, 839, 549, 550, 828, 830, 831, 848850
843, 846, 850, 854 Zibinom, 833
vglm-class, 782 zibinomial, 105, 605, 812814, 833, 834, 834
vglm.control, 12, 41, 97, 187, 263, 300, 316, zibinomialff (zibinomial), 834
332, 493, 580, 639, 680, 683, 684, Zigeom, 837
730, 769, 772, 775, 776, 778, zigeometric, 310, 608, 699, 815, 818, 838,
780782, 784, 844, 847 838
vglmff-class, 788 zigeometricff, 699
vonmises, 146, 791 zigeometricff (zigeometric), 838
vsmooth.spline, 25, 413, 688, 770, 776, 793 Zinegbin, 840, 844
zinegbinomial, 181, 507, 610, 678, 699, 822,
waitakere, 355, 795 841, 842, 854
waldff, 367, 368, 797 zinegbinomialff (zinegbinomial), 842
waldtest, 451 zipebcom, 98, 845, 855
weibull.mean, 798, 802 Zipf, 536, 537, 848
weibullR, 180, 244, 315, 339, 340, 655, 757, zipf, 218, 219, 537, 538, 550, 699, 832, 848,
758, 798, 799, 799 849
weightsvglm, 779, 802 Zipois, 808, 833, 837, 841, 850, 854, 855
wine, 804 zipoisson, 330, 332, 525, 529, 532, 533, 535,
wrapup.smart, 721, 805 538, 588, 618620, 678, 699, 807,
808, 822, 825, 836, 839, 844847,
xs.nz, 492 851, 852, 858
zipoissonff, 330332, 699
yeo.johnson, 805 zipoissonff (zipoisson), 852
yip88, 807, 825, 854, 855 Zoabeta, 56, 856, 858
Yules, 809 zoabetaR, 857, 858
yulesimon, 699, 809, 810, 810

Zabinom, 811
zabinomial, 605, 813, 836
zabinomialff (zabinomial), 813
Zageom, 815

GCDkit Manual
No ratings yet
GCDkit Manual
342 pages
Mess PDF
100% (1)
Mess PDF
94 pages
Reading Comprehension 1
100% (4)
Reading Comprehension 1
9 pages
Delta Neutral Vega Long
No ratings yet
Delta Neutral Vega Long
6 pages
General Description: ISO 17987/LIN 2.x/SAE J2602 Transceiver
100% (1)
General Description: ISO 17987/LIN 2.x/SAE J2602 Transceiver
24 pages
Reading Comprehension 1
No ratings yet
Reading Comprehension 1
12 pages
Bharat Petroleum Corporation Limited: Sv/Tv/Ta-Form
100% (2)
Bharat Petroleum Corporation Limited: Sv/Tv/Ta-Form
1 page
Pediatric Demyelinating Diseases of The Central Nervous System and Their Mimics
100% (1)
Pediatric Demyelinating Diseases of The Central Nervous System and Their Mimics
338 pages
Boom Placer Spare Parts Manual Sp1420 RMC - 80009056 - 0, Edition - Dec '18
No ratings yet
Boom Placer Spare Parts Manual Sp1420 RMC - 80009056 - 0, Edition - Dec '18
104 pages
The College Walkthrough Ver 0.39
No ratings yet
The College Walkthrough Ver 0.39
22 pages
Actuar
No ratings yet
Actuar
142 pages
Safe Work Procedure
No ratings yet
Safe Work Procedure
2 pages
Car
No ratings yet
Car
169 pages
Pracma
No ratings yet
Pracma
392 pages
Pracma
No ratings yet
Pracma
362 pages
Vares
No ratings yet
Vares
163 pages
Evd PDF
No ratings yet
Evd PDF
88 pages
Vegan
No ratings yet
Vegan
295 pages
GCDkit Manual PDF
No ratings yet
GCDkit Manual PDF
282 pages
Package Astsa': R Topics Documented
No ratings yet
Package Astsa': R Topics Documented
99 pages
Car PDF
No ratings yet
Car PDF
156 pages
Package SM': R Topics Documented
No ratings yet
Package SM': R Topics Documented
74 pages
Daewr
No ratings yet
Daewr
79 pages
Package Car': October 9, 2012
No ratings yet
Package Car': October 9, 2012
154 pages
Rob Compositions
No ratings yet
Rob Compositions
187 pages
Data Mining 12
No ratings yet
Data Mining 12
94 pages
Actuar
No ratings yet
Actuar
145 pages
GCDkit Manual
No ratings yet
GCDkit Manual
272 pages
AgroReg 1.2.7
No ratings yet
AgroReg 1.2.7
110 pages
Package Boot': August 29, 2013
No ratings yet
Package Boot': August 29, 2013
117 pages
AICcmodavg PDF
No ratings yet
AICcmodavg PDF
212 pages
Labdsv
No ratings yet
Labdsv
67 pages
Lesson Planning in Teaching
No ratings yet
Lesson Planning in Teaching
10 pages
Asbio Manual
No ratings yet
Asbio Manual
145 pages
Physiology Pneumonics
No ratings yet
Physiology Pneumonics
9 pages
Name: - Date: - Grade & Section: - Score
No ratings yet
Name: - Date: - Grade & Section: - Score
2 pages
Sim Diffproc
No ratings yet
Sim Diffproc
176 pages
Vikas Gurjar20241226045412
No ratings yet
Vikas Gurjar20241226045412
1 page
40 câu hỏi giao tiếp
No ratings yet
40 câu hỏi giao tiếp
17 pages
Facts On Water Supply and Sanitation in Sweden (English)
0% (1)
Facts On Water Supply and Sanitation in Sweden (English)
27 pages
Forrester - Enabling Smarter Procurement
No ratings yet
Forrester - Enabling Smarter Procurement
15 pages
WELDING Presentation
No ratings yet
WELDING Presentation
21 pages
Synthetic
No ratings yet
Synthetic
6 pages
Cambridge First B2, Test Book 1, Test 2, Speaking Script For Exam Practice.
No ratings yet
Cambridge First B2, Test Book 1, Test 2, Speaking Script For Exam Practice.
4 pages
A FAREWELL TO VIROLOGY (EXPERT EDITION) DR Mark Bailey
No ratings yet
A FAREWELL TO VIROLOGY (EXPERT EDITION) DR Mark Bailey
67 pages
ASN RB Nov Dec 2023 1
No ratings yet
ASN RB Nov Dec 2023 1
51 pages
Major Assignment 1
No ratings yet
Major Assignment 1
4 pages
Arroyo Housing Project
No ratings yet
Arroyo Housing Project
20 pages
ML22164A439
No ratings yet
ML22164A439
12 pages
JPPPF June2025 111 02 13 26 Dwi+Ambar
No ratings yet
JPPPF June2025 111 02 13 26 Dwi+Ambar
14 pages
Lecture 1a
No ratings yet
Lecture 1a
22 pages
Basic Tools in Routine Evaluation of Cardiac Patients
No ratings yet
Basic Tools in Routine Evaluation of Cardiac Patients
26 pages
Catalogue Wireframe
No ratings yet
Catalogue Wireframe
11 pages
Master Copy - ARCH 2023-2024
No ratings yet
Master Copy - ARCH 2023-2024
1 page
FAC3761 - Exam Prep - Mock Question Paper - Suggested Solution
No ratings yet
FAC3761 - Exam Prep - Mock Question Paper - Suggested Solution
9 pages
Yorrick - Player Sheet
No ratings yet
Yorrick - Player Sheet
2 pages
Ficha Técnica de Balatas-001 Noviembre 2011
No ratings yet
Ficha Técnica de Balatas-001 Noviembre 2011
4 pages
Airlangga Career Fair Online Ticketing
No ratings yet
Airlangga Career Fair Online Ticketing
1 page
Homework 6: Math 308 Due: 8 March
No ratings yet
Homework 6: Math 308 Due: 8 March
3 pages
Back in the Real World (Stone Angel #2)
From Everand
Back in the Real World (Stone Angel #2)
Marvin H. Albert
4.5/5 (2)
Fly Fishing Guide to the Battenkill: Complete Guide to Locations, Hatches, and History
From Everand
Fly Fishing Guide to the Battenkill: Complete Guide to Locations, Hatches, and History
Doug Lyons
No ratings yet
Coaching in the Helix. How self-development impacts the quality of our coaching
From Everand
Coaching in the Helix. How self-development impacts the quality of our coaching
Elisabeth Goodman
No ratings yet
Deadline Istanbul (The Elizabeth Darcy Series)
From Everand
Deadline Istanbul (The Elizabeth Darcy Series)
Peggy Hanson
5/5 (1)
Deadline Yemen (The Elizabeth Darcy Series)
From Everand
Deadline Yemen (The Elizabeth Darcy Series)
Peggy Hanson
5/5 (1)
Trouble in Tahiti: Blood on the Hibiscus
From Everand
Trouble in Tahiti: Blood on the Hibiscus
Hayford Peirce
No ratings yet
The Celestial Womb: Poetry for the Divine Feminine
From Everand
The Celestial Womb: Poetry for the Divine Feminine
Ayleen Guzman
No ratings yet
The Unstuck Book
From Everand
The Unstuck Book
Conor McCarthy
No ratings yet
The Gracious Lily Affair
From Everand
The Gracious Lily Affair
Van Wyck Mason
5/5 (1)
Alienist: A Gerald Knave Science Fiction Adventure
From Everand
Alienist: A Gerald Knave Science Fiction Adventure
Laurence M. Janifer
No ratings yet
Intrusion Detection Honeypots
From Everand
Intrusion Detection Honeypots
Chris Sanders
3/5 (2)
Knit Soxx for Everyone: 25 Colorful Sock Patterns for the Whole Family
From Everand
Knit Soxx for Everyone: 25 Colorful Sock Patterns for the Whole Family
Kerstin Balke
4.5/5 (2)
Between River and Mountain
From Everand
Between River and Mountain
Sally Walker Brinkmann
No ratings yet
Osama the Gun
From Everand
Osama the Gun
Norman Spinrad
5/5 (1)
Lords of Creation
From Everand
Lords of Creation
Eando Binder
2.5/5 (3)
Gray Hat Hacking the Ethical Hacker's
From Everand
Gray Hat Hacking the Ethical Hacker's
Çağatay Şanlı
5/5 (1)
Life Ain't A Dress Rehearsal: Lives in Poetry
From Everand
Life Ain't A Dress Rehearsal: Lives in Poetry
Cecil D. Haas
No ratings yet
Bimbo Heaven: Stone Angel #7
From Everand
Bimbo Heaven: Stone Angel #7
Marvin H. Albert
No ratings yet
The Last Smile: Stone Angel #5
From Everand
The Last Smile: Stone Angel #5
Marvin H. Albert
No ratings yet
Knots, Bends, Splices With tables of strengths of ropes, etc. and wire rigging
From Everand
Knots, Bends, Splices With tables of strengths of ropes, etc. and wire rigging
Captain Jutsum
No ratings yet
The Deathguards
From Everand
The Deathguards
Phyllis Ann Karr
No ratings yet
Keys to Better Reading
From Everand
Keys to Better Reading
Judy McFall
No ratings yet
Oneness
From Everand
Oneness
Carmeline Pusateri
No ratings yet
Brass Methods: An Essential Resource for Educators, Conductors, and Students
From Everand
Brass Methods: An Essential Resource for Educators, Conductors, and Students
David Kish
No ratings yet
Operation Longlife
From Everand
Operation Longlife
E. Hoffmann Price
3.5/5 (3)
The Stock Market from A to See - 2nd Edition
From Everand
The Stock Market from A to See - 2nd Edition
John Nunez
No ratings yet
Old Breed General: How Marine Corps General William H. Rupertus Broke the Back of the Japanese in World War II from Guadalcanal to Peleliu
From Everand
Old Breed General: How Marine Corps General William H. Rupertus Broke the Back of the Japanese in World War II from Guadalcanal to Peleliu
Amy Rupertus Peacock
No ratings yet
The Future Is Ours: The Collected Science Fiction of Edward D. Hoch
From Everand
The Future Is Ours: The Collected Science Fiction of Edward D. Hoch
Edward D. Hoch
No ratings yet
Operation Exile
From Everand
Operation Exile
E. Hoffmann Price
3.5/5 (1)
Murder in the Willett Family: A Lt. Valcour Mystery
From Everand
Murder in the Willett Family: A Lt. Valcour Mystery
Rufus King
No ratings yet
Bath Knits: 30 Projects Made to Pamper
From Everand
Bath Knits: 30 Projects Made to Pamper
Mary Beth Temple
No ratings yet
Never Walk Alone
From Everand
Never Walk Alone
Rufus King
No ratings yet
Hamlet Had an Uncle: A Comedy of Honor
From Everand
Hamlet Had an Uncle: A Comedy of Honor
James Branch Cabell
4.5/5 (7)
Duenna to a Murder
From Everand
Duenna to a Murder
Rufus King
No ratings yet
A Man's Guide: Navigating the menstrual mood swing
From Everand
A Man's Guide: Navigating the menstrual mood swing
Meghan Kurts-Forrester
No ratings yet