Chap 7
Chap 7
Multivariate linear regression model is essentially several univariate linear regression models putting
together, with the errors being related with each. Here univariate (multivariate) means the response
variables are univariate (multivariate). The notations of this chapter may be nasty.
where
1 z11 ··· z1r
Y1 β0 ²1
. 1 z21 ··· z2r ..
Y = .. , Z = .. , β = and ² = ... .
... ..
.
..
. .
.
Yn βr ²n
1 zn1 ··· znr
where alm , 0 ≤ l, m ≤ r are the entries of the matrix (Z0 Z)−1 , i.e.,
a00 a01 · · · a0r
a10 a11 · · · a1r
(Z0 Z)−1 = ... .. .. .. .
. . .
ar0 ar1 ··· arr
This confidence interval is based on
β̂k − βk
√ ∼ tn−r−1 , k = 0, ..., r.
akk s
One might also construct T 2 -type confidence region at confidence level 1 − α for β as a vector of
r + 1 dimension:
© o
β ∈ Rr+1 : (β̂ − β)0 (Z0 Z)(β̂ − β)/s2 ≤ (r + 1)Fr+1,n−r−1 (α)
(v). Prediction.
Let
1
z01
Z0 = z
02
...
z0r
be any given value of covariates. Let Y0 = Z00 β + ²0
be the response of an experiment not yet carried
out. We wish to predict the response Y0 or estimate its mean Z00 β with accuracy justification. The
predictor or estimator is naturally Z00 β̂, which is unbiased. Moreover,
³ ´ ³ ´
Z00 β̂ − Z00 β ∼ N 0, σ 2 Z00 (Z0 Z)−1 Z0 and Z00 β̂ − Y0 ∼ N 0, σ 2 [1 + Z00 (Z0 Z)−1 Z0 ] ,
by noticing that
Z00 β̂ − Z00 β = Z00 (Z0 Z)−1 Z² and Z00 β̂ − Y0 = Z00 (Z0 Z)−1 Z² + ²0 .
Then, the prediction interval (or just call it confidence interval if you like) for Y0 at confidence level
1 − α is q
Z00 β̂ ± tn−r−1 (α/2)s 1 + Z00 (Z0 Z)−1 Z0 .
And confidence interval for Z00 β, the mean of Y0 , at confidence level 1 − α is
q
Z00 β̂ ± tn−r−1 (α/2)s Z00 (Z0 Z)−1 Z0 .
28
We note that this confidence interval still holds when the first component of Z0 takes any value,
not just 1.
Another simple example is a multivariate linear regression model relating students academic perfor-
mance measured by midterm and final exam scores with diligence measured by weekly time devoted
to study and ingenuity measured by IQ score:
½
Yi1 = β01 + β11 zi1 + β21 zi2 + ²i1 ,
i = 1, ..., n
Yi2 = β02 + β12 zi1 + β22 zi2 + ²i2
where Yi1 , Yi2 zi1 and zi2 are, respectively, midterm exam score, final exam score, weekly hours of
time devoted to study and IQ score of the i-th randomly selected student.
(i). Model description:
The k-th univariate linear regression model:
where Yik is the k-th response for subject i. Writing in matrix form:
where
Y1k β0k ²1k
. .
Y(k) = .. , β(k) = ... and ²(k) = ..
Ynk βrk ²nk
The so called multivariate linear regression model is to put the m univariate linear regression model
together as
Yn×m = Zn×(r+1) β(r+1)×m + ²n×m ,
which is the same as
. . . . . .
[Y(1) .. · · · ..Y(m) ] = Z[β(1) .. · · · ..β(m) ] + [²(1) .. · · · ..²(m) ]
One distinct feature here is that the errors, ²i1 , ..., ²im , within one observation/subject may be
related with each other. This suggest that the m univariate linear regression models should be
treated together as one model rather than separately treated as m individual univariate models.
(ii). Model assumptions:
The errors (²i1 , ..., ²im )0 , as random vectors of m dimension, are iid ∼ M N (0, Σ) and are independent
of the covariates. Note that here Σ is an m × m matrix.
29
(iii). Estimation:
The least squares criterion for the multivariate linear regression model is to minimize over β
n h
m X
X i2 X
m
Yik − (β0k + zi1 β1k + · · · + zir βrk ) = kY(k) − Zβ(k) k2 .
k=1 i=1 k=1
i.e.,
X n
m X r
X m h
X n
X r
X i
2
min (Yik − [β0k + zil βlk ]) = min (Yik − [β0k + zil βlk ])2 .
β β(k)
k=1 i=1 l=1 k=1 i=1 l=1
Hence β̂(k) is the same as the LSE of β(k) of the univariate linear regression model: Y(k) = Zβ(k) +
²(k) . Therefore,
which is formally the same as the expression of the LSE of univariate linear regression model. The
residuals are
²̂n×m = Yn×m − Zn×(r+1) β̂(r+1)×m .
and the estimator of Σ is
s11 ··· s1m
.. .. .. 1
S≡ . . . = ²̂0 ²̂.
n−r−1
sm1 ··· smm
where skl is the (k, l)-th entry of S. Then, a confidence region at confidence level 1 − α for β(k) , as
a vector in Rr+1 , can be constructed as
n (β̂(k) − β(k) )(Z0 Z)(β̂(k) − β(k) ) o
β(k) ∈ Rr+1 : ∼ (r + 1)Fr+1,n−r−1 (α) .
skk
For more detail, such as confidence intervals for βjk , simultaneous confidence intervals for β0k , ..., βrk ,
please refer to Section 7.1 about univariate linear regression models.
(v). Prediction.
30
Let
1
²01
z01 ..
Y0 = β 0 Z0 + ²0 = β 0
... + .
²0m
z0r
be a new experiment with covariate Z0 which has not been carried out. We are interested in
predicting its response Y0 , an m-vector, or estimating its mean β 0 Z0 , with accuracy justification.
The predictor of Y0 or estimator of its mean β 0 Z0 is naturally β̂ 0 Z0 . To compute the variance,
observe that, for any non-random n-vector V = (v1 , ..., vn )0 ,
Xn n
X n
X
0 2
var(² V ) = var( vi ² i ) = vi var(²i ) = Σ vi2 = kV k2 Σ == V 0 V Σ,
i=1 i=1 j=1
where ²i , an m-vector, is the i-row of ² representing the m errors for the i-th subject/observation.
Let V = Z(Z0 Z)−1 Z0 . Then,
Moreover,
1 0 (n − r − 1)m
T2 ≡ [β̂ Z0 − β 0 Z0 ]0 S−1 [β̂ 0 Z0 − β 0 Z0 ] ∼ Fm,n−r−m
κ n−r−m
1 (n − r − 1)m
and [Y0 − β 0 Z0 ]0 S−1 [Y0 − β 0 Z0 ] ∼ Fm,n−r−m ,
1+κ n−r−m
which provide theoretical foundation for the following inference procedures:
(1). Confidence region for β 0 Z0 = E(Y0 ) at level 1 − α:
n (n − r − 1)m o
β 0 Z0 ∈ Rm : [β̂ 0 Z0 − β 0 Z0 ]0 S−1 [β̂ 0 Z0 − β 0 Z0 ] ≤ κ Fm,n−r−m (α)
n−r−m
(2). Prediction region (or just call it confidence region) for Y0 at level 1 − α:
n (n − r − 1)m o
Y0 ∈ Rm : [Y0 − β 0 Z0 ]0 S−1 [Y0 − β 0 Z0 ] ∼ (1 + κ) Fm,n−r−m (α) .
n−r−m
,
0
(3). Simultaneous confidence intervals for β(k) Z0 = E(Y0k ), k = 1, ..., m at confidence level 1 − α:
r
0 0 (n − r − 1)m
for β(k) Z0 : β̂(k) Z0 ± skk · κ Fm,n−r−m (α) k = 1, ..., m.
n−r−m
31
Example 1.1 (continued) Ten American Companies Sales (Y1 ) and profit (Y2 ) are two responses
regressed on the covariate asset (Z). The multivariate linear regression is the two univariate linear
regression models putting together:
and a confidence region at confidence level 95% for β(1) = (β01 , β11 )0 is
Suppose company ABC has 50 billion dollars of asset. We wish to predict both sales and profit
of the company with accuracy justification. Set Z0 = (1 50)0 and denote by Y01 and Y02 the yet
unknown sales and profit of the company. The estimator is
µ ¶ µ ¶
β̂01 + 50β̂11 46.45
β̂ 0 Z0 = =
β̂02 + 50β̂12 2.45
where S stands for sales and P for profit of the company. And simultaneous prediction intervals at
(nominal) confidence level 95% is
On the other hand, suppose we are interested in providing an estimation and inference for the mean
sales and profit for all companies with asset 50 billion dollars, which is
µ ¶ µ ¶
EY01 β01 + 50β11
=
EY02 β02 + 50β12
The estimator is still same as (46.45, 2.45)0 . A confidence region at confidence level 95% is
n µ ¶ µ ¶ µ ¶ µ ¶ o
£ S 46.45 ¤0 −1 £ S 46.45 ¤
(S, P) : − S − ≤ 1.477
P 2.45 P 2.45