Projection Matrices
Projection Matrices
Projection Matrices
2.1
Definition
(2.1)
25
26
and
Ker(P ) = Sp(I n P ).
(2.4)
(2.5)
Because the equation above has to hold for any x E n , it must hold that
I n = P V W + P W V .
Let a square matrix P be the projection matrix onto V along W . Then,
Q = I n P satisfies Q2 = (I n P )2 = I n 2P + P 2 = I n P = Q,
indicating that Q is the projection matrix onto W along V . We also have
P Q = P (I n P ) = P P 2 = O,
(2.6)
2.1. DEFINITION
27
implying that Sp(Q) constitutes the null space of P (i.e., Sp(Q) = Ker(P )).
Similarly, QP = O, implying that Sp(P ) constitutes the null space of Q
(i.e., Sp(P ) = Ker(Q)).
Theorem 2.2 Let E n = V W . The necessary and sufficient conditions
for a square matrix P of order n to be the projection matrix onto V along
W are:
(i) P x = x for x V, (ii) P x = 0 for x W.
(2.7)
along Sp(y) (that is, OA= P Sp(x)Sp(y ) z), where P Sp(x)Sp(y ) indicates the
projection matrix onto Sp(x) along Sp(y). Clearly, OB= (I 2 P Sp(y )Sp(x) )
z.
Sp(y) = {y}
z
Sp(x) = {x}
O P {x}{y} z A
{x|x = 1 x1 +2 x2 } along Sp(y) (that is, OA= P V Sp(y ) z), where P V Sp(y )
indicates the projection matrix onto V along Sp(y).
28
V = {1 x1 + 2 x2 }
O P V {y} z A
Figure 2.2: Projection onto a two-dimensional space V along Sp(y) = {y}.
Theorem 2.3 The necessary and sufficient condition for a square matrix
P of order n to be a projector onto V of dimensionality r (dim(V ) = r) is
given by
P = T r T 1 ,
(2.8)
where T is a square nonsingular matrix of order n and
1 0 0
. .
. . ... ... . . .
..
0 1 0
r =
0 0 0
. . .
.. . .
.
. .. .. . .
0 0 0
0
..
.
0
0
..
.
=T
,
x = A = [A, B]
0
0
y = A = [A, B]
=T
2.1. DEFINITION
29
Thus, we obtain
P x = x = P T
P y = 0 = P T
=T
0
0
= T r
= T r
PT
Since
= T r
that
P T = T r = P = T r T 1 .
Furthermore, T can be an arbitrary nonsingular matrix since V = Sp(A)
and W = Sp(B) such that E n = V W can be chosen arbitrarily.
(Sufficiency) P is a projection matrix, since P 2 = P , and rank(P ) = r
from Theorem 2.1. (Theorem 2.2 can also be used to prove the theorem
above.)
Q.E.D.
Lemma 2.2 Let P be a projection matrix. Then,
rank(P ) = tr(P ).
(2.9)
(2.10)
rank(P ) + rank(I n P ) = n,
(2.11)
E n = Sp(P ) Sp(I n P ).
(2.12)
30
2.2
(2.13)
Q.E.D.
(2.14)
Theorem 2.5 The necessary and sufficient condition for a square matrix
P of order n to be an orthogonal projection matrix (an orthogonal projector)
is given by
(i) P 2 = P and (ii) P 0 = P .
Proof. (Necessity) That P 2 = P is clear from the definition of a projection
matrix. That P 0 = P is as shown above.
(Sufficiency) Let x = P Sp(P ). Then, P x = P 2 = P = x. Let
y Sp(P ) . Then, P y = 0 since (P x, y) = x0 P 0 y = x0 P y = 0 must
31
Qy (where Q = I P )
Sp(A) = Sp(Q)
Sp(A) = Sp(P )
Py
Figure 2.3: Orthogonal projection.
Note A projection matrix that does not satisfy P 0 = P is called an oblique projector as opposed to an orthogonal projector.
(2.15)
32
..
.
1
n
1
n
.. .
.
(2.16)
1
n
In P M =
1
n1
..
.
n1
1
n
n1
1
..
.
1
n
n1
n1
n1
..
..
.
.
1 n1
(2.17)
Let
QM = I n P M .
(2.18)
Clearly, P M and QM are both symmetric, and the following relation holds:
P 2M = P M , Q2M = QM , and P M QM = QM P M = O.
(2.19)
xR =
x1
x2
..
.
xn
, x =
x1 x
x2 x
..
.
1X
, where x
=
xj .
n j=1
xn x
Then,
x = QM xR ,
and so
n
X
(xj x
)2 = ||x||2 = x0 x = x0R QM xR .
j=1
(2.20)
2.3
33
In this section, we consider the relationships between subspaces and projectors when the n-dimensional space E n is decomposed into the sum of several
subspaces.
2.3.1
(2.21)
(2.22)
(2.23)
Q.E.D.
Theorem 2.7 Let P 1 and P 2 denote the projection matrices onto V1 along
W1 and onto V2 along W2 , respectively. Then the following three statements
are equivalent:
(i) P 1 + P 2 is the projector onto V1 V2 along W1 W2 .
(ii) P 1 P 2 = P 2 P 1 = O.
(iii) V1 W2 and V2 W1 . (In this case, V1 and V2 are disjoint spaces.)
Proof. (i) (ii): From (P 1 +P 2 )2 = P 1 +P 2 , P 21 = P 1 , and P 22 = P 2 , we
34
35
Theorem 2.9 When the decompositions in (2.21) and (2.22) hold, and if
P 1P 2 = P 2P 1,
(2.24)
36
V2
(2.25)
(2.26)
(2.27)
P 2i = P i (i = 1, m).
(2.28)
(2.29)
37
rank(P i ) =
m
X
tr(P i ) = tr
i=1
m
X
Pi
= tr(I n ) = n.
i=1
(2.30)
Then, E n = Vi V(i) . Let P i(i) denote the projector onto Vi along V(i) . This matrix
coincides with the P i that satisfies the four equations given in (2.26) through (2.29).
(2.31)
(2.32)
(2.33)
Corollary 2 Let P (i)i denote the projector onto V(i) along Vi . Then the
following relation holds:
P (i)i = P 1(1) + + P i1(i1) + P i+1(i+1) + + P m(m) .
(2.34)
Q.E.D.
38
Note The projection matrix P i(i) onto Vi along V(i) is uniquely determined. Assume that there are two possible representations, P i(i) and P i(i) . Then,
P 1(1) + P 2(2) + + P m(m) = P 1(1) + P 2(2) + + P m(m) ,
from which
(P 1(1) P 1(1) ) + (P 2(2) P 2(2) ) + + (P m(m) P m(m) ) = O.
Each term in the equation above belongs to one of the respective subspaces V1 , V2 ,
, Vm , which are mutually disjoint. Hence, from Theorem 1.4, we obtain P i(i) =
P i(i) . This indicates that when a direct-sum of E n is given, an identity matrix I n
of order n is decomposed accordingly, and the projection matrices that constitute
the decomposition are uniquely determined.
(2.35)
(i = 1, m),
(i 6= j), and rank(P 2i ) = rank(P i ),
(iii) P 2 = P ,
(iv) rank(P ) = rank(P 1 ) + + rank(P m ).
All other propositions can be derived from any two of (i), (ii), and (iii), and
(i) and (ii) can be derived from (iii) and (iv).
Proof. That (i) and (ii) imply (iii) is obvious. To show that (ii) and (iii)
imply (iv), we may use
P 2 = P 21 + P 22 + + P 2m and P 2 = P ,
which follow from (2.35).
(ii), (iii) (i): Postmultiplying (2.35) by P i , we obtain P P i = P 2i , from
which it follows that P 3i = P 2i . On the other hand, rank(P 2i ) = rank(P i )
implies that there exists W such that P 2i W i = P i . Hence, P 3i = P 2i
P 3i W i = P 2i W i P i (P 2i W i ) = P 2i W P 2i = P i .
39
Q.E.D.
2.3.2
40
We first consider the case in which there are two direct-sum decompositions of E n , namely
E n = V1 W1 = V2 W2 ,
as given in (2.21). Let V12 = V1 V2 denote the product space between V1
and V2 , and let V3 denote a complement subspace to V1 + V2 in E n . Furthermore, let P 1+2 denote the projection matrix onto V1+2 = V1 + V2 along
V3 , and let P j (j = 1, 2) represent the projection matrix onto Vj (j = 1, 2)
along Wj (j = 1, 2). Then the following theorem holds.
Theorem 2.16 (i) The necessary and sufficient condition for P 1+2 = P 1 +
P 2 P 1 P 2 is
(V1+2 W2 ) (V1 V3 ).
(2.36)
(ii) The necessary and sufficient condition for P 1+2 = P 1 + P 2 P 2 P 1 is
(V1+2 W1 ) (V2 V3 ).
(2.37)
(2.38)
41
and
P 1+2 = P 1 + P 2 P 1 P 2 .
(2.39)
2.3.3
Commutative projectors
42
(2.40)
(2.41)
(2.42)
(2.43)
43
to hold is
P 1 P 2 = P 2 P 1 , P 2 P 3 = P 3 P 2 , and P 1 P 3 = P 3 P 1 .
(2.44)
(2.45)
44
(2.46)
s
X
Pj
j=1
X
i<j
P iP j +
P i P j P k + + (1)s1 P 1 P 2 P 3 P s
i<j<k
(2.47)
to hold is
P i P j = P j P i (i 6= j).
2.3.4
(2.48)
Noncommutative projectors
We now consider the case in which two subspaces V1 and V2 and the corresponding projectors P 1 and P 2 are given but P 1 P 2 = P 2 P 1 does not
necessarily hold. Let Qj = I n P j (j = 1, 2). Then the following lemma
holds.
Lemma 2.4
V1 + V2 = Sp(P 1 ) Sp(Q1 P 2 )
(2.49)
= Sp(Q2 P 1 ) Sp(P 2 ).
(2.50)
[P 1 , Q1 P 2 ] = [P 1 , P 2 ]
and
"
[Q2 P 1 , P 2 ] = [P 1 , P 2 ]
I n P 2
O
In
In
O
P 1 I n
= [P 1 , P 2 ]S
#
= [P 1 , P 2 ]T .
45
(2.51)
V1[2] = {x|x = Q2 y, y V1 }.
(2.52)
and
Let Qj = I n P j (j = 1, 2), where P j is the orthogonal projector onto
Vj , and let P , P 1 , P 2 , P 1[2] , and P 2[1] denote the projectors onto V1 + V2
along W , onto V1 along V2[1] W , onto V2 along V1[2] W , onto V1[2] along
V2 W , and onto V2[1] along V1 W , respectively. Then,
P = P 1 + P 2[1]
(2.53)
P = P 1[2] + P 2
(2.54)
or
holds.
Note When W = (V1 + V2 ) , P j is the orthogonal projector onto Vj , while P j[i]
is the orthogonal projector onto Vj [i].
(2.55)
46
2.4
(2.56)
P0 = P.
(2.57)
47
(2.60)
M PM
. Then, P = P , and (2.58) can be rewritten as ||P y||2
2
||y|| . By Theorem 2.22, the necessary and sufficient condition for (2.59) to
hold is given by
= (M 1/2 P M 1/2 )0 = M 1/2 P M 1/2 ,
2 = P
P
leading to (2.60).
(2.61)
Q.E.D.
Note The theorem above implies that with an oblique projector P (P 2 = P , but
P 0 6= P ) it is possible to have ||P x|| ||x||. For example, let
1 1
1
P =
and x =
.
0 0
1
(2.63)
48
The above identity and Theorem 2.23 lead to the following corollary.
Corollary
(i) If V2 V1 , tr(X 0 P 2 X) tr(X 0 P 1 X) tr(X 0 X).
(ii) Let P denote an orthogonal projector onto an arbitrary subspace in E n .
If V1 V2 ,
tr(P 1 P ) tr(P 2 P ).
Proof. (i): Obvious from Theorem 2.23. (ii): We have tr(P j P ) = tr(P j P 2 )
= tr(P P j P ) (j = 1, 2), and (P 1 P 2 )2 = P 1 P 2 , so that
tr(P P 1 P ) tr(P P 2 P ) = tr(SS 0 ) 0,
where S = (P 1 P 2 )P . It follows that tr(P 1 P ) tr(P 2 P ).
Q.E.D.
We next present a theorem on the trace of two orthogonal projectors.
Theorem 2.24 Let P 1 and P 2 be orthogonal projectors of order n. Then
the following relations hold:
tr(P 1 P 2 ) = tr(P 2 P 1 ) min(tr(P 1 ), tr(P 2 )).
(2.65)
p
tr(P 1 )tr(P 2 ).
(2.66)
p
tr(P 1 )tr(P 2 ) min(tr(P 1 ),
2.5
49
(2.67)
i=1 j=1
(2.68)
(2.69)
(2.70)
(2.71)
Proof. Relations (2.68) and (2.69) are trivial. Relation (2.70) follows immediately from (1.20). Relation (2.71) is obvious from
tr(V 0 A0 U 0 U AV ) = tr(A0 AV V 0 ) = tr(A0 A).
Q.E.D.
Note Let M be a symmetric nnd matrix of order n. Then the norm defined in
(2.67) can be generalized as
||A||M = {tr(A0 M A)}1/2 .
(2.72)
This is called the norm of A with respect to M (sometimes called a metric matrix).
Properties analogous to those given in Lemma 2.6 hold for this generalized norm.
There are other possible definitions of the norm of A. For example,
Pn
(i) ||A||1 = maxj i=1 |aij |,
(ii) ||A||2 = 1 (A), where 1 (A) is the largest singular value of A (see Chapter 5),
and
Pp
(iii) ||A||3 = maxi j=1 |aij |.
All of these norms satisfy (2.68), (2.69), and (2.70). (However, only ||A||2 satisfies
(2.71).)
50
(2.74)
= A).
(the equality holds if and only if AP
Proof. (2.73): Square both sides and subtract the right-hand side from the
left. Then,
tr(A0 A) tr(A0 P A) = tr{A0 (I n P )A}
= tr(A0 QA) = tr(QA)0 (QA) 0 (where Q = I n P ).
The equality holds when QA = O P A = A.
A0 AP
)
||2 = tr(P
(2.74): This can be proven similarly by noting that ||AP
0
0
0
0
2
A || . The equality holds when QA
A =
A ) = ||P
= O P
= tr(AP
0
Q.E.D.
A AP = A, where Q = I n P .
The two lemmas above lead to the following theorem.
Theorem 2.25 Let A be an n by p matrix, B and Y n by r matrices, and
C and X r by p matrices. Then,
||A BX|| ||(I n P B )A||,
(2.75)
where P B is the orthogonal projector onto Sp(B). The equality holds if and
only if BX = P B A. We also have
||A Y C|| ||A(I p P C 0 )||,
(2.76)
(2.77)
(2.78)
51
or
(A BX)P C 0 = Y C and P B A(I p P C 0 ) = BX(I n P C 0 ).
(2.79)
52
2.6
The projectors we have been discussing so far are based on Definition 2.1,
namely square matrices that satisfy P 2 = P (idempotency). In this section,
we introduce a generalized form of projection matrices that do not necessarily satisfy P 2 = P , based on Rao (1974) and Rao and Yanai (1979).
Definition 2.3 Let V E n (but V 6= E n ) be decomposed as a direct-sum
of m subspaces, namely V = V1 V2 Vm . A square matrix P j of
order n that maps an arbitrary vector y in V into Vj is called the projection
matrix onto Vj along V(j) = V1 Vj1 Vj+1 Vm if and only if
P j x = x x Vj (j = 1, , m)
(2.80)
P j x = 0 x V(j) (j = 1, , m).
(2.81)
and
(2.82)
a 0 0
P = b 0 0 .
c 0 1
Then, P e1 = e1 and P e2 = 0, so that P is the projector onto V1 along V2
according to Definition 2.3. However, (P )2 6= P except when a = b = 0,
or a = 1 and c = 0. That is, when V does not cover the entire space E n ,
the projector P j in the sense of Definition 2.3 is not idempotent. However,
by specifying a complement subspace of V , we can construct an idempotent
matrix from P j as follows.
53
(2.83)
2.7
=
1. Let A
A1
O
O
A2
and A =
A1
. Show that P A P A = P A .
A2
2. Let P A and P B denote the orthogonal projectors onto Sp(A) and Sp(B), respectively. Show that the necessary and sufficient condition for Sp(A) = {Sp(A)
54
http://www.springer.com/978-1-4419-9886-6