Sym Herm
Sym Herm
Herm(n) = {A ∈ Mn (C) : A∗ = A}
λx + iλy = λz = Az = Ax + iAy.
1
Collecting the real and imaginary parts of each entries of each component of
the above equality gives a non-zero real eigenvector: at least one of x or y.
Diagonalization Theorem (i) If A ∈ Herm(n), then the eigenvalues of
A are real. Furthermore, for any two distinct eigenvalues λ, µ of A with
corresponding eigenvectors x and y in Cn , we have (x, y) = 0.
(ii) If A ∈ Mn (R) (respectively, is in Mn (C)) then A ∈ Symn (R) (respec-
tively, is in Herm(n)) if and only if there is u in O(n) (respectively, u in
U(n)) for which
λ1 0 . . . 0
.. .
∗ 0 λ2
. ..
u Au = . .
.. .
.. .. 0
0 . . . 0 λn
where λ1 , . . . , λn are the eigenvalues of A (with multiplicity).
Proof. (i) Let λ be an eigenvalue of A with corresponding eigenvector x 6= 0
in Cn . Then
so (λ − µ)(x, y) = 0.
(ii) Notice that sufficiency in both the real symmetric and hermitian cases
in trivial.
Let us consider necessity. Let µ1 , . . . , µk denote the full set of distinct eigen-
values of A with respective eigenspaces Ej = kerFn (µj I − A), j = 1, . . . , k.
Since Ei ⊥ Ej for i 6= j, as observed in (i), we may find an orthonormal basis
u11 un1
u(1) = ... , . . . , u(m) = ...
u1n umn
2
for E = E1 +· · ·+Ek such that u(1) , . . . , u(n1 ) is a basis for E1 , u(n1 +1) , . . . , u(n1 +n2 )
is a basis for E2 , . . . , and u(n1 +···+nk−1 ) , . . . , u(m) is a basis for Ek . (Here each
nj = dimC Ej .) Let for j = 1, . . . , k, Pj in Mn (F) be the matrix corresponding
to the orthogonal projection onto Ej , i.e. with entries
n1 +···+nj n1 +···+nj
X X
Pj,i0 j 0 = (ej 0 , u(l) )u(l) , ei0 =
(ej 0 , u(l) )(u(l) , ei0 )
l=0+n1 +···+nj−1 l=0+n1 +···+nj−1
Since Ei ⊥ Ej if i 6= j, Pi Pj = 0 = Pj Pi .
Hence if p(X) = l=0 al X l is any polynomial, we have
P
X k
X
p(A) = al A l = p(µj )Pj
l=0 j=1
3
where A0 = I, by convention.
Lemma. Given A and P1 , . . . , Pk as above, another matrix B commutes with
A, i.e. [A, B] = AB − BA = 0, if and only if [Pj , B] = 0 for each j.
Proof. Sufficiency is evident from (†).
To see necessity, let for each j
0 . . . 0 λn 0 . . . 0 νn
Since [A, B] = 0, the lemma above provides that [A, Pi0 ] = 0 = [Pj , B] for
any i, j, and the lemma again provides that [Pj , Pi0 ] = 0 for any i, j. Hence
each Pj Pi0 is self-adjoint and squares to itself, and is hence the orthogonal
4
projection onto Eij = kerFn (A − µj I) ∩ kerFn (B − µ0i I). Further (†) provides
that
k0
k X k
! k0 !
X X X
Pj Pi0 = Pk Pi0 = I (∗)
j=1 i=1 j=1 i=1
so 0 0
k X
X k k X
X k
A = AI = µj Pj Pi0 and B = IB = µ0i Pj Pi0 .
j=1 i=1 j=1 i=1
Take orthonormal bases for each of the non-zero spaces Eij and combine them
into an orthonormal basis
v11 vn1
.. ..
v(1) = . , . . . , v(n) = .
v1n vnn
for Fn (this is possible by (∗)). Let v be the matrix with columns v(1) , . . . , v(n) ,
and we obtain the desired diagonal forms.
Corollary. A in Mn (C) is normal, i.e. [A, A∗ ] = 0, if and only if there is v
in U(n) for which
λ1 0 . . . 0
. . . ..
0 λ .
v ∗ Av = . . 2 .
. . .
. . . 0
0 . . . 0 λn
where λ1 , . . . , λn are the eigenvalues of A (with multiplicity).
Proof. Sufficiency being evident, we show only necessity. Let
1 1
ReA = (A + A∗ ) and ImA = (A − A∗ )
2 2i
so ReA, ImA ∈ Herm(n) and A = ReA + iImA. It is easy to verify that A is
normal if and only if [ReA, ImA] = 0. Hence simultaneous diagonalization,
above, provides the necessary unitary diagonalizing matrix v.
0 1
No real analogue. The real matrix J2 = is normal, but admits
−1 0
only purely imaginary eigenvalues, and hence cannot be diagonalized by or-
thogonal matrices (i.e. unitary matrices with real entries).
5
Real skew-symmetric matrices. A matrix B in Mn (R) is skew-symmtric
if B T = −B.
Real Skew-symmetric Block Diagonalization Theorem. If B T = −B
in Mn (R) then there is u in O(n) and λ1 , . . . , λm > 0 in R such that
λ1 J2 0 ... ... ... 0
... ..
0 .
. ..
.
. λm J2 .
uT Bu = . ..
.. 0 .
.. . . ..
. . .
0 ... ... ... ... 0
0 1
where J2 = , i.e. 2m ≤ n.
−1 0
Proof. First, notice that iB ∈ Herm(n) and hence has real eigenvalues so B
has purely imaginary eigenvalues (including, possibly 0). In particular, the
only real eigenvectors may be in ker B.
Consider B T B, which is in Symn (R). Any eigenvalue µ of B T B with eigen-
vector x in Rn \ {0} satisfies
B(Bx) = B 2 x = −B T Bx = −µj Bx ∈ Vx
6
(j) (j)
so Vy ⊥ Vx . Hence we may build an orthonormal basis u1 , . . . , ulj for Ej
(j)
such that each u2i ∈ Vu(j) and Vu(j) ⊥ Vu(j) for i0 = 1, . . . , blj /2c, and,
2i−1 2i−1+2i0 2i−1
in particular, lj is even.
Pl
Putting everything together we have that dim E = j=1 dim Ej is even,
and we can find an orthonormal basis u1 , . . . , u2m , u2m+1 , . . . , un for Rn for
which the spaces Vj = Vu2j−1 are pairwise orthogonal and span E, and
u2m+1 , . . . , un ∈ ker B. Letting u be the matrix whose rows u1 , . . . , un we
find that
B1 0 . . . . . . . . . 0
..
0 ...
. .
. ..
. Bm .
T
u Bu = . ..
.. 0 .
.. . .
. . ..
.
0 ... ... ... ... 0
where each Bj ∈ M2 (R). Since (uT Bu)T = uT B T u = −uT Bu we find that
each block must have the form Bj = λj J2 with λj in R \ {0}, and by applying
block permutations we may assume λj > 0. (One may further check that the
√ √
values λ1 , . . . , λm are the values µ1 , . . . , µk with multiplicities.)
Remark. The complex analogue of this result is much easier. If B ∈ Mn (C)
with B ∗ = −B we can B skew-hermitian. Notice that iB ∈ Herm(n) and
is hence unitarily diagonalizable with real eigenvalues, so B too is unitarily
diagonalizable but with purely imaginary eigenvalues.