Lecture 3
Lecture 3
Lecture 3
Lecturer: David P. Williamson Scribe: Seung Won (Wilson) Yoo
1 Eigenvalue Identities
We first present a few useful eigenvalue identities. Let A ∈ Rn×n be a symmetric matrix
with real eigenvalues λ1 ≥ λ2 ≥ ...λn with corresponding eigenvectors x1 , x2 , . . . , xn such
that the xi are orthonormal.
Proof: Let
X = x1 x2 . . . xn
λ1 0 0
D = 0 ... 0
0 0 λn
Then,
AX = XD
since the columns of X are the eigenvectors. Interestingly, X T X = I as the columns are
orthonormal, so X T = X −1 . This implies A = XDX −1 (right multiplying AX = DX by
X −1 ). Then, Ak = (XDX −1 )k = XDk X −1 , where Dk has the form
k
λ1 0 0
Dk = 0 . . . 0
0 0 λkn
3-1
Lemma 2
n
Y
det(A) = λi .
i=1
Proof:
det(A) = det(XDX −1 )
= det(X) det(D) det(X −1 )
= det(D)
Yn
= λi .
i=1
2
Recall that in the first lecture, we defined the trace of A to be T r(A) = ni=1 aii . We
P
used the following without proof in the first lecture, and now we can prove it.
Lemma 3
n
X
T r(A) = λi .
i=1
Proof: Consider the characteristic polynomial of A, which we defined in the first lecture
to be det(λI − A); it is a degree n polynomial in λ. Then we can rewrite it as:
X n
Y
det(Z) = sgn(σ) aiσ(i) .
σ∈Sn i=1
The only permutation thatQcan produce a λn−1 term is the identity, so the only
P term in the
sum with a λn−1 term is ni=1 (λ − aii ). The coefficient of λn−1 here is − ni=1 aii . Then,
we are done, with
n
X X n
T r(A) = aii = λi .
i=1 i=1
2
3-2
2 The Perron-Frobenius Theorem
Recall for an undirected graph G, its adjacency matrix is defined as A = (aij ) where
(
1 if (i, j) ∈ E
aij =
0 otherwise
(i) λ1 ≥ −λn ;
(ii) λ1 > λ2 ;
(iii) There exists an eigenvector x1 > 0 (that is, every coordinate is strictly positive).
Proof: First we will prove (iii). Let x1 , . . . , xn be the corresponding eigenvectors, and
assume they are orthonormal. Recall that
xT Ax
λ1 = maxn = xT1 Ax1 .
x∈R xT x
Define a vector y such that ∀i, y(i) = |x1 (i)|; then y T y = xT1 x1 = 1. We show that y is also
an eigenvector corresponding to λ1 . To see this, we have
λ1 = xT1 Ax1
X
= aij x1 (i)x1 (j)
ij
X
≤ aij |x1 (i)||x1 (j)|
ij
X
= aij y(i)y(j)
ij
= y T Ay
≤ λ1 .
The last inequality follows by the definition of λ1 , and by the fact that y has unit norm.
Since λ1 = λ1 , all the inequalities must be equalities, so y is an eigenvector of λ1 .
We now argue that none of the entries of y can be zero. We have y ≥ 0 by definition
and y 6= 0 since it is an eigenvector. To show that none of the entires are zero, we use the
fact that the graph is connected, so if there is some j such that y(j) = 0, this means that
there is an edge (i, k) ∈ E such that y(i) = 0 and y(k) 6= 0. Then we have:
X
(Ay)(i) = y(j) ≥ y(k) > 0
j:(i,j)∈E
3-3
Now we prove (i). Let ∀i, y(i) = |xn (i)|. Again, we have
y T y = xTn xn = 1
= y T Ay
≤ λ1 ,
as desired.
For (ii), let ∀i, y(i) = |x2 (i)|. Then y y = xT2 x2 = 1. Then we have that
λ2 = xT2 Ax2
X
≤ aij |x2 (i)||x2 (j)|
ij
X
= aij y(i)y(j)
ij
= y T Ay
≤ λ1 ,
Now we show that somewhere along the way, the inequality is strict. Assume x1 > 0, as
we can from (iii). Since hx1 , x2 i = 0, and both are nonzero, some of the entries of x2 are
positive and some are negative. We split into two cases:
• Case 1: All of the entries of x2 are nonzero. Then, since G is connected, ∃(i, j) ∈ E
such that x2 (i) < 0, x2 (j) > 0. Then, x2 (i)x2 (j) < |x2 (i)||x2 (j)| which gives us the
strict inequality that we wanted. Hence, λ2 < λ1 .
3 Bipartite Graphs
We now turn to showing how all the various identities we’ve proven over this lecture and
the last can be applied to showing something about the structure of graphs. In particular,
we show that the spectrum of the adjacency matrix tells us whether the graph is bipartite
or not.
3-4
Lemma 5 If G is bipartite, and λ is an eigenvalue of adjacency matrix A, then so is −λ.
3-5
Proof: By Perron-Frobenius, λ1 ≥ −λn , and by the previous theorem, the graph being
bipartite implies that λ1 = −λn .
For the other direction, Let xn be the eigenvector corresponding to λn with xTn xn . Let
y(i) = |xn (i)| for all i. Again, we have y T y = xTn xn = 1. Also,
= y T Ay
≤ λ1 .
The assumption λn = −λ1 implies that all the inequalities are equalities. This implies that
y is an eigenvector corresponding to λ1 , with y ≥ 0. By our proof of Perron-Frobenius,
since y ≥ 0, we have y > 0 and this implies that xn (i) 6= 0 for all i.
If all the inequalities are equalities, xn (i)xn (j) has the same sign whenever aij > 0.
Since λn = xTn Axn < 0, all of these products must be negative. This implies that for any
edge in the graph, either xn (i) > 0, xn (j) < 0 or xn (i) < 0, xn (j) > 0. This induces the
bipartition
V = {i : xn (i) < 0},
W = {i : xn (i) < 0}.
2
This brings about an interesting research question. What happens when λ1 is close to
−λn ? Does this mean that the graph is “almost bipartite”, in the sense that if we remove
some of the edges, it would become bipartite? Possibly the answer to this question is already
well known.
3-6