Lecture 3 Linear Algebra
Lecture 3 Linear Algebra
1 Vector Spaces
Definition 1.1. A vector space V over R, also called a linear space, is a set V
with operations vector addition (∀u, v ∈ V , u + v defines a vector in V ) and scalar
multiplication (∀α ∈ R, ∀u ∈ V , α · u defines a vector in V ) such that ∀α, β ∈
R, ∀u, v, w ∈ V , the following axioms hold:
A1 (u + v) + w = u + (v + w),
A2 u + v = v + u,
A3 ∃o ∈ V s.t. u + o = u ∀u ∈ V (a zero vector o exists),
A4 ∀u ∈ V, ∃v ∈ V s.t. u + v = o (use −u to denote v),
A5 α · (u + v) = α · u + α · v,
A6 (α + β) · u = α · u + β · u,
A7 α · (β · u) = (α · β) · u,
A8 1 · u = u.
An element of a vector space is called a vector or a point.
1
Theorem 1.1. If V is a vector space, then ∀α ∈ R, u ∈ V,
(i) o is unique,
(ii) 0 · u = o,
(iii) α · o = o,
(iv) −u = (−1) · u.
2
Remark 1.1. To prove that V is a linear space, we need to check (i) V is closed
under + and ·, (ii) A1-A8.
Example 1.2. (i) V = R2 , linear subspaces include R2 itself and {(x, y) : y = αx}
for some fixed α ∈ R.
(ii) V = C([a, b]). U = {α · f : α ∈ R} for some fixed f ∈ C([a, b]) is a linear
subspace.
Pn
What does i=1 αi vi mean? To simplify the exposition, we omit the “·” when
there is no risk of confusion.
not be well-defined.)
3
Pn
if for any linear combination of vectors in A, v = j=i αi vi , we have v = o ⇒ αi =
0, i = 1, . . . , n.
Pn
Exercise 1.1. Suppose v1 , . . . , vn are distinct vectors. Show that [ i=1 αi vi = o ⇒
αi = 0, i = 1, . . . , n] ⇔ [∄vi ∈ {v1 , . . . , vn } s.t. vi ∈ Span({v1 , . . . , vi−1 , vi+1 , . . . , vn })].
How about V = {o}? Note that {o} is not a linearly independent set. Since
Span(Ø) = V , we conclude that dim V = |Ø| = 0.
combination of {vn : n ∈ N+ }.
4
Theorem 1.4. Every linear space has a Hamel basis.
The proof of the theorem uses Zorn’s Lemma and is rather involved. It only
shows the existence but does not construct the basis explicitly.
Theorem 1.5. Any linearly independent subset of a linear space can be extended to
a Hamel basis.
Theorem 1.6. If A and B are two finite bases of a linear space V , then |A| = |B|.
Example 1.4. (i) T (x) = 3x. T (x1 + x2 ) = 3(x1 + x2 ) = 3x1 + 3x2 = T (x1 ) +
T (x2 ), and we can verify T (αx) = αT (x). T is linear.
(ii) T (x) = 3x + 1. T (0 + 0) ̸= T (0) + T (0), so T is not linear.
5
2 Matrices
a11 · · · a1n
. ... ..
. m×n
Am×n =
. = (aij ) ∈ R
. .
am1 · · · amn
Remark 2.2. (i) Am×n = (aij ) = [b1 , b2 , . . . , bn ] with bj = [a1j , a2j , . . . , amj ]T
called the j-th column vector. A = [cT1 , cT2 , . . . , cTm ]T with ci = [ai1 , ai2 , . . . , ain ]
called the i-th row vector.
6
(ii) Intuition: ∀u ∈ Rn , u = [u1 , . . . , un ]T , u = u1 · e1 + u2 · e2 + · · · + un · en .
T (u) = T (u1 · e1 + u2 · e2 + · · · + un · en )
p
Note that bTj bk = bj · bk . Thus we must have ∥bj ∥ = (bj · bj ) = 1 ∀j, and all
7
column vectors of A are pairwise orthogonal (i.e. bj · bk = 0 for any j ̸= k).
The same applies to A’s row vectors.
T
Theorem 2.2 (Singular Value Decomposition). Any matrix Am×n = Um×m Σm×n Vn×n
for some orthogonal U, V and some diagonal Σ.
Definition 2.5. The determinant of a matrix An×n is det(A) or |A|. (see wiki)
8
1
λ1 0 λ1 0
1
λ2
−1
λ2
Σ= ,Σ = .
... ...
1
0 λn 0 λn
Remark 2.5. Why also maximum number of row vectors? Let {c1 , c2 , . . . , cr } be a
basis of the column space of A, then there exists Vr×n such that [c1 , c2 , · · · , cr ]V = A.
Thus, each row of A is a linear combination of rows of V , which implies that the
maximum number of linearly independent row vectors must be less than or equal to
r. By doing the same for AT one can show that the maximum number of linearly
independent column vectors of A must be less than or equal to the maximum number
of linearly independent row vectors.
9
(a) if m ≥ n, then rank(Am×n Bn×l ) = rank(B),
(b) if m ≤ n, then rank(Cl×m Am×n ) = rank(C),
(v) An×n is invertible if and only if A has a full rank.
Then,
2 0
′
AB =
2
, AB = 2 .
0 0
10
(ii) It follows that we want to find λ’s s.t. A − λIn is not full rank, since otherwise
v = ⃗0 is the only solution. This is equivalent to solve det(A − λIn ) = 0 for λ.
(iii) To find the eigenvectors, for each solution λ to det(A − λIn ) = 0, solve (A −
λIn )v = 0 to find nonzero solutions for v.
We have
2−λ 7
|A − λIn | =
−1 −6 − λ
7 7
v = 0.
−1 −1
11