Operations in Hilbert Space
Operations in Hilbert Space
CSUSB ScholarWorks
2008
Recommended Citation
Gumaer, Dennis Michael, "Operations in Hilbert space" (2008). Theses Digitization Project. 4400.
https://scholarworks.lib.csusb.edu/etd-project/4400
This Thesis is brought to you for free and open access by the John M. Pfau Library at CSUSB ScholarWorks. It has
been accepted for inclusion in Theses Digitization Project by an authorized administrator of CSUSB ScholarWorks.
For more information, please contact [email protected].
Operators in Hilbert Space
A Thesis
Presented to the
Faculty of
San Bernardino
In Partial Fulfillment
Master of Arts
in
Mathematics
by
September 2008
Operators in Hilbert Space
A Thesis
Presented to the
Faculty of
San Bernardino
by
September 2008
Approved by:
_______________
This review of some of the major topics in elementary Hilbert space theory starts with
a study of basic operations on a Hilbert space. The theory of operators is developed by
providing details regarding several types of operators, in particular compact operators.
Compact operators on a Hilbert space are operators which map the closed unit
ball to a set whose closure is compact. This study of compact operators is the start of the
refinement of bounded linear operators to those which are also members of the Schatten
p-class operators - the final goal of this discussion.
iv
Acknowledgements
Table of Contents
Abstract iii
Acknowledgements iv
1 Introduction 1
2 The Basics 4
2.1 Norm and Inner Product...................................................................... 5
2.2 Orthogonality..................................................................................... 6
2.3 Applications........................................................................................ 7
3 Introduction to Operators 10
5 Unitary Operators 17
6 Compact Operators 20
7 The Spectrum 25
8 Self-Adjoint Operators 28
13 Conclusion 46
Bibliography 50
1
Chapter 1
Introduction
The norm is the arbitrary dimensional analogue of the absolute value. In higher
dimensions it is denoted ||-||. There are many ways for the norm to be defined. In the
case of the space the norm is defined as
/ o° \ I
This norm defines a metric on the space allowing further structure as it narrows down
the spaces which can be called a Hilbert Space.
An inner product is the arbitrary dimensional analogue of the dot product. (The
convention used in my primary research resource, Hilbert Space: Compact Operators and
the Trace Theorem by J.R. Retherford[Ret93], is that inner products are denoted by (•, •).
This convention will be followed throughout the thesis.) Many different inner products
are possible. However each must satisfy a list of properties - one property of particular
importance will follow shortly. For our primary example ^2 of the thesis, the inner product
is defined over as
\ 00 _
V(an), (bn) G ^2) f (®n)j (^n) 1 = 5 >
' 71=1
(rc,j;)2 = ||a;||, Vx G H.
Among bounded operators, there are many further designations: unitary, normal, self-
adjoint, compact, Hilbert-Schmidt, Trace and Schatten class operators. Each of these
operators have specific properties and uses and each will be touched on in the remainder
of thesis. Particular attention will be focused on the latter portion of the list.
As an example of what will be covered regarding these operators, a brief look
into trace operators is warranted. The linear transformation A from to Kn can be
described by an n x n matrix (%•). The trace of that n x n matrix is the sum of the
diagonal entries, or
n
Tr(A) = y an.
i=l
This trace is something that is useful for Hilbert Space as well. Yet going from an infinite
dimensional space to another infinite dimensional space would not admit a traditional
matrix to sum the diagonal terms. So, as with the abstraction of norms and inner
products, an abstraction of the trace is necessary to glean the desired information.
The means of the abstraction of operators with finite trace will make judicious
use of the Schmidt Decomposition Theorem. That is, for all x 6 H a compact operator
T can be written as oo
Tx = ^Jan(x,xn)yn
n=l
for some orthonormal sets (xn) and where an is an eigenvalue of [T,*T] 1/2. This
statement ties together a significant portion of the work that will be covered toward
the end of this thesis. The Schmidt Decomposition Theorem gives a form for the infi
nite dimension by infinite dimension matrix that would be impossible to describe in the
traditional matrix form.
The formulation and properties of the operators listed above are the goal of the
text. Each operator will be defined and put into context. Then examples using each
operator will be explored. While practical applications abound, that is not the intent of
the thesis and none will be offered. The intent of the text is to familiarize the reader with
the theoretical background necessary for such applications.
4
Chapter 2
The Basics
Planar geometry was the origin of ideas which mathematicians later extended
from R2 to arbitrary dimensional spaces. The familiar absolute value was generalized to
the norm denoted || • || which gives the length of the vector in the defined space. The dot
product used to find angles between vectors in R2 or in R3 became the more general inner
product denoted (•, ■) in arbitrary complex linear spaces. The concept of perpendicular
lines was extended via one of its planar properties. In higher dimensions orthogonal lines
are defined by the inner product.
In the Euclidean Space Rn, repeated use of the Pythagorean Theorem gives the
formula for the 2-norm:
n i
Vo: = (o?i, rr2, z3,..., xn) € Rn> ||a:||2 = ( ^2 ) *'
i=l
or even
Vx = (x^x2,x3,... ,xn) e X, ll^lloo = marr(|a;i|,..., |zn|).
These last two norms are obviously different from the usual norm in R2, but these norms
are not coming from the usual R2.
2.1 Norm and Inner Product
Definition 2.1. Let X be a complex linear space. A norm is a function from X into the
non-negative reals satisfying:
1. ||z|[ = 0 iff x ~ 0;
3. the triangle inequality ||a; + y\\ < ||e|| + ||y|| holds for all x, y G X.
Definition 2.2. Let (xn) be a sequence in a normed space X. Then {xn} is called a
Cauchy sequence if
Definition 2.3. If every Cauchy sequence (xn) in a normed space X converges in X then
X is called a complete space.
Definition 2.4. Let X be a linear space over C. An inner product, (•, •), is a function
defined on X x X, the set of all pairs of elements in X, satisfying:
(X, (-,-)) or X with an inner product (•, •) is called an inner product space.
A complete normed linear space is called a Banach space, named after the Polish
mathematician Stefan Banach. Hilbert Space - the focus of this paper - is a specific type
of those spaces. Hilbert Space is a Banach Space with the added restriction that it has an
inner product derived from the norm of the space, more specifically Vac G H ||rc||2 = (x, x).
An important and recurring example of Hilbert Space is €2. A complex sequence
(an) is an element of ^2 provided the sum of the magnitude squared of each term is finite.
Mathematically
6
2.2 Orthogonality
n=l
7
While this theorem holds for every complete orthonormal set, it is mostly com
monly used with the canonical orthonormal basis et for £2-
2.3 Applications
After several definitions it is generally a good idea to give some properties and
applications using those definitions. These properties and applications follow.
Proposition 2.6. In an inner product space X, (re, u) = (a?, v) for all x implies that
u — v.
In Hilbert Space a sequence can converge strongly or weakly. As the names im
ply, a strongly convergent sequence meets stricter requirements than a weakly convergent
sequence. So a strongly convergent sequence is automatically weakly convergent. This
leads to the question, when does weak convergence imply strong convergence? But first,
the definitions of these terms are given.
Definition 2.7. For Hilbert Space H, a sequence (xn) C H is said to converge strongly
to the point x 6 H if ||a;n — m|| —> 0 as n oo. This type of convergence is also known
as norm convergence or convergence in the norm.
Definition 2.8. For Hilbert Space If, a sequence (a?n) C H is said to converge weakly to
x 6 H if, for all y 6 H, (xn, y) —> (z, y) as n —► oo.
= o.
Proposition 2.10. For x,y in Hilbert Space H,xly iffVa € C, ||rE + ay|| > ||n?||.
leaving
0 < a(y, x) + a(x, y) + aa(y, y). (2.1)
Let a = i with n > 0. Substituting a into (2.1) above leaves part of the
inequality needed. So
111
0 < ~(y,x) + -(®,y) + -^v)
71 71 7Z
and
< ±(y,z) + ±(x,y). (2-2)
9
Now let a = — £ with n > 0. Substituting a into (2.1) above solves for the remainder of
the required inequality.
o< + -^(^y) +
n n nr
111
(2-3)
and
2(^y)-^(y^) = 0-
□
10
Chapter 3
Introduction to Operators
Hilbert Space has been carefully created to mimic Euclidean Space. As a result,
research in the area doesn’t need to focus on the space itself. Interest is usually concen
trated on the functions - or more precisely operators - that act on the space and the
consequences of that action.
Functions are defined as a mapping from one set to another. In general terms,
a function is simply a set of rules which describes how elements in one set are mapped
to elements in another set. Restrictions are usually placed on these functions depending
on the focus of study. For the purpose of Hilbert Space, the functions are required to be
linear. In any linear space, function f is linear when for a?, y in the domain and scalars
a, b it holds that
f(ax + by) = af(x) + bf(y).
Linear functions from Hilbert Space H to Hilbert Space K are called linear operators.
Theorem 3.4. For Hilbert Spaces H, K and linear operator T from H to K the following
are equivalent:
1. T is bounded,
2. T is continuous at 0,
3. T is continuous over H.
Example 3.5. For the following examples, let the Hilbert Space H = £2 and let the
sequence (e^) be a countable orthonormal basis in £2.
1. Let Tei = ef+i for i > 1. This operator is known as a shift operator. Then ||7j| = 1.
oo oo
Proof Let x = which implies Tx = 2 (z> ei)e»+i-
2=1 2=1
12
1=1
Proof. It is clear that ||ei|| = 1 and ||Tei|| = ||ef + e^i|| — \/2 Vi. For x 6 h,
00 oo
x = 52^’ ei)ei 911(1 Tx = 52ef)(ei + ei+x)-
i=l i=l
Hence
< 4|rf
Thus II^ZYeII < 2||ic|| and ||T|| < 2. Now it must be shown that ||T|| —> 2. Let
fn = ^(ei + e2 + ” -en). Then, ||/n|| = 1 and Tfn = ^(ei + 2e2H---- 2en4-en+i).
So ||T/n|| = ^jj^/l + 4(n-l) + l = y/4 - 2/n. Hence, ||T/n|| -> 2 as n -> oo.
Since the norm is the supremum of all possible values, ||T|| = 2. □
*.
B) Let f € H Then there is a unique y e H such that f(x) = (a:,y) Vie G H.
Moreover, ||/|| = ||y||.
13
Chapter 4
Recall that the inner product has, as part of its definition, a property (x,y) =
(y, x). Calculations concerning inner products of complex terms can quickly become more
involved than the same calculation with only real terms. However, as it will be shown,
the real case may be too simple and important information can be lost. A few examples
are in order.
In a real Hilbert Space 77, (ir,y) = (y,rr) can be written as (a?,y) = (y,x)- This
leads to simplifications that would be impossible if H is a complex Hilbert Space.
Proposition 4.1. In a real inner product space H, if x,y G H and ||x|| = ||y|| then
||aa? + by\\ = ||&xc 4- ay|| Va, b G R.
Finally, taking the square root of both sides leaves ||asc 4- by\\ = ||6a; + ay||. □
The definition of an isosceles triangle is well known. Plainly, it’s a triangle that
has two sides of equal length. This idea is generalized to a real Hilbert Space in the
following theorem.
Theorem 4.2. (The Isosceles Triangle Property) Let H be a real Hilbert Space. If
x,y,z 6 H, x + y 4- z = 0 and ||a?|| = ||y||, then ||a? — z|| = ||y — z||.
Proof It holds that
Then taking the square root of both sides gives the desired equality. □
Having (a;, y) = (y, x) can obviously be beneficial in some cases. But sometimes
this property can allow terms to cancel even when they are necessary for the completion
15
of a proof.
Proposition 4.3. For a real Hilbert Space H, x,y G H and ||z|| = ||y|| imply that
(x + y,x — y) = 0. If H is complex, this assertion is not true.
(V2 + 2
= — yj'ii 7^ 0.
I 2
So for the x and y given above ||z|| = ||y|| but (x + y, x — y) 0. □
Proposition 4.4. In complex Hilbert Space H, T G L(H) and (Tx, x) = 0 Vac G H
implies T = 0. This is false if H is real.
Proof. 1. If H is complex:
(T(ac + y),x + y) = 0 by assumption. Now,
(Tx, y) + (Ty, x) = 0
(Tx, y) - (Ty, x) = 0
2. If H is real:
/ 0 -1
Here is a counterexample. Let T =
U 0
Then
-1 W
0 J ^2
Chapter 5
Unitary Operators
The next notion to be covered is the extremely useful adjoint of operators. The
adjoint of an operator is actually a direct result of the Riesz Representation Theorem.
S(x) = (Tx,y).
The Riesz Representation Theorem (3.7) states Bz G H such that \/x S(x) ~
(x,z). So (Tx,y) = (x,z). Then there is an operator that maps y onto z. This operator
* and is called the adjoint of T. T
is T * is a bounded linear operator and ||T
*|| = ||T||.
A unitary operator is one which is defined by special properties involving its
adjoint. The definition of this operator follows.
Definition 5.2. Unitary Operators. Let T G L(H). If it is given that T-1 G L(H) and
* = T_1 then T is said to be unitary.
if T
+ vif
2 2 2 2
18
= l^x + y,x + y)
1
= + ^y) F^(y,x) + <Xy,y)],
*(M -
= ^>(x-y,x-y)
as desired. □
It is important to note that the inner product is also a Hermitian form, so the
above lemma applies to the following theorem.
IIT^H2 = (Tx,Tx)
19
*Tx,x)
= (T
= fa^Tx^) since T is unitary
= fax)
= IW2
So if T is unitary then T is an isometry and therefore T is an isometric isomorphism.
*Tx,y)
(T (Tx,Ty)
i1 f HTz + Tj/H2 - ||T® - Ty||2 + iHTo: + iTy||2 - i\\Tx - iTytf2
4
(by Lemma 5.3.)
* + y} II2 - ||T(x - y) [|2 + i||T(z + «/) II2 - 2||T(s - «/) I
J [||T( 2
So (T
*Ta?,y) = fay) Va:,y G H => T
* * = T”1 => T is unitary.
= I => T □
20
Chapter 6
Compact Operators
Definition 6.1. Given the unit ball A = {x G H : ||®|| < 1} a compact operator is
an operator T G L(H) such that T(A) is compact. The set of all compact operators is
denoted K(H).
1. T G K(H) <=> T
* G K(H);
3. TeK(H) rTeK(H);
*
4. T G K(H) <=> TT G K(H);
In R2 and C there is a well known result that states every bounded sequence
has a convergent subsequence. For Hilbert space this isn’t the case. However, given a
few more restrictions a similar result can be obtained.
First, a quick definition. A Hilbert space is said to be separable if it contains a
countable, dense subset.
Theorem 6.3. Every bounded sequence (xn) in a separable Hilbert Space H has a weakly
convergent subsequence.
Continuing this process - the underlined parts would be explained later - gives
the following sequences:
A = {(^1,1,2/1), (a?i,2,yi)> (®i,3»2/1), •• •}, (®i,fc,yi)^«i
A new sequence is created using the underlined terms in the above sequences:
Hence ((zm,w)) is convergent. So (xmtm) is weaky Cauchy and is therefore weakly con
vergent. □
Proof. Let T G K(7?). If T were not completely continuous then there exists a weakly
convergent sequence (xn) that converges weakly to x such that
Using the weak convergence of (xn) and Lemma 6.4, (xn) has a bound. Thus (Txnf)
has a convergent subsequence (Txn,2)- Define m such that (Txnf) converges to m. It is
given that (Txnf) converges weakly to Tx so m = Tx. So ||Txni2 — Tx|| approaches 0, a
contradiction. T is therefore completely continuous.
For the completely continuous operator T to also be compact, it must map the
unit ball {x^H: M < 1} into a set whose closure is compact.
Let A = {x e H : ||x|| < 1} and let T(A) = B. If it can be shown that every
sequence in B has a subsequence converging to some member of B, then B is relatively
compact. So T would then map the unit ball to a relatively compact set and T G K(H)
as desired.
Let {i/fc} C B. There exists a sequence {x^} C A such that Txk = Yk. Note
that {xfc} is bounded because ||x|| < 1. So by Theorem 6.3 3{zn) C {xjJ such that
is weakly convergent. Also (Tzn) is norm convergent since T is completely continuous.
So B is relatively compact. Therefore T is compact. □
Compact operators are one of the main focuses throughout the rest of the text
and are worthy of further investigation.
While this next proposition is instructive, it serves another purpose as well. It
offers insight into a topic that will be covered in the next chapter. Keep this proposition
in mind when reading Chapter 7.
Proof. Assume ker(A7 — T) has infinite dimension. Then 3{xn} C ker(Al — T) such
that {xn} is an infinite orthonormal set. (Otherwise ker(A7 — T) would have a finite
orthonormal set which would serve as a finite basis for the space. For the moment, this
is assumed not to be the case.)
24
Theorem 6.7. An operator T G X(H) iff there are finite rank operators An such that
respectively. It can be shown that the required An is given by the composition An = PnT.
Thus
||T-An|| = ||T-P„T||
= sup||(T-P„TM
IM<1
co
sup 22 (Tx,Ui)ui
IM<1 i=n+l
Since ||a;|| < 1, the closure of Tx is compact. The above sum tends to zero for large
enough n when Tx is in a compact set. So limn_>oo ||T — An|| = 0. □
25
Chapter 7
The Spectrum
Tx — Ax = 0,
(T — AI)x = 0, I being the identity matrix.
This has a non-trivial solution when T—Al is not invertible. It is here that the explanation
for the purposes of this paper diverges from the explanation given in linear algebra. In
linear algebra the procedure concluded with writing down the matrix that corresponded
with T and subtracting Al. Then to find out where T—Al is not invertible, this procedure
dictates setting the determinant of the above difference to zero and solving for A.
In the Hilbert Space version, there is no matrix. Since no calculations can be
done, the generalization stops at (Tx — AZ) being not invertible. In the Hilbert Space
version, the result of this process is called the spectrum.
Definition 7.2. Let T G L(7Z). The resolvent of T, or p(T’), is the set of all complex
numbers not in cr(T). That is,
p(T) = C\<r(T).
where (en) is the standard basis of C?. Show that if |A| < 1 then A G a(T). Partition the
spectrum.
Solution. First, find the actual eigenvalues of T. Let (AZ — T)(ac) = Ax — Tx = 0, i.e.,
A G <7p. Then
oo oo oo
So oo
X=
n=l
This must be true for all x G l2. When (x, ei) 0, we have 0 < |A| < 1. This also implies
that if |A| > 1 then A G p(T).
Secondly, if A = 0 and Tx = 0 then 0(®, ei) — (a?, e2) = 0 and (a;, 62) = 0. This
leads to x = (sq, 0,0,...) but Tx = 0. So XI — T is not one to one and A = 0 is an
eigenvalue.
Since Co = {z G C | |z| < 1} C cr(T) and cr(T) is closed, we have that Ai G
Ci = {2 G C | |^| = 1} C ct(T). Then Ai ar(T) because A G o-r(T) implies that XI — T
has a dense range. Also Ai £ crp(T) because A G <JP(T) implies that XI — T is one to one.
Thus Ai 6 ac(T) since crp(T), ar(T) and ac(T) are disjoint and their union is cr(T).
This completes the partition. □
28
Chapter 8
Self-Adjoint Operators
This definition has some major consequences. Given Hilbert Space H, for self-
adjoint operator T and V® G H, (T®, ®) = (®,T®). (It is important to note that, by
definition, T being self-adjoint implies that T G L(ZI).) But part of the definition of
inner product states that (T®,®) = (®,T®). From these equalities, it is obvious that
(Tx, ®) is real for all ® G H - a significant restriction.
One important property that results from working with strictly real terms is
defined next. This definition revolves around the fact that the real line is well ordered.
It follows that some sort of ordering can be imposed on self-adjoint operators.
denoted T > 0, if
V® 6 H, (Tx, x) > 0.
There are some basic properties of self-adjoint operators that will prove useful
in later work.
Proposition 8.3. In Hilbert Space H, for operator T 6 B(H) and for self-adjoint oper
ators A,B,Ce L(H):
*
4. T *
> 0 andTT > 0.
The first three statements in the above proposition are rather trivial. However,
the last deserves a second look as it will be used frequently in later work.
This partial ordering of the self-adjoint operators of L(H) allows some interesting
results with the spectrum from the previous chapter.
Theorem 8.4. IfT is self-adjoint then a(T) has only real elements. Also, ifT>0, then
a(T) C [0,4-oo).
||Tx||2 = (Tx,Tx)
*Tx,x)
= (T
= (TT
*x,x)
= *x
(T),T
= IIT^II2.
so ||Tx|| = ||7”o:||.
*x||
Conversely when ||Tx|| = ||T Yx G H, it follows that
*Tx,x)
(T = (Tx,Tx)
= l|Tx||2
= l|T^||2
= *x
z),T
(T
= (TT
*x,x),
so (T
*Tx,x) = (TT
*x,x). Working with this equality,
T,x)
*x
(T x,x)-(TT = 0
*x
T,x)
(T x-TT = 0
*)((T
Tx,x)
-TT = 0.
Then let S = T
* *.
— TT S is shown to be self-adjoint since
*
Then T = TT
* and T is normal as desired. □
31
Chapter 9
The Polar Decomposition Theorem defines a method for finding two operators
whose composition is equivalent to a given operator. This theorem is one of the last
pieces to the puzzle needed to prove the main classes of operators in this thesis.
Theorem 9.1. IfTfQ is both a compact and self-adjoint operator, then either [|T|| or
— ||T|| is an eigenvalue ofT.
The restricted operator maybe zero, which is how this procedure is completed in finite
dimensional spaces. This algorithm is continued ad finitum if the original space is infinite
dimensional. Beginning with the eigenvalue from the first iteration,
terminating in the finite dimensional case. Also, the sequence (An) converges to zero by
Theorem 6.7.
Theorem 9.2. (Schur’s Theorem.) Let H be a Hilbert Space and let T G L(H) be a
compact and self-adjoint operator. For a sequence {An} of nonzero eigenvalues ofT, the
associated orthonormal sequence of eigenvectors (xn) and fx G H the following equality
holds: oo
Tx = An(o;, xn)xn, x G H.
n=l
This is the Schur Representation of the operator T. Also er(T) — {An} U {0}.
Positive self-adjoint operators have enough structure that they have many analo
gies to real numbers. There is a partial ordering of these operators (as was discussed in
Proposition 8.3), similar to the ordering of the real numbers. Another analogy to real
numbers is the existence of a unique square root. Both existence and uniqueness must
be proven and proof is available in a variety of texts on Hilbert Space. The consequences
of this square root are numerous and useful in the remainder of this work.
Definition 9.3. Let T G £(/7) be positive and self-adjoint operator. If there is a self-
adjoint A G L(H) such that A2 = T then A is called a square root of T. If A is also
positive then A is called the (meaning unique) square root of T and denoted A = T1?2.
(Schur’s Theorem requires that T f 0. If T — 0 the theorem, and hence the above
representation, does not apply. However, in this case A = 0 fills all the requirements
to be the unique positive square root.) Theorem 8.4 states that [AJ are real numbers
(which are known to have unique square roots). Define fa — A^2.
There exists a positive self-adjoint operator A such that
oo
Ax = r € H.
n=l
n=l
= Tx.
There are two more details that need attention before the main topic of this
chapter. The polar decomposition, defined below, provides a connection between the
operators T, T
*, and its square root A. Note: Proposition 8.3 states T
* > 0. The
operator T
* then has a unique positive square root.
Theorem 9.5. (The Polar Decomposition Theorem.) Let T G L(H) and A = (71*? 1)1/2.
There is an operator U € L(H) such that
1. T=UA;
3. Ux = OforxeR(A)±.
Proof. Since A2 = T
* *,
— A for all x e H ||Ar||2 = (Ax, Ax) = (A
*Aa;,x) =
*Tx,x)
(T = ||2Yc||2, and ||Ax|| = ||Tx||. If we. let U : R(A) —> R(T) be given by
U(Ax) = Tx for x G H, then U is a well-defined bounded linear operator on R(A) and
11^ = ||T:r|| for x G Z?(A). Hence, U can be extended onto I?(A) and, moreover, we
can define U = 0 on Then, U G L(H) and it satisfies all the above statements.
□
This operator U has the added property of taking the set of eigenvectors of A
(which are orthonormal) to another set which is orthonormal as well.
Theorem 9.6. Let T G K(H) and let U be the operator defined by Theorem 9.5. Let
(xn) be the orthonormal eigenvectors of A — (PT)1/2. Then the set (Uxn) is also
orthonormal.
Proof. Given (xn) is orthonormal, the second part of Theorem 9.5 has ||t7a?n || = ||a;n|| = 1
for all n. Then for n m,
But
Combining the two equations, —(Uxn,Uxm) = (Uxn,Uxm). This implies that the real
part of (Uxn, Uxm) must be zero.
Chapter 10
There has now been enough background in operators to move onto the major
topics of this paper. Many of the previous topics will be refined and combined into a
much more focused underlying theory.
Schur’s Representation is helpful in calculations involving compact, self-adjoint
operators. A similar result would be useful for operators that don’t meet the strict
restriction of being self-adjoint. Combining topics from the previous chapters gives such
a result.
To ease notation, a brief lemma.
an(T) — inf{||T — B|| : B is a finite rank operator with rank B < n}.
These crn(T) are also called the singular numbers of T. Some examples using
singular numbers will follow shortly.
Finally, the Schmidt Representation Theorem, due to the German mathemati
cian Erhard Schmidt.
Proof. For T G K(H'), A is compact and self-adjoint. A then has a Schur Representation
oo
Ax = ^<rn(T)(x,xn)xn
n=l
for the eigenvalues, an(Tf of A and associated eigenvectors (xn). Then Theorem 9.6 says
that yn = Uxn is an orthonormal set. Thus using the polar decomposition of the operator
T leaves oo oo
Tx = UAx = U^an(T)(x,xn)xn = ^(rn(T)(x,xn)yn
n=l n=l
as desired. □
Calculations using the Schmidt Representation deal with infinite sums and inner
products which are relatively easy to work with. It is the singular numbers crn(T) which
may take some work to gain intuition of their properties. Some examples will assist in
that intuition.
Example 10.3. Show ||T|| = <Ji(T) > ^(T) > ... for T G K(H).
Proof. The definition of an(T) states that ai(T) = inf{||T[|}. (Note that rank A < 1
implies that A = 0.) So crip1) = ||T||.
Since = inf{||T — A|| : rank A < n}, an+i(T) = inf{||T — A|| : rank A <
n 4- 1}, and
{A : A is of finite rank with rank A < n+1} C {A : A is of finite rank with rank A < n}
Lastly, an example establishing a rule for the norm multiple operators and their
corresponding eigenvalues.
Example 10.5. Show ||B|| • ||T||crn(S) > an(RST) for operators R,S,T G K(H).
Chapter 11
This is where the work in the previous chapters comes together. This culmina
tion begins with the work done by Robert Schatten. The Schatten p-class operators or
Sp operators are defined below.
Definition 11.1. The Schatten p-class operators on Hilbert space H are given by
Example 11.2. For Hilbert space H, ifTe Sp(H) thenT can be factored into T = BDA
as follows:
H------- ---- ^H
A B
T
loo
for the complete orthonormal basis xn created in Schur’s Theorem, Theorem 9.2. Let
oo
Ax = ((x,xn)) = (an). Then (an) G ^oo> because 52 W2 < oo- Let D((an)) =
n=l
(trn(T)an). But is (crn(T)an) G €p? Well
oo oo oo
£ |a„(T)a„f < £ U(an)IIEo ' K(T)|’ = 11 (*n)l|go ’ £ MT)IP < oo.
n=l n=l n=l
So (an(T)an) G €p.
oo
Now let B(an(T)an) = X &n(T)(an)yn, where T(xn) = yn for n > 1 and (yn)
n=l
is the orthonormal set found by using Theorem 9.6, on the orthonormal set (xn). Finally,
00
As with any object defined so far, the Schatten class operators must have a norm
associated with it.
Definition 11.3. Given Hilbert space H and T G SP(H) the Schatten p-norm, written
sp(H) is
/ \ i/p
Theorem 11.4. Let T G K(H). If for all U,V G L(f2,H) ((TUe^Ven)) G £P) then
T G SP(H)
for some orthonormal sets (xn), (yn)' Let U, V be operators which change the orthonormal
set (ej) as follows:
Uei — Xi and Vei = yi, i> 1.
The statement of the theorem says that for every, and as such this particular, U, V that
((TUen,Ven)) So ((Tx^yi)) G £p. On the other hand,
(Txi,yi) =
= (^i(^)yi^i)
(u.i)
Chapter 12
To complete this paper two final classes of operators will be discussed. Both
of these operators predate the Schatten p-class operators. It will be shown that the
trace class and Hilbert-Schmidt class operators are actually specific types of the Schatten
p-class operators.
Theorem 12.2. The trace class is identical with the Schatten 1-class. That is,
TC(ff) = Stiff)
and
si{T) = r(T) for T G TC(ff).
42
Proof. If T G TC(H), then T G K(H) and T has a Schmidt representation. Let e > 0 be
given. Since T G TC(H) there is some representation'of T such that for all x G H there
is a (wn) and (zn) where
oo oo
Ta; = ^(x,wn)zn and 52 11'^1111^11 < (1 + c)t(T).
n=l n=l
oo
This gives a representation that admits £ llwn II Ikn || arbitrarily close to - but larger
n=l
than - r(T). For T to be in S/H), ^trn(jr) must be finite. This can be verified by
finding an upper bound for J3crn(T).
Let (xn), (yn) be orthonormal sets. Then, beginning with a calculation similar
to (11.1) in Example 11.4,
oo oo
= y~^(Ta?n, yn)
n=l n=l
00 ✓ oo
“ 5 5 (xni wm)(zm,yn)
n=lm=l
00 / oo \ x/2 ,f oo
< 521 52 52 l(Am, yn)|
m=l \ n=l / <n=l
oo
< 52 Ikmllll^mll
m=l
< (1 + e)r(T) < oo.
So TC(H) C 5i (S') and S1(T) < r(T).
Then let T G 5i(-£T). So T has a Schmidt representation,
oo
Tx =
71=1
00 oo
which means T is also in TC(H). And £ llz/nll||^n|| = 52 ffn(T). So r(T) < si(T).
Therefore Si(H) is identical to TC(H). □
43
Theorem 12.4. The Hilbert-Schmidt class, HS(H), is identical with the Schatten 2-
class, S2(H). Their norms also coincide.
Proof. Let T G HS(H) and let (xn) be a complete orthonormal set in H. Then for e > 0
there is an TV such that
00 \ V2
E
n=Ar+l
iitm2
/
<e-
N
Define the orthogonal projection Px ~ f^(x,xi)xt. Thus for x G H,
i=l
oo
E
n=7V+l
(z, Xi)Txi
°° \ I 00
< E ( E
n=7V+l / \n=7V+l
< ||a:||e.
This implies that \\T — TP\\ < e hence T G K(H). So T has a Schmidt representation
oo
Tx = ^ok(T)(x,Uk)vk
k=l
44
£||T®n||2 = y(Txn,Txn)
n=l n=l
OO OO
= E£^CD2|(
*».^)| 2
n=l fc=l
OO OO
= E£^(r)2i(^,^)i2
fc=l n=l
00
= j>
*
(T) 2-
fc=l
Definition 12.5. For a Banach space X, a sequence (yn) C X is said to be weakly square
summable provided
00
J2l/(?/n)|2 < +00
n=l
Once again, the new operator should be put into context with respect to Sp.
Theorem 12.7. The Schatten 2-class operators are identical to the absolutely 2-summing
operators.
The proof is by double inclusion. First, it will be shown that J]2 c *^2- Note:
Theorem 12.4 states SifH) is equivalent to
Let T 6 f^C^) 311(1 let fen) C H be a complete orthonormal set. Then (yn)
is a weakly square summable sequence. Then T G EM^) imPlies that (Tyn) is square
summable, ie. ||Tyn||2 < oo« Thus T G HS(H) and therefore T e S2(#)- So J]2 C 52-
Next choose T G S2 and let (yn) be weakly square summable. Theorem 11.4
states that T G /C(7T) implies VC7, V G £(^2>77)> ((7Ven, Ven)) G ^2. This is true for all
U, V\ so in particular, the U where Uen = yn and V where V = TU. Then ((Tyn, Tyn)) €
^2 and (||Tyn ||2) G €2, which implies
oo
^IITshH^+oo.
n=l
Chapter 13
Conclusion
Appendix A
Banach Space
A complete normed linear space is called a Banach space. This space was touched
on very briefly in Section 2.1. While the focus of this paper is on Hilbert space, there Eire
undeniable links between these two types of spaces. One such link is studied when dealing
with the Schatten p-class operators, so a short primer on Banach spaces is warranted
[Roy88].
The Banach spaces of interest are called spaces. The familiar space ^2 is one
of these spaces. (A Hilbert space is after all a Banach space with an inner product.) The
spaces are defined
f
tp = * (ttn) 22l®nlP < +°°
n=l
Again, a norm is required and it is defined to be
/ 00 \ Vp
ll(®n)llP= ( '
This space is also defined for p = 00. In that case,
There are some fundamental theorems in Banach space theory, such as Holder’s
Inequality and Minkowski’s Inequality, which are given here in a form conducive for work
in Hilbert space. These inequalities are focused on Banach space which is a secondary
goal. Consequently, only brief outlines of the proofs are given. Complete proofs are
readily available.
48
Theorem A.l. (Holder's Inequality). Let cq,..., an, . ,bn C C and let p, q > 0 be
such that A + 1 = 1. For p> 1 it holds that
n n n P
1.
52^1 -
3=1 .3=1 Z>l’
Ew” .3=1
This inequality is also obtained for p = 1;
n
52
i=i
- .3=1 : 1 < i < n}
Proof. The proof of Holder’s Inequality comes easily using Young’s Inequality, ab <
~ + y, and a clever substitution for a and b. □
Theorem A.2. (Minkowski's Inequality). Let cq,..., an, tq,... ,bn cC and let p, q> 0
be such that | ~ = 1. For p>l,
n
1
P n n1
P
Proof. The proof follows from Holder’s Inequality by factoring out a term of |a + 6| from
the sum notation. Note p = p/q + 1. □
In Section 2.1 was not shown to be complete. Instead, it will be shown here
that lp is complete.
While all the work in this section holds for p = oo, the above results are sufficient
for present purposes.
50
Bibliography
[Ret93] J. R. Retherford. Hilbert Space: Compact Operators and the Trace Theorem,
volume 27 of London Mathematical Society Student Texts. Cambridge University
Press, Cambridge, 1993.
[Roy88] H. L. Royden. Real Analysis. Macmillan Publishing Company, New York, third
edition, 1988.