100% found this document useful (1 vote)
112 views

Quantum Probability

This document is an introduction to quantum probability, quantum mechanics, and quantum computation for mathematically prepared readers. It discusses how quantum probability generalizes classical probability theory by replacing the commutative algebra of classical probability with a non-commutative algebra. Quantum states are described as vectors in a complex vector space or Hilbert space. The key feature of quantum mechanics is quantum superpositions, where quantum systems can exist in multiple states simultaneously.

Uploaded by

algebra
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
100% found this document useful (1 vote)
112 views

Quantum Probability

This document is an introduction to quantum probability, quantum mechanics, and quantum computation for mathematically prepared readers. It discusses how quantum probability generalizes classical probability theory by replacing the commutative algebra of classical probability with a non-commutative algebra. Quantum states are described as vectors in a complex vector space or Hilbert space. The key feature of quantum mechanics is quantum superpositions, where quantum systems can exist in multiple states simultaneously.

Uploaded by

algebra
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 11

An introduction to quantum probability, quantum mechanics, and quantum

computation
Greg Kuperberg
UC Davis
(Dated: October 8, 2007)

Quantum mechanics is one of the most surprising Nonetheless, theoretical results suggest that quan-
sides of modern physics. Its basic precepts require tum computers are possible rather than impossi-
only undergraduate or early graduate mathemat- ble. Entirely apart from technological implications,
ics; but because quantum mechanics is surprising, quantum computation is a beautiful subject that
it is more difficult than these prerequisites suggest. combines mathematics, physics, and computer sci-
Moreover, the rigorous and clear rules of quantum ence.
mechanics are sometimes confused with the more dif- This article is an introduction to quantum prob-
ficult and less rigorous rules of quantum field theory. ability theory, quantum mechanics, and quan-
Many working mathematicians have an excellent tum computation for the mathematically prepared
intuitive grasp of two parent theories of quantum reader. Chapters ?? and ?? depend on Section 1
mechanics, namely classical mechanics and proba- but not on each other, so the reader who is inter-
bility theory. The empirical interpretations of both ested in quantum computation can go directly from
of these theories, above and beyond their mathe- Chapter 1 to Chapter ??.
matical formalism, have been a great source of ideas This article owes a great debt to the textbook on
in mathematics, even for many questions that have quantum computation by Nielsen and Chuang [4],
nothing to do with physics or practical statistics. For and to the Feynman Lectures, Vol. III [2]. An-
example, the probabilistic method of Erd os and oth- other good textbook written for physics students is
ers [? ] is a fundamental method in combinatorics by Sakurai [5].
to show the existence of combinatorial objects. In
principle, the precepts of quantum mechanics could
Exercises
be similarly influential; there could easily be one
or more kind of quantum probabilistic method.
But in practice the precepts of quantum mechan- These exercises are meant to illustrate how empir-
ics are not very familiar to most mathematicians. ical interpretations can lead to solutions of problems
Two subdisciplines of mathematics that have assim- in pure mathematics.
ilated these precepts are mathematical physics and
1. The probabilistic method: The Ramsey num-
operator algebras. However, much of the intention
ber R(n) is defined as the least R such that if
of mathematical physics is the converse of our pur-
a simple graph has R vertices, then either it
pose, to apply mathematics to problems in physics.
or its complement must have a complete sub-
The theory of operator algebras is close to the spirit
graph with n vertices. By considering random
of this article; in this theory what we call quantum
graphs, show that
probability is often called non-commutative proba-
bility. 2(n1)/2
R(n) .
Recently quantum computation has entered as a (2(n!))1/n
new reason for both mathematicians and computer
scientists to learn the precepts of quantum mechan- (The proof can be described as a couting ar-
ics. Just as randomized algorithms can be moder- gument. However, a solution phrased in terms
ately faster than deterministic algorithms for some of probabilitistic existence is more in the spirit
computational problems, quantum algorithms can of these notes.)
be moderately faster or sometimes much faster than 2. Angular momentum: Let S be a smooth sur-
their classical and randomized alternatives. Quan- face of revolution about the z-axis in R3 , and
tum algorithms can only run on a new kind of com- let ~p(t) be a geodesic arc on S, parameterized
puter called a quantum computer. As of this writ- by length, that begins at the point (1, 0, 0) at
ing, convincing quantum computers do not exist. t = 0. Show that ~p(t) never reaches any point
within 1/|py (0)| of the vertical axis.
3. Kirchoffs laws: Suppose that a unit square is
Electronic address: [email protected] tiled by finitely many smaller squares. Show
2

that the edge lengths are uniquely determined evant axioms. One of the restrictions on the alge-
by the combinatorial structure of the tiling, bra is commutativity: If x and y are two real- or
and that they are rational. (Hint: Build the complex-valued random variables, then xy and yx
unit square out of material with unit resistivity are the same random variable. In quantum prob-
with a battery connected to the top and bot- ability, this commutative algebra is replaced by a
tom edges. Cut slits along the vertical edges of non-commutative algebra called a von Neumann al-
the tiles and affix zero-resistance wires to the gebra. The remaining definitions stay as much the
horizontal edges. Each square becomes a unit same as possible.
resistor in an electrical network.) We will mostly consider finite-dimensional quan-
tum systems. These are enough to show most of
the basic ideas of quantum probability, just as finite
1. QUANTUM PROBABILITY or combinatorial probability is enough to show most
of the basic ideas of classical probability. Infinite-
dimensional quantum systems are discussed in Sec-
The precepts of quantum mechanics are neither
tion ??.
a set of physical forces nor a geometric model for
To summarize, quantum probability is the most
physical objects. Rather, they are a generalization
natural non-commutative generalization of classical
of classical probability theory that modifies the ef-
probability. In this authors opinion, this description
fects of physical forces. If you have firmly accepted
does the most to demystify quantum probability and
classical probability, it is tempting to suppose that
quantum mechanics.
quantum mechanics is a set of probabilistic objects,
in effect a special case of probability rather than a
generalization. But this is not true in any reasonable 1.1. Quantum superpositions
sense; quantum probability violates certain inequal-
ities that hold in classical probability (Section ??).
It is also tempting to view quantum mechanics as We will begin by discussing part of the pure-state
a a deterministic dynamical system that produces model of quantum mechanics in order to show the
classical probabilities and is otherwise hidden. This inadequacy of classical probability.
interpretation is not reasonable either. A pure state of a quantum mechanical system can
In physics courses, quantum mechanics is usu- be described as a vector of a complex vector space
ally defined in terms of operators acting on Hilbert H. If the system is finite, then we can say that the
spaces. A state of a system is a vector of its Hilbert vector space is Cn . It will be convenient to label
space, the vector evolves by unitary operators, the the basis of this vector space by an arbitrary finite
vector is measured by Hermitian operators, and the set A rather than by the numbers from 1 to n; we
measured values have probability distributions. can then denote the vector space CA . The general
state space H is not just a vector space but a Hilbert
Although we will discuss the vector-state model,
space, meaning that it has a positive-definite Hermi-
we will emphasize the non-commutative probability
tian inner product h|i. When H is Cn or CA , then
model from operator algebras. In this model, a sys-
it has the standard inner product
tem can be fully quantum, or fully classical, or things
in between. The fully quantum case corresponds to X
the vector-state model, but even in this case, the h|i = a a .
aA
general state is described by an operator rather than
a vector. The states that can be described by vectors In quantum theory, the traditional notation is |i
are called pure; the others are mixed states. (a ket) for a vector and h| (a bra) for the
The vector-state model of quantum mechanics corresponding dual vector
was originally known as matrix mechanics and is
due to Heisenberg. The historical alternative is h| = = h|i.
Schrodingers wave mechanics. Wave mechanics is
best understood as a special case of matrix mechan- This notation is due to Dirac [1] and is called bra-
ics, and we will describe it this way. The probabilis- ket notation. Recall also that a linear map from a
tic interpretation of quantum mechanics is due to Hilbert space to itself is called an operator.
Max Born and is known as the Copenhagen inter- In finite quantum mechanics, as in classical proba-
pretation (Section ??). bility, we can define a physical object by specifying a
Since classical probability is a major analogy for finite set A of independent configurations. In infor-
us, it is reviewed in Section ??. The point is that a mation theory (both quantum and classical), the ob-
classical probabilistic system (or measurable space) ject is often called Alice. Classically, the set of all
is an algebra of random variables that satisfies rel- normalized states of Alice is the simplex A spanned
3

by A in the vector space RA (see Section ??). I.e., The requirement that U is linear is the quantum su-
a general state has the form perposition principle. It appears to contradict the
X classical superposition principle, and it is thus an ap-
= pa [a] parent paradox of quantum probability. (However,
aA the treatment in Section 1.3 reconciles the two sides
of this paradox.) The entries of U are also called
for probabilities pa 0 that sum to 1. (For unnor-
amplitudes, just as the entries of a stochastic map
malized states, the sum need not be 1.) The number
are also probabilities. Since we have posited that
pa is interpreted as the probability that Alice is in
|a |2 is a probability, U conserves total probability
state a. Quantumly, Alices set of pure states is the
if and only if
vector space CA . In other words, a state of Alice is
a vector ||U || = ||||
X
|i = a |ai for all CA ; i.e., if U is a unitary embedding. If
aA A = B or at least |A| = |B|, then U is a unitary
operator.
with complex coefficients a that are called ampli- It will be convenient to consider maps that pre-
tudes. The square norm |a |2 is interpreted as the serve or decrease probability. Such maps are called
probability that Alice is in the configuration |ai. The extinction processes; the model random walks that
total probability is therefore the sum can terminate, experiments that can be scratched,
X etc. A classical map M of this kind is called sub-
h|i = |a |2 . stochastic. The corresponding quantum condition is
aA
||U || ||||
The state |i is normalized if this sum is 1. The
phase of a (i.e., its argument or angle as a complex and such as U is subunitary.
number) has no direct probabilistic interpretation,
but it becomes important when we consider opera-
tors on |i. While the relative phase of two coor- i/2 i/2
dinates a and a is indirectly measurable, it will
i/2
turn out that the global phase of |i is not mea-
i/2
surable, i.e., it is not empirical. Indeed, the global
phase of |i is absent from the operator formalism i/2 i/2
that we will define in Section 1.3.
The state |i is also called a quantum superposi-
tion, an amplitude function, or a wave function. This Figure 1: An idealized two-slit experiment.
last name is motivated by the fact that |i typically
satisfies a wave equation in infinite quantum me- One traditional, idealized setting for the quan-
chanics (Example ?? and Section ??). It also pre- tum superposition principle is a diffraction appara-
dates the Copenhagen interpretation and arguably tus known as the two-slit experiment. Figure 1 shows
distracts from it. the basic idea: A laser emits photons that can travel
If A and B (Alice and Bob) are the configu- through either of two slits in a grating and then may
ration sets of two classical systems, then an empiri- (or may not) reach a detector. The source has a sin-
cally allowed map from Alices state to Bobs state gle state (the state set A has one element), while the
is given by a stochastic linear map grating has two states and there are two detectors
M : RA RB , (B and C each have two elements). The transitions
for each photon, as it passes from A to B to C, are
also called a Markov map. The property that M described by two subunitary matrices
is linear is the classical superposition principle: dis-
joint probabilities add. In addition, in order to be U : CA CB V : CB CC .
stochastic, M must have positive entries (so that We can choose the matrices to be
probabilities remain positive) and its column sums i i i

must be 1 (to conserve probability). U = 2i V = 2i 2 ,
In the quantum case, an empirical transition from 2 2 2i
Alices vector states to Bobs vector states is a linear so that
map  1
2
VU = .
U : CA CB . 0
4

The total amplitude of the photon reaching the top could be tuned to fire only one photon at a time.
detector is 12 and the probability is 14 ; this case Of course, a two-slit experiment is only an ideal-
is called constructive interference. The total ampli- ization of a real experiment; however, it is very sim-
tude reaching the bottom detector is 0, so the photon ilar to many actual experiments and even routine
never reaches it; this case is called destructive in- demonstrations. Note also that diffraction experi-
terference. On the other hand, if one of the slits of ments can portray any operator and therefore any
blocked, then we can discard one of the states in |B|, process in quantum mechanics, just as any classi-
with the result that each detector is reached with cal stochastic map can be modelled by balls falling
1
probability 16 . The classical superposition principle through chutes. The two-slit experiment can be
would dictate a probability of 81 for each detector demonstrated with photons, or electrons or even
with both slits open; thus it is violated. molecules, but it really describes general probabilis-
tic rules. See Sections 1.5 and ?? for more discus-
sion.
i/2 i/2 Examples 1.1.1. A qubit is a two-state quantum
object with configuration set {0, 1}. Two of their
quantum superpositions are:
i/2 i/2
|0i + |1i |0i |1i
|+i = |i =
2 2
Figure 2: An angle-dependent detector in the two-slit Both of these states have probability 12 of being in
experiment. either configuration |0i or |1i, but they are differ-
ent states. This is demonstrated by the effect of a
A natural reaction to the violation of classical su- unitary operator H called the Hadamard gate:
perposition is to try to determine which slit the pho-  
ton went through. For instance, the detector could 1 1
H= .
be sensitive to the angle that the photon comes in, 1 1
as in Figure 2. Or there could be a detector at one of
the slits that notices that the photon passed through It exchanges |0i with |+i and |1i with |i.
it. But in any such circumstance, the two paths then The spin state of a spin- 12 particle is a two-state
results in different final states (of the experiment as system which is important in physics. (Electrons,
a whole) rather than in the same state. Thus the protons, and neutrons are all spin- 21 particles.) The
final state vector is conventional orthonormal basis is |i (spin up) and
 1
4 |i (spin down). The names of the states refer to
|i = the property of the electron spinning (according to
14
the right-hand rule) about a vertical axis in these
and its total probability is two states. Even though a rotated electron is still
1 an electron, neither this configuration set nor any
h|i = ||||2 = , other is preserved by rotations. The resolution of
8
this paradox is that rotated states appear as super-
regardless of the phases of path segments to and
positions. For example, the spin left and spin right
from the slits. The lesson is that amplitudes of dif-
states are analogous to |+i and |i:
ferent trajectories of an object only add when there
is no evidence of which trajectory it took. If the |i + |i |i |i
trajectory is recorded at all, the probabilities add. |i = |i = .
If we want to see quantum superposition, it is not 2 2
enough to wittingly or unwittingly ignores such ev- Although we will soon switch to a more general
idence. Rather, if the two trajectories induce dif- model of quantum probability, we can say a few
ferent states of the universe, so that some observer words about presenting this vector space model in a
could in principle distinguish them, then they obey basis-independent form. As we said, a quantum ob-
classical superposition. Moreover, the effect is not ject can be assigned any Hilbert space H rather than
the result of interaction between photons; photons the standard finite-dimensional vector space CA for
do not interact with each other1 . Indeed, the laser

quire the extreme energies of cosmic rays and particle ac-


1 More precisely, significant photon-photon interactions re- celerators.
5

a configuration set A. Since states evolve by unitary Such an expansion is interpreted as path sum-
operators, we can conjugate the standard basis of mation; it is the same idea as a sum over his-
CA by any unitary operator, to conclude that any tories in classical probability.
orthonormal basis of any Hilbert space H can be For example, let n = 4 and let each
called a configuration set. Likewise, if we accept a
real-valued function f on A as a random variable,
!
1 1 1
then we can model it by a diagonal matrix D who Uk = .
entries are the values of f . We can then conjugate 2 1 1
that too by a unitary operator U , to conclude that
any Hermitian operator H = U DU 1 represents a Find the amplitudes of the 16 paths and group
real-valued random variable. If the eigenvalues of them according to how they sum.
H are 0 and 1, so that H = P is a Hermitian pro- 5. In general for a spin- 21 particle, the state
jection, then this corresponds to a Boolean random
variable. These are the basic rules of vector-state |~v i = |i + |i
quantum probability, with the exception of the cru-
cial tensor product rule for joint states (Section 1.5). spins in the direction

~v = (Re , Im , ||2 ||2 ).


Exercises
Check that this is a unit vector when |~v i is
1. Suppose that the lengths of the entries of a normalized, and that every unit vector in R3 is
complex matrix U are all fixed, but the phases achieved. This formula is therefore a surjective
are all chosen uniformly randomly. (If you like, function from the unit 3-sphere S 3 C2 to the
you can also suppose that for any choice of the 2-sphere S 2 R3 . What is its usual name in
amplitudes, U is subunitary.) Show that on mathematics?
average, each entry of U |i satisfies the clas-
sical superposition principle.
1.2. A classical review
2. If U is a matrix, then the matrix
Mab = |Uab |2 Since our intention is to generalize classical prob-
ability, we will review some of the notions of this
can be called dephasing of U . A dephasing of theory in the finite case.
a unitary matrix is always doubly stochastic, A classical probabilistic system is most commonly
meaning that the entries are non-negative and modelled by a Boolean algebra of random vari-
the rows and columns sum to 1. Find a 3 ables. (In the infinite case, it should be a -algebra;
3 doubly stochastic matrix which is not the see Section ??.) That the algebra is Boolean means
dephasing of any unitary matrix. that it is an algebra over Z/2, and that every ele-
3. Show that every n k subunitary matrix U ment is an idempotent, x2 = x. The elements of
can be extended to an (n + k) (n + k) unitary are called events. They correspond to random
matrix V : variables that take the values 1 and 0, or equiva-
lently true and false, or yes and no. (Multiplication
U
 
V = . is Boolean AND, while adding 1 is Boolean comple-
mentation.) A state or distribution of is then a
Show that V cannot usually have order less function from to [0, ] such that:
than n + k.
(x) 0, and
4. If U1 , U2 , . . . , Un are unitary operators, then P  P
each entry of their product x
j j = j (xj ) when xj xk = 0 for all
U = Un . . . U2 U1 j and k.

can be expressed as a sum of products of en- The value


tries of the factors:
P [x] = (x)
han |Un . . . U2 U1 |a0 i
X represents the probability of the event x (the prob-
= han |Un |an1 i . . . ha2 |U1 |a1 iha1 |U1 |a0 i. ability that x is true). So the axioms say that prob-
a0 ,a1 ,...,an abilities are positive, and probabilities of disjoint
6

events add. The state is normalized if (1) = 1, The state is normalized if (1) = 1. (We
which means that the total probability is 1. write M# instead of M for the dual space
If the algebra is finite, then it is isomorphic to because is already used internally to M.)
(Z/2)A , the (Z/2)-valued functions on some finite
If you know or suppose that M = CA , then it is not
set A or the algebra of subsets of A. The set A is
the set of configurations of . hard to show that the self-adjoint elements are the
The complex-valued random variables over or real-valued random variables RA , the positive ele-
A form an algebra denoted L () or CA = (A). ments are the non-negative random variables RA 0 ,
This algebra is generated (as an algebra over C) by and the Boolean variables are the 0 1-valued vari-
the elements of , with addition in Z/2 forgotten ables {0, 1}A = (Z/2)A = .
and multiplication retained. In other words, if xy = It is also not hard to show that the two definitions
z in , then this is also imposed as a relation in of a state are equivalent. Indeed, the set of normal-
L (). (Technically, L () is only the bounded ized states of M or is the simplex A RA that
random variables and is a Banach-space completion consists of convex sums of elements of A. This sim-
of the algebra so generated, but these concerns are plex is shown for a two-state system (a randomized
only important in the infinite case; see Section ??.) bit) and a three-state system (a randomized trit) in
The state extends to a linear functional on L (), Figure ??, together with an example element in each
so that case. To support this picture, we define [a] to be the
state which is definitely a. For example, if a bit is 1
E[x] = (x) with probability p, then its state is

now represents the expected value of x as a random = (1 p)[0] + p[1].


variable. Also, L () has an involution x 7 x that
conjugates the coefficients and values of x and that The notion of assigning a state or probability dis-
will be crucial later. The element x is called the tribution to a probabilistic system has two differ-
adjoint of x and it is also written x in the physics ent empirical interpretations, and the distinction be-
literature. tween them will be important. 2 In the frequen-
An equivalent formulation is to write axioms for tist interpretation, the state of an object is always
an algebra M which can be recognized as L () for a configuration a A, although you may not know
some Boolean algebra . In this approach, M is a which one; and a distribution is a summary of
commutative, positive-definite -algebra. By defini- which configuration you witness in repeated trials.
tion: In the Bayesian interpretation, the state of an ob-
ject is a probabilistic state A , which however is
M is an associative algebra over the complex observer-dependent; it represents the observers ra-
numbers C. tional belief about which configuration a A will be
witnessed, whether or not repeated trials are possi-
M has an anti-linear, anti-automorphism : ble.
(x) = x (x+y) = x +y (xy) = y x Frequentism and Bayesianism are mathematically
equivalent. They are only different philosophi-
cally, or they may lead to different practical ad-
M is positive-definite, meaning that if x x = vice. However, quantum probability required a de-
0, then x = 0. gree of Bayesianism. Although frequentism will re-
M is commutative; xy = yx for all x and y. main valid in some contexts, strict frequentism is un-
tenable as the fundamental interpretation of quan-
If M is finite-dimensional, then these axioms imply tum probability. So it is good practice to think of
that M is isomorphic to CA for a finite set A, so that a randomized bit, for example, as living in an in-
it is indeed equivalent to the other axiom set. (If M termediate state between 0 and 1, i.e., a classical
is infinite-dimensional, then these axioms should be superposition.
strengthened, as we will discuss in Section ??.) Finally, one fundamental operation on states, es-
Here are some other important definitions related pecially in the Bayesian interpretation, is the notion
to M.
An element z M is self-adjoint if x = x ; it
is positive, or x 0, if x = y y for some y; and 2 Actually there are several variations of both interpretations
it is Boolean if it is self-adjoint and if x = x2 . in this endless debate in statistics. We have chosen a fairly
aggressive flavor of frequentism and a fairly conservative
A state is a dual vector M# which is pos- flavor of Bayesianism. This is not entirely fair, but it serves
itive on positive elements: (x) 0 if x 0. our pedagogical goals.
7

of a conditional state. If p is a Boolean random An element x M is positive, or x 0, if


variable in M and is a state, then as we said, the x = y y for some y. If M is finite-dimensional,
probability of p is P [p] = (p). If p is witnessed by the positive elements form a cone M+ .
an observer who knows or believes the prior state ,
then afterwords M has an updated state p given An element p M is Boolean if it is self-
by the formula adjoint and if p = p2 . Such a p is also called a
self-adjoint projection. The Boolean elements
(px) form a set Mbool .
p (x) = .
(p)
A dual vector M# has an adjoint defined
This is the state conditioned on p, i.e., what by (x) = (x ), and it is self-adjoint if =
becomes given that p was witnessed. The formula . The set of self-adjoint dual vectors is the
is not meaningful if (p) = 0, which is to say, if real vector space Msa .
p is impossible. We also define the unnormalized
conditional state A state is a dual vector M# which is pos-
itive on positive elements: (x) 0 if x 0.
|p (x) = (px), (1) The set of states is a dual cone M+ .
which is well-defined regardless of the probability of The state is normalized if (1) = 1. The set
p. This state is the empirical posterior state if we of normalized states is the state region M .
view the measurement of p as an extinction process,
by declaring extinction if p is false. These definitions yield the following elementary
inclusions:

Exercises Mbool M+ Msa M


M M+ Msa M# .
1.3. Algebras and states
Classically (i.e., if M is commutative), Msa and
In this section we will define quantum probability Mbool are both closed under multiplication, so that
as non-commutative probability. This is the other Msa is a real algebra and Mbool is a Boolean alge-
end from Section 1.1; it makes quantum probability bra. However, quantumly neither one is closed under
seem as similar as possible to classical probability. multiplication, so that at first glance, Msa is only a
We will conclude by showing that the the two de- real vector space and Mbool is only a set. Actually,
scriptions are equivalent. Mbool has somewhat more structure than that; see
We chose the previous sections axioms for an al- Exercise ??. The most important extra structure
gebra M of complex random variables so that they at the moment is that M is partially ordered with
could be made quantum simply by dropping commu- respect to positivity: x y if x y 0. The inher-
tativity. To review, M is a positive-definite -algebra ited partial orderings of Msa and Mbool are both
if important.
If M is finite-dimensional, then we can classify
M is an associative algebra over the complex its structure using the Artin-Schreier theorem, be-
numbers C. cause its positive-definite structure implies that it is
M has an anti-linear, anti-automorphism : semisimple (Exercise ??). Since the complex num-
bers are algebraically closed, the theorem says that
(x) = x (x+y) = x +y (xy) = y x M is isomorphic to a direct sum of matrix algebras:

M
M
M is positive-definite, meaning that if x x = = Mnk .
0, then x = 0. k

If M is finite-dimensional, then these axioms are In particular, if M is a matrix algebra Mn , then it


adequate; they are the main definition of a finite is as non-commutative as possible. We will call such
quantum system. an M and the system that it models fully quan-
We can also repeat these related definitions with- tum. In basis-independent form, a fully quantum
out changes: system M is the algebra B(H) of operators on a
finite-dimensional Hilbert space H. (The B is for
An element x M is self-adjoint if x = x . bounded, although in this context all operators
Such elements form a real vector space Msa . are bounded; see Section ??.)
8

If M is fully quantum, then we can use the matrix


trace to convert a state from a dual vector on M
to an element.

(x) = Tr(x).

(Actually this works in general, using the sum of the


traces of the matrix summands.) Then is positive
as a dual vector if and only if it is positive as an
element of M, if and only if it is a positive-definite
Hermitian matrix (Exercise ??). Also is normal-
ized if and only if Tr() = 1. Because such a is
a matrix and because its diagonal entries are proba- Figure 3: A Josephson junction qubit: superconducting
bilities, physicists also call it a density matrix or (in aluminum on a silicon chip [3].
basis-independent form) a density operator. In this
terminology, densitymeans probability density, as
in a probability distribution. The diagonal entries (This formula is consistent with the fact that all
of a density matrix are in fact probabilities of con- probabilities are linear in .)
figuration (Section ??). But what about quantum superpositions? If M is
Example 1.3.1. The 2 2 matrix algebra M2 , or fully quantum, then they are also present in different
a system that it models, is a second and better def- guise from classical superpositions. To help separate
inition of a qubit. The Pauli spin matrices are a the terminology, classical superpositions are in gen-
convenient basis for (M2 )sa : eral called mixtures, while quantum superpositions
(when they are defined) are often just called super-
! ! positions.
1 0 0 1 In general, if K is a convex set in a real vector
0 = I = 1 = X =
0 1 1 0 space, then a point in K is extremal means that
it is not a convex combination of two other points
! !
0 i 1 0
2 = Y = 3 = Z = . in K. An elementary theorem in convex geometry
i 0 0 1 states that if K is compact and finite-dimensional,
then every point in K is a convex linear combina-
A state of M2 is positive and normalized if and tion of its extremal points. If K = M , then the
only if it is of the form extremal points are those states that are not mix-
tures. These states are called pure and other states
I + aX + bY + cZ
= , are called mixed. By the theorem, every mixed state
2 is a mixture of pure states. However, in a fully quan-
where a, b, and c are three real numbers that satisfy tum system, the representation of a state as a mix-
ture is never unique (Exercise ??).
a2 + b2 + c2 1. If M = Mn is fully quantum, then a state is
pure if and only if it has rank 1 as a matrix (Exer-
The state region M 2 is therefore a geometric sphere cise ??). It then has the form
in the affine space of unit-trace, 2 2 Hermitian
matrices. It is known as the Bloch sphere. = = |ih|

As in the example of a qubit, an important dif- for some vector Cn , since it is also Hermitian.
ference between quantum probability and classical If is normalized, then in addition is normalized,
probability is that the state region M is not a sim- by the relation
plex (except in the commutative case). But it is al-
ways convex, because it is defined by linear equalities Tr = h|i.
and inequalities. This convex structure allows clas-
sical superpositions in a quantum setting. Empiri- In basis-independent form, if M = B(H), and if a
cally, if we have two states 1 and 2 of a quantum state on M is pure, then it is described by a vector
system, and if we prepare a new state by choosing |i H. A configuration set of M is, by definition,
1 with probability p and 2 with probability 1 p, any orthonormal basis of H. Any state |i is a com-
then plex linear combination of the configurations, and
such a linear combination can be called a quantum
= p1 + (1 p)2 . superposition.
9

To summarize, a pure state of a fully quantum M is given by a unitary operator u : HA HB by the


is represented by a vector in a Hilbert space, and it formula
is a quantum superposition of any orthonormal basis
of configurations. However, the transformation from E(x) = uxu1 .
a vector state |i to the corresponding density op- In more concrete terms, the automorphism group of
erator = |ih| is non-linear and erases the global Mn as a -algebra is the unitary group U (n) (Exer-
phase of |i. Therefore empirical probabilities are cise ??). Note that u is conventionally covariant, so
a non-linear function of the vector state, and the that it transfers pure states forward, from Alice to
global phase of a vector state is not directly empiri- Bob.
cal. (But relative phases are empirical, so the global Note also that this motivation for unitary opera-
phase of |i is indirectly empirical, if |i is used as tors does not support the analogy with Markov maps
a summand of another vector.) in Section 1.1. Rather, unitary operators are analo-
gous to permutations of configurations of a classical
Example 1.3.2. Since the state region M 2 of a system, since these are the reversible maps among
qubit is a geometric sphere, every boundary point is
Markov maps. This is another way to say that we
a pure state |ih|. It is not hard to check that any
have resolved the paradox of that section: Classical
two opposite points form a (line) basis of the Hilbert
and quantum superposition do not contradict each
space C2 . In the context of quantum computation,
other because in some ways, they are not analogous.
the standard basis of C2 is called |0i and |1i, and
Section ?? discusses the correct notion of quantum
these states are assigned to the top and bottom of
Markov maps, namely quantum operations. Classi-
the Bloch sphere, as in Figure ??. Another basis is
cal and quantum superposition coexist for quantum
|0i + |1i |0i |1i operations, just as they do for states in the operator
|+i = |i = . formalism.
2 2
Another coherent operation is conditioning a state
These states with a Boolean random variable. If is a state of
In the middle is the uniform state (also called the M and p Mbool is Boolean, then the unnormalized
maximally mixed or maximum entropy state) = conditional state is defined by
I/2. |p (x) = (pxp).
Although probabilities are nonlinear functions of This reduces to equation (??) when M is commuta-
vector states |i, there are several important oper- tive, but it cannot be exactly the same formula as
ations on a quantum system M which are linear on before because (px) is not positive as a dual vector
vector states. We can call such operations coher- in x, nor even self-adjoint. It is easy to check that if
ent ; they serve to justify the quantum superposition M is fully quantum, then the pure state |i condi-
principle in Section 1.1. tions to the pure state p|i. So conditioning a state
The most important coherent operation is an al- is linear on pure states.
gebra isomorphism. Automorphisms and isomor- With a bit of modification, unitary operators and
phisms in general are the model of reverse dynami- projections generate all subunitary maps between
cal systems and reversible physical transformations. any two Hilbert spaces (Exercise ??). These are
The wrinkle is that algebra isomorphisms, and more all of the linear operators, or coherent operations,
generally algebra homomorphisms, are contravari- used in Section ??. Their construction establishes
ant, meaning that they transfer states backward. If the quantum superposition principle as a corollary
Alice and Bob have algebras MA and MB , then a of non-commutativity. In Section ??, quantum su-
homomorphism perpositions appear in another way, as a corollary of
the classical superposition principle.
E : MB MA

transfers states from Alice to Bob by means of its Exercises


transpose:
1.4. Measurements
E # : M# #
A MB .

If Alice and Bob are both fully quantum and have In this section we will look more closely at the
Hilbert spaces HA and HB of the same dimension, measuring quantum random variables. A random
then every algebra isomorphism variable x Msa (or more generally a classical do-
main A M as defined below) is also called a mea-
E : B(HB ) B(HA ) surable or an observable. To measure it is to pass
10

to the conditional state, just as is done in classical random variable is then in general defined by a set
probability. of mutually exclusive Booleans that sum to 1:
We said that if p is a Boolean random variable X
in an algebra M, then the unnormalized conditional pa = 1 a 6= b = pa pb = 0.
state is aA

|p (x) = (pxp). If M = B(H) is fully quantum, then this system of


projections is equivalent to an orthogonal direct sum
The normalized conditional state is thus decomposition of the Hilbert space H:
M
(pxp) H= Ha .
p (x) = ,
(p) aA

or in the vector-state case, It will also be convenient to generalize a classical


realm to allow some of the projections pa to van-
p|i ish. This is equivalent to making the realm a homo-
|p i = p . morphism A M rather than a subalgebra. Sec-
h|p|i
tion ?? discusses a much more significant general-
Conditioning on a measurement is also called state ization known as a POVM.
collapse or wave function collapse, but in the The most important case of a classical realm A is
context of non-commutative probability, this is an one generated by a single self-adjoint element x
overly dramatic term. The concept of a conditional Msa . In this case the structure of A implies that x
state is very natural in classical probability; and it is has a spectral decomposition,
equally natural and not all that different in quantum X
probability. See Section ??. x= p ,
The behavior of non-commuting random variables (x)
...
where the value set of the measurement, A = (x),
Example 1.4.1. is also called the spectrum of x. The probability
formula,
If any set of Boolean variables in M all com-
mute, or indeed if any set of self-adjoint elements P [x = ] = (p ),
in M all commute, then they generate a commuta-
tive, positive-definite -subalgebra A M. We will is then consistent with the expectation interpreta-
call such an A a classical realm in M. Elements in tion of the state ,
A are elements in M, and states on M restrict to
E[x] = (x).
states on A. While we work within A, we are free
to use any notion or result from classical probabil- So Msa is the space of real-valued random variables,
ity without modification. In particular, the elements just as it was classically. (Note that if M is fully
that we chose to generate A have a joint distribu- quantum, then the structure theorem for this A is
tion, and the order that they are measured does not equivalent to the spectral theorem for Hermitian ma-
matter. trices.)
If a classical realm A M is finite-dimensional, Another important type of classical realm A is a
then it is isomorphic to CA for some set A. Thus a maximal commutative -subalgebra of M. (By def-
state on M induces a probability for each outcome inition, A is not contained in any commutative -
a A. Moreover, A has a basis {pa } of minimal subalgebra B.) If M = B(H) is fully quantum, then
projections indexed by the set A. We can then say it is easy to show that A is maximal if and only if
that A is a model of an A-valued random variable A and H have the same dimension (Exercise ??);
with postconditioned states, using the same formulas indeed A consists of the diagonal matrices with re-
as for a single Boolean variable: spect to some orthonormal basis A of H. Each min-
imal projection pa of A has rank 1. It follows that
(pa xpa ) the conditional state a does not depend on ; it is
P [a] = (pa ) a (x) = .
(pa ) always the state
We can also run the construction backwards to a = pa = |aiha|.
build A from its minimal projections. Say that two
Booleans p and q are mutually exclusive if pq = 0 Although the basis A need only be a line basis of H,
(so that they necessarily commute). An A-valued it is often convenient to make it a vector basis, so
11

that H = CA . The set A is a configuration set, in and only if they have no overlap (Exercise ??).
keeping with Section 1.1. If n = |A| = dim H, then Unlike real-valued random variables, there are two
we say that M is an n-state system, even though notions of complex-valued and vector-valued random
technically n is the number of configurations rather variables. We can let z be any element of M, which
than the number of states. For example, a qubit is the complexification of Msa , so that
can also be described as any fully quantum 2-state
system. z + z z z
A maximal classical realm A is also called a com- z = x + iy x= y= .
2 2i
plete measurement. The name evokes the fact that
once any configuration a A is measured, the condi- Then z is a complex random variable in a weak sense,
tional state is pure and determined by a, so there is because x and y are both self-adjoint and are both
no more left to learn from the state of M. (Nonethe- therefore real random variables. This defines a com-
less, M has many different complete measurements; plex random variable in the weak sense. The wrinkle
and as we said, even a pure state is typically still is that x and y may not commute with each other,
a source of perpetual randomness.) Note also that in which case z and a state do not yield a distri-
if M is fully quantum and we have chosen a basis bution on C. If the real and imaginary parts x and
so that A consists of diagonal matrices, then the di- y do commute, or equivalently if z and z commute,
agonal entry aa of a state is just the probability then z is normal. A state and a normal z generate
of the outcome a A. We can view as a classical a classical realm and a classical state on C as usual.
probability distribution on A, plus extra off-diagonal Likewise a vector-valued random variable in the
information. weak sense is any ~v M V for some vector space
If M is not fully quantum, then some of the above V . We can say that ~v is normal when its components
analysis has to be modified. Nonetheless, it is still commute in any basis of V , in which case it generates
true that all of the conditional states of a maximal a classical realm and a classical state on V , given a
classical realm A are pure, that a set A of such out- state on M. An important example of the weak
comes is called a configuration set, and that any two kind of vector-valued random variable is the angular
configuration sets have the same cardinality. If momentum operator (Section ??).
Note the set Mnor of normal elements of M is
M
M
= Mnk , not closed under either addition or multiplication
k
(Exercise ??). The same is true of the set (MV )nor
then the cardinality of vector-valued measurements, or in general the A-
P of A is the total sum of the valued measurements where the set A is an abelian
matrix sizes, n = k nk .
Similar to a complete measurement, if is a pure group. Only the real random variables, Msa , have
state, then there is an associated minimal Boolean the special property that they can be added even if
p with the same matrix as which answers whether they do not commute.
the system is in the state . If p and q are two such
minimal Booleans, then Tr(pq) is both the proba-
bility that the state p will be found in the state q, 1.4.1. Exercises
and vice-versa; it can be called the overlap between
p and q. If M is fully quantum, so that p and q
have state vectors |ai and |bi, then their overlap is 1.5. Joint systems
|ha|bi|2 . Two pure states are mutually exclusive if

[1] Paul A. Dirac, Principles of quantum mechanics, Ox- IEEE Trans. Appl. Superconduct. 13 (2003), no. 2,
ford University Press, 1930. 989993.
[2] Richard P. Feynman, Robert B. Leighton, and [4] Michael A. Nielsen and Isaac L. Chuang, Quantum
Matthew Sands, The Feynman lectures on physics. computation and quantum information, Cambridge
Vol. 3: quantum mechanics, Addison-Wesley, 1965. University Press, Cambridge, 2000.
[3] K. M. Lang, S. Nam, J. Aumentado, C. Urbina, [5] Jun John Sakurai, Modern quantum mechanics, 2nd
and John M. Martinis, Banishing quasiparticles from ed., Benjamin/Cummings, 1985.
josephson-junction qubits: why and how to do it,

You might also like