Instant Download Quantum Stochastic Thermodynamics: Foundations and Selected Applications (Oxford Graduate Texts) Philipp Strasberg PDF All Chapter
Instant Download Quantum Stochastic Thermodynamics: Foundations and Selected Applications (Oxford Graduate Texts) Philipp Strasberg PDF All Chapter
Instant Download Quantum Stochastic Thermodynamics: Foundations and Selected Applications (Oxford Graduate Texts) Philipp Strasberg PDF All Chapter
com
https://ebookmass.com/product/quantum-
stochastic-thermodynamics-foundations-and-
selected-applications-oxford-graduate-texts-
philipp-strasberg/
ebookmass.com
More products digital (pdf, epub, mobi) instant
download maybe you interests ...
https://ebookmass.com/product/discrete-communication-systems-
oxford-graduate-texts-stevan-berber/
https://ebookmass.com/product/thermodynamics-and-applications-in-
hydrocarbon-energy-production-abbas-firoozabadi/
https://ebookmass.com/product/quantum-mechanics-3rd-edition-
nouredine-zettili/
https://ebookmass.com/product/introduction-to-quantum-field-
theory-with-applications-to-quantum-gravity-1st-edition-iosif-l-
buchbinder/
The Historical and Physical Foundations of Quantum
Mechanics Robert Golub
https://ebookmass.com/product/the-historical-and-physical-
foundations-of-quantum-mechanics-robert-golub/
https://ebookmass.com/product/optimizations-and-programming-
linear-non-linear-dynamic-stochastic-and-applications-with-
matlab-abdelkhalak-el-hami/
https://ebookmass.com/product/quantum-materials-devices-and-
applications-mohamed-henini/
https://ebookmass.com/product/chatgpt-for-beginners-features-
foundations-and-applications-1st-edition-eric-sarrion/
https://ebookmass.com/product/quantum-dots-fundamentals-
synthesis-and-applications-rakshit-ameta/
Quantum Stochastic Thermodynamics
Quantum Stochastic Thermodynamics
Foundations and Selected Applications
Philipp Strasberg
Universitat Autònoma de Barcelona
3
This book is dedicated to the memory of my PhD supervisor, Tobias Brandes
Preface
Recent decades have seen much progress in our understanding of thermodynamic pro-
cesses at the nanoscale. But nanoscale systems are very small, in most applications
far from equilibrium, often subject to strong fluctuations and sometimes even char-
acterized by exotic quantum properties—so it seems that these features rule out any
possibility of finding a consistent thermodynamic description for them.
It is the primary objective of this book to show that this is not the case. There is a
thermodynamic framework, characterized by a remarkable internal consistency, that
is able to describe nanoscale systems even under extreme conditions. Moreover, this
framework not only reaffirms common folklore around thermodynamics (there is no
perpetual motion machine, etc.), but also provides a wealth of beautiful results beyond
the traditional scope of thermodynamics—opening up the possibility of understanding
a plethora of different physical situations from a unified perspective.
The main title Quantum Stochastic Thermodynamics suggests that the present
book is about a synthesis of two research fields: classical stochastic thermodynamics
and quantum thermodynamics. Both have pushed the boundaries of the applicability of
the laws of thermodynamics by explaining and supplementing them with microscopic
considerations. For a considerable large class of nanoscale systems and processes, I
believe that most foundational questions have been settled by now. The present book
is intended to fill a gap in the literature by justifying this claim in detail for a large
variety of situations. I am also convinced that its content will prove important in
exploring new territories at the rapidly evolving frontiers of this field.
The subtitle Foundations and Selected Applications emphasizes that the reader can
mainly expect explanations about the basic theoretical pillars. These explanations are
intended to be pedagogically accessible. However, the book is also driven by the desire
to introduce a general and versatile framework characterized by conceptual clarity—
in complete awareness of the fact that this poses additional technical obstacles for
the beginner. To remedy this, a considerable effort has been made to transparently
explain common ‘jargon’ in the community (non-equilibrium entropies, local detailed
balance, Landauer’s principle, entropy production, time-reversal symmetry, the arrow
of time, etc.), which often appears unnecessarily mystifying (a problem that seems to
have a tradition in thermodynamics and statistical mechanics). The reader will often
find the same (or closely related) results derived in different ways in order to generate
confidence and trust in the framework.
The field of quantum stochastic thermodynamics is, however, fascinating, not only
because it allows us to addresss foundational questions about the nature of heat,
entropy or the second law, but also because it might have direct practical applications
in a world with increasing nanotechnological abilities. These applications could come
in the form of efficient thermoelectric devices, powerful energy harvesters, fast cooling
How to Read this Book vii
Structure of the book: To get a complete picture, I believe one should sooner or
later read the entire book and, then, it would perhaps be most beneficial to go through
it in linear order. However, I also believe that the impatient reader should not be afraid
to skip sections or chapters. For instance, Chapter 1 (Quantum Stochastic Processes)
appears to be the most abstract one, in particular its second half. While I believe that
this more abstract point of view helps to view the entire field in a clear and unified
way, it is certainly not necessary to reach an understanding of Chapter 2 (Classical
Stochastic Thermodynamics), which solely requires some basic background knowledge
of the theory of classical stochastic processes. Likewise, readers with some familiarity
with open quantum system theory can directly start reading Chapter 3 (Quantum
Thermodynamics Without Measurements). To understand Chapter 4 (Quantum Fluc-
tuation Theorems), some background information from previous chapters is required.
For some sections of Chapter 5 (Operational Quantum Stochastic Thermodynamics)
it is necessary to have also read and understood the end of Chapter 1. Finally, three
appendices complement this book by providing information about topics that appear
viii Preface
at various points in the main text, but whose detailed exposition requires a longer
detour, which would blur the main narrative.
Structure of the sections: Typically, I have tried to start each section with
a small paragraph motivating its content and to end each section with a small sum-
mary or outlook. To facilitate orientation, some sections (in particular longer ones) are
divided into subsections using unnumbered subtitles. Furthermore, important state-
ments are distinguished by longer italic text and boxed equations highlight important
definitions or results.
Index: I have tried to make a long and informative index list. Words or phrases
appearing in this list are printed in a boldface font in the main text at the point
where they are first introduced or explained. However, as I said above, the book is
characterized by presenting similar concepts in different contexts and from different
perspectives. Thus, the book is not written as an encyclopaedia, but tries to maintain
a narrative that is most beneficial for pedagogical purposes.
Exercises: Various exercises are scattered throughout the text. These exercises, some-
times supplemented by (hopefully) helpful hints on how to solve them, should be rather
simple because I believe there is no benefit in torturing the reader. Exercises fall, how-
ever, into two categories. The first category of exercises are the short ones. They are
supposed to supply simple cross-checks for the reader or, by asking for derivations of
some equations, to acquaint the reader with standard mathematical manipulations in
the field. Then, there are also various longer exercises, which might even require some
simple symbolic programming. These longer exercises are typically meant to introduce
ideas, concepts or results whose detailed exposition would probably bore the more ex-
perienced reader in the field. Thus, the exercises help to keep the book more concise,
while allowing me at the same time to cover a wider range of topics. I remark that all
exercises appear during the text at the point where they best fit the overall narrative.
Basic Notation
I tried to keep abbreviations to a minimum and used only those that are widely
used in the literature. The three most important abbreviations, which are scattered
throughout the text, are CP (completely positive), CPTP (completely positive and
trace-preserving) and BMS (Born–Markov secular). In some equations, I write h.c. to
denote the Hermitian conjugate. Also the abbreviation POVM (positive operator-
valued measure) is used occasionally.
Below, I further provide a non-exhaustive list of the most important notation used
throughout the book.
Quantum dynamics: I use Dirac notation for states |ψi and their conjugate transpose
hψ| that live in a Hilbert space H and its dual, respectively. The scalar product is
written as hϕ|ψi. To avoid unwanted technicalities, I assume Hilbert spaces are (or
can be approximated to be) finite dimensional: d = dim H < ∞. The density matrix
is typically denoted by ρ, whereas most other operators are denoted by capital letters
such as H, P, X, . . . and I is the identity. Exceptions are the familiar bosonic and
fermionic creation and annihilation operators (denoted typically by a(†) , b(†) , c(†) , d(†) )
and the Pauli matrices:
01 0 −i 1 0
σx = , σy = , σz = .
10 i 0 0 −1
The trace of some operator O is denoted tr{O}. Superoperators (also simply called
maps), which map operators onto operators, are always denoted by calligraphic let-
ters such as C, D, L, . . . . The tensor product is denoted ⊗ and additional subscripts
A, B, S . . . are used to indicate on which subspace some operator is acting (e.g. ρS ) or
to denote the partial trace (e.g. trB {. . . }).
Gibbs state) is denoted by the Greek letter π. For instance, πS (β) denotes the Gibbs
state of some system S at inverse temperature β.
Finally, every process starts at the initial time t0 = 0. Moreover, I do not set Boltz-
mann’s and Planck’s constants kB and h̄ to 1. I believe that this makes the physical
content of many equations more insightful. For practical manipulations, this choice is,
of course, not the most convenient one, but since the majority of the literature sets
kB ≡ 1 and h̄ ≡ 1, I thought it would be good to keep them explicit here.
Acknowledgements
This book is dedicated to the memory of Tobias Brandes because—long before I en-
joyed being his PhD student—his inspiring and unprecedented lectures were the reason
why I actually turned towards theoretical physics. In my opinion, Tobias’s approach
to physics was dual, tackling deep and conceptual problems while having at the same
time an eye on experimentally well-grounded approaches and models, which also work
‘in practice’. I view my own research, and in particular this book, in the tradition of
his philosophy, perhaps with some bias more towards conceptual and general ideas. I
hope Tobias would have enjoyed reading it.
Concerning the topics exposed here I further owe much of my detailed knowledge
to Massimiliano Esposito and Gernot Schaller. Neither ever hesitated to share their
insights and ideas with me about various topics in statistical mechanics, stochastic
thermodynamics, non-equilibrium physics and open quantum systems. Much of their
knowledge is reflected here; yet, I believe I succeeded in also adding my own twist.
A special thanks goes to Kavan Modi. A short, but in retrospect important, dis-
cussion at the Kavli Institute in Santa Barbara in 2018 inspired me to look at the
problem from a different angle and part of the material presented in Chapter 5, and
in some sense the motivation to write this book, is a consequence of that discussion.
Many more colleagues have shared their insights and thoughts with me. Those on
whom I could particularly rely concerning the topics presented in this book include
Robert Alicki, Janet Anders, Felipe Barra, Victor Bastidas, Javier Cerrillo, Luis Cor-
rea, María García Díaz, David Gelbwaser-Klimovsky, John Goold, Giacomo Guarnieri,
Géraldine Haack, Christopher Jarzynski, Matteo Lostaglio, Mark Mitchison, Kavan
Modi, Wolfgang Muschik, Juan Parrondo, Martí Perarnau-Llobet, Matteo Polettini,
Andreu Riera-Campeny, Felix Ritort, Àngel Rivas, Dominik S̆afránek, Rafael Sánchez,
Anna Sanpera, Udo Seifert, Michalis Skotiniotis, Christopher Wächtler and Andreas
Winter. It makes me a bit sad to know that I probably forgot to correctly acknowledge
all the people who contributed to my actual understanding of this topic. There were
clearly many more people who influenced my thinking.
Furthermore, since writing a book is quite a ‘mammoth project’, I am grateful to
Javier Cerrillo, Marco Merkli, Kavan Modi, Andreu Riera-Campeny, Àngel Rivas and
Rafael Sánchez, who read (parts of) early drafts of the book and provided valuable
comments. In particular, a special thanks goes to Teresa Reinhard for making sure
that various parts of the book do not sound too cryptic to the outsider.
I also want to thank the team at Oxford University Press. In particular, I much
appreciated the uncomplicated manner of communicating with them, which clearly
contributed to the fact that the writing of this book was an overall very joyful expe-
rience.
xii Acknowledgements
Of course, the constant support I receive from my family and friends, including the
little lion, are invaluable.
Most parts of the work reported here were financially supported by the DFG
(project STR 1505/2-1). For further support I thank the Spanish Agencia Estatal
de Investigación, projects IJC2019-040883-I and PID2019-107609GB-I00, the Span-
ish MINECO FIS2016-80681-P (AEI/FEDER, UE) and the Generalitat de Catalunya
CIRIT 2017-SGR-1127.
Finally and almost needless to say, all mistakes and opinions expressed here are entirely
my own. I am always grateful to receive further comments, questions and feedback (to
get in contact with me I suggest you type my name into your preferred search engine
and look up my up-to-date email address). Errata will be announced on my homepage.
Contents
Summary. This chapter describes the basic features of open quantum systems, i.e.
quantum systems that are affected by noise due to uncontrollable degrees of freedom
of an environment or bath. This noise is responsible for effects such as dissipation,
decoherence and irreversibility. We study the equilibrium states of open quantum
systems and review tools from quantum measurement theory, which describe how
to extract information from an (open) quantum system. We generalize these tools
to multi-time statistics and define the notion of a quantum stochastic process and
a quantum Markov process. Finally, we study in which cases a quantum stochastic
process looks classical.
∂ i
ρ(t) = − [H, ρ(t)]. (1.1)
∂t h̄
Here, H is the Hamiltonian operator characterizing the total energy of the system and
[A, B] ≡ AB −BA is the commutator. Furthermore, i and h̄ are the familiar imaginary
unit and Planck’s constant, respectively. Note that we will also use the notation ∂t ρ(t)
to denote a partial derivative with respect to time.
Isolated quantum systems have some important characteristics:
(i) There exists a unitary time evolution operator U (t) ≡ exp(−iHt/h̄), which means
that U (t)U (t)† = U (t)† U (t) = I, where I denotes the identity matrix. This time
evolution operator propagates the system state according to
(ii) The spectrum of the state ρ(t), i.e. its eigenvalues λk , do not change in time:
X
ρ(t) = λk |ψk (t)ihψk (t)|. (1.3)
k
∂ i
|ψ(t)i = − H|ψ(t)i. (1.5)
∂t h̄
Since the Schrödinger equation can be applied to compute the time evolution of
any of the |ψk (t)i in eqn (1.3), it is equivalent to the Liouville–von Neumann
equation (1.1).
(iv) The von Neumann entropy of the system, which is defined as
is P
constant in time. This follows again from point (ii) above because SvN [ρ(t)] =
− k λk ln λk , which implies that the von Neumann entropy quantifies the clas-
sical uncertainty about the state of the system. Alternatively, the conservation of
von Neumann entropy follows by using
d dρ(t)
SvN [ρ(t)] = −tr ln ρ(t) (1.7)
dt dt
and eqn (1.1). Note that eqn (1.7) holds only if the rank of the density operator
does not change during the evolution, which is guaranteed by point (ii) above. If
the rank changes in time, the von Neumann entropy is not differentiable. Note
that we use a subscript ‘vN’ for the von Neumann entropy throughout the book
because we want to distinguish it from the notion of thermodynamic entropy
introduced later on. Readers unfamiliar with information-theoretic concepts of
entropy can find an overview in Appendix A.
System–Bath Theories and the Origin of Noise 3
Time-dependent case
Quantum systems are often subjected to time-dependent fields in a laboratory, and
these fields can be treated semiclassically (e.g. laser light). In this case, the Hamil-
tonian becomes time dependent and is denoted by H(λt ), where the time-dependent
parameter λt specifies the external fields. We prefer the notation H(λt ) instead of
H(t) to avoid any possible confusion with the Heisenberg picture. In the following, λt
is called a driving or control protocol and the state of such a driven system evolves
in time according to the Liouville–von Neumann equation (1.1) with H replaced by
H(λt ).
The evolution ρ(t) = U (t, 0)ρ(0)U † (t, 0) is still described by a unitary operator
U (t, 0), but its explicit computation is now more complicated. Formally, we have
∞
X n Z t Z t1 Z tn−1
i
U (t, 0) = − dt1 H(λ1 ) dt2 H(λ2 )· · · dtn H(λn )
n=0
h̄ 0 0 0
X∞ n Z t Z t
1 i
= − dt1 · · · dtn [H(λ1 ) . . . H(λn )]+ (1.8)
n=0
n! h̄ 0 0
Z
i t
≡ exp+ − dsH(λs ) ,
h̄ 0
where we abbreviated λtj ≡ λj and the subscript + denotes time ordering. By dividing
the time interval [0, t] into n = t/δt small steps δt, we can also write
Y
n−1
U (t, 0) ≈ e−iH(λj )δt/h̄ , (1.9)
j=0
Fig. 1.1 Open quantum systems. (a) A rough sketch of a system S in contact with a bath B
with which it can exchange energies, particles, entropy, etc. (b) Diagram of the Newcomen at-
mospheric steam engine as an example of an ancient ‘open quantum system’: thermodynamics
had already divided the universe into a system part and reservoirs. (c) (False-coloured) scan-
ning electron microscope image of a modern open quantum system: a quantum dot formed by
a nanowire (thin green line) in contact with electron reservoirs (yellow) and heaters (blue and
red). Such setups will be treated in further detail in Section 3.10. Reprinted by permission
from Springer Nature: Springer Nature, Nature Nanotechnology (A quantum-dot heat engine
operating close to the thermodynamic efficiency limits, M. Josefsson et al.), Copyright by
Springer Nature (2018).
phonons (lattice vibrations) in the surroundings. Finally, the historical origin of the
theory of thermodynamics is rooted in the desire to understand, for example, steam
in a container in contact with hot air produced by burning coal on the one side and
cold water on the other side. Here, the system is defined by the container, which
contains the so-called working medium or working fluid, whereas the outside hot air
and cold water are called a heat bath or reservoir. We will also use these words from
Chapter 2 onwards. In this chapter, however, we use the broader term ‘environment’
or ‘bath’ to refer to any external, uncontrollable part of the world, not necessarily
described by a thermodynamic variable such as temperature. It will become clear
throughout this book that the predictive power of the second law comes from an
efficient description of these uncontrollable degrees of freedom about which we have
only very little information.
Quantum systems which are not isolated but in contact with a bath or environ-
ment are called open quantum systems; see Fig. 1.1 for sketches. Conceptually, many
different approaches exist to describe them theoretically and a very powerful one is
to model the bath itself as another quantum system such that the system and the
bath (which we sometimes also call the universe) constitute one big isolated quantum
system. This is the origin of system–bath theory and we will see below that this is
indeed not an assumption: any open quantum system can be seen as being part of a
larger isolated quantum system.
Mathematically, the system–bath composite is a bipartite quantum system de-
scribed by the tensor product of the system and bath Hilbert space: HS ⊗ HB . The
dimension of that space is dim(HS ⊗ HB ) = dim HS · dim HB . For many applications
System–Bath Theories and the Origin of Noise 5
the dimension of HS is very small, e.g. dim HS = 2 for a single spin, whereas the
dimension of HB is often very large, e.g. dim HB = 2NA , where the Avogadro number
NA is of the order of 1023 . The dynamics of the system and the bath is governed by
the Hamiltonian HSB = HS ⊗ IB + IS ⊗ HB + VSB . Here, HS and HB denote the
Hamiltonian of the isolated system or bath, respectively, and VSB denotes their in-
teraction. They sum up to the total Hamiltonian HSB . Whereas the system and bath
Hamiltonian commute (since they live on different Hilbert spaces) we have in general
6 0 and [VSB , HB ] 6= 0. In the following, we suppress tensor products with
[VSB , HS ] =
the identity for notational simplicity and write the system–bath Hamiltonian as
HSB = HS + HB + VSB . (1.10)
For simplicity, we assumed no driving λt here, but this will change in later chapters.
We remark that the tensor product structure of the system–bath composite assumes
the system and bath to be distinguishable objects and implies a particular choice of
gauge made when identifying what is the ‘system’ and what is the ‘bath’.
Since the system–bath composite is isolated, its global state described by the den-
sity operator ρSB (t) evolves according to the Liouville–von Neumann equation (1.1)
with H replaced by HSB . The system evolution is obtained by taking the partial trace:
Illustrative example
We illustrate the above arguments by considering an assembly of n interacting spins.
Readers unfamiliar with open quantum systems are invited to explicitly follow this
example by doing their own numerics for it. We assume that the spins are described
by the following global Hamiltonian:
h̄Ω X (i) h̄ X X
n n
HSB = σ + gij σx(i) σx(j) , (1.13)
2 i=1 z 2 i=1 j>i
6 Quantum Stochastic Processes
(i)
where σα (α = x, y, z) are the familiar Pauli matrices acting on spin i. The first term
describes n isolated spins with energy gap h̄Ω. The second term describes the interac-
tion between two spins with coupling strength gij . As our system we now choose one of
(1)
the spins, say the first, such that HS = h̄Ωσz /2. The bath Hamiltonian consequently
Pn (i) Pn P (i) (j)
becomes HB = h̄Ω i=2 σz /2 + h̄ i=2 j>i gij σx σx /2 and the remaining part
defines the interaction Hamiltonian VSB . The initial state of the system and bath is
assumed to be decorrelated:
e−βHB
ρB (0) = πB ≡ , ZB ≡ trB {e−βHB }, (1.15)
ZB
where ZB is the partition function of the bath. Therefore, we assume the following
situation: previous to the initial time the first spin is decoupled from the others and
prepared in a superposition of energy eigenstates. Then, at time t = 0 we suddenly
switch on the interaction with the bath, which is assumed to be thermalized.
We aim at a numerical exact simulation of the system–bath dynamics based on the
Liouville–von Neumann equation (1.1). For n spins the total density matrix as well
as the unitary time evolution operator are 2n × 2n matrices. This exponential growth
in size necessarily limits us to consider only a few spins, precisely we consider seven
spins in total (i.e. the bath consists of six spins). Note that one is often interested in a
bath that is much larger in size. Computing the exact dynamics of an open quantum
system then quickly becomes impossible, which motivates the need for efficient and
reliable approximation schemes. For the moment, however, a small bath suffices to
illustrate our points above. To continue with the discussion of our model, let us choose
the spin–spin interactions gij random from a uniform distribution over [0, 1]. We do
not explicitly write down the values for gij here because the dynamics are qualitatively
similar for most choices.
Numerical results for an initial (dimensionless) bath temperature of βh̄Ω = 10 are
shown in Fig. 1.2. Note that this bath temperature is relatively cold, i.e. the energy
gap h̄Ω of each single spin is 10 times larger than the typical energy kB T = β −1 of
a thermal excitation. Figure 1.2a shows the time evolution of the expectation value
(1) (1)
hσx i(t) = tr1 {σx ρS (t)}. For better comparison we also plot its time evolution in
the case that the spin was isolated, i.e. for VSB = 0 (thin grey line). Their difference
is quite striking; in particular, it is difficult to recognize any structure or pattern for
the open quantum system case. Figure 1.2b shows the purity, which decreases as
expected. Note that the minimal value of the purity for a two-level system is 1/2.
Figure 1.2c shows the evolution of the von Neumann entropy, which is no longer
conserved. Note that the maximum value for the von Neumann entropy of a two-level
System–Bath Theories and the Origin of Noise 7
Fig. 1.2 Exemplary time evolution of an open quantum system for a ‘cold’ bath. The thin
grey line in (a) shows the time evolution of an isolated system with an identical system Hamil-
tonian. The thin grey line in (d) shows the time evolution of the quantum relative entropy
with respect to a reference state, which describes a refined equilibrium state introduced in
Section 1.3.
system is ln 2 ≈ 0.7. In Fig. 1.2d we show the time evolution of a quantity that
plays an important role throughout the book. It is known as the quantum relative
entropy, which is defined in general as
for two arbitrary density matrices ρ and σ. The relative entropy is non-negative and
only zero if ρ = σ. It can be regarded as a measure of statistical ‘distance’ between ρ
and σ; further information is provided in Appendix A. In particular, we plot
e−βHS
D ρS (t) , (1.17)
ZS
i.e. the distance between the reduced system state and its associated Gibbs state at the
inverse temperature of the bath. One could naively expect that this difference should
become very small for long times in accordance with equilibrium statistical mechanics.
This is not the case for two important reasons. First, a bath of only six spins is still
too small to induce equilibration of a small quantum system. Second, even if the bath
were larger, equilibrium statistical mechanics in fact predicts a state different from
the Gibbs state, as we will explain in detail in the next section. The time evolution of
the relative entropy with respect to this different state is shown by the thin grey line,
which is much closer to zero than eqn (1.17).
Finally, Fig. 1.3 shows the same plots as Fig. 1.2, but for a different initial tempera-
ture of the bath. This time we chose βh̄Ω = 1, such that the thermal excitation energies
8 Quantum Stochastic Processes
Fig. 1.3 Exemplary time evolution of an open quantum system for a ‘hot’ bath.
are comparable with the energy gap of each single spin. Equivalently, one could say
that we have more ‘noise’ in the bath. The difference compared with the cold case
βh̄Ω = 10 is quite striking. Oscillations are much less pronounced and eqn (1.17) be-
comes quite small after some transient time. This means that the standard equilibrium
Gibbs ensemble describes the system state quite well. In fact, the system is well de-
scribed by a completely mixed state ρS (t) ≈ IS /2, which follows by considering the
time evolution of the purity.
To conclude, an open quantum system behaves very differently from an isolated
quantum system. Even for a very small bath of only six spins effects such as equilibra-
tion and thermalization start to become visible. Next, we consider equilibrium states
of open quantum systems in the case that the bath is much larger.
e−βH
π≡ , Z ≡ tr{e−βH }. (1.19)
Z
Here, β = (kB T )−1 is the inverse temperature and Z is the partition function. If the
internal energy U = tr{Hπ} is fixed, the Gibbs state is characterized by the fact that it
maximizes the von Neumann entropy with the inverse temperature implicitly defined
through the relation U = −∂β ln Z. Vice versa, for a fixed von Neumann entropy
SvN (π), the Gibbs state minimizes the energy with the inverse temperature implicitly
defined through the relation SvN (π) = −β 2 ∂β (β −1 ln Z).
Now, suppose that the system–bath composite is well described by a global Gibbs
state πSB = e−βHSB /ZSB with ZSB ≡ trSB {e−βHSB }. What does the reduced system
state trB {πSB } look like? Perhaps surprisingly, it turns out that in general trB {πSB } 6=
πS = e−βHS /ZS . Only in the case of very weak coupling between the system and the
bath, i.e. if VSB is negligible compared with HS and HB , does the reduced system state
equal the conventional canonical ensemble. For non-negligible coupling VSB , however,
we define πS∗ ≡ trB {πSB } and we have πS∗ 6= πS .
Nevertheless, it is still possible to write πS∗ in an apparent Gibbs form with respect
to an effective Hamiltonian H̃S :
e−β H̃S
πS∗ = trB {πSB } = . (1.20)
Z̃S
In fact, this is possible for every density matrix ρS by defining H̃S ≡ −β −1 ln(Z̃S ρS ),
and not only for πS∗ . Notice that H̃S is fixed only up to an arbitrary choice of the
positive normalization constant Z̃S . For the reduced state πS∗ of a canonical equilibrium
ensemble there is one convenient choice for the effective partition function, which we
denote by ZS∗ = Z̃S and which is obtained by setting
That is, the effective partition function ZS∗ is the ratio of the partition functions of the
system–bath composite and of the bath alone. The corresponding effective Hamiltonian
is known as the Hamiltonian of mean force and reads explicitly
1 1 trB {e−βHSB }
HS∗ = − ln(ZS∗ πS∗ ) = − ln . (1.22)
β β ZB
∗
One quickly verifies that trB {πSB } = e−βHS /ZS∗ . The Hamiltonian of mean force pro-
vides a neat concept used various times in the following chapters. Physically speaking,
it can be seen as an effective free energy landscape for the system, a claim that becomes
clearer in Chapter 2. Notice that the Hamiltonian of mean force depends explicitly on
the inverse temperature β. For weak coupling VSB , HS∗ reduces to HS . Thus, the
familiar canonical ensemble only emerges in the weak coupling regime.
It is natural to ask: Does the effective Gibbs state (1.20) represent a good approx-
imation of the equilibrated open quantum system state, i.e. is πS∗ ≈ limt→∞ ρS (t)?
Equilibrium States of Open Quantum Systems 11
If that is the case, we say that the system thermalizes, which is a stronger require-
ment than equilibration alone, which only assumes the system state to become time
independent for long times. Paralleling the objections raised by Poincaré’s recurrence
theorem above that equilibration can never strictly happen, similar doubts may rise for
the case of thermalization. In particular, no initial system–bath state ρSB (0) different
from πSB will ever reach the latter state during the evolution.
Exercise 1.1 Show that, if ρSB (0) 6= πSB and if HSB is time independent, ρSB (t) 6= πSB
for all t.
Exercise 1.2 Consider a harmonic oscillator with frequency ω and Hamiltonian (in mass-
weighted coordinates) HS = 21 (p2S +ω 2 x2S ) coupled to a ‘bath’ of one other harmonic oscillator
with the same frequency ω. The global Hamiltonian is
2
1 2 c
HSB = HS + pB + ω 2 xB − 2 xS . (1.23)
2 ω
Here, the coupling strength is c and we have written the Hamiltonian in a manifestly positive
form such that HSB ≥ 0 for any choice of ω and c. Now, show first that, if treated as a classical
system, there is no correction to the thermal state of the system: πS∗ = πS = e−βHS /ZS . Hint:
12 Quantum Stochastic Processes
Remember that the trace over the bath degrees of freedom becomes classically an integral
over the phase-space coordinates (xB , pB ) of the bath.
Then, show that quantum mechanically this is no longer true. Hint: You are not asked
to directly compute HS∗ , but only to show that πS∗ 6= πS . This can be done in various ways.
One way is to compute the equilibrium variance of the system coordinate tr{x2S πSB } by
changing
√ to normal modes. Denote the (squared) eigenfrequencies of HSB by Ω2± ≡ [c2 +
2ω ± c + 4c ω ]/2ω and show that
4 4 2 4 2
eβh̄(Ω+ +Ω− ) − 1
tr{x2S πSB } = h̄ . (1.24)
(Ω+ + Ω− )(eβh̄Ω+ − 1)(eβh̄Ω− − 1)
Confirm that in the classical limit (h̄ → 0) this result is independent of the coupling strength
c. Also confirm that for c → 0 you recover the result trS {x2S πS } = h̄ coth(βh̄ω/2)/2ω, showing
that even at zero temperature quantum fluctuations give rise to a non-zero variance.
The above result can be generalized to a system–bath Hamiltonian of the form
" 2 #
1X 2 ck
HSB = HS + pk + ωk xk − 2 S
2
. (1.25)
2 ωk
k
This describes a system with arbitrary Hamiltonian HS coupled via an arbitrary system
coupling operator S to a bunch of harmonic oscillators. Such system–bath models, which are
well justified whenever the statistics of the bath are (approximately) Gaussian, are known as
Caldeira–Leggett models and they play an important role in the theory of open quantum
systems. Convince yourself of the fact that for a classical system the Hamiltonian of mean
force is identical to the system Hamiltonian: HS∗ = HS . Obviously, this no longer holds
for quantum systems. Computing the exact reduced equilibrium state of the open quantum
system is possible, but it is no longer a triviality.
Projective measurements
In quantum mechanics the measurement Pn of a system observable XS is described by
the projection postulate. Let XS = x=1 λ(x)ΠS (x) be the spectral decomposition
of the observable with n different eigenvalues λ(x), which are labelled by the index
x. Furthermore,P{ΠS (x)} is a set of projection operators satisfying ΠS (x)ΠS (x′ ) =
n
δx,x′ ΠS (x) and x=1 ΠS (x) = IS . If the state of the system is ρS , then the probability
of observing result x is p(x) = trS {ΠS (x)ρS }. The post-measurement state ρ′S (x)
conditional on observing x becomes
ΠS (x)ρS ΠS (x)
ρ′S (x) = . (1.26)
p(x)
X
n X
n
ρ′S = p(x)ρ′S (x) = ΠS (x)ρS ΠS (x). (1.27)
x=1 x=1
Notice that this state is in general different from the pre-measurement state, ρ′S 6=
ρS , which reflects the well-known fact that quantum measurements are disturbing.
This measurement backaction is absent only if the initial state commutes with the
observable: [ρS , XS ] = 0. Even in this case, however, the conditional state (1.26) is
in general different from the pre-measurement state, i.e. ρ′S (x) 6= ρS . This is not a
quantum effect but a consequence of the fact that the observer updates its state of
knowledge about the system. Readers who feel unfamiliar with this description are
advised to do the following exercise.
Exercise 1.3 Consider the case where all projectors ΠS (x) = |xihx|S are of rank 1 and the
system state is pure: ρS = |ψihψ|S . Convince yourself of the fact that the above framework
then reduces to the conventional picture known from introductory textbooks in quantum
mechanics. What corresponds to the Born rule? Which equation describes the ‘collapse’ of
the wave function? When does the measurement reveal no information, i.e. when do we have
ρ′S (x) = ρS ? Can you think about physical examples where the projectors are not of rank 1?
Remarkably, it is possible to derive eqn (1.27), which describes the average effect
of a quantum measurement, using only unitary dynamics and no projection postulate.
This works as follows. Consider a second n-dimensional ‘auxiliary’ quantum system
A, called the ancilla in the following, which interacts with our system S of interest.
Imagine that the initial system–ancilla state is decorrelated, ρSA = ρS ⊗ ρA , and the
ancilla is prepared in some fixed pure state ρA = |1ih1|A . Furthermore, assume that
the overall system–ancilla interaction is described by the following unitary operator:
X
n X
n
V = ΠS (x) ⊗ |r + x − 1ihr|A , (1.28)
x=1 r=1
ρ′SA = V ρS ⊗ ρA V † . (1.29)
The reduced state of the system is obtained by tracing out the ancilla,
X
n
ρ′S = trA {V ρS ⊗ ρA V } = †
ΠS (x)ρS ΠS (x), (1.30)
x=1
which is identical to eqn (1.27). Thus, the average effect of a system measurement arises
in this picture from interactions of the system with an outside ancilla. By tracing out
the ancilla, we lose information and the reduced dynamics is no longer unitary.
To derive the conditional post-measurement state (1.26) in this picture, assume
that we subject the ancilla to a projective measurement of the observable RA =
14 Quantum Stochastic Processes
Fig. 1.4 Circuit representation of an ancilla-assisted measurement with time running from
left to right. The input states to the process (left, triangles) are a system and ancilla state ρS
and ρA . Both interact via a unitary transformation, denoted here with a calligraphic symbol
V. The joint state ρ′SA after the interaction is in general correlated. To finally infer something
about the system, the experimentalist measures the state of the ancilla, here with projector
|rihr|A . The conditional post-measurement state of the system is denoted ρ′S (r).
Pn
r=1 r|rihr|A . Suppose this projective measurement of RA reveals outcome r, then the
conditional post-measurement state reads ρ′SA (r) = |rihr|A V ρS ⊗ ρA V † |rihr|A . Keep-
ing the information about r but tracing out the ancilla degrees of freedom reveals
Generalized measurements
We generalize the picture abovePby starting with an arbitary initial ancilla state,
written in its eigenbasis as ρA = j pj |jihj|A , an arbitrary unitary V , not necessarily
Quantum Measurement Theory 15
the one specified P in eqn (1.28), and a final projective measurement of some ancilla
n
observable RA = r=1 r|rihr|A . As above, we now take a look at the conditional state
of the system given the measurement result r of RA . Tracing out the ancilla, we get
the non-normalized system state
X
ρ̃′S (r) = trA {|rihr|A V ρS ⊗ ρA V † } = pj hr|A V |jiA ρS hj|A V † |riA . (1.32)
j
Notice that the Kj (r) are still operators acting on the system Hilbert space. Using
them, we can express eqn (1.32) more compactly as
X
ρ̃′S (r) = Kj (r)ρS Kj† (r). (1.34)
j
This equation makes it more transparent how the system state actually gets trans-
formed by computing the operators Kj (r) via eqn (1.33). In terms of them, we can
also express the probability of obtaining measurement result r as
X n o
p(r) = trS {ρ̃′S (r)} = trS Kj† (r)Kj (r)ρS . (1.35)
j
which are always positive, M (r) ≥ 0, which follows from the fact that any operator of
the form A† A is positive. Furthermore, they satisfy the completeness relation
X
M (r) = IS . (1.37)
r
Moreover, the set of operators {M (r)} completely fixes the measurement statistics
and therefore plays an important role in quantum measurement theory, where they
are known as a positive operator-valued measure or, in short, a POVM.
Exercise 1.4 Show that any set of operators {M (r)}r satisfying the completeness relation
and M (r) ≥ 0 for all r gives rise to a set of well-defined probabilities p(r) = trS {M (r)ρS }
for any state ρS . Hint: By ‘well defined’ we mean probabilities that are positive and sum up
to one.
We now study two examples to illustrate how the projection postulate can be
generalized. The first
Pnexample describes projective measurement of the initially studied
observable XS = x=1 λ(x)ΠS (x), but it includes classical measurement errors. For
16 Quantum Stochastic Processes
which coincides with eqn (1.38). Thus, they give rise to the same measurement statis-
tics. However, the post-measurement state looks very different:
Xp Xp
ρ̃′S (r) = K(r)ρS K(r) = p(r|x)ΠS (x)ρS p(r|x′ )ΠS (x′ ). (1.41)
x x′
This teaches us an important lesson, namely that the probablity operators M (r) do not
uniquely fix how the state changes as a result of the measurement. The next exercise
elucidates the difference between eqns (1.39) and (1.41) further.
Exercise 1.5 Consider an initially pure state ρS = |ψihψ|S . Then, show that the post-
measurement state given by eqn (1.41) is always pure, i.e. [ρ′S (r)]2 = ρ′S (r), whereas this is
in general not the case for eqn (1.39). Thus, eqn (1.41) preserves the ‘quantum character’ of
the state, whereas eqn (1.39) inevitably introduces classical noise.
Summary
We summarize what we have found out in this chapter. By shifting the description of
the projective measurement from the system itself to an external ancilla, with which the
system interacts in a unitary way, we obtained a more general description of quantum
measurements expressed by system state changes of the form (1.34). It includes the
Operations, Interventions and Instruments 17
case of projective measurements, but as eqns (1.39) and (1.41) have shown also more
general transformations. The generalization of the average unconditional system state
after a projective measurement, eqn (1.27), follows from eqn (1.34) as
X X XX
ρ′S = p(r)ρ′S (r) = ρ̃′S (r) = Kj (r)ρS Kj† (r). (1.42)
r r r j
Owing to trace conservation, we know that the operators Kα must satisfy the com-
pleteness relation X
Kα† Kα = 1. (1.44)
α
Apart from this relation, it turns out that the operators Kα are indeed arbitrary. We
will learn more about them from an abstract perspective, which is not necessarily
related to quantum measurements, in the next section.
It is instructive to point out the similarities and differences between the system–
ancilla picture and the system–bath picture of Section 1.2. In both cases the system is
open because of its coupling to some external degrees of freedom. However, we did not
call the ancilla a ‘bath’ here because its interpretation is quite different in the present
context. Whereas the bath was introduced in Section 1.2 as an uncontrollable and
typically large object, the small and controllable ancilla of this section was introduced
to probe the state of a quantum system. The physical interpretation of this situation
is different, but—as the next sections will show in greater detail—the mathematical
description is essentially the same. Before proceeding, we conclude this section with a
final exercise to elucidate the difference between classical and quantum measurements.
Exercise 1.6 Construct the classical counterpart of the theory above. To this end, replace
the notion of the density matrix ρS by a vector p with elements p(x) denoting the probability
of finding the classical system in state x. Let p(r|x) be the conditional probability of obtaining
the measurement result r given that the system is in state x and define the matrix M (r) with
elements Mxx′ (r) = δx,x′ p(r|x). What do {M (r)} and a POVM have in common? What is
the
P quantum counterpart of the classical expression M (r)p? Relate the probability p(r) =
x p(r|x)p(x) to obtain outcome r to the expression M (r)p. Show that the (normalized) post-
measurement state of the system given result r obeys Bayes’ rule p′ (x|r) = p(r|x)p(x)/p(r).
Finally, verify that the average post-measurement state does not change: p′ = p. Thus, Bayes’
rule describes the most general non-disturbing classical measurement.
of freedom to manipulate a quantum system and the intention might not always be
to ‘measure’ the system in a literal sense. We further abbreviate eqn (1.43) as
X
ρ′S = CρS ≡ Kα ρS Kα† . (1.45)
α
The map C, which takes an operator and maps it to another operator, is also known as
a superoperator to distinguish it from the notion of a ‘usual’ operator, which ‘only’
maps vectors onto vectors. Mathematically speaking, this distinction is superfluous:
the space of matrices itself forms a vector space and, therefore, one can represent
any superoperator C also by a (large) matrix. For physics applications it is, however,
advantageous to use a terminology and notation that distinguishes whether an operator
acts on states |ψi or density matrices ρ. From now on, we frequently use superoperators
as an efficient bookkeeping tool. By convention, superoperators are always written
with a calligraphic letter such as C and they act on all operators to their right, i.e.
C2 C1 ρ = C2 [C1 (ρ)]. Note that the action of different superoperators does not commute
in general, i.e. C2 C1 ρ 6= C1 C2 ρ. Readers who are exposed to this for the first time are
advised to consult Appendix B before proceeding.
We also introduce the superoperator C(r) corresponding to eqn (1.34):
X
ρ̃′S (r) = C(r)ρS ≡ Kαr ρS Kα† r , (1.46)
αr
which is obtained from eqn (1.45) by using only a subset of the operators Kα . The
subsets of Pindices {αr } are obtained from the total set {α} by a disjoint decomposition
such that r C(r) = C. In the following we P call a set of maps {C(r)} that decomposes
an operation C of the form (1.45) such that r C(r) = C an instrument. Notice that,
given only eqn (1.45), there are many instruments that decompose C.
Now, our claim is that eqn (1.45) describes the most general control operation
or intervention we can implement in a laboratory on a quantum system at a single
time. To verify this, let us find the minimial conditions we would like to have satisfied
by an instrument {C(r)} such that it describes a physically allowed control operation
in a laboratory. Clearly, if the state of the system was previously described by a
valid density matrix ρS , the minimal requirement is that also the final set of states
{ρ̃′S (r) = C(r)ρS } can be interpreted in a legitimate way as the state of a physical
system. Recall that any density matrix ρ is positive, has unit trace and if ρ1 and ρ2
are two valid density matrices, then their convex combination λρ1 + (1 − λ)ρ2 for any
λ ∈ [0, 1] is another valid density matrix. Therefore, we postulate that any physically
legitimate instrument {C(r)}r should at least satisfy the following:
(i) C(r) is positive: if ρS ≥ 0, then also C(r)ρS ≥ 0.
(ii) C(r) is trace non-increasing: if tr{ρS } = 1, then p(r) = tr{C(r)ρS } is the prob-
ability of applying the map C(r) (in view of a quantum measurement, it is the
probability of obtaining result r). If p(r) = 1, C(r) = C is called trace-preserving.
P (i)
(iii) C(r) is convex linear:
P if i λi ρS is a statistical mixture of quantum states with
λi ≥ 0 satisfying i λi = 1, then
Operations, Interventions and Instruments 19
X (i)
X (i)
C(r) λ i ρS = λi C(r)ρS . (1.47)
i i
Now, to make things even a little more complicated, it turns out that desideratum
(i) is not enough. Remember that C(r) could describe, for instance, a measurement of
a system S coupled to a bath B. Prior to the measurement the system–bath state ρSB
can be arbitrary and we would like to ensure that the post-measurement state also
describes a legitimate quantum state. This means that we actually want the global
state ρ̃′SB (r) = [C(r) ⊗ IB ]ρSB ≥ 0 to be positive, where IB denotes the identity
operation acting on the bath defined via IB ρB ≡ ρB for any state ρB . This is not
necessarily guaranteed by the notion of a positive map. Therefore, we replace (i) by
the stronger condition:
(i′ ) C(r) is completely positive: consider the composite system HS ⊗HB with HB an
arbitrary Hilbert space. If ρSB ≥ 0 is an arbitrary positive state of the composite
system, then [C(r) ⊗ IB ]ρSB ≥ 0 is also positive.
Exercise 1.7 An example for a positive but not completely positive map is the transpose
operation T . Let ρ = ρ00 |0ih0| + ρ01 |0ih1| + ρ10 |1ih0| + ρ11 |1ih1| be an arbitrary density
matrix of a qubit, then the transpose operation with respect to the basis {|0i, |1i} is defined
via T ρ ≡ ρ00 |0ih0| + ρ10 |0ih1| + ρ01 |1ih0| + ρ11 |1ih1|. Show that this map is positive, but not
completely positive. Hint: As a counterexample consider the transpose operation√acting on
the first qubit of the pure and maximally entangled bipartite state (|00i + |11i)/ 2.
Exercise 1.8 Let C be a convex linear map acting on density matrices ρ of a Hilbert space
with dimension dim H = d. Show that every complex d × d matrix A can be written P as a
linear combination of density matrices ρi with complex coefficients ci ∈ C, i.e. A = i ci ρi .
Hint: To do so, you can use the two fundamental results that, first, every A can be written
as A = A1 + iA2 with A1 = A†1 and A2 = A†2 being Hermitian and, second, every Hermitian
A can be decomposed as A = A+ − A− with A± ≥ 0 being positive.
˜ ≡ P ci Cρi and in the following, tacitly
The extension C˜ of C is then defined via CA i
assuming this extension, we identify C ≡ C.
˜
P P
Kα† r Kαr ≤ IS . If αr Kα† r Kαr = IS , then C(r) = C is trace-preserving. Equa-
αr
tions (1.45) or (1.46) are called the operator–sum representation of a CP(TP) map.
P
It is not too hard to show that a map of the form (1.46) with αr Kα† r Kαr ≤ IS
satisfies points (i′ ), (ii) and (iii). The other direction is harder to show and not done
here. Instead, we only note that the operator–sum representation (1.46) is not unique,
as explicitly shown by the next exercise.
Pd † P
Exercise 1.9 Consider the map Cρ = α=1 Kα ρK and define Kα ≡ α Uαβ K̃β for an
Pαd †
arbitrary d × d unitary matrix U . Show that Cρ = α=1 K̃α ρK̃α .
It turns out that there is another important representation theorem for an instru-
ment, which brings us back to our ancilla construction used in the previous section.
Unitary dilation theorem. A set of maps {C(r)} acting on a d-dimensional system
Hilbert space HS forms an instrument if and only if there exists a d2 -dimensional
‘ancilla’ Hilbert space HA , a unitary USA acting on HS ⊗ HA , a pure ancilla state
|ϕiA ∈ HS and a set of projectors {ΠA (r)} acting on the ancilla space such that
†
C(r)ρS = trA {ΠA (r)USA (ρS ⊗ |ϕihϕ|A )USA } (1.48)
†
CρS = trA {USA (ρS ⊗ |ϕihϕ|A )USA }. (1.49)
We are not going to prove the unitary dilation theorem here, but only discuss its
consequences. For instance, the next exercise shows that the number of operators Kα
appearing in the operator–sum representation is at most d2 with d = dim HS .
Exercise 1.10 Use the unitary dilation theorem to derive eqns (1.45) and (1.46) by giving
explicit expressions for the operators Kαr and Kα in terms of USA , |ϕiA and ΠA (r). Confirm
that the maximum number of operators Kα is d2 . Hint: Remember eqn (1.33).
dilation theorem is an abstract statement and does not tell us for a given transformation
of the system what is the actual environment causing this transformation. This is
related to the non-uniqueness of the operator–sum representation: there are infinitely
many Hilbert spaces HA , unitary matrices USA , states |ϕiA (possibly also mixed states)
and projectors ΠA (r) satisfying eqns (1.48) and (1.49).
Before concluding this section, we point out a subtle but important observation.
What the unitary dilation theorem also teaches us is that satisfying desiderata (i′ ),
(ii) and (iii) is equivalent to representing the interaction of the system with some en-
vironment that is initially decorrelated from the system, i.e. of the form ρS ⊗ |ϕihϕ|A .
However, the evolution of an open quantum system (1.11) could also result from a
system initially correlated with the environment. In this case, the operator–sum repre-
sentation and the unitary dilation theorem break down. The reason is that desideratum
(iii) (linearity) is then no longer satisfied. This is exemplified in the next exercise and
a way out of this ‘dilemma’ is presented in Section 1.7.
Exercise 1.11 Consider the interaction of two qubits. The first qubit is called the system
and the second√ the ancilla. We further introduce the maximally entangled states |±i =
(|00i+|11i)/ 2, where by convention we set |iji ≡ |iiS ⊗|jiA for i, j ∈ {0, 1}. Let the system–
ancilla interaction be modelled by the unitary USA = |+ih+| + |−ih10| + |10ih−| + |01ih01|
(check that this is a unitary matrix). The evolution of the composite system is therefore
†
modelled via ρ′SA = USA ρSA USA , which is clearly linear with respect to the joint input state
ρSA and CPTP.
Next, consider the reduced dynamics of S assuming that the initial system–ancilla state
ρSA = |+ih+| is entangled. Verify the following results: (1) the initial reduced system state is
ρS = (|0ih0| + |1ih1|)/2; (2) the final system state is identical to the initial system state, ρ′S =
†
trA {USA |+ih+|USA } = ρS ; (3) if we first perform a measurement of the initial system in its
eigenbasis, the initial system state does not change on average: |0ih0|ρS |0ih0|+|1ih1|ρS |1ih1| =
ρS ; but (4) the final system state after such an initial measurement
† 1 3
trA {USA (|0ih0|ρSA |0ih0| + |1ih1|ρSA |1ih1|)USA }= |0ih0| + |1ih1| (1.50)
4 4
is different from ρS . Hence, we have a ‘paradox’ because the same reduced initial system state
gives rise to two different final states. Thus, we cannot associate any map C, which only acts
on the system part, with this input–output relation.
How can this be resolved? Clearly, from a global point of view there is no paradox: the
two initial states ρSA = |+ih+| and |0ih0|S ρSA |0ih0|S + |1ih1|S ρSA |1ih1|S are different (albeit
they give rise to the same reduced system state), and hence there can be two different output
states. With respect to our construction above, the key insight is to realize that it is no longer
meaningful to speak about different initial system states if the system is initially entangled with
its environment (here, the ancilla). In particular, we assumed that it is possible to mix (or
convex combine) different system states without influencing the dynamics, i.e. the definition
of the map. This assumption is incompatible with having an initial system state entangled
with its environment.
We summarize the content of the last two sections, which play an important role
in the following. First, quantum systems can undergo more general state transfor-
mations than unitary evolutions and projective measurements. However, even general
state transformations can be constructed using only unitary evolutions and projective
measurements on a bigger system–ancilla (or system–bath) space. If the desiderata
(i′ ), (ii) and (iii) are satisfied, where point (iii) assumes that the preparation of the
22 Quantum Stochastic Processes
initial system state can be disentangled from the effect of the state transformation,
then we can use either the operator–sum representation or the unitary dilation the-
orem. If there is no post selection (i.e. conditioning on a measurement result r), the
state transformation is described by a CPTP map represented by either eqn (1.45) or
eqn (1.49). If there is conditioning, the state transformation is described by a CP map
represented by either eqn (1.46) or eqn (1.48). On average, all CP maps have to add
up to a CPTP map. Vice versa, every CPTP map can be decomposed into a set of
CP maps. Such a set is called an instrument.
Finally and for completeness, we mention one extension of the framework intro-
duced here. So far, we have assumed that the dimension d of the system space does
not change during the control operation, but for some applications it makes sense to
relax this requirement. These applications play a minor role in quantum stochastic
thermodynamics and the mathematical modifications we have to add only amount
to correct bookkeeping of the different input and output Hilbert spaces in the nota-
tion. Nevertheless, the final exercise shows a couple of neat examples of such control
operations, which are useful to keep in mind.
Exercise 1.12 Verify that the following maps are CPTP by finding an operator–sum repre-
sentation for them. Example 1: The ‘trace map’ is defined for any input state ρ by Cρ ≡ tr{ρ}.
This map is also often associated with ‘discarding a system’ as it destroys all the information
contained in a system and replaces it by a ‘trivial’ system living in the Hilbert space H = C
with the only possible density matrix ρC = 1. Example 2: Consider a bipartite system with
Hilbert space H1 ⊗H2 . Show that the ‘partial trace map’ is CPTP: Cρ12 ≡ tr2 {ρ12 } = ρ1 . Ex-
ample 3: In contrast to Example 1, one can also ‘create’ a system using a CPTP map, which
takes input states from the Hilbert space H = C, by defining Cρ c ≡ ρ for any c ∈ C and some
fixed density matrix ρ. Note that, according to this example, ‘states’ in quantum mechanics
are CPTP maps. Example 4: An extension of Example 3 and in some sense the opposite of
Example 2 is the map which ‘adds’ a fixed state ρ2 to an input state ρ1 : Cρ2 ρ1 ≡ ρ1 ⊗ ρ2 .
time evolution of the system, the experimenter measures the value of R at an ar-
bitrary set of times {tℓ }nℓ=0 , where here and in the following we assume the order
0 = t0 < t1 < · · · < tn . The results of the measurement at time tℓ are denoted by rℓ . By
repeating the experiment many times, the experimenter can gather enough statistics to
construct (approximately) the joint probability distribution p(rn , tn ; . . . ; r1 , t1 ; r0 , t0 )
to observe r0 at time t0 , r1 at time t1 , and so on and so forth up to the last measure-
ment giving result rn at time tn . Since the subscript at the measurement result rℓ is in
one-to-one correspondence with the time tℓ of the measurement, we write for brevity
To save further space in the notation, we write the sequence of measurement results
in boldface as rn P ≡ (rn , . . . , r1 , r0 ). In this notation the content of eqn (1.52) reduces
to p(rn ) ≥ 0 and rn p(rn ) = 1. Note that the sequence rn has n + 1 entries (and not
n) since our first measurement happens by convention at time t0 (and not at t1 ).
The conditions (1.52) of positivity and normalization have to be satisfied by any
probability distribution. Therefore, p(rn ) does not yet describe a stochastic process
that has one important additional structure. To uncover this additional structure, we
need to look at the joint probability distribution p(rn , . . . , rℓ+1 , rℓ−1 , . . . , r0 ) to obtain
the results rn , . . . , rℓ+1 , rℓ−1 , . . . , r0 by measuring the system at times tn , . . . , tℓ+1 ,
tℓ−1 , . . . , t0 , i.e. at all times except for time tℓ , where we perform no measurement.
To emphasize this fact, we denote this probability by p(rn , . . . , rℓ , . . . , r0 ). Then, we
define a classical stochastic process by the requirement that
X
p(rn , . . . ,
rℓ , . . . , r0 ) = p(rn , . . . , rℓ , . . . , r0 ) (1.53)
rℓ
for all ℓ ∈ {0, 1, . . . , n}. This is known as the Kolmogorov consistency condition
and it tells us that not measuring the observable R at some point in time is equivalent
to measuring it followed by marginalizing about the measurement result. Therefore,
the joint probabilities of a classical stochastic process form a hierarchy, where the
(n + 1)-time joint probability p(rn , . . . , r1 , r0 ) contains all the information about the
k-time probabilities (with k < n+1) obtained from measuring the system at any subset
of the times {tn , . . . , t1 , t0 }. Furthermore, an important mathematical result known as
the Daniell–Kolmogorov extension theorem says that we can also go the reverse way:
whenever we have a hierarchy of n-time joint probabilities satisfying the Kolmogorov
consistency condition, then they can be constructed as marginals of N -time joint
probabilities with N > n, which also satisfy the consistency condition. The important
point here is that this holds even in the limit of n → ∞. The Daniell–Kolmogorov
extension theorem therefore provides a bridge between experimental reality (where any
measurement statistics are always finite and described by n < ∞) and its theoretical
description (which often uses continuous-time dynamics in the form of, for example,
stochastic differential equations, which assume n → ∞).
24 Quantum Stochastic Processes
It is clear that the Kolmogorov consistency condition is in general not satisfied for
quantum systems. It is, however, noteworthy that even for classical systems eqn (1.53)
can be broken. This can happen simply for the reason that the measurement of a
classical system can be disturbing as well. Another reason is an experimenter choosing
to actively break the consistency condition. Examples of the latter kind include the use
of feedback control, where an external agent manipulates the dynamics of a system
based on the so far available information (say, based on rk up to some time tk ).
The future probabilities of observing rn (n > k) are then in general different from
the situation where the external agent had not observed the system previously (and,
consequently, had not applied any feedback control). Another example is a clinical trial
where the health of patients is monitored at regular intervals and where the process
can be actively changed by giving drugs. The health of the patients in the future then
depends on the question of whether their health was monitored previously or not (and,
consequently, whether they received drugs or not).
These examples show that the theory of classical stochastic processes breaks down if
an external agent intervenes in the process. The mathematical language appropriate for
such situations is the theory of classical causal models. Without going into its details,
we point out that causal models can distinguish between causation and correlation,
whereas classical stochastic processes cannot. Indeed, classical stochastic processes
allow correlations to be quantified between two random variables, say X and Y , but
the consistency condition forbids us to infer whether X is a cause of Y , Y is a cause
of X, or whether there is no causal relation between X and Y and their correlation
is the result of another common cause Z. In order to infer causal relationships, it is
important that we can change the process and thus violate the consistency condition,
for instance by looking at the behaviour of Y after forcing X to take on a certain
value. The following example makes this qualitative reasoning quantitative.
Exercise 1.13 We consider three binary random variables S, B and C with values s, b
and c. On a given day S describes whether the sun is shining (s = 1) or not (s = 0), B
describes whether the number of sunburns is high (b = 1) or low (b = 0) and C describes
whether the number of ice cream sales is high (c = 1) or low (c = 0). We assume the
notion of ‘high’/‘low’ to be chosen according to some reasonable threshold. We further set
the conditional probabilities to be p(b = 1|s = 1) = p(c = 1|s = 1) = p(b = 0|s = 0) = p(c =
0|s = 0) = λ, with λ ∈ [1/2, 1]; for example, λ = 1 implies that the number of sunburns is
always high if the sun is shining. By conservation of probability, p(b = 0|s = 1) = p(c = 0|s =
1) = p(b = 1|s = 0) = p(c = 1|s = 0) = 1 − λ. Furthermore, we assume the probability for
a sunny day to be p(s = 1) = 1/2, which implies that the sun is not shining with the same
probability p(s = 0) = 1/2. Now, first confirm that B and C are correlated unless λ = 1/2.
This can be done in several ways, for instance by computing the mutual information, which
we introduce in Appendix A in detail, between B and C:
X p(b, c)
IB:C ≡ p(b, c) ln = ln 2 − SSh (2λ − 2λ2 ). (1.54)
p(b)p(c)
b,c
Next, we turn to the correlations between S and B. Confirm that IB:S = ln 2 − SSh (λ)
and find the values of λ for which S and B are maximally correlated/uncorrelated.
Finally, we strongly believe that S is the cause of B (and also of C), i.e. the shining sun
triggers sunburn (and ice cream sales). But how can we make this intuition rigorous?
Assume that we have an external mechanism that can change whether the sun is shining
or not (which seems unrealistic at first sight, but we come back to it below). We therefore
introduce an additional intervention variable IS (not to be mixed up with the mutual in-
formation), which labels the following three actions i: do nothing, i.e. leave the sun as it
is (i = idle), make the sun shining (i = 1) or block sun shine (i = 0). Now, consider the
conditional probability p(b|s, i) for sunburns given sunshine s and intervention i. For i = idle
we set p(b|s, idle) = p(b|s) as defined above. Furthermore, we assume p(b|s, i = 1) = λ and
p(b|s, i = 0) = 1 − λ independent of s because i ∈ {0, 1} overwrites the natural value of s to
be identical to i. Next, confirm that the mutual information between B and IS is
X p(b, i)
IB:IS = p(b, i) ln = SSh [p(b)] − [1 − p(i = idle)]SSh (λ) − p(i = idle) ln 2, (1.55)
p(b)p(i)
b,i
where the marginal probability p(i) that we perform a certain intervention is assumed to be
controllable in an experiment. We now define that S is a cause of B if there are correlations
between IS and B. Confirm that there are no correlations between B and IS , i.e. S is not
the cause of B, if one of the following two cases happen: either p(i = idle) = 1, which
corresponds to the case that we do not perform any intervention and, hence, cannot test for
causality, or λ = 1/2, which implies that there are no correlations
P between B and S in the first
place. Furthermore, confirm that in general p(b, s) 6= i p(b, s, i), where p(b, s) = p(b|s)p(s)
is the joint probability from the beginning obtained without interventions and p(b, s, i) =
p(b|s, i)p(s)p(i) is the joint probability with interventions. Thus, the Kolmogorov consistency
condition (1.53) is broken in general. Show that the consistency condition is obeyed if and
only if p(i = idle) = 1 or p(i = 0) = p(i = 1). Can you see why? Of course, we could also
replace B by C above and find that sunshine causes a high number of ice cream sales.
Finally, let us return to our assumption that we can change the sunlight by an external
intervention. Indeed, such a mechanism is not easy to construct for a human. But to distin-
guish causation and correlation, it is not necessary that humans perform the intervention: it
could be also done by nature, for instance, due to a solar eclipse. Important is only that we
can fix the intervention variable independent of the other variables in the model.
¹ Hebrew his
heart.
(6.) Set (that is, the ruler does this, but, as usual, this is not
expressed when the proposition is intended to have a general
bearing) the perverse fool (generic――‘perverse folly’ then will be a
good rendering) in high places many a one, and the rich (but the
hiphil form is worthy of remark, ‘persons that make rich’) in a low
place ( ֵש ֶפלoccurs so punctuated at Psalms cxxxvi. 23 only, rendered
‘low estate’) sit.
7 I have seen I have seen serfs on
servants upon horses, horseback, and princes
and princes walking as walking like serfs afoot.
servants upon the earth.
¹ Hebrew
grace.
(15.) The toil (i.e. ‘anxious care,’ which is the meaning of this
word) of the foolish ones wearies him (another distributive plural;
the result of these various fools’ labour is weariness to each of them.
It is also to be noticed that the verb is feminine, and yet עמלis usually
masculine. Several nouns are, Stuart observes, masculine or
feminine ad libitum scriptoris. There is however, we suspect, a
perceptible difference in the meaning in these cases. The stricter
agreement denotes closer union between the verb and its
nominative; and if this be so, the idea of the passage may be
rendered by ‘the toil of the fools is self-weariness’), which (full
relative, equivalent therefore to ‘because’ he does) not know (or is
instructed) to (in order to) go towards (אל, LXX. εἰς) a city (not the
city, as is usually rendered.) The obvious meaning would surely be,
that the fool had lost his way, and hence as he is going wrong he has
simply his trouble for his pains.
¹ Hebrew
maketh
glad the
life.
XI. (1.) Cast thy bread upon the face of the waters, for in the
multitude of the days thou shalt find it. (This passage is usually
taken as an exhortation to liberality. Hengstenberg however
understands it to refer to ships and their cargo of grain. ♦ Zöckler
refers to Proverbs xi. 24 for a similar sentiment, and Luke xvi. 9; the
idea is clearly that of an unexpected return).
(2.) Give a portion to seven, and also to eight (see Job v. 19,
Micah v. 4 (5), for similar idioms; it is equivalent to our ‘everybody,
and some one else’), for not dost thou know what shall be
mischief upon the earth.
(3.) If they are full the clouds ( עבis the thick vapour that
appears and disappears) rain ( גשםis the storm rain which does
mischief or good according to circumstances, see chapter xii. 2) they
cause to empty (clouds do not always prognosticate rain; and even
if they should, a storm may do mischief rather than good); and if is
falling a tree in the south, or if either in the north (‘if’ is hence
emphatic) the place where may fall (contracted relative) the tree
(now with the article, for it is the falling tree spoken of above) there it
will be (the unusual form יהואhas troubled the commentators much:
Moses Stuart pronounces the אto be otiose, which is not explaining
the form at all. But may not the following be a sufficient explanation?
―― הואin this book is used in the sense of the existence of an
object: might not Koheleth coin a verb by adding the יof the present
tense, with the idea, ‘makes itself be’?――compare also Joshua
x. 24, Isaiah xxiii. 12, where this otiose אoccurs; the rendering of the
LXX. by ἔσται shows how they understood it, and so also the Syriac
and Vulgate. The whole sentence is ironical, when the tree has really
fallen, then we know which way it fell. The Masoretic accentuation of
this passage is peculiar――we should naturally have expected them
to have divided the verse into two clauses, at יריקו, ‘they empty,’
instead of which the greatest pause occurs at ‘north’ ַּב ָּצ ֑פ ֹון, but this
method of reading renders the irony of the passage; the verse will
then stand thus:――‘If the clouds are full of rain they will empty
themselves upon the earth, and so if the tree should incline to the
south, or if it should incline to the north――the place where it falls is
where it really will be.’ The accentuation is rhetorical rather than
logical, and the Masorets have shown great taste in their pointing).