Quantum Mechanics - Special Chapters PDF
Quantum Mechanics - Special Chapters PDF
QUANTUM MECHANICS
SPECIAL CHAPTERS
Greiner Greiner
Quantum Mechanics Mechanics II
Special Chapters (in preparation)
Greiner· Reinhardt
Field Quantization
Greiner· Reinhardt
Quantum Electrodynamics
2nd Edition
Greiner· Schafer
Quantum Chromodynamics
Greiner· Maruhn
Nuclear Models
Greiner· Milller
Gauge Theory of Weak Interactions
2nd Edition
Walter Greiner
QUANTUM
MECHANICS
SPECIAL CHAPTERS
With a Foreword by
D. A. Bromley
Springer
Professor Dr. Walter Greiner
Institut fiir Theoretische Physik der
Johann Wolfgang Goethe-Universităt Frankfurt
Postfach Il 19 32
D-60054 Frankfurt am Main
Germany
Street address:
Robert-Mayer-Strasse 8-10
D-60325 Frankfurt am Main
Germany
email: [email protected]
Title of the original German edition: Theoretische Physik, Ein Lehr- und Obungsbuch,
Band 4a: Quantentheorie, Spezielle Kapitel, 3. Aufl., © Verlag Ham Deutsch, Thun 1989
ISBN 978-3-540-60073-2
Library of Congress Cataloging-in-Publication Data.
Greiner, Walter, 1935 - [Quantenmechanik, English] Quantum mechanics. Special ehapters / Walter Greiner; with a
foreword by D. A. Bromley, p. cm. Includes bibliographical referenees and index
ISBN 978-3-540-60073-2 ISBN 978-3-642-58847-1 (eBook)
DOI 10.1007/978-3-642-58847-1
1. Quantum theory, 2. Electrodynamies, 3. Quantum field theory, 4. Mathematical physics. 1. Greiner, Walter, 1935 -
Theoretische Physik, English, Band 4a. Il. Title. QCI74.12.G74513 1998 530.12-dc21 97-24126
This work is subject ta copyright. AII rights are reserved, whether the whole or part of the material is concerned,
specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction an
microfilm Of in any other way, and storage in data banks. Duplication of this publicatian or parts thereof is pennitted
only underthe provisions of the German Copyright Law of September 9, 1965, in its current vers ion, and permission
for use must always be obtained from Springer-Verlag. Viol.tions are liable for prosecution under the German
Copyright Law.
The use of general descriptive names, registered names, trademarks, etc. in this publication does not imply, even in the
absence of a specific statement, that such names are exempt from the relevant protective laws and regulations and
therefore free for general use.
modern physics teaching and the bridges that Greiner builds to the pioneering
figures of our science upon whose work we build are welcome ones.
Greiner's lectures, which underlie these volumes, are internationally noted
for their clarity, their completeness and for the effort that he has devoted to
making physics an integral whole; his enthusiasm for his science is contagious
and shines through almost every page.
These volumes represent only a part of a unique and Herculean effort
to make all of theoretical physics accessible to the interested student. Beyond
that, they are of enormous value to the professional physicist and to all others
working with quantum phenomena. Again and again the reader will find that,
after dipping into a particular volume to review a specific topic, he will end
up browsing, caught up by often fascinating new insights and developments
with which he had not previously been familiar.
Having used a number of Greiner's volumes in their original German in
my teaching and research at Yale, I welcome these new and revised English
translations and would recommend them enthusiastically to anyone searching
for a coherent overview of physics.
Theoretical physics has become a many-faceted science. For the young stu-
dent it is difficult enough to cope with the overwhelming amount of new
scientific material that has to be learned, let alone obtain an overview of the
entire field, which ranges from mechanics through electrodynamics, quantum
mechanics, field theory, nuclear and heavy-ion science, statistical mechanics,
thermodynamics, and solid-state theory to elementary-particle physics. And
this knowledge should be acquired in just 8-10 semesters, during which, in
addition, a Diploma (Masters) thesis has to be worked on and examinations
prepared for. All this can be achieved only if the academic teachers help to
introduce the student to the new disciplines as early on as possible, in order
to create interest and excitement that in turn set free essential new energy.
At the Johann Wolfgang Goethe University in Frankfurt am Main we
therefore confront the student with theoretical physics immediately, in the
first semester. Theoretical Mechanics I and II, Electrodynamics, and Quantum
Mechanics I - An Introduction are the basic courses during the first two years.
These lectures are supplemented with many mathematical explanations and
much support material. After the fourth semester of studies, graduate work
begins, and Quantum Mechanics II - Symmetries, Statistical Mechanics and
Thermodynamics, Relativistic Quantum Mechanics, Quantum Electrodynam-
ics, the Gauge Theory of Weak Interactions, and Quantum Chromo dynamics
are obligatory. Apart from these, a number of supplementary courses on spe-
cial topics are offered, such as Hydrodynamics, Classical Field Theory, Special
and General Relativity, Many-Body Theories, Nuclear Models, Models of El-
ementary Particles, and Solid-State Theory.
This volume of lectures provides an important supplement on the subject
of Quantum Mechanics. These Special Chapters are in the form of overviews
on various subjects in modern theoretical physics. The book is devised for
students in their fifth semester who are still trying to decide on an area of
research to follow, whether they would like to focus on experiments or on
theory later on.
The observation by Planck and Einstein that a classical field theory -
electrodynamics - had to be augmented by corpuscular and nondeterministic
aspects stood at the cradle of quantum theory. At around 1930 it was recog-
nized that not only the radiation field with photons but also matter fields,
e.g. electrons, can be described by the same procedure of second quantization.
VIII Preface
hear about in colloquia and seminars. The lectures may help to furnish better
orientation in the vast field of interesting modern physics.
We have profitted a lot from excellent text books, such as
3. Noninteracting Fields . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81
3.1 Spin-Statistics Theorem. . . . . . . . . . . . . . . . . . . . . . . . .. 98
3.2 Relationship Between Second Quantization
and Elementary Quantum Mechanics .. . . . . . . . . . . . . .. 99
7. Superfluidity..................................... 193
7.1 Basics of a Microscopic Theory of Superfluidity . . . . . . . .. 194
7.2 Landau's Theory of Superfluidity . . . . . . . . . . . . . . . . . .. 205
for a gauge transformation (1.4) once the electromagnetic potentials are in-
troduced via the minimal coupling
e
p ----+ p - - A (1.6)
c
into the Schrodinger equation. The form of the wave equation, which besides
the Schrodinger equation could also be the Pauli equation or the Dirac equa-
tion or any other, remains unaltered. Sometimes the transformation (1.5) is
called a gauge transformation of first degree, whereas the transformation (1.4)
is denoted as a gauge transformation of second degree.
Inserting (1.3) into the second and third of Maxwell's equations (1.1), we
arrive at
411' .
-),
c
(1.7)
-47rQ.
V . A' + ~ acp'
= O. (1.9)
c at
Here the gauge function X is determined from
V2X_~~X
2
= _(V'A+~acp) (1.10)
2 c at c at
Equations (1. 7) finally read
1 a 2A' 411' .
V2A' --- --),
c2 at 2 c
(1.11)
1 a 2 cp'
V 2 cp' - -47rQ.
c2 at 2
1.2 Electromagnetic Plane Waves 3
V2A _ ~ fJ2A
c2 fJt2 0,
...."
v . A 0, (1.13)
ip o.
A typical plane-wave solution of these equations is charaterized by a real
vector potential A governed by the wave-number vector k, called the wave
vector for short, and the real polarization vector e:
A(r, t) 2elAoi cos (k . r - w t + 0:)
-- A oe i(k·r-wt) + A*oe -i(k·r-wt)
= Ao ei(k.r-wt) + C.c .. (1.14)
Here Ao = lAo Ie eia is the amplitude and "c.c." stands for the "complex
conjugate" of the first term. It is easy to see that the ansatz (1.14) is a
solution of the first equation of (1.13) if
w = kc = Iklc, (1.15)
and of the second equation of (1.13) if the polarization vector Ao is perpen-
dicular to the wave vector k,
Ao -L k. (1.16)
We say that Ao is transverse as it fulfills the last relation, i.e. Ao' k = O. The
corresponding electric and magnetic fields follow from (1.3) and (1.14):
E -2klAole sin (k . r - wt + 0:) ,
(1.17)
B -2lAolk x e sin (k· r - wt + 0:)
The Poynting vector is defined as
S = ~ExH (1.18)
47r
and, obviously, is parallel to k; remember that H = B for a charge-free and
current-free space. The time average of the Poynting vector, S, over one period
T = 27r /w of oscillation is given by
2
S = ~ IAol2 (1.19)
27rc
and represents the intensity of the electromagnetic plane wave.
4 1. Quantum Theory of Free Electromagnetic Fields
EXERCISE
Problem. Show that the general solution of the Maxwell equations can be
expressed by potentials AI and <pI with V . AI = 0 and <pI = 0 if V . j = 0
and (} = O.
V. (V x H - ~ aD) = V. (V x H) _ ~ a(v· D)
cat c at
= 47T (V. j) . (5)
c
Since the first term on the left-hand side is always equal to zero, i.e.
dive curl H) = 0, and the second term now vanishes because V D = 47T (} = 0,
we conclude that
V·j = o. (6)
The explicit solution for the gauge function X is obtained from a time inte-
gration of (1):
V·B 0,
V·E 0,
1 aB (1.20)
VxE --- ,
c at
1 aE
VxB = +--.
c at
The first three of these equations are automatically fulfilled if the Coulomb
gauge (1.12)
'P 0,
(1.21 )
V·A
°
is considered and (1.3) is used. The last of (1.20) then leads to the wave
equation
(1.22)
A(£,y,z,t) A(O,y,z,t) ,
A(x,£,z,t) A (x, 0, z, t) , (1.23) Fig. 1.1. The electromag-
netic field is confined to a
A (x, y, £, t) A(x,y,O,t) . box; a normal wave mode
with two full wavelengths
Remembering that for normal modes all degrees of freedom vibrate with the along the x axis is indi-
same frequency, we are lead to the ansatz cated
A (x, y, z, t) = A (x, y, z) e iwt (1.24)
Inserting it into the wave equation (1.22) we arrive at
(1.27a)
or
(1.27b)
The two polarization vectors E:kl and E:k2 are perpendicular to the wave vector
k, which determines the direction of propagation:
E:k<1' k = o. (1.28)
This is a result of (1.21) and (1.26). We choose the two polarization vectors,
Fig. 1.2. The polarization
vectors cku are perpendic- which now both lie in the plane perpendicular to k, as two linearly indepen-
ular to the wave vector dent vectors orthogonal to each other (see Fig. 1.2):
k. This reflects the trans-
verse character of electro- (1.29)
magnetic waves The normalization factor Nk appearing in (1.26) will be determined later. In
order to fulfill the boundary conditions (1.23), the wave vector k has to take
on the form
211"
k = (kx, ky, kz) = L {nl' n2, n3} , nl, n2, n3 E Z, (1.30)
We have made sure that the vector potential, which determines the real
electromagnetic fields, stays real by adding the complex conjugate to each
term of the sum; in order to avoid double counting the summation has to
be restricted to those vectors k = {k x , ky, kz} with kz > O. Each term in
the sum (1.31) contains a term with eik . x and also one with e- ikx , so that
the summation has to be limited; see also Fig. 1.4. The factors eiwkt have
Fig. 1.4. The half space been absorbed into the time-dependent Fourier coefficients ak<1' Because of
kz > 0 is shown with a the normalization factor N k , the Fourier coefficients do not represent the
hatching. Every vector k
amplitudes of the A field alone, but nevertheless reflect a measure for the
of this half space can be
transformed by a reflec- latter. Later on aku and at]" will be interpreted as creation and annihilation
tion into -k belonging to operators for photons. Because aku and a ku will not commute anymore as
the lower half space operators, we have to pay attention to their precise ordering from now on.
1.3 Quantization of Free Electromagnetic Fields 7
Inserting the expansion (1.31) into the wave equation (1.22) yields
"L "
L Nk eka [(_k2 aka (t)_~d2aka(t))
2 d2 e ik·x
k a = 1,2
c t
o (1.32)
(1.36)
H = 1 { d3 (E2 + B2)
81T J£3 X
(1.41)
We exploit the orthogonality relations
{d3xeik,xe-ik'.x = L 3 8kk , (1.42)
1£3
and
(1.43)
_(_1)17 81717 , ,
rd3x
JL3
_l_aA . aA*
87fc 2 at at
=
+(-It[akaa_ka+akaa~ka]} . (1.44)
Analogously the computation of the second line of (1.40) becomes
Nk = V27ffic 2 . (1.47)
£3 Wk
Then the energy of the electromagnetic field becomes equal to a sum of ener-
gies corresponding to "harmonic field oscillators":
EXERCISE
I = rd3x~(V
JL3 87r
x A)(V x A*),
Solution. With
A(x, t) = L Nk ek17 [ak17 eik .x + ak17 e- ik .x ]
k,17
and (1.37) we calculate
V x A(x,t) = i LNkk x ek17 [ak17eik.x -ak17e-ik.X]
k,17
and, analogously,
VxA*(x,t) = i LNklk'xekl171 [akl17leikl'X-akl17le-ikl.X].
k', (7'
x fL3
* ik X] [akl17 eik'·x - akl17,e
d3 x [ ak17e ik .x - ak17e-' ' * -ikl.X]
JL3
rd3xei(k-k ' )'''' = L3 bkk'
we deduce further that
I = - 8~ L L L3 NkNk' [(k . k') (ek17 . ek'17 / ) - (ek17 . k') (ek l17 1 . k)]
k,a k',o'
1= L
k,CT
L~~ N'f£3 [(akCTakCT + akCTakCT) - (-It (akua-kCT + akCTa':.kCT)] ,
Now we introduce the quantization. The expression (1.48) for the energy
of an electromagnetic field motivates us to proceed in an analogous manner
as with the quantization of phonons of the harmonic oscillator. 2 From now on
we designate aka as ala and interpret ala as the Hermitian adjoint of aka. If
the operators ala and ak' a' possess different quantum numbers, i.e. belong to
different oscillators, it is reasonable to assume that they commute with each
other. We require that
(1.50)
In addition
iI = 'L-2-
"' nwk (atka ak a + ak a at)
ka
k,a
(1.52)
L~k
k,a
is infinite because of the existence of infinitely many field oscillators. However,
we will discover that this infinite contribution plays no role in most of the
physical problems. Only in special cases is it necessary to consider the zero-
point energy, for example, in the change in the zero-point energy resulting from
a change in the volume or the boundary conditions of the physical system. As
an explicit example we will discuss the Casimir effect.
Heisenberg's equations of motion for the operators aka read
or (1.53)
The Hamiltonian (1.52) splits into separate contributions resulting from the
various normal modes of the electromagnetic field. As a consequence the
eigenstate has to factorize with respect to different normal modes. The eigen-
state I ... , nkcr, ... , nk' cr', ... ) of the electromagnetic field is a direct product
of eigenstates Inkcr) belonging to separate field oscillators. We write
I .. ·,nkcr, ... ,nk'a""') = .. ·Inka) .. ·Ink'a') .... (1.54)
The relation
t
akcrakcr
A AInka )
nka Inka) (nka = 0,1,2, ... ) (1.55)
represents the eigenvalue equation for the number operator f.ha of the normal
mode kcr; in a moment we will realize which particles are to be counted here.
This and the two following relations have been adopted directly from our
introductory course in quantum mechanics and need no further specification.
As a direct consequence of the commutation relations (1.50) we note
aka I· .. , nka, ... ) = Vnka I· .. , nka - 1, ... ) (1.56)
and
at.,. I· .. , nka, ... ) = Vnka + 1 I· .. , nkcr + 1, ... ) . (1.57)
Using (1.54) and (1.55) together with (1.52), we observe that the states
I ... , nka, ... ) are indeed eigenstates of the Hamiltonian H:
p = r
iL3
d 3 x _l_E x B ;
41fc
(1.60)
(l.62)
k,rr k,rr
Because of (l.55), the states (l.54) are also eigenstates of the momentum
operator:
pI· .. , nkrr, ... ) L nknkrrl··· ,nkrr,"')
k,rr
L nk nkrr I ... , nkrr, ... ) (l.63)
k,rr
with eigenvalues
(1.64)
EXERCISE
P = LnkilLilkrr .
k,rr
Start with the classical expression for p and use the Fourier expansion (l.37)
in the Coulomb gauge.
Solution. The expressions for the classical fields read E = -~ ~1- Vtp and
B = V x A. In the Coulomb gauge (without any sources) the potentials
become 'jJ = 0 and V . A = 0; the Fourier expansion of the vector potential
yields
14 1. Quantum Theory of Free Electromagnetic Fields
Exercise 1.3.
p = r
lL3
d3xE x B
47rc
.
E, 1 BA
= -~8t = i
~
L N kWkCkl> (akl>e
'k A l·r t
A
- akl>e
'k)
-1 ·r
. (1)
k,1>
(2)
k',er'
Observe that the electromagnetic fields have become operators. Then the mo-
mentum operator p results:
p= _1_
47rC 1£3
rd 3 xE x B
lL3
rd 3xe i (k-k').r = L30kk' ,
transform
Ckl> X (k' X ck'I>') = k' (ckl>' ck'I>') - Ck'l>' (ckl>' k') ,
and arrive at
(3)
Ps = 4L32
7rC
LL NkNk'Wk [k' (ckl> . ck'I>') - ck'l>' (ckl> . k')]
k,cr k',u'
1.4 Eigenstates of Electromagnetic Fields 15
The term rv E:ku ·k' vanishes for k = k' since E:ku ..l k. Furthermore E:ku 'E:ku' = Exercise 1.3.
buu" so that the double sum in Ps breaks down:
With
i.e.
The sum Ek, u ~k over all momenta is equal to zero, because positive and
negative contributions cancel pairwise. Thus
(4)
Pas =
£3
-41rc2 L NkN-kWk[-k(E:ku·Cku,)+Cku(E:ku· k )]
k,cr,cr'
Again E:ka' k = 0 vanishes and E:ku' E:-ku' = -(-1)"" baa'; see Fig. 1.5. We
deduce
- £3 2
41rC
Lk
[NkN_kWk (-k) (o'klo'-kl + o'llo'~kl)
Pas = O.
Then the total momentum of the electromagnetic field finally reads
16 1. Quantum Theory of Free Electromagnetic Fields
(1.65)
where (1.3), (1.12), (1.37), and (1.39) have been used. It is tiresome to carry
along the sum over k and rJ all the time; therefore we will suppress this sum
in the following and only track one representative term out of (1.65):
E = +i j27flkv
----V-C [a (t)
·k e lX - a* (t) e- l·k .X] . (1.66)
This term describes one normal mode of the electromagnetic field. We are
tempted to change the classical electric field of E in (1.66) straight forwardly
into an operator E by writing
,
E(x, t) = +i j27flkv
----v-e [a(t) e ·k
l ·X - at (t) e- l·k .X] . (1.67)
However, some care has to be taken. We start again with the original classical
equation E = - ~ ~1 and define the operator for the electric field by the
operator equation
(1.68)
For the computation of the commutator we need to consider only the terms
in the expression for iI and A. with the same a and at; they characterize one
normal mode. The operators belonging to different normal modes commute
because of (1.50). We get
-- IV L3wk
c
27fhc2 " ..
--liWk ( i) [,
--
h
c ae ik.x+,t
a e -ik.x,t,]
a a
' -
_ .j27flkv
- I L3
k [, ik·x _ ,t
e ae a e
-ik.x]
, (1.69)
which is identical with the straightforward proposition (1.67). The last ap-
proach makes use of the definition of the vector potential as a quantum-
mechanical operator and of Heisenberg's equations of motion. It represents
the correct quantum-mechanical approach.
Generally the electromagnetic field contains many photons. It is charac-
terized by an N-photon state like the one of (1.54), which is a direct product
of states Ink,,) describing nk" photons of mode krJ. With In) we simply denote
the state corresponding to the mode characterized by the operators a, at. In
1.5 Coherent States (Glauber States) of Electromagnetic Fields 17
(nIEln) = i J27rnw
A 'k
--v-c(nlae"x - ate-'
'k
,xln) = 0, (1. 70)
where
(1.71)
enters because of relations (1.56) and (1.57). On the other hand the expecta-
tion value of the square of the electric field, which apart from a factor of 87r
describes part of the energy density, is equal to
27rnw
- - c ' c (nl (ae''k ,X - ate-'
'k
'X)
87r£3
x (ate- ik .x - ae ik .x ) In)
-- naa +a an
27rnw (IAAt
87rL3
AtAI)
This result is surprising. The expectation value of the E field in the n-photon
state vanishes, whereas, except for the zero-point energy, the energy density
is equal to the half of n photons times nw per volume L3. The other half of
the expected energy density results from the term i1 . i1 t/87r. This puzzle is
resolved once we realize that statistical phases are assigned to the n photons
in the field. After we average over the phases, the expectation value of E
vanishes. On the other hand the phases drop out for the energy density; the
energy density is a real positive number for every space-time point.
It was Glauber,3 who introduced a field state, for which the expectation
value of the E operator resembles the classical E field. It allows for an un-
certainty in the photon number in order to define the phase of the field more
precisely. The Glauber state Ie), often also called the coherent state, reads
L
00
f: Ibnl
n=O
2 = e- 1c12 f: Ic~~n =
n=O
e- 1c12 e+ 1c12 = 1, (1. 76)
Since the Glauber state has no definite photon number, the expectation value
of the annihilation operator is not zero; this is in contrast to the incoherent
state (1.54) with definite photon number. We calculate
00 00
m=On=O
m,n=O
m=O
c, (1. 77)
(cIElc) =
A
+i
J21f1iW
----ve(clae1X -
'k t
a e- 1
'k
'Xlc)
+i J21fhw
----ve[ce ok oX_c*e- 1ok ,x]
1
-21cl J21fhw
----v- e sin(k ' x + 6c ) , (1. 79)
This expression has the form of a classical electromagnetic wave; see (1.17),
Its amplitude is given by Icl and its phase is determined from the phase of
c = lei ei8c , It is worth mentioning that the second line of (1. 79) is formally
identical to the expression (1.67); the creation and annihilation operators a, at
have been substituted by the complex amplitudes c, c*,
EXERCISE
The Glauber state has no definite photon number n. We therefore ask for
the uncertainty ,1n of the photon number within the coherent state Ie), which
is defined as
EXERCISE
Problem. Determine the mean quadratic deviation of the electric field within
a coherent state and show that it vanishes in the classical limit.
The expectation value of E within the Glauber state has already been calcu-
lated in (1. 79). We recapitulate:
(eIEle) (eIEle)*
2nru.u 2 2
----v- c . c (lei + le* I
-e2 ei2k-x - (*)2
e e -i2k.x) . (2)
Substraction of (2) from (1) yields the mean quadratic deviation
t 2 2n ru.u
(elE . E Ie) - I(eIEle) I = ----v-. (3)
A A A
In the classical limit, i.e. n ---> 0, the mean quadratic deviation goes to zero.
1.5 Coherent States (Glauber States) of Electromagnetic Fields 21
EXAMPLE
Here we will discuss an effect that gives interesting insight into the nature
of the electromagnetic field and its role for quantum mechanics. It is a typi-
cal interference phenomenon that can be understood at an elementary level;
strangely enough, its importance was recognized only 30 years after the de-
velopment of quantum mechanics. 4
Let us recapitulate how the interaction between charged particles is de-
scribed in the classical picture. Following the conception by Faraday and
Maxwell this interaction is not given via a remote action, but as a local
interaction with an electric field E(x, t) and a magnetic field B(x, t). It is
sufficient to know the field strengths E and B at each space-time point of
the particle's trajectory to describe the motion of a particle with charge e.
Newton's equation of motion holds with the Lorentz force
F= e ( E+ ~x B) . x (1)
AIo 1 aX
Ao - ~7ii' (3)
AI A+ Y'x
with an arbitrary function X(x, t) leave the electromagnetic fields and, conse-
quently, also the Lorentz force invariant.
In quantum mechanics the notion of a force is not of much use. One starts
instead with a Hamiltonian. The Hamiltonian of a nonrelativistic charged
particle moving in an electromagnetic field contains the potentials Ao and A,
e)2
H = - 1 ( p - - A + eAo , (4)
2m c
and leads to the Schrodinger equation
The wavefunction tJ! now directly depends on the potentials Ao and A and only
indirectly on the fields E and B. Even when fields vanish, the wavefunction
tJ! is still influenced by the potentials.
4 Y. Aharonov, D. Bohm: Phys. Rev. 115 (1959) 485.
22 1. Quantum Theory of Free Electromagnetic Fields
Example 1.6. To begin with, we study the special case in which A leads to a stationary
and curlfree field B = V x A = o. If the wavefunction!lio represents a solution
of the Schrodinger equation (5) with A = 0 but otherwise arbitrary Ao(x, t),
then we can easily show that the wavefunction
Some care has to be taken when performing this step; this transformation
only holds when the curl of A is vanishing. (We convince ourselves with a
simple counter example. A homogeneous magnetic field in the z direction
B = B· e z = V x A is achieved A = ~Brecp; we choose the integration path
in the radial direction, so that e r . ecp = 0 and the path integral vanishes.
Then (7) cannot be fulfilled.) Only if the curl of the vector potential is equal
to zero is the path integral in depend end of the chosen path. With the help of
(7) we find
and
which are integer multiples of 7r. Assuming a large distance between the screen
and slits, L » d, we approximate the angle of diffraction by tan j3 = YI L ;::j
5sId (see Fig. 1.6), so that
58
0_
O'P = 'P2 - 'PI = "hp 5s = "hp LYd = X Yd
= XL ; (13)
compare with Fig. 1.6. For the interference maxima we then find
L
Yn = n27rX d . (14)
24 1. Quantum Theory of Free Electromagnetic Fields
Example 1.6. Because of the finite width of the slits a small destructive interference leads to
a decrease in intensity for the interference maxima; the interference pattern
is shown in Fig. 1.6.
What is going to change if a vector potential A (x) is introduced into
the diffraction experiment? According to (6) new phase factors come into
play. It seems plausible to identify the path from the source through the slits
to the point x on the screen as the integration path r (x). Different paths
n r
(x) and 2 (x) belong to the two slits and correspond to different partial
wavefunctions !Ii! and !Ii2 ; see Fig. 1.7. Then the new phases result in
Dip = Dip° +~
fie
(J, -J, )
r2 r1
dx' . A (x') (16)
Now we realize that the difference between two path integrals can be written
as an integral over the closed path r. Furthermore, applying Stoke's theorem,
we get
i dx' . A (x') = J dF . (V x A)
This is just the magnetic flux going through the surface surrounded by the
curve r r r
= 2 - l . The phase difference
(21)
A detailed analysis shows that the envelope of the interference pattern does
not change. This shows that on average the electron beam is not deflected
Fig. 1. 9. Illustration of
by the localized B field; only quantum-mechanical interference effects are the vector potential A =
influenced. Nevertheless the result seems to be stunning. Although the elec- {O, 0, A",} of an ideal sole-
tron beam encounters no magnetic field at any place, its presence results in a noid
26 1. Quantum Theory of Free Electromagnetic Fields
Example 1.6. displacement of the interference pattern. Referring to the concept of a local
interaction, which we have good reasons to believe in, we come to the con-
clusion that the vector potential A(x, t) plays a more fundamental role than
the magnetic field B(x, t). On the other side because, of the gauge freedom,
A(x, t) contains too much information. The only true and decisive quantity
remains the phase integral exp [i ;c Ir
dx l . A (Xl) ].
.Q; .QJ
I 2
The experiment sketched in Fig. 1.8 was performed in a slightly modi-
\ fied form soon after the proposal by Aharonov and Bohm. 5 Instead of two
\ I
\ I slits a biprism was used to achieve larger intensities. Such a biprism consists
\ I of a thin electrically charged wire between two grounded plates. It deflects
\ J
\ I the beam as if it came from two spatially separated sources Qi and Q~; see
Fig. 1.10. Chambers 6 added a thin ferromagnetic metal fragment, a so-called
whisker, behind the biprism and was then able to detect the change in the in-
terference pattern. Somewhat more elegant was the experiment of Mollenstedt
and Bayh. 7 With a clever electron optical device they succeeded in separat-
ing the two partial beams by a distance of about 60 11m. A freely suspended
air solenoid of 5 mm length and 7 11m radius (!) was introduced into the
Fig. 1.10. Electrostatic gap. Varying the current through the solenoid, they were able to displace the
biprism interference pattern continuously; the envelope remained unaltered. The dis-
placement was in full accordance with the expected result (21). Although the
Aharonov-Bohm effect follows naturally from quantum mechanics, its exis-
tence had been questioned repeatedly. New experiments 8 use small toroidal
magnets instead of solenoids; one of the two beams goes through their open-
ings. In this manner the magnetic scattering field can be eliminated. In addi-
tion the torus was covered with a superconductor to exclude any penetration
of the electrons into the magnetic region. None of these precautions changes
the experimental results.
The Aharonov-Bohm effect (the influence of the potential on the phase
of a wavefunction) does not only hold for magnetic fields. In fact, a relation
analogous to (6) can be derived for the case of electric fields. The presence
of a spatially constant but time-dependent electrostatic potential Ao(t) alters
the phase of a wavefunction according to
5 In fact the experiment was proposed 10 years earlier by W. Ehrenberg and R.E.
Siday, Proc. Roy. Soc. 62B (1949) 8, but at that time it did not raise much
interest. Independently, Aharonov and Bohm developed the idea and analyzed it
on more profound theoretical grounds.
6 R.G. Chambers: Phys. Rev. Lett. 5 (1960) 3.
7 G. Mollenstedt, W. Bayh: Naturwiss. 49 (1962) 81.
8 Osakabe et al.: Phys. Rev. A 43 (1986) 815.
1.5 Coherent States (Glauber States) of Electromagnetic Fields 27
shorter than the length of the boxes. As soon as the wave is inside the box, a
voltage U is applied for a short time. The phases then become
so that
(24)
is obtained for the relative phase. In order to have as much similarity as pos-
sible to (18) for the magnetic field case, the relation E = -V Ao has been
employed and an integral over a spatiotemporal region has been introduced.
The electric flux defined by the double integral of (24) is gauge invariant as in
the case for the magnetic flux of (17). The electric flux also leads to a trans-
lation (shift) of the interference pattern. This holds although the electrons
are inside the Faraday cages and thus do not encounter any E field as the
potential is switched on. On the other side the electron wavefunction is spread
all over space with, of course, different strengths.
From these considerations about the Aharanov-Bohm effect we conclude
that in quantum mechanics the interaction of a charged particle with an elec-
tromagnetic field results in a phase factor
This contains the potentials Ao and A as fundamental quantities and not the
field strengths E and B. The factor Rl2 specifies the probability amplitude
for a particle to propagate from space-time point (Xl, h) to point (X2' t2)'
The phase factor depends on the trajectory X (t). In principle, all possible
trajectories have to be considered, each of them with its own phase factor.
28 1. Quantum Theory of Free Electromagnetic Fields
Example 1.6. In fact, this conception provides the basis for an alternative, yet equiva-
lent, formulation of quantum mechanics; i.e. the "path integral" approach 9 to
quantum mechanics.
Finally we now show that the expression (25) also incorporates "macro-
scopic effects" of the electromagnetic field as the deflection of a charged par-
ticle due to the Lorentz force. Again we concentrate on the case of the simple
two-slits experiment and on small deflection angles. The considerations leading
to (19) and (21) are repeated with the additional assumption that the elec-
trons run through a sector of width D, which is penetrated by a homogenous
magnetic field B; see Fig. 1.12. The magnetic flux determines the deflection;
it is given by
<P = f dF . B ~ dD B . (26)
LlY = L Py = DL eB . (28)
P pc
This proves (in first order in (3) that the phase factor (25) includes the ac-
tion of the Lorentz force. A more detailed analysis would show that the total
probability distribution is shifted as sketched in the Fig. 1.12; this is in con-
trast to the Aharanov~Bohm effect, for which the envelope of the interference
9 See for example R.P. Feynman, A.R. Hibbs: Quantum Mechanics and Path Inte-
grals (MacGraw-Hill, NY 1965) and Chap. 13 of this book.
1.6 Biographical Notes 29
patter remains unchanged, indicating that on average no action of a force has Example 1.6.
occured. 10
,
Hmp " Pi 2
= '~--+V. (2.2)
i 2mi
The index 'mp' stands for 'many particles'.
How does this many-particle system interact with electromagnetic fields?
From the lectures on quantum mechanics' we remember that it is necessary
for a quantum theory to remain gauge invariant. The interaction between ra-
diation and matter has to be in such a way that gauge transformations of
the form (1.4) do not change the observable quantities predicted from theory
(eigenvalues, expectation values, transition probabilities, and so on). This im-
portant requirement for gauge invariance is fulfilled with the minimal coupling
Pi
,
---> Pi - -ei A' ( Xi·
A ) ( 2.3 )
c
Together with the Hamiltonian for the radiation field, (1.40) and (1.48), the
Hamiltonian for the many-particle system (2.2) then becomes
J
i
E . E* + 11 . i1*
+ 8~ d3 x (2.4)
----
Hmp + Hrad + Hint.
I W. Greiner: Quantum Mechanics - An Introduction, 3rd ed. (Springer, Berlin,
Heidelberg 1994).
This is the Hamiltonian for the total system "matter + radiation". The in-
teraction operator Hint for interaction between the many-particle system and
the electromagnetic field follows from calculating the square of the expression
inside the sum of (2.4),
,
Hint = """ [ei"
--Pi' A(Xi) + -e;,
- A2 (Xi) ] . (2.5)
L i m·c 'l,
2mc 2 1.
Here, the so-called Coulomb gauge V . A = 0 has been employed, which allows
us to write p . A + A . P as 2p . A. In the following, we suppress the sum and
the corresponding index i for simplicity, so that
, e,' e2 , 2
Hint = - mcP' A(X) + 2mc2A (X) (2.6)
represents the interaction of one particle with the radiation field. We will come
back to the explicit sum (2.5) when necessary. Now we separate the interaction
(2.6) into a term proportional to A and another term proportional to A2,
+ Hint
, , I ' "
Hint = Hint , (2.7)
and deduce further with (1.31) that
(2.8)
and
H' /I
(a) ~nhw' int
hw ~
(b)K
~ jk'
i (2.9)
n
hw The interaction Hint between the electromagnetic field and the many-particle
~
for photons linearly, HInt induces transitions for which one photon is either
created or annhilated. On the other side HI~t induces transitions where two
photons are involved. For example the processes (at a, aa t ) describe absorption
and reemission of one photon or vice versa (see Fig. 2.1). We will demonstrate
this point further with various examples.
An atom is in the initial state lai) and decays into the final state laf) by
emitting a photon with wavevector k and polarization 0". Subscripts i and f
stand for "initial" and "final", respectively. The initial and final states of the
total system "atom + radiation field" are denoted by Ii) and If), respectively,
and, more explicitly, are given by
where I and II describe a complete set of states. Here, Mfi represents the
transition matrix element. In our case the contribution of first order to Mfi
results merely from HInt and we get with (2.8)
(2.14)
In the second step we have taken into account that only the term with the
creation operator at,
contributes within the infinite sum. Equation (1.57)
has been employed for the last step. The 8 function appearing in (2.13a)
guarantees energy conservation:
Ef - Ei = Ear + nwk - Eai Ear - Eai + nwk = O. (2.15)
~ '-v-'
=E f =Ei
The photon with energy nwk takes away the energy Eai - Ear from the atom.
With (2.13a) and (2.14) we obtain the transition probability per unit time
for the transition of the atom from lai) to laf) by emission of a photon in
first-order perturbation theory:
tIme emission It
27l' ( e ) 2 (27l' he 2 )
Ii: me L3 Wk (nka + 1)
x l(aflp'E:kae-ik,xlai)12
x8(Eaf - Eai + nwk) . (2.16)
The factor nka + 1 in the radiation matrix element has deep and far-reaching
consequences. The probability for emission is not solely proportional to the
number of photons nka already contained in the initial state; this would be
stimulated emission alone. Emission also exists if initially no photon is present
at all (nka = 0); this is known as spontaneous emission. Once again, the in-
duced emission is proportional to the number of photons nka already present
in the radiation field. In some way they shake the excited state and stimulate
the transition. It is the zero-point oscillations of the radiation field that effect
the transition because of their "shaking". These zero-point oscillations are of
pure quantum-mechanical origin and are always present. The 1 appearing in
nka + 1 results from the commutation relations (1.50); this is for the same
reasons as the term ~ appears in (1.52) for the zero-point energy of the radia-
tion field. It is really remarkable and very satisfying indeed that the theory we
have developed so far automatically includes spontaneous emission. We will
2.2 Lifetime of an Excited State 35
n: L 1UIH[ntl
211" """"'
k,rr
' i ) 12 J(Eaf - E a, + Wk)
(2.17)
We investigate the limiting case L3 -+ (X) for this equation. The volume of the
box to which the electromagnetic field is confined goes to 00; the electromag-
netic field then occupies the whole space. For this limiting case we will show
f
that
Lk
->
L3
(211")3 d 3k (2.18)
becomes valid. For this we consider the three-dimensional lattice space with
axes nx,ny,n z ; ni E Z. Every lattice point {nx,ny,n z } represents a normal
mode of the electromagnetic field with a certain polarization.
Every vector n = {nx, ny, n z } represents such a normal mode; each vector
begins at the origin. Into a lattice volume t:mx 6.ny 6.n z , which reaches from
nx to nx +6.n x , ny to ny +6.ny and n z to n z +6.n z , fall exactly 6.n x 6.ny 6.n z
arrow heads (lattice points). Therefore, exactly 6.n x 6.ny 6.n z electromagnetic
normal modes exist in this volume. According to (1.30)
Lki = 211"ni (i = 1,2,3 for x,y,z) (2.19)
Fig. 2.3. All normal
and thus
modes n are counted as
L 6.k i 211" 6.ni , (2.20) the number of points in-
side the box with volume
from which llnx llny llnz
6.n x 6.ny 6.n z
L3
(211")3 6.k x 6.k y 6.k z (2.21 )
follows for the number of normal modes falling into the interval [ki' k i + 6.kiJ.
In the limiting case L -+ (X) and 6.k i -+ 0 (2.21) goes over to
L3
(211")3 d 3k (2.22)
from which (2.18) follows.
36 2. Interaction of Electromagnetic Fields with Matter
We return now to the determination of the lifetime (2.17) and choose the
two polarization vectors eka as depicted in Fig. 2.4. It follows that
L l(aflp'ekae-ik'''lai)12 I(af Ip. ek
A
2 e -ik'''1 ai )1 2
a=I,2
(2.23)
Light emitted from atoms typically has energies of Iiw ~ 10 eV. We conclude
that
Fig. 2.4. The polarization
vector ekl points into the 27l" Iiw
k·x ~ TaBohr lie aBohr
plane of the page; ek2 falls
into the plane given by p 10eV ~ x 1O- 8 cm
and k and is of course per-
pendicular to k. The kz 1.97 x 10- 5 eV cm
coordinate axis is chosen 2.7 x 10- 3 « 1 (2.24)
to be parallel to the vec-
tor (afljllai); hence, {) rep- and expand the exponential appearing in (2.23) into a Taylor series:
resents the polar angle in
k space
e-ik-x = 1 - ik . x - ~(k . X)2 + .... (2.25)
Because of (2.24) we restrict the expansion to the first term only; thus (2.23)
becomes
L l(aflp'ekae-ik-xlai)12 ~ !ek2·(aflpl a i)1 2 . (2.26)
0"=1,2
For reasons that will become clear in a moment this approximation is called
the dipole approximation. The next terms in expansion (2.25) would lead
to magnetic dipole radiation, electric quadrupole radiation, and so on. The
quantum-mechanical theory of multipole radiation represents an important
concept in nuclear physics; see the specialist literature. 3 Looking again at
Fig. 2.4 we can further simplify expression (2.26) into
Using also (2.23) and (2.22), we find that the lifetime (2.17) now becomes
( ~)
T i
-;
f
= ~
27l"m
J d3 k ~ I(aflp lai)1 2 sin2 {) 8(Ear -
Wk
Eai + liw k ) .
(2.28)
We introduce spherical coordinates in k space and set the k z axis along the
direction of the vector (aflplai) (see Fig. 2.5). Then it follows that
d3 k k 2 dk sin {) d{) d<p
(2.29)
Fig. 2.5. Spherical coor- In the last step, the integration J d<p = 27l" has been performed because ex-
dinates in k space pression (2.28) does not dependent on <po Thus we deduce
3 J.M. Eisenberg, W. Greiner: Nuclear Theory, Vo!' 2. Excitation Mechanism of the
Nucleus, 3rd ed. (North-Holland, Amsterdam 1975).
2.2 Lifetime of an Excited State 37
e2 27r
27rm 2 c3
J dWk Wk !(af!p !ai)!
"2
with Wfi = (Ea. - E a,) / Ii as the transition frequency. The matrix element
(af!fJ !ai) can be cast in another form, i.e.
dx
(af!fJ !ai) (af!mill!ai)
i A A
-fim(af!xHmp - Hmpx!ai)
( ~)
T i ..... f
= 4e:~!i !(af!x!ai)!2
3nc
. (2.32)
-fi[X' HmpJ- .
Applying the trick (2.31), we recognize that the momentum matrix element
e(af! fJ !ai) can be expressed as a matrix element of the dipole operator ex. By
now the designations dipole approximation and dipole radiation introduced
earlier should have become clear.
Yet another form can be derived for the expression (2.32) of the lifetime
once Heisenberg's equations of motion are applied twice,
d2 x dfJ
(af!m dt 2 !ai) (af!ill!ai)
i A A
-fi(af!fJHmp - HmpfJ!ai)
-k(Ea. - Ea,)(af!fJ!ai)
i dx
-fi(Ea• - Ea,)m(af! ill !ai)
- ; (Ea. - EaY(arlx!ai) ,
(2.33)
38 2. Interaction of Electromagnetic Fields with Matter
( fiwfi)
7 .I-+f
= 4e21{afld2Xlai)12
3c3 dt 2
(2.34)
energy _ 2e21 d2 x 12
(2.35)
time - 3c3 dt 2
We illustrate the theory developed so far with a couple of examples.
EXERCISE
Problem. Show that the selection rules for dipole transitions are given by
1:11 = ±l and I:1m = ±1,0, where I and m represent the angular momentum
quantum numbers of the electron.
M rv Jd3r1P~'l'm,(r)r1Pnlm(r).
To begin with we express the components of the position vector in terms of
If ·
spherical harmonics. We need
- - sin {) el'P
If .
Yl1 CI9, rp)
81T '
- sin {) e -1'1'
81T '
sin {) cos rp
sin {) sin rp
2.2 Lifetime of an Excited State 39
ax -b x -~fi
2 3'
ay by -;ifi,
az ~.
We deduce further that
x l' sin 'l9 cos cp r(a xYl1 + bxY1- 1 )
y l' sin 'l9 sin cp r(a Y Yl1 + byY1- 1 )
z l' cos 'l9
Referring to the matrix element M above, we multiply these spherical har-
monics further with Yim('l9, cp). The following integrals appear:
and
with
dS? sin'l9 d'l9 dcp .
Generally
= ((2 h +( 1?(21 2
47r 23 +1
r1)) 1
2 (1112 13Imlm2m3)(1112131000)
4 See e.g. W. Greiner: Quantum Mechanics - Symmetries, 2nd ed. (Springer, Berlin,
Heidelberg 1994).
40 2. Interaction of Electromagnetic Fields with Matter
For all other cases the angular integrals vanish, so that the matrix element is
zero.
EXERCISE
nwl 2
s p
= EI -
s
E2
P
= me 4
21i,2
(1 _~)
4
= ~8 ea2
has been used. In spherical coordinates the unit vectors are given by e±1 =
=f ~(ex±iey) and eo = e z . As a consequence, for m = 0 only the z component
of the matrix element of r does not vanish. Thus, it follows that
1
T
4(3)3 :3 (2)4
3 8" ~ 1(2p,m = 01zlls)1 2
This matrix element becomes
J d 3 r1j;*(2p)z1j;(ls)
_1_
87ra 3
J~ a
e- r / 2a v'2 cos '/J(r cos '/J)e- r / a r2 dr sin'/J d'/J dcp .
2 V21°O
- - err d -3r/2a 4
34a4 0
4v'2 (~r a,
so that
1
T
or
T = (3)8
2
a (hc)4
c e 2
(3)8
2
a
ca 4
Inserting numbers yields
T ::::! 1.6 X 10- 9 s .
EXERCISE
Problem. Show that the 2s state of the hydrogen atom cannot decay via the
p . A interaction into the Is state by emitting a photon, i.e.
Since
_ 1 -ria
1j;18 -.J7ra 3 e ,
we deduce that
42 2. Interaction of Electromagnetic Fields with Matter
Excercise 2.3.
and derive
h
(2Slc· pe- 1'k 'Tlls) = -:-
la
J c· r e- 1'k ·z
d 3 r 1/J2s'l/Jis--
r
.
EXERCISE
AA -_ "
~
NkCku [Aaku (t) e ik·r + aAtku (t) e -ik.r]
k,u
H"' becomes
H"' = - ieh "N
2mc ~ k
ir. (k xc
ku
) [a ku eik . r + atku e- ik .r ]
k,u
H"'
A ieh
= - 2mc
- - L NO'· (k xc) [a
k ku ku
e1'k · r - t e- 'k]
aku r1 ·
k,u
Here k is no longer an operator; the minus sign in the second term results
from differentiation of the exponential.
2.2 Lifetime of an Excited State 43
EXERCISE
Problem. The magnetic interaction between the spin of the electron and the
nuclear spin splits the ground state of the hydrogen atom into two levels: one
with total spin 1 and one with total spin O. The photon, which is emitted
from the transition between these two states, comes with a wavelength of
>. : : : 21cm. Use the results of Exercise 2.4 to calculate the lifetime of this
transition. For the calculation of the lifetime you may choose one particular
state of the degenerate spin-l triplet. Why?
Solution. The spin of the electron 1p'I)e and that of the proton 1P,2)p couple
to the total spin wavefunction
This final state carries angular momentum a for the electron-nucleus system.
The one photon carries the total angular momentum of the state, i.e. 1. The
lifetime is given by the expression
1
T
l: 2
2; iUIif'''li)i 8(DoE - hck).
final states
The 8 function guarantees energy conservation; here DoE = nwk = hcko, with
ko = 21l'j21 em-I, represents the energy difference between the two states,
which is equal to the energy nwk = hck of the emitted photon. Furthermore
(see Exercise 2.4),
H'" = ien
2me £
)"2 ""'
3 L
2
At e -ik·r .
rJ:U
k
,a
vck
Now k has become a continuous variable; compare again with (2.18). It follows
that
Furthermore
aLii) = 11s)el i)e IT)nucleusl···, lka,·· .)rad ,
X 0-. (k X cka)llls)elT)elT)nucleusI1ka)).
The scalar product is
u· (k x cka) = &x{k X cka)x + &y{k X cka)y + &z{k X cka)z.
We use the Pauli matrices
(Jx =
(0 1)
1 0 '
A
(Jy =
(0 -i)
i 0 '
A
(Jz =
(1 0)
0 -1
and deduce
2.2 Lifetime of an Excited State 45
Because of the orthogonality of If) and Ii) only those configurations contribute Exercise 2.5.
for which the electron switches from the initial state Ii)e to the state Il)e due
to 0-. Hence, only (Jx and (Jy contribute:
1 .
M = - V2 [(k x ck(J")x + l(k x ck(J")Y] .
Initially we have chosen the spin to be along the z axis. In such a system z
it is not convenient to evaluate the cross products appearing in M. Thus, we
rotate the coordinate system in such a way, that the z' axis coincides with
the direction of k (see Fig. 2.6):
x x' cos fJ - z' sin fJ ,
y y' ,
z x' sin fJ + z' cos fJ . x
becomes
+ [(k x Ck(J")y,]2}
in the rotated coordinate system. The latter has been chosen such that
k = ke z "
so that
(k x ck(J");'
(k x CkG" )z'
(k x Ck(J" )~,
We get
(J"
fJ represents the angle between the spin vector and k. Finally we obtain
~T = ~~
4m 2 c2 211"
(J d k ~k2(1
3
k 2
+ cos 2 fJ)c5(ko - k)) ,
~~~ J
T
k 3c5(ko - k) dk
4m 2 c2 2 3
~ e 2 11, k 3
3 m2c2 o'
46 2. Interaction of Electromagnetic Fields with Matter
Exercise 2.5. Inserting the value ko = 21r /21 em -1, we calculate a lifetime of
T ~ 3.4 X 10 14 s ~ 10 7 years.
Now it has become obvious that these kinds of transitions are not observable
in a laboratory. However, such photons are well known in radioastronomy.
EXERCISE
Problem. Use the results of Exercise 2.5 to calculate the lifetime of the 2s
state of the hydrogen atom. The spins of the electron and proton couple as in
Exercise 2.5, i.e. to total spin 1. Assume that, because of the spin-dependent
interaction, the 2s state decays into the ground state by emitting one photon.
Later on we will demonstrate, in Exercise 2.10, that the two-photon decay
process is much faster.
~ e2 1i, k31(lsle-ik.rI2s)12
3 m2c2 0
We expand:
e -ik·r = l '- 1 1 .2(k ·r )2 + ...
k ·r+2"1
Since (lsI2s) = 0 the first term of the expansion does not contribute. The
second term leads to the matrix element
M = (lsi k . r 12s) .
The Is and 2s wavefunctions depend only on r. The scalar product yields
k . r = kr cos 19 as the z axis is chosen parallel to k. Thus the matrix element
is proportional to
2.2 Lifetime of an Excited State 47
as a consequence, the second term of the expansion does not contribute. There-
fore we start all over with
(lsle- ik .r I2s) ~ -~(lsl(k.r)212s).
The corresponding wavefunctions are given by
1 e -ria
--
1/Jls
va 3 7r
and
1/J2s = 1 ( 2 - - e -ria , r)
4v27ra 3 a
so that the matrix element now becomes
M (lsi e- ik .r I2s)
~ -~Jr2dr-l-e-rla(k.r)2
2 va 3 7r
x ~
4 27ra 3
(2 - .c)
a
e- rla sin'!? d'!? d<p.
1 2
T
e2 h
~ -3 ~ko
m c
3(1 rc;(koa) 2) 2
16v2
or
From
3 e2
lieko = -- a
8 a ' lie
it follows that ko = ~ ~ and
e
--
mc
J21f hc
-
L3 Wk
2
- ( afP'ekue
1 ik .XI)
A
ai
- -e
mc
J21fhc2 ( af 1P . eku e ik·x 1ai ) Vr;;:;-:-
-L3
Wk
A
nku ; (2.38)
compare this with (2.14). Here we have assumed again that only H{nt con-
tributes to this process in first order and that (1.56) determines the photon-
matrix element. According to (2.13a) the transition probability per unit time
for the absorption process is given by
(
trans. Prob.) 21f (~)2 (21flic 2 ) nku
time absorPtion- Ii mc L3 Wk
( trans. Prob.)
time absorption
)2 (211"nc I( aip·eku
I' . af )1 2
2
-211" ( - e - -) nku e- ikxl
n mc L3 Wk
X 8(Ea, + Wku - E af ) . (2.41)
r
A comparison with (2.16) for the emission process shows that the transition
probability for stimulated emission, which is the term in (2.16) proportional to
nku, is identical to the absorption probability per unit time (2.41). Observe that
lai),laf) in (2.16) are the same states as laf) and lai) in (2.41), respectively.
The final state (ground state of the atom) for emission is identical to the
initial state (again the ground state of the atom) for absorption.
~
We will calculate the cross section ai ..... f(ka) for the absorption of a photon ~
with momentum nk and polarization a. It is defined as the transition prob- ~
atom
ability per unit time for the absorption of a photon divided by the incoming Jphoton
photon flux jphoton: Fig. 2.8. Illustration of
nku the absorption process
jphoton = V c. (2.42)
This yields
(2.43)
The incoming radiation will possess a certain frequency spectrum. If we in-
tegrate over this spectrum, the 8 function appearing in (2.43) will disappear
and, instead, the intensity of the absorbed photons from the radiation will
appear in (2.43). Soon we will recognize that the atomic levels (states of the
particle system) are not sharp either; so far we have assumed the opposite,
which expresses itself in terms of the 8 function appearing in (2.43). The
atomic levels have a naturallinewidth; see Sect. 2.7.
50 2. Interaction of Electromagnetic Fields with Matter
EXERCISE
Solution. According to (2.41) the transition probability per unit time is given
by
h27r e )2 (27rlie)
( me L3 nkcr
I(q IAP . ckcr eikrl 2 2m
. Is )1 u, (li2q2 ~ E Is " .. ) ,
~ flJ..Uk
Wk
a(ka) =
L
k
-+ (2:)3 J q2 dqdS2e ,
da Exercise 2. '1.
dD (k, a)
x8 (Ii;~ - E 1s - ~k) .
Now
a a
oz cos 1J or '
e2
re
mc2 '
Wk ck,
!
and
£3 li2 1
- r e -k-2 q2 dq I(ql cos1J11s)1 2
21l' m a
li2q2 )
X 8 ( 2m - E 1s - ~ .
8 2;
21i": (q2 _ (E 1s +~))
2m 2 2
ti,28(q - qo)
for the 8 function, where q5 stands for q5 = (2m/li2) (Els + ~). Furthermore,
8(q 2 - qo)
2 1 [8(q - qo) + 8(q + qo) 1 .
= -2
qo
Because q, qo > 0, it is impossible for their sum, q + qo, to become zero; as a
consequence, 8(q + qo) = 0 and
li2q2 ) 2m 1
6 ( 2m - E 1s - ~ --6(q-qo) .
li2 2qo
It follows that
52 2. Interaction of Electromagnetic Fields with Matter
Exercise 2.7. da L3 2
ds:? = 21rreka2
J 2
q dq l(qlcos'l911s)1 2q0c5(q-qo)
2 1
or
da L3 qo 2
d He
n = - re -k 21(qOlcos'l911s)1
21r a
We expand the plane waves into spherical harmonics 7
+1
L L
00
cos'19, = ( 41r
3 )! Y (")
'19 , !P .
lO
d3 x
Y('I9', !p')
1
J
or
41r 00 +1
M ~ r 2 e- r / a dr ~m~I(-i)ljl(qOr)Yi::n('I9,!p)
J )
1
Y(4~)!
lO
1
we get Exercise 2. 7.
f(qoa) 1 00
x 2 dxh(qoax)e- X
2qoa
2 .
(1 + q3a 2 )
With the abbreviations re = e2/mc 2 and a = fi? Ime 2 the scattering cross
section finally takes on the form
dCT L3 qo 167r 2 cos 2 {) a3 f2( )
dst e 27r r e ka2 7r L3 qoa
EXAMPLE
u(w)dw = 1{),<p
1 w~dwk .
2nwknka- (
27r
)3--3-smfJdfJdcp
c
87rhw~ dWk
(27r)3cJ (exp (k:wT ) - 1)
hw 3 1
--z1
7rC
(exp (k~wT ) - 1) dWk . (1)
(2.45)
(2.46)
Transitions between these two states can only be caused by H{nt from (2.8).
We deduce that
UIH{ntl i ) = (qrl(···, nkO" - 1,···1 ( - ~J p. €kO"J ~:~ Fig. 2.10. Absorption and
emission of a photon by
x (akO" ei b: + akO" e- ik .X) Iqi)I···, nkO", ... )
a free electron is prohib-
ited. The photon is indi-
e J21rfic(
-- I'
- - qfP·€kO"e ikxl) ~
. qi ynkO" cated by a wavy line, the
me L3 wk
electron by a straight line.
_~J21rfic
me L3 wk
(-qf· €kO"h/nkO"
The interaction point is
called vertex
X
JL3
e -iQf·X.tk-x eiqi · X d3
L3/2 e L3/2 x. (2.49)
JmL3
e-iqf·X ei(qi+k).X
--
m d 3 x = 6qf ,qi+k . (2.50)
Pe +)
( P'"( = Pe' , ( Pe + P'"( ) 2 = 2
Pe' = 2
Pe .
Since p~ = m; and P; = 0 it follows that 0 + 2P'"(Pe + = m; m;, so that
P,"(Pe = O. In the rest frame of the electron the four-vectors become
Pe = (me,O),
Hence, it follows that
E'"( = 0;
which means that no photon is present. Therefore it is impossible for a free
electron to absorb a photon. We have to draw the conclusion that processes
of first order caused by H{nt do not exist. As a consequence we investigate
now the processes of first order with H{~t; see (2.9). This part of the interac-
tion contains terms of the form aLak' 0"" Obviously, they describe processes
in which a photon of the kind k' a' is absorbed and another photon kO' is
emitted. Such photon scattering is best illustrated with Feynman diagrams as
in Fig. 2.11.
The full lines describe the electron, whereas the wave-like lines represent
the photon. The graph is to be read from bottom to top in accordance with
the direction of time. The incoming photon kiO'i is absorbed (annihilated) by
the incoming electron. Another photon kWf is produced. In this annihilation-
creation process the electron scatters into the state qf. The initial and final
direction states for this process are
of time
Fig. 2.11. Sketch of a Ii)
(2.53)
process, where an electron If) Iqr) I· .. ,nkjO"j - 1, ... ,nkf(J"f + 1, ... ) .
in state Iqi) absorbs a pho-
ton kjO"j and emits another
photon kWf, so that the The transition from Ii) to If) is caused by the two terms with iL!Wfak;O"; in
electron scatters into state (2.9). According to (2.9) and (2.13a) the transition probability per unit time
Iqf) becomes
trans. prob.)
(
time
2)2 lekfO"f . ek;0";1 2
photon scattering
A At
x ( akiO"iakfO"fe
i(ki-kr)·x + akfO"fakiO"ie
At A i(ki-kr)'X)
( trans. Prob.)
time
2
photon scattering
X 1(qfle i(ki-kr)·x
Iqi)
12 nkiO"i (nkfO"f + 1)
X <5 (hwk + (hqi)Z _ hwk _ (hqr)2) (2.56)
'2m {2m
.
(q le,(ki-kr),xlq') =
J
The matrix element between the electron states is calculated to be
e-i(qf+kr)'X ei(qi+k.).X
d3 x---==-------::=--
f ' JV JV
= <5qf +k{,ki+qi' (2.57)
It expresses momentum conservation:
h(qf + kr) = h(qi + ki) . (2.58)
Energy conservation is contained in the <5 function of (2.56):
hw . + (hqi)2 = hw + (hqr)2 . (2.59)
k, 2m k{ 2m
The occurence of the factor nkfO"f + 1 in (2.56) is interpreted as stimulated
emission of photons kWf in the final state; apparently the scattering is en-
hanced if photons kflJ"f of the final state are already present. Often this is not
the case, so we set nk{O"f = 0.
2.5 Calculation
of the Total Photon Scattering Cross Section
To calculate the total photon scattering cross section IJ"totai, we sum (2.56) over
all the final states of the photon as well as the final states of the electrons; this
result is then divided by the incoming photon flux nkiO"c/ L3. With nkfO"f = 0,
this yields
58 2. Interaction of Electromagnetic Fields with Matter
O"total
x8 (nw . + (liqi)2
k,2m
_ nw _ (liqr)2)
2m k, . (2.60)
~ L~
""'3
-->oo
L3
(271")3
J d3kr ,
here cos'l9 = ki . kc / kikr represents the cosine of the angle between the incom-
ing and the scattered photon. Evidently this small shift is of pure quantum-
mechanical origin; it vanishes for the limiting case li -+ O. Hence, we pursue
the classical approximation and set Ai = Ar. As a consequence the 8 function
in (2.61) becomes
1
8(lic(ki - kr ))) = hc8(ki - kr). (2.63)
Here k = k i - kr ;::::: 0 has been employed for the energy of the electron:
li 2 li 2 li 2 li2 li2
-(qi + k)2 = _q2 + _ k 2 + -qi' k ;::::: -q~. (2.64)
2m 2m 1 2m m 2m 1
With (2.63) the total photon scattering cross section (2.61) becomes
O"total = ( ne4
me2
) ""'
6
0",
J k f2 dkf d Jtk,
n ICk""i'
Cif
k 12 ~
2kckw, '(k f - k·)
/00 U
ftC
1
(2.66)
If the photons are unpolarized, expression (2.65) has to be averaged over the
initial polarizations and has to be summed over the final polarizations to
obtain the total cross section. We get (see Fig. 2.13)
"21""""
L L ICkiO'i' ckfO'fl 2 "21[ ICkil' ckf11 2 + ICkil' Ckf 2 12
€kf 2
+ ICki2 . ckfll2 + ICki2' Ckf212] Fig. 2.13. Wavenumber
1 and polarization vectors of
"2 [1 +0+0 + cos 2 '!9] the absorbed (i) and emit-
1 ted (f) photons
"2 (1 + cos2 '!9) (2.67)
and perform the integration over dstf = sin '!9 d'!9 drp in (2.65). The outcome
is
EXERCISE
Problem. Show that for photon scattering a (small) shift in wavelength b.A =
Af - Ai = h(l- cos '!9) I mc occurs as a result of the nonrelativisitic conservation
laws for momentum and energy, (2.58) and (2.59). Assume that the electron
is initially at rest.
Solution. Since the electron is at rest initially we have qi = O. With lick = I1w
we derive from (2.58) and (2.59) that
n} 2
lick i lickf + ~ (1)
2m
and
ki = kf + qf· (2)
From the second equation we directly get
q; = (k i - kf)2 = k~ - 2ki . kf + k; .
Inserting this into the first equation gives
+ -1i( k i2 -
2k i k f cos '!9 + kf2) ,
2
lick i = lickf
2m
where '!9 represents the angle between k i and k f . With lik = hi A it follows
that
60 2. Interaction of Electromagnetic Fields with Matter
Exercise 2.9. h
c-
A
Multiplication with AA' results in
eh(A' _ A)
'---v---"
~
2m
(A'A +~)
N
_hm cos{) '
2
=~A
~A = l::...-
mc
[! (A' +~) -
2 A N
cos{)]
If we assume that the shift in wavelength stays small, i.e. A' ::::: A, then
A' A A,2 + A2 2A2
'I + N = AN ::::: --:\2 = 2
and
h
- (1 - cos{)) .
me
EXERCISE
Problem. Determine the lifetime of the 2s state of the hydrogen atom as-
suming a decay with the emission of two photons. It is necessary to combine
the matrix element of second order, iI' rv p. A, with the matrix element of
first order, iI" rv A2. Do not solve the problem exactly, only estimate the
order of magnitude.
where nek 1 and nek 2 represent the energies of the emitted photons. The sum
over the final states goes with a summation over the polarizations and an
integration over the momenta of the two photons,
L
final states
--+ (2:)3 L Jd3k1 (2£:)3 L Jd k
0"1 0"2
3 2
L (27r!ic ~ PA . E:k" (
e 2) A ..k·x At -.k·x Exercise 2.10.
fI' - mc L3 Wk ak" e + ak" e . )
k,"
and
fIll
X
A A
( ak"ak',,' e
i(k+k')·x + ak"ak,,,,e
A At i(k-k')·x
+ ak"ak'
At, i(-k+k')·x + At At i(-k-k').X)
,,' e ak"a k , ,,' e
and
fIll
Mr(.2) "
~
(f1iI'ln) (nIH'li)
Ei _ En
( d d )
secon or er .
In the latter term the (time) order of the creation of the two photons is
important:
11 s) lIs)
1
or
Fig. 2.14. Feynman dia-
12s) 12s) grams of second order
M -
-
~
2mc 2
(27r!ic
P
2
) 1
CVklk2
M'
.
62 2. Interaction of Electromagnetic Fields with Matter
Here el and e2 are the polarization vectors of the first and second photons,
respectively. We then get
where re = e2 /me 2
represents the classical electron radius. Since we are inter-
ested only in an order of magnitude estimation, we approximate the dimen-
sionless matrix element M' as
As a consequence we obtain
~
7
= r;e
4(27r)3
J J d 3k l
kl
d 3 k 2 8(k o - kl - k 2) .
k2
Because of spherical symmetry [f(k) = f(lkl)J the relation
holds, so that
1 =
-;. r;e
4(27rp167r 2 J J kl dk l k2 dk2 5 [(ko - kl ) - k2J .
1
7
2
r_e_
27r
e l0
ko
k1(ko-kddk 1
ko = ~Z2~
8 a
with a = e2 / lie ~ l~7' It then follows that
2.6 Cherenkov Radiation of a Schrodinger Electron 63
1 Exercise 2.10.
T
where the Bohr radius a = fj? /me 2 has been used. We obtain T = 0.114 Z-6 s.
Hence, we deduce for the hydrogen atom TH = 0.114s. Indeed, the two-photon
decay is much faster than the one-photon decay with T = 9.5 years (compare
with Exercise 2.6).
With an exact treatment of the matrix element M' we would derive
.!. = 8.226 Z6 s-1 .
T
The variable c:(w) is known as the dielectric constant and depends on the
frequency w of the electromagnetic wave. Ii- ~ 1 is called the magnetic per-
meability of the medium. n(w) ~ y'c:(w) represents the refractive index. The
frequency (w) dependence of these quantities indicates the dispersion of elec-
tromagnetic waves in the medium. In general the speed of light c' in the
medium is smaller than the speed of light c in vacuum, i.e. c' < c. For this
reason a particle can move with a velocity v > c' in the medium, which is be-
yond the speed of light (in a medium). We will convince ourselves that under
these circumstances a charged particle is able to emit and absorb photons.
The medium acts as an additional particle that absorbs the recoil and in this
way permits the conservation laws to be fulfilled.
The energy of the electromagnetic field is given by expression (1.40):
H rad = - Ii,
81l' L3
d 3 x(E 2 +B 2 ). (2.70)
64 2. Interaction of Electromagnetic Fields with Matter
EXAMPLE
~
47r
[S n
+ S(1)]
t'
(6)
where we have distinguished between tangential and normal components, i.e.
S(l) E(l) x H(l) (7)
2 n t t'
St(l) E(l) x H(l)
t n
+ E(l)
n
x H(l)
t·
(8)
The term
E(l) x H(l)
n n
= 0 (9)
vanishes. Similarly we arrive at the result
Fig. 2.15. At the inter-
face H?) H?) and S(2) = 4~ [Sn + si 2)] (10)
E~l) = E?) hold, i.e. the
tangential components of for side 2 ofthe interface. Obviously, because of the continuity of the tangential
E and H are continuous components, we have
10 We follow here arguments given by L.D. Landau, E.M. Lifshitz: Electrodynamics
of Continuous Media (Pergamon, Oxford 1960).
2.6 Cherenkov Radiation of a Schrodinger Electron 65
the normal component of the energy flux is continuous at the interface. This is
what one expects. We would not have obtained this result if we had used Ex B
or D x H or D x B in (1). We realize that (1) represents the correct general
expression for the energy flux density; moreover, in vacuum it coincides with
the already known result (1.18).
The change of energy per second in a normalized volume of the medium
is described by div S. Applying Maxwell's equations in a medium,
laD
rotH - - -
47r .
-3,
at c c
laB
rotE + - - 0,
c at (12)
divB 0,
divD 47rp,
Such a simple interpretation is not possible for media with dispersion. More-
over, dispersion present in a medium in general causes dissipation of energy; a
medium showing dispersion is absorbing. In order to calculate this dissipation
we consider a monochromatic electromagnetic wave and determine the aver-
age of expression (13) over time. Because (13) is quadratic in the real field
quantities E and H we have to be careful with a transition to complex-valued
fields and have to rewrite all expressions with the real forms. To avoid a mis-
understanding we now call the monochromatic fields E and H and denote
them in complex-valued form:
Example 2.11. Products like E . E and E* . E* vanish with time averaging, because they
contain factors e'F2iwt. We deduce that
~
47r
(E .
-
aD
at + -H . aB)
at = iw [(e* -
167r
e)E . E* + (11* -
r'
II.)H . H*]
r'
~
87r
(e"IEI 2 + Jl"IHI 2 )
Q. (17)
The quantity Q can be understood as a mean amount of heat produced per
normalized time and normalized volume; according to (13) it describes the
disappearing energy flux averaged over time and per unit time and volume
element. Furthermore, we can write Q also in the form
Q (18)
with
e" ~(e* - e) ,
Again, E and H stand for the real field quantities and the bar indicates a
time average over a period T = 27r / w.
Equation (18) is very important as it demonstrates that energy dissipation
(absorption) is governed by the imaginary parts of e and Jl. The two terms in
(18) are interpreted as electric and magnetic losses. The sign of Q must always
be positive because the second law of thermodynamics means the total entropy
of a closed system, which is a measure for the equipartitioning of energy, must
increase. The dissipation of energy results in production of energy. Therefore,
Q > 0 has to hold for all times. From this and from (18) we conclude that the
imaginary parts of e and Jl must always be positive:
e" > 0, Jl" > o. (19)
This is a very general statement. It is a result of the second law of thermody-
namics and is valid for all materials and frequencies. The real parts e' and Jl'
of e and Jl are given by
e +e*
e' =-- (20)
2
No restrictions exist for them. Depending on the particular physical circum-
stances, they could be either positive or negative.
For a real physical material, nonstationary processes are always thermo-
dynamically irreversible. Thus a variable electromagnetic field always suffers
from electric and magnetic losses in a real material. Often they remain small,
but they are always present. As a conclusion, the imaginary parts, e"(W) and
Jl"(w), do not become rigorously zero in any frequency domain. This finding
is of fundamental importance, as we will recognize in the following. Obviously
this does not exclude the existance of frequency domains for which e"(W) and
Jl"(w) take on very small numbers, so that only minor losses occur. Such fre-
quency domains are called domains of transparency of the material. For them
it makes sense to introduce the notion of the intrinsic energy U of a material
2.6 Cherenkov Radiation of a Schrodinger Electron 67
E Eo(t) e- iwot ,
(21)
H Ho(t) e- iwot ,
where Eo(t) and Ho(t) are slowly varying functions in time compared to
e- iwot .
To determine the intrinsic energy we have to calculate the right-hand side
of (13) for transparent materials. We insert the real parts of (21) into (13)
and obtain
Averaging this expression over time, i.e. with respect to the period T = 27r /wo,
. . *. .*
leads to the products E· D, E*· D ,H· B, and H* . B vanishing and results energy
in density
Compared to e- iwot , the amplitude Eo(t) changes only slowly with time; there-
fore we expect only those components with
a« Wo (28)
to appear in the Fourier integral. Then we can write for each single component
68 2. Interaction of Electromagnetic Fields with Matter
1 jEo(a) e-i(wo+<»t da
(33)
2.6 Cherenkov Radiation of a Schri:idinger Electron 69
We return to (2.70) for the energy of the free electromagnetic field. This
expression has to be modified for media with dispersion according to (37) of
the last exercise:
Hrad = J d 3 xu
J d3x ~
8~
(d (WE(W))
dw
E. E + d(WJl(w)) H· H)
dw
(2.71)
H rad -- Jd 3 X 81l'
1 (d(WE(W))E
dw . EBB)
+ . (2.72)
For the case of media with no frequency dependent dielectric constant E(W) =
E= const we deduce the expression
H rad = J 3
d x-
1
81l'
(EE. E + B· B) , (2.73)
J -E
L381l'
1
. Ed 3x = "'""
~ 81l'c2 k
k,a
2
Wk N2L3 [(Aaka aka
At At aka
+ aka A )
J 3 1
d x-EE·E
81l' '
becomes
(2.74)
Similarily we deduce from (1.45) that
(2.75)
Hrad =
(2.76)
We realize that terms not conserving the photon number [second sum in (2.76)]
vanish for a constant dielectric constant. With the refractive index, n 2 = E,
the remaining term becomes
A "'"" W~ 2N 2L3 [A At At A ]
H rad = ~ 41l'c 2 n k aka aka + a ka akl7 (2.77)
k.17
2.6 Cherenkov Radiation of a Schrodinger Electron 71
(2.78)
so that
(2.79)
This change in the normalization becomes noticeable for the expression for
the vector potential. From (1.37) we deduce with (2.78) that
A(x,t) = L (2.80)
k,a
iI!mt -- -~ ~
me~
(2.81 )
k,a
With this result we now determine the transition amplitude per unit time for
a free electron with momentum liq emitting a photon with momentum lik.
After the emission of the photon the electron has momentum Iiq' = Ii(q - k).
We find
,
x [ak'a' e ik'·x + ak'a'
,t e -ik'.X] I'f/;q ())
x I ... ,Oka,··· )
e
(2.83)
me
Obviously,
('f/;q'(X)lp·ckae-ik.xl'f/;q(x)) = liq·cka 6q',q-k, (2.84)
and we obtain the following preliminary result for (2.82):
( tran~. prob.) =
tIme q-->q-k
(2.85)
72 2. Interaction of Electromagnetic Fields with Matter
1 8( rJ nfu,,;k fu,,;k m )
vlik cos - 2mvc - mvli~n
x8 ( cos rJ - - c - -
nfu,,;k)
- (2.86)
nv 2mcv
The photon is emitted at an angle rJ with respect to the electron trajectory.
This Mach angle is given by
cos rJ = ~
nv
[1 + fu,,;n 22 ]
2mc
. (2.87)
If the photon energy is much smaller than the rest energy of the electron,
fu,,;« mc2 (mc 2 ~ O.5MeV), we obtain approximately
c c'
cosrJ = - = -. (2.88)
nv v
This is the classical Mach angle (see Fig. 2.17), which in this context is also
named after its Russian discoverer as the Cherenkov angle. Equation (2.88)
can be fulfilled only if the particle velocity v is larger than the wave velocity
c' = cln, i.e. v > cln. For a vacuum, n = 1 and v can never become larger
than c. Therefore a photon emission from a free particle is not allowed in
vacuum; see again the discussion about photon absorption after (2.52).
In the following we are interested in the loss of energy of the electron per
unit length along its trajectory (see Fig. 2.18). It is given by
We calculate
a a
q2 sin 2 fJ
q2(1 - cos 2 fJ)
m 2v 2
Ji2 (1 - cos 2 fJ)
and use (2.18), which can be expressed in spherical coordinates in k space: Fig. 2.18. Illustration for
L
k
----> (£3)3
271'
roo k
io
2 dk 11
-1
d( cos fJ) r
io
27r
drp. (2.90)
the energy loss of a parti-
cle in a medium along the
trajectory x
We then derive
dW
dx
~/k
(2.91 ) ek2~ ~
\...ek 1
In the last step we have not been careful enough; as it is, the integral would
Fig. 2.19. Illustration of
diverge. But this is not the case because, as a result of (2.87), cos fJ can only the polarization vectors
range between -1 and 1 and thus limits the photon energy to relative to the particle mo-
nw < (nv/c - 1) 2mc 2 (2.92)
mentum hq = mv
- n2
The integration domain in (2.91) does not reach out to infinity, only to the
limiting energy (2.92).
EXERCISE
cosfJ = -c
nv
[nw
1 + __ (n 2 -1)
2mc
2
g2]
1--
c2
if the relativistic instead of the nonrelativistic expression for the particle en-
ergy is applied.
74 2. Interaction of Electromagnetic Fields with Matter
cos'I'J = -
vn
c[
1 + -n-w1-
2mc2
g2_(n 2 -1)
c2
1
r*IIEui
a (natural) linewidth because the initial state has only a finite lifetime as a
result of its decay. According to Heisenberg's uncertainty relation the energy
uncertainty flE is related to the lifetime T by flE ~ n/T. Up to now we
have considered only sharp spectral lines. This is reflected in the 6 function
6(Ea, - E a, -nwk) appearing in the expression for the transition probability
(2.16). Evidently something must be wrong in our considerations employed
E Uf so far. In fact the mistake comes from perturbation theory as we have taken
it over from the Introduction to Quantum Mechanics. l l Now we will modify
Fig. 2.20. A finite life- this formalism and include a finite linewidth. 12
time means a finite level Hence, we again consider the emission of a photon from an atom. Initially,
width r and a finite spec-
tralline width result from the atom is in state lai) and, to keep things simple, is only allowed to decay
the uncertainty relation into one final state laf). As previously no photon should be present in the
~t~E ~ n initial state and only one in the final state. We write
(2.94)
(2.95)
Here I¢n) represent the solutions of the unperturbed system, which consists
of the atomic many particle system and the radiation field Ho = Hmp + Hrad
[compare with (2.4)]:
(2.96)
Inserting (2.94) into (2.95) and considering (2.96) leads to a system of coupled
differential equations:
dcm(t)
dt
_
-
_ ~" ()(r!. IH'. Ir!.)
h L..t Cn t 'I'm tnt 'l'n exp
(i(Em -h En)t) . (2.97)
n
We denote the amplitude of state Ii) with Cio and those of states If) with
Cfka. The indices indicate that Ii) has no photon and If) has one photon of
the kind ka. Then (2.97) read
Be aware that we consider only the two atomic states Ii) and If). On the
right-hand side of the first equation all states laf)lka) are considered, into
which the initial state lai)IO) can decay. On the right-hand side of the second
equation only the term proportional to Cio is taken into account; in principle
also matrix elements with two-photon states la f) I ... , 2ka, ... ) appear, but the
amplitudes of these states stay negligibly small. For the amplitudes CiO of the
decaying state lai) we make the ansatz
The energy shift ~Ei remains unknown for the moment and still has to be
determined. ~Ei can be a real, imaginary, or complex-valued quantity. We
insert (2.99) into the second equation of (2.98) and deduce
So far we have not used the first equation of (2.98) at all. Also, we did not de-
termine !:I.Ei . We choose !:I.Ei such that the first equation of (2.98) is fulfilled.
We insert (2.99) into the left side ofthe first equation of (2.98), substitute the
amplitudes (2.100) into the right side, and get
- ~!:I.g
It 1
exp (-~!:I.gt)
It 1
- ~ L(iOIHintlfkO") (fkO"IHintliO)
ku
i '"
- ~ L.J
I( fkO" IHint I'20 )1 21 -
A exp [*(Ei + !:I.Ei - Er - nwk)t]
E!:I.E E nw
n ku i + i - r- k
(2.101)
Hence, the energy shift !:lEi is complex valued. The real and imaginary parts
can be read off right away from (2.103):
" 1UkaIH{ntl iO ) 12
(2.104a)
~ Ei - Er - nwk '
k,IT
-1r L 1 UkalH{nt liO) 126(Ei - Er - nwk)' (2.104b)
k,IT
The sum L:kIT is over all photons with all possible momenta nk and polar-
izations a. It is straightforward to see that a consideration of several atomic
states lar) leads to an additional summation L:f for the right-hand side of
the first equation of (2.98); hence, the result (2.104) modifies to
or
(2.108)
The state lai) decays with lifetime T. On the same footing the state obtains a
level shift ~(~E), which results from the emission and reabsorption of (vir-
tual) photons (see Fig. 2.21).
Another consequence from the decay of state lai) becomes clear by looking
at (2.100). Inserting
i
~Ei = ~(~Ei) - 2'ifi
yields
lim
t-+oo
ICfk,,12 = l(Jk(JIH{ntliO)12
(2.109)
Of course this reflects the intensity distribution of the emitted line: The spec-
tralline has a Breit- Wigner distribution with center fiw k = Ei + ~(~Ei) - E f
and width fi'ii see Fig. 2.23. Because of the self-energy of the electron, emis-
lIw
sion and reabsorption of photons results in the spectral line being shifted by
Fig. 2.23. Breit-Wigner ~(~Ei)' In Sect. 5 we will come back to the determination and the necessary
form of the spectral line renormalization of the self-energy in more detail.
EXERCISE
iot e
ixt
lim (1 - e ) = lim (-i ixt ' dtl) (2)
t---+= X t---+=
holds because
t
i
-i 0 eixt'dt ' = -i - elxt
1 . , [t' =t
ix t'=O
1
= - - ~.
x
ixt
x
(3)
lim lim (i
t---+CXJ c---+O+ iorot ei(x+io:)t' dt') = - lim lim (i
0: ..... 0+ t ..... oo
t ei(x+io:)t' dtl)
io
_ lim (i roo ei(x+io:)t' dtl) . (5)
0: ..... 0+ io
The last step is possible because the integral is defined as long as c: > 0 and
also remains finite for t ...... 00.
This integration can be performed:
r
io
00
ei(x+io)t'dt ' =
i(x
1
+ ic:)
ei(x+io:)t'
[tl=OO
t'=o
(6)
Since this integral also converges at the upper bound because of the conver-
gence factor exp( -c:t'), i.e. exp(i{x + ic:)t')lt'== = 0, we deduce that
- lim (i
0:---+0+ io
r= ei(x+io:)t' dt') =
o:~rg+ (x ~ ic: )
o:~rg+ ex + 7c:)(~C:- iC:))
o:~rg+ (x2: c: 2 - i x2 : c: 2) (7)
The imaginary part represents a well-known representation of the r5 function:
lim
0---+0+
(~)
x + c:
= 7l'r5(x). (8)
lim (11 -
00
- -c:- d x)
x 2 + c: 2
(1 1
0---+0+ 7l' -00
r 00
d(x/c:) )
1 + (X/c:)2
(11
o:.:...rg+ ;;: -00
00
lim - -dy- )
0---+0+ 7l' -00 1 + y2
1, (9)
80 2. Interaction of Electromagnetic Fields with Matter
Exercise 2.13.
I: 6(x)f(x) dx = lim
0->0+
[~1°°
1r -00 x2
C
+ c2
x (f(O) + xl' (0) + ~2 !" (0) + ... ) dX]
1 +
£ ...... 0+ 1r -(X) X
lim
£ ...... 0+
(- -+x)
-
c
:Z;2 2
- P
-
(1)
-
X
(11)
Hence, we obtain
(13)
3. Noninteracting Fields
So far we have learnt that the classical radiation field takes on particle prop-
erties (quanta with energy flw and momentum lik) as soon as it has been
quantized. Thus, we might suspect that all wave fields show particle char-
acter as soon as they have been quantized and that, on the other side, all
particles appearing in nature could be understood as quanta of a field. With
certain particles, such as electrons, the question arises, what is the wave field
that contains these particles as quanta? We presume that it might be the
wavefunction 'lj;(x, t) and begin with the nonrelativistic Schrodinger equation
fJ'lj;(x t) 1i2
fJ ' = _ _ V2'lj;(X, t) + V(x)'lj;(x, t) = H'lj;(x, t) (3.1)
A
iii
t 2m
for a particle with mass m in a potential V(x). Equation (3.1) is the fun-
damental equation of elementary quantum mechanics. As with the Maxwell
equations representing the field equations for the electromagnetic fields (com-
bined in the vector potential A and Coulomb potential ¢), we consider the
Schrodinger equation to be a field equation for the wave field 'lj;(x, t). Now we
quantize Schrodinger's wave field 'lj;(x, t) in complete analogy to the quantiza-
tion (1.50) of the vector potential (and with that of the electric and magnetic
fields) in the el~ctromagnetic case. We assume {'lj;n(x)} to be a complete set
of solutions of H,
(3.2)
so that we can write the most general solution of (3.1) as
(3.3)
n
This represents the equations of motion for the normal coordinates. It is called
first (i.e. the usual) quantization. In contrast to that formulation we will now
learn how to introduce creation and annihilation operators for field quanta
(which may be particles, photons, and so on). The formulation of quantum
mechanics in terms of such creation and annihilation operators is called sec-
ond quantization. To achieve this, our goal is now to find a Hamiltonian depen-
ding only on the normal coordinates bn , introduced above, and yielding the
equations of motion (3.4). It could be that H is given by
H ! d3 x1/;*(x,t)H1jJ(x,t)
(3.6)
n
This resembles a Hamiltonian consisting of an infinite sum of harmonic oscil-
lators with energies En and frequencies Wn = En/n. Hence, we interpret
At
b~ -> bn ,
bn -> bn (3.7)
as creation and annhilation operators of these oscillator quanta and require
the commutation relations
(3.8)
(3.10a)
or
."dbn
In-
dt
n'
(3.10b)
3. Noninteracting Fields 83
1~
- - b b ···b 10)
VNm! m m m
_l_(b t )Nm 10) (3.11)
VNm! m ,
in which N particles of the kind m appear with the same wavefunction 'lj;m(x).
The number operator fin for particles of the kind "n" is
A At A
N n = bnbn , (3.12)
and the general state vector
I ... , nn, ... , nn', ... ) (3.13)
of the 'Ij; field is, as previously [compare with (1.54)], a direct product of the
state vectors of the field oscillators Inn)
I···,nn,···,nn"···) = ···,lnn)···lnn')···. (3.14)
Here
Nnl···,nn,o .. ,nn"···) == nnl···,nn,···,nn"···) (3.15)
represents the eigenvalue equation for the particle-number operator.
The formalism developed so far holds for bosons, because, as we have
realized, it allows several particles to have identical wavefunctions. Now, we
attempt to modify the formalism, so that it can also describe fermions. We
keep
(3.16)
[A,BL = AB+BA.
1 Those go back to P. Jordan, E. Wigner: Z. Physik 47 (1928) 631.
84 3. Noninteracting Fields
(3.17)
and recognize that we have derived the same equations of motions as previ-
ously in (3.1Ob). Notice, however, that the anticommutation relations (3.16)
do not possess the same algebraic properties as the commutation relations do.
The commutation relations (3.8) possess the same algebra as the classical
Poisson brackets
~(BABB _ BBBA)
A B
{ ,}q,p
=
8. Bqi BPi Bqi BPi .
(3.18)
{A,B} -{B,A}
{A,e} 0, if c is a number,
{(AI + A 2 ), B} (3.20)
{AIA2' B} + Al {A2' B} ,
{AI, B}A2
{A, {B, e}} + {B, {e, A}} + {e, {A, B}} = 0 (Jakobi identity) .
Here we have paid attention to the ordering of the various quantities to in-
clude possibly noncommuting quantities. We will demonstrate in Exercise 3.1
that the commutators (3.8) do indeed fulfill the equations analogous to (3.20),
whereas the anticommutators (3.16) do not fulfill these equations. Hence,
we conclude that the anticommutators do not have a classical analogy. This
should not lead to a misunderstanding: the number operator N = L:n Nn
and the Hamiltonian iI = L:n EnNn have classical limits because both are
bilinear in b~ and bn and commute with each other. This shows that the anti-
commutation relations represent something typically new, which appears only
in quantum mechanics. We will see soon that they incorporate the Pauli prin-
ciple, which does not exist on a classical level. These conclusions are supported
by the following physical observation. To become measurable classically a field
amplitude has to be very strong; it must be possible to have a large number
of particles in the same state so that their fields can add up coherently (see
Sect. 1.5 on coherent states). This implies that particles that give rise to clas-
sical fields have to obey Bose-Einstein statistics. We conclude, for example,
that light quanta have to be Bose particles because strong electromagnetic
fields can be produced and measured classically. For the case of electrons in
a metal, which obey Fermi~Dirac statistics, only quantities like the energy,
charge, or current density are measurable classically; these quantities are bilin-
ear combinations of the field amplitudes (operators b+ and b). The amplitude
of the electron field alone is linear only in b+ or b and cannot be measured
classically.
EXERCISE
Problem. Show that the commutation relations (3.8) but not the anticom-
mutation relations (3.16) fulfill the algebra formulated in (3.20).
(3.21 )
The term b~b~bnbn vanishes because (3.16) means it is antisymmetric with
respect to the exchange of the first two (or the last two) operators. From
(3.16) it follows that
~t ~t_ _ ~
bnbn - 0 - bnb n .
A
Nnln n )
3. Noninteracting Fields 87
EXERCISE
where s determines the degree of the parastatistics. Here the operators &i(a)
obey the standard commutation relations
2 H.S. Green: Phys. Rev. 90 (1953) 270.
3 O.W. Greenberg, A.M.L. Messiah: Phys. Rev. 138 (1965) B1155.
4 Y. Ohnuki, S. Kamefuchi: Phys. Rev. 170 (1968) 1279; Ann. Phys. NY 51 (1969)
337; Ann. Phys. NY 78 (1973) 64; Prog. Theor. Phys. 50 (1973) 258.
5 See for example H.J. Fink, W. Scheid, W. Greiner: J. Phys. G 3 (1977) 1119.
88 3. Noninteracting Fields
tt
Solution. First we show for the para-Bose case that
0,(3
s
28'j Lada)
a=1
and consequently
[iii,{iij,iik}] = 28 ij iik.
An analogous proof holds for the para-Fermi statistics
3. Noninteracting Fields 89
EXERCISE
•
SOjk - L (aj(a)al(a) - al(a)aj(a)
0<=1
•
SOjk - L {{aj(a), al(a)} - 2al(a)aj(a)}
0<=1
• •
SOjk - L Ojk +2 L al(a)aj(a) ,
0<=1 0<=1
'-v---'
.liik
i.e.
L al(a)aj(a)
8
ejk 2
0<=1
for para-Fermi statistics.
The result for para-Bose statistics is the same. From
aj(a)IO) = 0
it follows that
ejklO) = O.
Furthermore,
(olal(a) (ak(a)IO))t
0,
3. Noninteracting Fields 91
EXERCISE
Problem. With the results derived in Exercises 3.2 and 3.3 discuss the oc-
cupation numbers of the para-Fermi states:
XN = (al)NIO);
the degree of the para-Fermi statistics is given by s. Hint: Examine the norm
of the state!
Solution. We suppress the quantum number k because it does not turn out
to be necessary. The norm of the state XN is given by
i.e.
IlxN11 = (Ola N(at)NIO) .
First of all, we consider the simple cases with N = 1,2, .... Furthermore, it is
convenient to pull a factor out of the matrix element:
IlxN11 =
s
~ (Ola N(at)NIO) .
For N = 1 we get
IlxI11 = ~(OlaatIO).
s
We employ the operator G:
[a,a t ] = -G+s,
i.e.
IlxI11 = ~(Olata
s
- G + slO) .
Since 0,10) = GIO) = 0 and (010) = 1 it follows that
1
IlxI11 = -(OlsIO) = (010) = 1.
s
Hence, we deduce
IlxI11 = 1.
The N = 2 case yields
IIx211 ~(OlaaatatIO)
s s
12 (Ola(ata - G + s)atIO)
12 (OlaataatIO) - 12 (0 IaGa t 10) + ~ (OlaatIO)
s s s~
=1
92 3. Noninteracting Fields
For N = 3 we find
IIx311 13 (OlaaaatatatIO) = 13 (Olaa lata - C + s] atatlO)
8 8
~(OlaaataatatIO)
83
- ~(OlaaCatatIO)
S3
+ ~(OlaaatatIO)
2 8
~
= IIx211
~(OlaaatataaIO)
S3
- ~(OlaaatcatIO)
s3
+ 12 (OlaaatatIO) - 13 (OlaCaatatIO)
8 s
~
= IIx211
2
-3(0IaaatatI0) + IIx211
8
~(Olaaatat
3
lata - C + 8] 10) - ~(OlaaatCatIO)
3
8 8
~(OlaaatatataIO)
83
- ~(OlaaatatcIO)
s3
311x211- 13 (Olaaa t
s
[a t6 + 2a t] 10) - 2~llx211
s
Exercise 3.4.
(3.26)
where nn = 0 or 1 because of (3.24). We examine the vector
At
bnlnn)
and apply the operator Nn = btbn to it. This gives
(3.27)
This reveals that btlnn) represents an eigenvector of btbn with eigenvalue
1 - nn. Then we note
(3.28)
94 3. Noninteracting Fields
bnlnn ) - bnNnlnn )
(1 - nn)b n Inn) . (3.31)
Obviously bnln n ) is also an eigenvector of N n with eigenvalue 1 - nn, which
means
bnlnn ) = Dnl1 - nn) (3.32)
with
Thus
D n -- e io~ ynn
~ =
- 8'nynn,
~
(a) bosons
EXERCISE
Problem. Derive the boson commutation relation (3.8) from relations (3.36)
for the boson creation and annhilation operators.
Solution.
(a)
(b)
At At At At
( bn bn' - bn' bn ) I· .. n n ... n n , ... )
For the case of fermions the choice of phase factor is more complicated.
We select en = eia " in such a way that fermion commutation relations
't't , + bn,b
bnb 't 't 0,
n n
(3.39)
bnbn, + bn,bn o
follow from (3.37) with an arbitrary state I··· nn'" nn' ... ). Here it is an ex-
cellent excercise to convince ourselves that relations (3.39) do not follow from
(3.37) for an arbitrary but fixed choice of phase en = eia ". We immediately
calculate that
(bnbn, + bn,bn)I'" nn'" nn"")
=; enen, (Fnyfn;; + yfn;;Fn) I··· nn'" nn' ... )
i=- o. (3.40)
Here it was assumed that en depends only on nn and not on the remaining
occupation numbers. Evidently this strategy leads to a contradiction with
(3.39). In order to remove it we proceed in the following way: We order the
states Inn) of the system in an arbitrary but then fixed manner; for example,
(3.41 )
The operation of bk or bl
on these states is then given by (3.37), where we
choose the phase ek = + 1 or -1 depending on the number of preceding
occupied states to be even or odd. In other words: If an even number of states
nv = 1 is to be found within the states reaching from InI) to Ink-I), then we
set
for even L:~:~ nv. For the case when the number of occupied states within
InI) to Ink) is odd, we fix the phase ek = -1; hence,
3. Noninteracting Fields 97
bklnl' .. nk"')
(3.42)
At
bklnl ... nk"')
The difference between the phase choice (3.42) and the choice (3.40) attempted
before is that now the phase fh depends on the occupation numbers of the
preceding states Ink)' In (3.40) 8 k depended on the index k only. This choice of
phase builds on the construction of the state Inl ... nk ... ) out of the creation
operators bt:
The operation
bk Inl ... nk ... ) = bk (bitt (b~t2 ... (bttk ... 10)
and explains naturally the choice made for the phase factors.
EXERCISE
Problem. Show that the phase choice (3.42) is consistent with the commu-
tation relations (3.39) for an arbitrary state Inl ... nk ... ).
Solution. We compute
b1bklnl ... nk'" nl ... ) (1)
and
(2)
where Inl ... nk ... nl ... ) represents an arbitrary state. With no loss of gener-
ality we assume 1 > k for the ordering of the operators. So that the operations
(1) and (2) are not identical to zero from the very beginning, nk and nl must
98 3. Noninteracting Fields
Exercise 3.6. both be equal to 1 for the state Inl ... nk··· nl ... ). The operation (1) then
yields
hkhzinl ... nk··· nl···) = 8l8kfokfollnl··· nk - 1··· nl - 1···). (3)
It empties first state I and then state k. The operation
hlhklnl···nk···nl··-) = 8;8~fokfollnl···nk-1···nl-1···) (4)
empties first state k and then state I. Thus
~k-l
8~ = 8k = (-l)L..v nv
remains unaltered. The phase 8f = -8l differs exactly by the factor -1 from
8 l because state k has been emptied by hk before, so n~ nk - 1 = o.
Obviously the result is
(5)
or
(6)
This is exactly the second relation in (3.39). The first relation in (3.39) can
be deduced in a completely analogous manner. The same holds for
A At T
bkbl + blbk = 81k ·
Since the ket vectors Inl ... nk···) represent all possible states of the many-
particle problem, they form a complete set; consequently, the anticommuta-
tion relations (3.16) follow as operator equations from (3.42).
We have become acquainted with two possibilities for the quantization, the
one with commutators (3.8) and the other with anticommutators (3.16). With
a Hamiltonian of the form (3.9) both possibilities lead to equations of motion
[(3.lOb) and (3.17)] for the operators, which are formally identical to the
"classical equations of motion" (3.4). This reflects a criterion to accept both
quantization procedures as possible quantization schemes. Immediately the
question has to be raised as to when to quantize with commutators and when
to quantize with anticommutators. Put in another way, when does Bose-
Einstein statistics and when does Fermi-Dirac statistics apply? When should
the Pauli principle hold and when not?
An answer exists in the general form of a theorem from Pauli: 6 Parti-
cles with integer spin have to obey Bose-Einstein statistics and particles with
half-integer spin have to obey Fermi-Dirac statistics. We presume that the
spin quantum number is already contained in the quantum number n of the
wavefunction 1/Jn for fermions.
6 W. Pauli: Phys. Rev. 58 (1940) 716.
3.2 Relationship Between Second Quantization and Quantum Mechanics 99
Later in this series on theoretical physics we will discuss and explain this
theorem in much more detail. 7 For the moment we only indicate the deeper
roots: the energy spectra of Hamiltonians have to be bounded from below (i.e.
have to be essentially positive) and the validity of microcausality has to be
claimed.
Remark. We assume the existence of spin-1/2 particles to be fundamental;
all other particles with different spins (0,1,3/2, ... ) can be thought of as being
built up of spin-1/2 particles. Under this aspect it seems to be plausible that
spin-1/2 particles have to obey the "more fundamental" Fermi-Dirac quanti-
zation. Then all particles with integer spin have to be Bose particles because
they are build up of an even number of Fermi particles. Hence, for example
the exchange of two spin-1 particles would correspond to an exchange of two
pair's of spin-l/2 particles. If the Bose quantization held for the elementary
spin-1/2 particles themselves, then consequently all particles would be Bose
particles. Thus, we have to demonstrate that in any case spin-1/2 particles
have to obey Fermi-Dirac statistics.
the field 1jJ(x, t) of (3.3) becomes a field operator ~(x, t). We write for the
field operator
(3.43a)
n n
for the Hermitean conjugate field operator we get
'¢t(x,t) = L,)~7jJ~(x) = L,)~(nlx). (3.43b)
n n
Here also the bracket notation for the states 7jJn(x) = (xln), 7jJ~(x) = (nix)
has been indicated. We shall use it from time to time when it seems suitable.
Obviously, the field operator '¢(x, t) annihilates a particle at position x and
at time t; it is a linear combination of annihilation operators bn which de-
pends on position (x) and time (t). Similarily '¢ t (x, t) has to be interpreted
as an operator creating a particle at position x and time t. The commutation
relations for the field operators '¢( x, t) and '¢t (x, t) can be derived from those
A At
between bn and bn ,:
n n'
n
5(x - x'). (3.44)
Here (3.8) and (3.16) and, for the last step, the completeness relation of the
functions 7jJn(x) have been used. Analogously we find
J ~ b~1P~(x)
d3 x [- : : V2 + V] ~ bn,7jJn' (x)
9 W. Greiner, J. Reinhardt: Field Quantization (Springer, Berlin, Heidelberg 1996).
3.2 Relationship Between Second Quantization and Quantum Mechanics 101
L Jd3x1jJ~(x)En,1jJn'(x)
b~bn'
L En,b~bn' Jd3x1jJ~(X)1jJn'(X)
n,n'
n,n'
(3.46b)
n,n' n
- T at 1jJ(x, t) = ['
no, 1jJ(x, t), H'] _ ' (3.47)
[~(x,t),iIL = J
[~(X,t), d3x'~t(x',t) [-2n: Vt2 + V(X')] ~(X"t)L
= J d3 x' {~(x, t)~t (x', t) [- 2n: V,2 + V(x')] ~(x', t)
-~t(x',t) [-;: V'2+V(X')] ~(X"t)~(x,t)}.
(3.48)
Employing the commutation relation (3.44),
~(x, t)~t (x', t) = o(x - x') =t= ~t (x', t)~(x, t) ,
we transform the second to last term in (3.48) into
We realize that in any case the second term is positive (and independent of the
statistics) and cancels the last term in (3.48). Thus, independent of the choice
of commutators or anticommutators for the quantization (Bose-Einstein or
Fermi-Dirac statistics), we get for the commutator
f
n n'
L: b~bn' d3x1/J~(x)1/Jn'(x)
n,n'
"" At A -_ "
~bnbn " A
~Nn. (3.52)
n n
dN
ill = -Iii [AN, HA] _ = 0; (3.53)
see the following exercise. The operator of the total particle number is a
constant in time; this is a very satisfying result.
EXERCISE
[N,iT] = 0 Exercise 3. 7.
with
iT = J x;j;t
d3 (x, t) [- ; : V2 + v(X)] ;j;(x, t) .
It holds
where we have left out the argument t. With the help of the commutation
relations
;j;(x');j;t(x) = o(x - x') ± ;j;t(x);j;(x')
NiI is transformed into
X
h2
[ - 2m V2 + V(x) ] ;j;(x)
it follows that
NfI = J d3x~t(x')
d 3 x' [- :~ V2 + V(X)] ~(x)8(x - x')
-J d3x~t(x)
d 3 x' [- :~ V2 + V(X)] ~(x')8(x - x')
+ J d3x~t(x)
d 3 x' [- :~ V2 + v(x)] ~(x)~t(x')~(x')
fIN,
i.e.
The ket vector 10) designates the vacuum state; it shall represent the state
without any real particles,lO so that
bnlO) = 0
for any bn . Because of (3.43) we immediately realize that
(3.54)
n
This result confirms that the vacuum does not contain particles. Thus, it
becomes clear that the vacuum expectation value of the field operator vanishes:
(Ol~(x, t)IO) = o. (3.55)
Hence, we expect that
(3.56)
describes a state where a particle stays at position x. To convince ourselves
we first calculate the action of the particle density operator n(x, t) of (3.50)
on this state and derive
n(x', t)~t (x, t)IO) ~t (x', t)~(x', t)~t (x, t) 10)
~t(x',t) [8(x-x')±~t(x,t)~(x',t)] 10)
8(x - x')1jJt (x, t)IO) . (3.57)
The last term of the second to last step vanishes because of (3.54). We realize
that ~t (x, t)IO) represents an eigenvector of the particle-number density oper-
ator with eigenvalue 8(x' - x). The eigenvalue is zero everywhere except at
x' = x. At this point (x' = x) the particle density becomes so large that an
integration over the vicinity of this position yields 1. Then the validity of the
following relation becomes clear:
J d3x' n(x', t)~t (x, t)IO) = J d3x' c5(x' - x)~t (x, t)IO)
= ~t(x, t)IO) ,
which shows that also ~t (x, t) 10) is an eigenvector to the total particle-number
operator with eigenvalue 1. The interpretation of ~t(x, t)IO) as a one-particle
state has been justified! Similarily,
~t(XI' t)~t (X2' t)IO) (3.59)
represents a two-particle state with a particle at Xl and one at X2. Many-
particle states can be constructed in an analogous fashion. We come back to
the one-particle state (3.56) and construct a superposition of such states,
(3.60)
where the function Xl (x) is an ordinary function and should not be interpreted
as an operator! According to the rules of quantum mechanics we interpret
(3.61 )
as the probability of finding a particle within the volume d 3 x in state
~t(x, t)IO). Because of (3.57) this probability is identical to the probability
of finding a particle in d 3 X at all. Evidently IX I (x, t Wd 3 x plays the role of
the probability density, which in conventional quantum mechanics is given by
11jJ(x, tWo Now we will construct XI(X) in such a way that lXI, t) becomes an
eigenvector to fI with eigenvalue E. This leads to the eigenvalue equation
(3.62)
or
J x [c5(x-x')±~t(x',t)~(x,t)]
d3 x'XI(x') 10)
J J ~t(x,
d3 x d 3 x' +
t) [- : : V2 V(X)] 8(x - x')xI(x')IO)
f d3Xd3x~···d3x~~t(x,t) h2 V2+V(X) ]
[ -2m ~(x,t)
x Xn(X~,···,x~)~t(x~,t).··~t(x~,t)IO)
f d3Xd3x~ ···d3x~~t(x,t)[···JXn(X~'···'X~)
x [O(x - xD ± ~t (X~, t)~(x, t)] ~t (x;, t) ... ~t (X~, t)IO)
3.2 Relationship Between Second Quantization and Quantum Mechanics 107
±,(f;t(X~, t) [O(X - x;) ± ,(f;t(X~, t),(f;(X, t)] ,(f;t(X~, t)··· ,(f;t(X~, t)IO)}
Jd3xd3x~ ... d3x~,(f;t(X, t) [... j Xn(X~"" ,X~)
X { l)±l)i-lO(X - xD,(f;t (X~, t)
,
X ... ,(f; t (x~_l , t),(f; t (X~+ 1, t) ... ,(f; t (X~, t) 10) }
This has to hold for arbitrary states ~t (x~, t) ... ~t (x n , t) 10); hence, we deduce
We examine now the Hamiltonian of two (or more) free particle fields and
their mutual interaction. It appears to be obvious to simply add the Hamil-
tonians of the free fields and to introduce their interaction suitably. We have
already come across the interaction between charged particles and photons.
In the following we elaborate on how this well-known system is completely
described from the field-theoretical point of view, i.e. in the language of sec-
ond quantization. With the introduction of creation and annhilation operators
aL., aka for photons the radiation field itself has already been treated field-
theoretically. But we now also want to express the particle processes with
particle creation and annihilation operators b~ and bn . The noninteracting
particle field will be described with Hamiltonian (3.46) and the photon fields
through Hamiltonians (1.40) or (1.52). The interaction (2.6) between the two
fields is introduced via minimal coupling:
, ,e ,
p ---7 p - - A .
c
We insert this replacement directly into (3.46) and obtain the total Hamilto-
nian by addition of the mentioned terms:
3 8~ (E2 + 13 2) =
H rad = J d x L( Wk + ~) ata ka . (4.3)
k,cr
The Hamiltonian for the interaction between the particle field and the radia-
, J
tion field is given by
and for the particle field operator the expansion (3.43) in terms of particle
creation and annihilation operators,
¢(x, t)
n
n
we obtain the result
Hint H'int + HI!int' (4.5)
where
and
H[~t L L L L b~ bn,€k 1 ,<71 • €k2,<72
k1,Ul k 1 ,a2 n n'
x [M(kl,k2,n,n')akl,<71ak2,<72
with eigenvalues
L EnNn
n
+Lnwk (nkCT +~)
k,cr
(4.11)
The interaction (4.5), more explicitly (4.6) and (4.7), causes transitions be-
Fig. 4.1. Scattering of a
tween states (4.10). For example, the term proportional to b~bn,akCT annihi- particle from In') to In) by
lates a photon with momentum nk and polarization (1, annihilates a particle absorption of a photon kO"
in state In'), and at the same time creates a particle in state In). We say that
through the annihilation of a photon of the kind k(1, a particle is scattered
from state In') to state In). The matrix element EkCT' M(k,(1,n,n') in (4.6)
represents the amplitude for this process. Analogously, the term proportional
to b~bn,at describes the emission of photon k(1 with simultaneous scattering
of the particle from In') to In). In the same spirit the terms contained in H{~t
can be interpreted. We illustrate these processes in Figs. 4.1-4.5.
Often the interactions (4.6) and (4.7) are written in abbreviated form:
HA ,
This notion is indeed correct. For instance we consider the Hermitean conju-
gated (H.c.) part of the first expression (4.13) for H{nt explicitly:
LLLEk(T' M*(k,(1,n,n')b~,bnat·
k,a n n'
( partial )
integration
J
Fig. 4.3. Scattering of a
27rnc
L3 wk
2
dV 7f;*,(x)
n
(-.;!!-
Imc
e- ik .x . v) 7f;n(x)
particle from In') to In)
by absorption (annihila-
tion) of two photons k 2 0"2
Ek(T . M( -k, n', n) . and klO"I
112 4. Quantum Fields with Interaction
k,a n n'
We have exchanged the summation indices in the last step. Result (4.15)
is identical to the second term of (4.6). On the same footing it is easy to
Fig. 4.4. Scattering of a show the identity of (4.14) and (4.7). The field-theoretical formalism (second
particle from In') to In) quantization) presented here clearly expresses the quantum character of the
with simultaneous emis- processes. However, it does not modify the physical content of the theory; it is
sion of two photons k 2 0'2
and kjO'j only the particle feature that is explicitly illustrated by second quantization.
EXAMPLE
This example can be solved rigorously with the methods developed and pre-
sented during the course of the first three sections. We will now approach it
with the tools of second quantization in order to demonstrate the advantages
of this formalism.
We know from classical electrodynamics that an accelerated charged parti-
cle radiates. In particular such a Bremsstrahlung process happens in the colli-
sion of two charged particles; during the collision the particles first slow down
and then accelerate again or vice versa. Bremsstrahlung occurs in various colli-
sion processes: for example, heavy-ion scattering (see Fig. 4.6), proton~proton
scattering; and electron~electron scattering.
Here we discuss electron~nucleus scattering, i.e. the collision of an electron
and an atomic nucleus and their bremsstrahlung. Because the mass of the
atomic nucleus with A nucleons is much larger than the electron mass,
mnucleon A :::::i 2000 A ,
melectron
4. Quantum Fields with Interaction 113
recoil effects can be neglected and the nuclei can be considered as being at rest.
This approximation is not valid for nucleus-nucleus bremsstrahlung. Without
any loss of generality we place the nucleus at the origin x = 0 of our coor-
dinate system and denote the position vector for the electron as x (compare
with Fig. 4.7). The interaction potential between the electron and nucleus is
described by the Coulomb potential
Ze Ze
-TxT
2 2
V(x) = = --;:- . (1)
Hv = Jd3x~t(x)V(x)~(x). (2)
of atomic nuclei in heavy-
ion collisions. Bremsstrah-
lung represents an im-
The states In) of the free particle are now given by plane electron waves portant background for
quasimolecular radiation
1 . from molecules temporar-
1j;q(x) = (xlq) = ,fL3 e,q·", . (3) ily formed during the col-
lision. This process has
nq represents the electron momentum. According to (3.43) and (3) the field been calculated by Rein-
hardt et al. 1 and experi-
operator ~(x, t) becomes mentally detected by J.8.
Greenberg et al. 2
~(x, t)
and
~t(x,t) (4)
q
L (nq')2
~b~bq'
A A
dV 1j;~(x)1j;q'(x)
q,q'
Here, m represents the electron mass. Expression (5) is plausible; the total
kinetic energy is given as a sum of one-particle energies belonging to the
various states 1j;q(x) multiplied by the number operator bkbq. With (4) the
(Coulomb) interaction (2) explicitly reads
1 J. Reinhardt, G. 8off, W. Greiner: Z. Phys. A 276 (1976) 285.
2 H.P. Trautvetter, J.8. Greenberg, P. Vincent: Phys. Rev. Lett. 37 (1967) 202.
114 4. Quantum Fields with Interaction
Example 4.1.
L b~bq' JdV e; ; V(X):;;
q,q'
L b~bq,v(q' - q) , (6)
q,q'
where
represents the Fourier transform of the interaction potential V(x). For the
case of Coulomb interaction this expression can be computed in closed form:
J21fhc
L3 wk
2
(-~)
me q' . ' 8q,q'+k'
€Ok a (10)
The sum over q' breaks down because of the Kronecker delta in (10). From
our previous investigations into the emission and absorption of photons from
free particles (see Chap. 2, (2.48) and following) we already know that these
4. Quantum Fields with Interaction 115
processes do not exist for free particles. They are only possible in a surround- Example 4.1.
ing medium with an index of refraction n > 1 (Cherenkov radiation, Chap. 2).
Therefore bremsstrahlung must occur first as a scattering of the charged par-
ticle at the nucleus from an initial to an intermediate state and a successive
photon emission with simultaneous scattering of the particle into the final
state. It is the presence of the atomic nucleus as a force center which makes
the emission of radiation (bremsstrahlung) feasible. In other words: brems-
strahlung is a second-order process, for which H{nt' (11), as well as H v , (6),
contribute as a perturbation. We list the transition matrix element up to
second order:
remain for the bremsstrahlung process. There are only two possibilities for
the intermediate states 11):
Ih) lone electron with Q2)P Ino photon)R
IQ2)pIO)R = bq2 10)pIO)R (16)
and
Ih) lone electron with q;)plone photon krY)R
Iq~)paLIO)R = b~2,IO)paLIO)R' (17)
We illustrate these two processes in Fig. 4.8.
From our previous investigations we know that momentum conservation
holds at a vertex (from the Latin expression "convertere", which means "to
turn around") of the form shown in Fig.4.9 [compare, for example, with
(2.51)]. Hence we can immediately write down momentum conservation for
the vertices illustrated in Fig. 4.8 for which a photon is emitted:
116 4. Quantum Fields with Interaction
q]
(btIO)pIO)RI Lb~bqIV(q' - q) Ib~110)pI0)R)
q,q'
Fig. 4.9. Electron-photon
vertex
q,q'
q,q'
(20)
4. Quantum Fields with Interaction 117
(21)
where we have used (18). With (11) the radiation matrix elements are calcu-
lated as follows:
I
+ (hl, 10)pi h,b,+" &l,lO) p) '( al.lo~~·I~;::' 10)R),}
en
me
en (22)
me
Similarly it follows that
118 4. Quantum Fields with Interaction
Example 4.1.
(23)
As we have already realized earlier in the context of (18) the two 6 functions
appearing in (22) and (23) express momentum conservation at the photon
vertices. Obviously, it also follows naturally from the formalism of second
quantization. We still have to calculate the energy denominators of (19). Since
we deal with real particles, this is an easy task:
qr
/'i 2
2m '
(24)
/'i 2 q2
Ef = __ 3 + nwk .
2m
Energy conservation for the total process is deduced from the 6 function ap-
pearing in the expression for the transition probability per unit time,
(26)
With this result the energy denominators of (19) can be cast into the following
form:
nqi nq3
Vi = and V3 = - (29) Example 4.1.
m m
have been introduced. With (20), (23), and (28) we can now write down the
bremsstrahlung matrix element, (19) explicitly; we get
Mfi = -41lZe 2
V(q3 + k - qi)2
(-en)
mc
J21f!ic 2
VWk
XEkcr· (nw k
(1-~-~) - nw (1-~+~))
kc 2mc k kc 2mc
X Ekcr . (1 _k.~3 kc
_ ~ -l-_--;-~-.:-l-+-"""'nk,-)
2mc
-
kc 2mc
(30)
All the time [see (24)] we have assumed nonrelativistic electrons, i.e. vlc« l.
Moreover, the photon momentum nk will remain small with respect to the
particle momentum in general, so that
nk« nq = mv.
Consequently, the denominator of (30) can be replaced by 1, and the denom-
inator of the first factor of (30) can be approximated by
(q3 +k - qd 2 ~ (q3 - qi)2 .
Then, (30) simplifies to
41fZe 2 n2
Mfi (31)
Vm 21Lv 12
where
Lv = V3 - Vi (32)
represents the difference between the electron velocities in the initial and final
state.
The total cross section IJ for bremsstrahlung is defined as the sum over
all transition probabilities per unit time for transition into all possible final
states and divided by the incoming electron current
(33)
(34)
A summation over electron states on the one hand and over photon states on
the other hand has been performed already; also replacement (2.18) has been
applied:
120 4. Quantum Fields with Interaction
Example 4.1.
L
£3--+ 00
------+
L3
(27r)3
j d3k,
k
L3 j 3
(35)
L
L 3 --->00
------+
(27r)3 d q3.
q
dDe is the solid angle element in which the electrons scatter and dDk the one
for photons (see Fig. 4.10). Inserting (31) into (34) yields
k,(f Since
Fig. 4.10. Solid angle ele-
ments into which the elec-
tron and photon are scat-
tered
(only positive q3 have to be considered in the integral J ... dq3) and
. rJ
16vl = 2VI sm"2 (37)
d3 a ( da ) dW (40)
df2e dDk dw = df2e Rutherford df2k dw .
4. Quantum Fields with Interaction 121
From (39) we unveil that the energy spectrum of the photon behaves like Example 4.1.
dw / w; as a consequence the probability of a photon being emitted with en-
ergy zero appears to increase to infinity. This behavior also shows up in the
relativistic theory of bremsstrahlung; it is known as the infrared catastrophe.
In order to better understand and remove this unreasonable result a careful
analysis of the actual experimental conditions is required for the observation
of bremsstrahlung. The crucial point is that every counting device possesses -----~ Ze
only a finite energy resolution. If it detects an electron scattered inelastically
into a finite energy interval around w = 0, then it will also detect an electron -----~ Ze
scattered elastically. Therefore we have to consider the elastic as well as the
inelastic scattering cross section for a comparison with experiment; both pro-
cesses, of course, up to the same order in e 2 . Since the bremsstrahlung part of
(39) is of second order in e 2 relative to elastic scattering the so-called radia-
tion corrections up to order e 2 have to be taken into account for the electron
scattering cross section
(d~J elastic .
Two different contributions exist. On the one hand, graphs of the form given
in Fig. 4.12 describe the second-order scattering of electrons in a Coulomb
potential (confer Exercises 4.2 and 4.4). Fig. 4.12. Second-order
On the other hand, a second-order interaction of the electron with itself Coulomb scattering
exists via the radiation field; the graphs depicted in Fig. 4.13 give an illus-
tration. They depict a virtual photon, which is first created at the electron
site and then falls back again to the electron like a boomerang. This is in
contrast to the graphs depicted earlier for which the second photon also runs
to the source (Ze); consult again the discussion about self-energy at the end
of Chap. 1. A systematic calculation of these graphs within relativistic theory
shows that the contributions from radiation corrections contain a divergent ---~Ze
term, too, which exactly cancels the divergence (39) of bremsstrahlung at
tu.J = O. Hence, the infrared catastrophe does not exist in reality!
EXERCISE
2m r
As discussed earlier, in second quantization the interaction takes on the form
Fig. 4.13. Radiation cor-
A _
Hv -
' " ' " At A
- ~~bqbq'
q q'
471' Ze 2
Plq_q'1 2
_
- ----v- ~~bq-kbqk2'
471' Ze 2
q k
'" '" At A 1 rections to Coulomb scat-
tering
122 4. Quantum Fields with Interaction
Exercise 4.2. We consider a transition from a particle state Ii) = bklO) to the final state
If) = bt,IO). tip and tip' denote the particle momenta before and after the
collision, respectively. The transition matrix element becomes
Mfi
A
= UIHvl z) -
. _
--v
47rZe 1 A At A At
2 ,",
L k (Olbp,bq_kbqbpIO)
q,k
2
--v
47rZe 1
2 ,", At A A At
L k2 (01 (5 p'q-k - bq_kbp' )bqbpIO)
q,k
41l'Ze 2 ,", 1
- -V L 5p 'q_k 5pq k 2
q,k
47rZe 2 1
--V(p_p,)2 .
To obtain the cross section one has to sum over all final momenta p' and
!
divide by the electron current np/mL3;
L3 m L3 12 I 27r 167r 2 (Ze 2)2 1
(J = np (27r )3 p dp dS?P'1i L6 (pI _ p)4
!
2m 2m
4mZ 2e4 2m p/2 dp' dS? , 5(p2 - p/2)
! !
ti2p ti2 P (pI _ p)4
2!
ti4 p (1 - cos 7'J)2
Z2 e dS?
4E2 4sin 4 7'J/2 .
Thus, we arrive at the well-known result
d(J) Z2 e4 1
(
dS? Rutherford = 16E2 sin4 7'J /2
for the differential cross section. Rutherford's scattering cross section diverges
for small scattering angles. Performing an integration over the solid angle we
realize that the total scattering cross section (J = f (d(J / dS?)dS? diverges, too.
This stems from the strong singularity'" 1jt'!4 at 7'J = 0; this singularity has
the same origin as the infrared catastrophe we have already experienced in a
calculation of bremsstrahlung.
4. Quantum Fields with Interaction 123
EXERCISE
Problem. Calculate the lifetime of the 2s state of the hydrogen atom with
respect to two-photon decay (compare with Exercise 2.10) with the methods
of second quantization.
k,u n,n'
and (1)
iI"
(2)
H' produces two photons in second order, whereas H" directly produces two
photons. No other term in (4.7) contributes to the two-photon process. In the
initial state the electron occupies the 2s state and no photon (radiation) is
present:
(3)
In the final state the electron occupies the Is state and two photons exist with
quantum numbers (k r , crr) and (k~, crD:
If) = btsiil f!
II
f
iikt ,
f'O"(
,10)eI0)rad' (4)
With the initial and final states given we can calculate the matrix elements
(2) of the Hamiltonians (1)
MN) UliI"li)
rad (01 e(Oliik;,II; iikf'"fblsH"b~s IO)e 10)rad
L L LM(-kl,crl,-k2,cr2,n,n')
124 4. Quantum Fields with Interaction
for the matrix element of operators. After commutation the photon part be-
comes:
(Olak'pUr,(atk},Ul ak f,Ue + 8k k 8O"l,O"f )
1, f at
k2,U2 10)
i.e. the electron has to occupy the 2s state as its initial state and the Is state
as its final state. Then we get
M( -k;, 0';' -k[, 0'[, Is, 2s) + M( -k[, O"f, -k;, 0";, Is, 2s) . (5)
In order to calculate the matrix element of second order we need the interme-
diate states
(6)
for which only one photon is present (H' can produce only one photon in first
order). The matrix element reads
(7)
4. Quantum Fields with Interaction 125
k,u n,n'
Furthermore,
and
(Olak;,a;ak"a,a!,aaLa z 10)
(Olak;,a; (aLak"a, + c5k,kAj,a,) aLaz 10)
(olak;,a;aLak"a,aLaz 10)
+ c5k,k,c5a,a, (Olak;,a;aLaz 10)
(Olak;,a;aL (aLaz ak"a, + c5k.,k,c5a.,a,) 10)
c5k,k,c5a,a, (Olc5k.,k; c5a.,a; + aLaz ak;,a~ 10)
Z)
, 'atk,a 10) + 15k , kf 15
15 k kf 15 O'z,CTr (Ola kflat (T,fff
15 k k'c5
%1 f aZ)a r,
f a,a r' + at
15k kf 15O'z,O'r (Olc5 k, k'c5
Z) k,O' ak'f,u t,10)
X (c5k.,kfc5a.,afc5k,k;c5a,a; + I5k,kfI5a,a,I5kZlk;I5a"a;)
LL M( -k, 0", n, n')c5n , ,zc5n ,ls
k,a n,n'
(8)
126 4. Quantum Fields with Interaction
k',a'm,m'
k',(j' m,m'
It holds
Hence,
X [bkzok,b""a,bk,k;ba,a; + bk,k,ba,a,bk"k;b"",,;]}
x [L L
k',a'm,m'
M(-k',a',m,m')bk"k,baz,a,bm',2sbm,z]
L
z,kz ,(J z
E-
1
~E
Z
[M( -k;, aL Is, z)bkz,k,bazoa'
Two terms arise as a result of the different time ordering of the photon emis-
sion as can be read from the arguments of the matrix elements; see Fig. 4.14. lIs)
Still the sum over electron states z remains in MA2). The total matrix
element Mfi = Mc~1) + MA2) becomes
Mfi = M( -kL O'L -kc, O'c, Is, 2s) + M( -kc, O'c, -k;, O'L Is, 2s) 12s)
+ ~ [E _ ~ or
-Iiw M( -kL O'L Is, z)M( -k f , O'c, z, 2s)
Z 28 Z f lIs)
+ E28_L_liwcM(-kc,O'f,IS,Z)M(-kr,O'Lz,2S)]. (12)
J
The expression for the lifetime is given by (compare with Exercise 2.10):
12s)
-:;1 L6 n2c
= (27l')6 27l' 'L,
" d 3k f d 3'1
kc Mfi 12-'( ')
u ko - kc - kc (13)
Fig. 4.14. Different time
CTf a f
ordering of two-photon de-
with ncko = E 2s - E 1s . From (2) we pull out those factors carrying dimensions: cay
and
M(k,O',n,n') = (27l'nc 2 )
L3 Wk
1/2 (_~)
lmc
Jd 3 x 1Pn* (eik'XEka' V) 1Pn"
Since M(k, 0', n, n') appears in the form M(k, 0', n, n')M(k', 0", m, m') within
the matrix element Mfi we can pull out of the total matrix element all quan-
tities carrying dimensions:
and
EXERCISE
J q2 dq dnq
J
h2 L6 (2n)3 (p2 _ q2)(pl _ q)2(p _ q)2
8m(Ze 2)2 q2 dqdnq
nh2L3 (p2 _ q2)(pI2 + q2 _ 2p" q)(p2 + q2 - 2p. q) .
The determination of this integral is far from easy. We follow a method pre-
sented by Dalitz. 3
It = J d3q
[(PI - q)2 + A2](p§ - q2)
lim 27T 1 +1
d(cos'!9)
100 q2 dq
(P1 2 +q2-2qPlcos'!9+A2)(p§_q2 +iE)
1 100-00
E->O+ -1 0
+1 q2 dq
lim 7T dt .
E->O+ -1 (PI 2 + q2 - 2qPIt + A2)(p§ - q2 + if)
1/ (p~ + ,\ 2 ) represents the Fourier transform of 1/ 47Tr exp( -r /'\), which is
the exponentially screened Coulomb potential. As we will realize later on, the
introduction of a finite ,\ is equivalent to giving the photon a finite mass ,\.
The integrand becomes singular at q = ±(p2 + if/2) and q = PIt ±
iJp~(1 - t 2 ) + A2. We evaluate the integral with the help of the residue theo-
rem as we close the integration path in the upper half of the complex q plane.
Here the two poles with positive imaginary part are enclosed by the integra-
tion path. Since the poles of the integrand are of first order the residues are
determined by
Res(J(q) , qi) = lim (q - qi)f(q) .
q-+qi
It =
where
r VPi(1-t 2 )+A2.
7T 2i
It = -
PI
1+
- 1
1
dt T(
1
p2t
1
- ip r
1 T)
P2 - PI t + 1
=
7T 2i
-
PI
1+
-1
1 d
d t In(p2 - PIt
dt- + iT)
2·
~ [In(p2 - PI + iA) -In(p2 + PI + iA)]
PI
Thus, we have
lim
E->O+
J [(PI - q)2 +
d 3q
A2](p§ - q2 + if)
= 27T 2 In P2 - PI
PI P2 + PI
+ iA .
+ iA
Via partial differentiation with respect to the parameter A we obtain
- ·
11m
{->O+
J ((q - Pl)2
2Ad 3q
+ A2)2(p§ - q2 + if)
i7T 2 ( i i)
PI P2 - PI + iA P2 + PI + iA
from which follows
130 4. Quantum Fields with Interaction
and with a small trick it can be transformed into the form of 12 : First, we
observe that a 0 function appears in the formula for the cross section, which
guarantees energy conservation. Hence, we only need to calculate the matrix
element for the case p~2 = p~ and we write P2 . p~ = p~ cos {). We make use of
the identity
~=
ab
1+-1
1 dz (a(l
2
+ z) + b(l -
2
Z))-2
12 +1 dz [2
-1 [).. + (P2 -I 2 1+
q) ]-2-
z + [).. 2 + (P2 - 2 1-
q) ]-2-
z] -2
1 +1
-1
dz
2
2 2 -2
- [(PI - q) + A ]
with
1
PI 2[(1+z)P2+(1-z)p],
2212212
A ).. + 2(1 - z )P2 + 2(z - 1)p2 . P2
I
1
)..2 + (1 - z2)p 2 2 (1 - cos{J)
In the limit A ---+ 0 the first term inside the bracket vanishes, so we finally Exercise 4.4.
arrive at
M(2) ~ 41l'm(Ze 2 )2 i 1 2p~sint9/2 A-+O
fi ~ - fj,2L3 p3 sin2 t9/2 n A ----400.
The matrix element diverges as the introduced photon mass A goes to zero.
Here the long range of the Coulomb potential leads to a divergence which
questions the result of the previous problem: How can we trust the result
from first-order perturbation theory if the second order already diverges? On
the other hand the experiments are in very good agreement with Ruther-
ford's result. Later in this volume we will recognize how these and other
divergences can be avoided within the framework of renormalization theory,
which is widely discussed in relativistic quantum electrodynamics.
5. Infinities in Quantum Electrodynamics:
Renormalization Problems
We begin with the infinite zero-point energy of the radiation field in the vac-
uum, which we have already encountered in (1.52):
(5.1)
L nwkaLak" + ~ L nwk
k,u k,O'
(5.3a)
and
iI = L nwkal"ak" , (5.3b)
k,"
respectively. Evidently both Hamiltonians differ by the presence of the infinite
zero-point energy. Now we could argue that we have to pay careful attention
to the ordering of a, a* in the classical expression (before quantization). Then
(5.3a) would be the correct Hamiltonian for the free electromagnetic field.
We could also follow the reverse direction and ask what classical expression
for the radiation energy would result from (5.3b) if we take into account the
precise ordering. Indeed, a lengthy backward calculation l starting from (5.3b)
yields
H rad 2-
87l'
Jd 3 x
X {E2 + ii + y'~V2 [E. (V x B) - (V x B) . E]} , (5.4)
1 E.G. Harris: A Pedestrian Approach to Quantum Theory (Wiley, New York 1972).
5.1 Attraction of Parallel, Conducting Plates 135
Now the question arises whether somehow the zero-point energy (or parts
of it) can be observed in an experiment. Casimir2 pointed out such a possibility
and Lifschitz and Fierz 3 have both investigated his argument in more detail. 4
The line of thought is as follows. The zero-point energy of the electromag-
netic field is given by
1
L '2 fkvn ,
n
Em Elv
cJQr-1J
where n stands for k. The frequencies Wn of the electromagnetic field depend
on the geometry of the volume in which the field is confined. If the geometrical
form changes the frequency of the normal modes, the zero-point energy will
also change. We consider a rectangular box of lenght R and basal area A =
L2. This box, with conducting walls, represents our volume (quantization
~~.~
volume), which determines the frequencies of the electromagnetic field via its I. RI1J.I~ ~/d
geometry. We introduce a second plate at a distance d from the first one and
are interested in the energy of the system depending on the position of one Fig. 5.1. Modification of
a rectangular box leads to
of the plates. Hence we subtract the energy of a reference configuration for a modification of the zero-
which the mobile plate is fixed at a certain distance (for instance 1/rJ with point energy
1/7] = 1/2; see Fig. 5.1). The difference in the energy is
U(d,R,A) = (EI+En)-(Em+Elv), (5.5)
where Ei (i = I, II, III, IV) represents the zero-point energy of the free elec-
tromagnetic field in the corresponding region depicted in the figure. In the
following, we will move the walls of our quantization volume to infinity, i.e.
U(d, A) = lim U(d, R, A) .
R->oo
where I, m, and n are positive integers. Because there are two possible polar-
izations, the potential energy becomes
(5.7)
where, for example, the bracket (d ----> R/7]) is a shorthand notation of the first
expression with the substitution of Rh instead of d.
We will present the further calculation in Exercise 5.1. After performing
the limiting cases R --> 00 and ).. --> 0 we obtain
This result is finite and does not depend on the cut-off procedure. From here
we calculate the force per unit area:
F = _~ (U(d,A)) = _ fiC7r 2 •
(5.9)
ad A 240d4 '
evidently this is not only a finite result again but, more astonishing, the force
depends only on the universal constants fi and c. Obviously it is indepen-
dent of e, the coupling between the electromagnetic field and matter. This
force depends only on the "zero-point pressure" of the zero-point oscillations
of the photon vacuum. Therefore we are allowed to say that this attracting
force between the conducting plates is of pure quantum-mechanical nature. It
vanishes for the limiting case fi --> O.
We have realized that two parallel plates alter the allowed normal frequen-
cies and thus the zero-point energy of the quantization volume (for the case
R ----> 00 of the "universe"). From this alteration results a force that pushes
the plates together.
An extension of this theory onto dielectric substances at finite temper-
atures was performed by Lifschitz and Fierz. An experimental test of the
theory started in 1957 with studies by Abrikosova and Deriagin,5 whereas the
first real evidence was provided by Sparnaay. 6 Details of this experiment are
presented in Exercise 5.2. From then on this experiment has been repeated
several times with different methods. The theoretical predictions have been
confirmed every time. 7
5 1.1. Abrikosova, B.V. Deriagin: Sov. Phys. JETP 3 (1957) 819; 4 (1957) 2.
6 M.J. Sparnaay: Physica 24 (1958) 751.
7 T.H. Boyer: Ann. of Physics 56 (1970) 474.
5.1 Attraction of Parallel, Conducting Plates 137
EXERCISE
Problem. Under the assumption that the length L of the plates is large com-
pared to the distance d, derive the relation given in the text for the potential
energy of the system of plates depicted in Fig. 5.1.
Solution. To begin with we calculate the first expression in (5.7) for the
potential energy; afterwards we replace d with the corresponding other quan-
tities. We get
U he L klmn(d, L, L) exp[->'k1mn(d, L, L)] ,
l,m)n
where
U = he ~
00 100 100
m=O dm n=O dnklmn(d,L,L)exp[->.klmn(d,L,L)]
or
U 00
he ~ 100 1
m=O
00
U = he 2
L2
L00 100 dxdy
7r 1=1 0
The substitution
rdr d¢ = ~ (~r dz = ~ (J f l2 dz .
138 5. Infinities in Quantum Electrodynamics: Renormalization Problems
r
Exercise 5.1. The integration over ¢ goes up to 7r /2 (only positive z)! We get
U = ~nc ~ ~ l n/2
d¢ 1 00
dz v'zTI (± exp ( -A±v'zTI)
so that
U = -~ncA7r2 ~
4 L..t
1=1
ioroo ~
z+ 1 ~
d0: 3
{exp (_ o:l rz+l)} .
d
1
U = --hcA7r
4
2 -d
3
d0: 3
10
00
-dz- Lexp
Z
00
+ 1 1=1
(O:l
--rz+l )
d
Since
~e--YI _ 1
L..t - 1- e--Y ,
1=0
it follows that
:l
~exp (- v'zTI) 1 - exp (-(o:/d)v'zTI) 1 ,
1
----;--.,-------;::==_ -
u=
5.1 Attraction of Parallel, Conducting Plates 139
U = ~ncA-7l"2~
2 do;2
[.!.1°O (x dx- 1)2 ]
0; e"Y
With z = x - I we get
u= ~ncA-7l"2
2
d [.!.1°O d;]
2
do;
2
0; e"Y-l Z
~ncA'71"2
2
d22
do;
[.!. (_~)OO ]
0; Z e"Y-l
1n A
"2 C
2 d2
'71" do;2
[1a 1 1]
eO'./d -
-
y 00
- "'"'-y
Bn n
eY -1 - L n! '
n=O
where Bn represent the Bernoulli numbers, which are given by Bo = 1, Bl =
-1/2, B2 = 1/6, B3 = 0, B4 = -1/30, .... Hence 8
Here (d --t R - d) means that d inside the first bracket has been replaced by
(R - d) and so forth.
8 See, for example: G. Artken, H.S. Weber: Mathematical Methods for Physicists
(Academic Press, New York 1995).
140 5. Infinities in Quantum Electrodynamics: Renormalization Problems
U(d,A) =
As we sum over all four contributions, the terms with a- 2 and a- 1 cancel.
The next term vanishes once differentiated with respect to a. Therefore we
are left with
U(d,A)
Hence, the potential stays finite and does not depend on the cut-off parameter
.\ nor on the coupling constant a; it depends only on the dimensions of the
plates. The resulting force
[j 1[2 A
F = - 8d U (d,A) = - 240 nc d4
is attractive; both plates attract each other!
EXAMPLE
e b
mmm
distance (d) between plates is chosen which is large enough so that the force
acting between the plates is smaller than the measurement resolution. With
the help of weight Wand spring S the scale is brought into equilibrium. Now
the distance between the plates is changed with the micrometer screws (m)
and the corresponding force is measured via the condenser (c).
In this way the dependence of the force on the distance d is obtained
which is depicted in Fig. 5.3. The width of the measured points goes back
to the micrometer screws. In the region F « 0.01 dynjcm 2 the quality of
the measurement of the force becomes uncertain. The broken line represents
the theoretical prediction as derived above. We see that the agreement be-
tween theory and experiment is remarkably good within the experimental
uncertainty. The experimental results obtained by Abrikosova show the same
agreement with the theoretical predictions.
The measured dependence of the force on the distance excludes expla-
nations (such as van der Waals forces) other than the Casimir effect. The
0.15 I---,.t-r--,,.-t---+---I---l
0.1
1\
\ theory
,Ill I
0.05 1----+--;\.-,--+1----1---11---1
0.01
,
x~x
Example 5.2. experiment described here has been presented in an oversimplified way. In
real life, precision work is necessary: The plates have to be entirely free of
dust, which is achieved only by very complicated procedures. Furthermore,
all electrostatic charge must be removed since it would lead to additional
forces. In addition, rigorous requirements have to be demanded for the plates
to be really flat. These are only some points that have to be taken care of. l l
EXAMPLE
The outcome of the zero-point energy for the plate condenser inspired
Casimir12 to develop a model for the electron. He assumed that the elec-
tron can be viewed as a spherical shell with homogenous surface charge e (see
Fig. 5.4). The electrostatic energy of a spherical shell with radius a is given
by
e2
Ee = -2a
The corresponding pressure is Pe = -aE/aV. With V = 47Ta 3/3 we obtain
p. _ _ _
1_ aEe
e - 47Ta2 aa
Fig. 5.4. Spherical shell
model of the electron i.e. the electrostatic pressure puffs up the spherical shell.
The presence of the spherical shell alters the normal modes of the electro-
magnetic field. The typical wave-number is k n rv l/a, so that
EN = - 720d3
7r 2 ncA
= -
7r 3
720a
nc 7r 3
= - 360
(ftc)
2a ;
see Fig. 5.5. For the limit of a point charge (R --+ 0) the electric field energy
of this sphere
(5.10)
becomes infinite.
This is the longitudinal (Coulomb) energy of the field. In quantum field
theory an additional energy exists as a result of the transverse electromagnetic
field. In the following, we will investigate this quantity within the framework
of nonrelativistic quantum electrodynamics, which we have pursued so far. Fig. 5.5. Electric field
In our lectures on quantum electrodynamics 14 we will become acquainted of a charged, conducting
with the relativistic theory, which is more exact and correct. For our purposes sphere
We designate the state with one free electron of momentum p and no photon
as
(5.14)
Then according to first-order perturbation calculation for the free electron we
calculate the contribution
(5.15)
EXAMPLE
According to classical electrodynamics the electron with its charge (-e) pro-
duces an electrostatic field where the energy is given by
e2
W = aR . (1)
The coefficient a depends on how the charge (-e) is distributed over the
small sphere with radius R. The order of magnitude for a is 1: for example,
a = 1/2 for a charge distribution confined to the spherical surface (compare
with result (5.10) for a conducting sphere) or a = 3/5 for a uniform charge
distribution over the whole sphere. The eigenmass of the electron consists of
two parts:
1) the mechanical mass m', which is not coupled to the field energy and
2) the electromagnetic mass mel, which is coupled to the field energy.
Hence, we have
m = m'+mel. (2)
5.2 Renormalization of the Electron Mass 145
According to (5.10) the mass coupled to the field is determined by Example 5.4.
W e2
mel = 2 = a Rc2 . (3)
Strictly speaking, classical electrodynamics provides the following expression
for the mass coupled to the electrostatic field:
4W
mel = '3 c2 . (4)
The factor 4/3 instead of 1 results from the instability of the charge distri-
bution as identical charge elements inside the sphere repell each other. Thus,
additional forces of nonelectromagnetic origin have to be introduced in order
to produce equilibrium. If we consider nonlinear electrodynamics these diffi-
culties do not occur because they lead to stable charge distributions inside
the electron. For this case the correct relation (3) between the field-coupled
mass and the field energy is obtained. Lorentz succeeded in showing that the
electromagnetic mass depends on the velocity v of the charge according to the
relation
(5)
VI - v2 /c 2
This relation is easy to understand because Maxwell's equations have to be
Lorentz covariant. Hence, the electrostatic energy has to transform like the
fourth component of a four-vector, from which (5) follows (consult the lectures
on classical mechanics and classical electrodynamics of this series) .15
According to Lorentz the mechanical mass m' should be a constant; he
assumed the validity of Newtonian mechanics. However, experimental expe-
rience showed that the total electron mass changes according to the law (5).
It appeared that the total electron mass possesses field character. Nowadays
a value of R « 10- 16 cm for the electron radius, deduced from scattering
experiments, is used. Lorentz used a value R ~ 10- 13 cm and calculated with
(3) a reasonable mass of m ~ 10- 27 9 = melectron. Then came the theory
of relativity and required that the mechanical mass m' had to obey the law
(5), toO.15~16 The apparent consistency of these considerations achieved so
far broke down and the nature of the electron mass again faded away into
darkness.
In first order the term p.A from (5.11) does not contribute because it does
not contain terms of the form at a or aa t, which describe the emission and
A 2
reabsorption of photons. On the other hand, the term A describes such simul-
taneous creation and annihilation processes, as shown graphically in Fig. 5.6.
With the help of (2.9) and (5.14) we deduce immediately that
(5.16)
Obviously this contribution to the energy of the free electron is infinite but
independent of the electron momentum p. It is equal for all electrons and, as
soon as energy differences are taken into account, it drops out. Therefore we
will not consider this contribution further.
Next we turn to the second-order contribution of the term p . A appear-
ing in the interaction (5.11), i.e. [(,t. This contribution will be much more
interesting; in second order it yields the contribution
to the energy. Since A is linear in the photon creation and annihilation op-
erators a, at the intermediate states II) have to contain one photon. Hence,
they are of the form
't t
II) bqIO)pak.,-IO)rad,
(5.18)
't
Ip) bplO)p ,
and
(hp)2
Ep ,
2m
(5.20)
(5.22)
Here {j represents the angle between the photon and electron (see Fig. 5.8).
Starting with
E(2) = _e 2_ (27rnc
___ 2 ) _L3_
p m 2 c2 £3 (27r)3
X J d3 k -
1
Wk 1i2p2 _
n2p2(1 - _
_ _---:,.---=----'-_
(/i
cos 2 {j)
2p2 _ 1i2 p ·k
_-'---c,---_-.,-
(/ik)2
+
nkc)
+
J
2m 2m m 2m
--e 2- --27T
(np?
m 2 e2 (27T)2
1 1+ 00
dk
1 1 - cos 2 {)
d(cos{))---------;:,,--
1 - ~ cos{) + 2'::,~2
1+ 1
0 -1
2
(np)2 ( - - -e2 1 00
dk )
-2- - d(cos{))(1 - cos 2 {)) v hkc'
m me 7T -1 0 1 - C cos {) + 2mc 2
Obviously the last integral over k diverges logarithmically because for large k
it behaves like J dkjk. With other words: The self-energy E~2) is divergent.
If we combine this contribution with the energy of zeroth order E~O) the total
energy of the free particle becomes
Ep E(O)
p
+ E(2)
p
(np)2
1+ 1
2m
(np)2 ( 2 1
- -e2 - dk )
00
- - - d( cos {))(1 - cos 2 {)) v hkc
2m me 7T _ 1 0 1 - C cos {) + 2mc 2
'- v "
=D
(np)2 (1 _ D). (5.24)
2m
D stands for the divergent part. If the extreme nonrelativistic approximation
vje« 1, fuvjme 2 « 1 is applied to D, which, of course, cannot be fulfilled in
the total integration domain, then we arrive at the result
D = 41+1
me 7T -1
d(cos{))(I-cos 2 {))
io
roo dk
~ (~)
io
roo dk
1
me
2 7T 3
-4 - e2 00
dk
37T me2 0 '
E
p
= (np)2
2m
(1- ~~ ioroo 37T mc2
dk) . (5.25)
We will pursue the following point of view: The mass m in the energy of zeroth
order,
has to hold (in lowest order). In other words: The unknown fictitious mass m
of the electron has to be conditioned (divergent) such that
m
m exp = 1- D
always has a finite value, i.e. the value for the electron mass deduced from ex-
periment. This procedure of interpreting the divergent self-energy as a change
from the fictitious mass m of the electron to the real mass m exp is known as
the renormalization of the mass.
m
Llv
+3/2
+1I2
2P3/2
-112
Fig. 5.10. Sketch of the
splitting of the hydro- -3/2
gen energy levels 2P3/2,
2Pl/2, 2S 1 / 2 in a magnetic +1/2
field and of the excitation 2S1/2
+1/2
frequencies of Lamb and 2Pl/2 -1/2
Retherford -1/2
IBI
EXAMPLE
As the hydrogen atoms have to pass through a resonator which contains Example 5.5.
electromagnetic radiation with the transition frequency 2S 1 / 2 --> 2P3/2 the
current diminishes since the induced gamma emission causes the number
of metastable atoms to be reduced. The experimental set-up is sketched in
Fig.5.1l.
magnetic
fieldB
10- 8 of all atoms are now
in the metastable state
beam
collector with Fig. 5.11. Lamb and
resonator gal vanometer Retherford's experimental
2.4-18.5 cm set-up
The splitting due to the Lamb shift is of the order of 10- 6 eV, which lies
in the microwave region. Technically it is more convenient to observe this
energy difference directly with a high-frequency method instead of optically
investigating the splitting of the Ha line. In particular, the latter procedure
is difficult to perform because of the Doppler shift of the atoms moving with
respect to the source; the resulting frequency shift,
/:)./.1 = 7 X 10- 7 VTjA/.I,
is proportional to /.I (T in Kelvin; A atomic weight).
As soon as 2s --> 2p transitions occur in the field region, the current is
reduced because metastable atoms have been removed.
In)
The self-energy of a state In) of the hydrogen atom is represented by the
diagram shown in Fig. 5.12. This diagram is analogous to the one for a free
electron, sketched in Fig. 5.7. The only difference is that now the electron
states In) are bound and not free. The correction to the energy is given by
(5.29)
Since
it follows that
E(2) = ~ e2 li L roo kdk l(n'lplnW (5.30)
n 37l' m 2 e
n'
loo En - En' - like'
This energy is as divergent as the self-energy for a free particle obtained pre-
viously in (5.25). At this point Bethe argued as follows: For a free electron the
matrix elements (n'lpln) yield only diagonal contributions and (5.30) becomes
the formerly derived second term of (5.25)
E(2)
p
= _~ ~
37l'me2
((liP )2)
2m
roo dk.
lo
(5.31)
(5.31) becomes
E(2)
n(mass)
(5.33)
x [ 1 1- ]
+like (5.34)
En - En' - Me .
5.3 The Splitting of the Hydrogen States 2S 1 / 2-2p3/2: The Lamb Shift 153
Because of
1 1 k--+oo
------+- --4
En - En' - like like like(En - En' - like)
(5.34) is logarithmically divergent. Here Bethe assumed that the integral anal-
ogous to (5.34) has to converge in a relativistic theory. This turned out to be
correct for the following reason. In a relativistic theory, states exist with pos-
itive and negative energy, which, according to the equation
(5.35)
+moc2
have to lie either above +moe2 or beneath -moc 2 . According to Dirac the --+---
lower states have to be occupied with electrons; otherwise an electron would
decay into even lower states and emit radiation. Thus lower states occupied
with electrons prevent a "radiation catastrophe" because the Pauli principle
prohibits such transitions.
If we attempt to construct a wave packet out of relativistic wavefunctions
that describes an electron with positive energy, states of negative energy do
not contribute as they are already occupied according to the Pauli princi-
Fig. 5.13. Typical energy
ple. Together the states with positive and negative energy are complete; but
spectrum of a relativistic
no subset is complete! In other words: With states of positive energy alone theory
we cannot construct a 83 (r) function. Hence, a wave packet consisting of
wavefunctions with positive energy cannot be compressed to arbitrary small
volumes; this can only be achieved up to the Compton wavelength of the
electron,
h
ACompton
me
In a way, the electron has acquired a spatial dimension; it is no longer localized
as a point. For this reason a relativistic theory automatically contains this
necessary restriction of the integration over k [compare with (5.34)]. This leads
to the convergence of the correponding integral. For a nonrelativistic theory
this convergence can be achieved by introducing a cut-off of the integral (5.34)
at the photon energy
me 1
like ~ me 2 ==} kCompton
ACompton .
This yields
1 ] kCompton
X
[ - lie In(En - En' - like) 0
154 5. Infinities in Quantum Electrodynamics: Renormalization Problems
In the last step (En' - En) has been neglected with respect to mc 2 in the
nominator of the logarithm. In order to simplify the sum over n' even further,
(En' - En) is replaced within the argument by a suitable mean value
(En' - En) ---> (En' - En)av. (av. = average) . (5.37)
This approximation is not as crude as it appears. The argument of the loga-
rithm
mc2
~ 105
En' -En
is a large number and the logarithm function is a slowly varying function.
Even a deviation in (En' - En)av. by about a factor of 10 results in only a 20%
incorrect value for the expression (5.36). With the approximation (5.37) the
logarithm in (5.36) can be written in front of the summation; the remaining
summation can be calculated as follows:
n' n'
n'
;m +
A2
Ho Vex)
represents the Hamiltonian of the hydrogen atom. The commutator results in
A A fj,
Hop-pHo = --:-VV(x). (5.39)
I
Since
Vex)
and
.6.V(x)
hold, we get for (5.38)
(nip, (Hop - pHo)ln)
(nl~v, (-~VV(x)) In)
fj,2 J'Ij.;~V . (VV)',pn d3 x
5.3 The Splitting of the Hydrogen States 2S 1 / 2-2p3/2: The Lamb Shift 155
f).E~2) = ~Z e:h311jJn(0)12In I mc
2
I (5.41 )
3 m c (En' - En)av.
We know from the theory of the hydrogen atom that states with nonzero
angular momentum (l i- 0) vanish at the origin:
1jJni (0) = 0 for l i- o.
Only s states have a nonvanishing probability within the nucleus:
(5.42)
;:=2
f).E(2)
~ 1040 MHz. (5.43)
of expression (5.34) and could achieve an even better agreement with the
experimental result; see the volume on quantum electrodynamics. 2o
In the lectures on quantum electrodynamics we realize that no infinity (in-
cluding vacuum polarization, which we will not touch here) is observable; they
are included in the finite values of the observed mass and charge (mass and
charge renormalization). Not all field theories are renormalizable. However,
quantum electrodynamics is renormalizable and because of that it is possibly
the most successful theory in physics. Nevertheless, the removal of infinites
by renormalization is not satisfying aestheticly; we return to this point in our
lectures on QED.
The largest radius of the electron charge distribution useful for our calcula-
tions is given by the Compton wavelength a = h/me. We write the interaction
with the radiation field as
HI = -~
me
JF(x - x')p· A(x') d 3 x' ,
F(x) where F(x) represents the charge distribution of the electron. Hence, the
electron interacts with only that part of the radiation field for which A < a,
i.e. lik < me, holds. The maximum photon momentum is given by k max =
me/Ii. On the other hand, the electron is localized within the Bohr radius
ao = li2 /me 2 . In the course of the calculation, the dipole approximation has
x been applied, i.e. exp(ikx) ~ 1; however,
me li2 lie
Fig. 5.14. Possible form-
factor for the electron
kx ~ kmaxao = Ii me2 e2
and
(~) -1 = ~ ~ 137,
Weisskopf, and Feynman 21 and by Kroll and Lamb. Also the so-called vac-
uum polarization is an important contribution to the Lamb shift. We discuss
it in the volume on quantum electrodynamics, where the correct relativistic
calculation due to Wichmann and Kro1l 22 is presented. In recent years it has
become possible to ionize heavy atoms completely and the investigation of, for
example, lead (82 protons) with only a single electron present allows the direct
measurement of the Lamb shift. This opens an interesting field of research,
particularly for testing quantum electrodynamics in strong central fields. 23
EXERCISE
Solution. At first we solve the classical equation of motion for the displace-
ment llx of a particle in an oscillating electric field E = Eo exp(iwt):
d2 = llx = __
~E ~Eoeiwt ==} e_ E - -e- E 0 eiwt
dt 211x m m mw 2 mw 2 '
so that
We now assume that E contains the modes of the (quantized) radiation field,
so that we can substitute E ---> 2.:ka Eka and obtain the expectation value
Since
Ekrr =
. (2----v-
1
~
"~2)1/2
7rnc
wk
1/2
Exercise 5.6.
and
27rn e2 "'" 1
£3 m 2 ~w3
J
k,a k
27rn e2 £3 1
£3 m2 (27r)3 2 d3 k(ek)3
-2- - ne-2
(27r)2 m 2 e3
1 J 00
k 2 dk
-
k3
- dD
2(~)2 roo
0
.: e dk ,
me 7r nc k io
where the factor 2 keeps track of the polarizations (J. This integral diverges
logarithmically; hence we introduce a cut-off for both integration boundaries:
(I~xl
2
) =
2 e2
- -
(
-
n ) 21krnax -dk .
7r nc me km'n k
The displacement ~x means the change in the potential given by (Taylor
series expansion):
V(x + ~x) = [1 + ~x· V + ~(~x· V)2 + ... J V(x).
We average the potential over all directions. Here we have to consider that
the spherical symmetric potential V(x) depends only on the modulus of x:
so that
X·oV
(a· V)(a· V)V(r) aao-.2-
, J ' r or
a.a.
'J
[(8rij _ Xi Xj ) oV
r3 or
+ XjXi
r2
o2V]
or2
2
( a _ (a.r)(a.r)) av +(a.:')202V
r r3 ar r or2
a2 ( 1 - cos 2 'l9 ) -oV + a 2 cos 2 'l9--
-
02V
r or ar2
5.4 Is There an Inconsistency in Bethe's Approach? 159
a2 2 OV 21 02V
---+a ---
r 3 or 3 or2
a2 [~OV + 02V]
3 r or or2 .
The expression within the last square brackets is identical to V 2 V (because
of the spherical symmetry!); hence,
(v(x + ~x)) = (1 + i(lL\xI2)V2) V(x) .
With
v r
and
V2V = _e 2 V 2 (~)
it follows that
Vpert = ~(I~xI2)V2V
27re 2 (l~xI2)<53(x)
3
~E (1PlVpertl1P)
e (~)2 In (k max )
~3 tie
4
me kmin
3
(1P1<5 (x)I1P)
e4 (~)2In (k max )
~3 lie 11P(O)12,
me k min
where 1P is the unperturbed ground-state wavefunction.
This expression has exactly the form of (5.41), from which we have calcu-
lated the Lamb shift. However, the cut-off momenta tik min and tik max remain
to be determined. The infrared divergency is not physical because a bound
and localized electron cannot be put into oscillations by photons with long
wavelenghts. Thus, kmin ~ l/aB = a/A. should hold. Here aB = ti2/(me 2 )
denotes the Bohr radius and A. = ACompton/(27r) = h/(27rme) = ti/(me) the
Compton wavelength divided by 27r. As the upper boundary we choose again
160 5. Infinities in Quantum Electrodynamics: Renormalization Problems
Exercise 5.6. the Compton wavelength kmax = 1/A. In the framework of the classical theory
we cannot give a satisfactory derivation for the latter. With (5.42) we arrive
at
b.E = - 4 a5
-me
371" n 3
(1)
2 Z 3 In -
Za
Onl.
With this result the Lamb shift of the hydrogen atom (Z = 1) becomes b.E ~
660 MHz, which in view of the many crude approximations is in surprisingly
good agreement with the experimental result. This illustrative explanation
of the Lamb shift as a consequence of the vacuum fluctuations was given by
Welton in 1948. 24
fI = Jd3x~t(x,t)fI~~(x,t)
+~Jd3x Jd3x/~t(x,t)~t(xl,t)V(x,xl)~(x,t)~(xl,t), (6.1)
where
ll,2v 2
fI~ = - - - + V(x) (6.2)
2m
represents the unperturbed one-particle Hamiltonian for a particle in an ex-
ternal potential V (x). ~(x, t) represents the field operators (3.43). fI itself is
the Hamiltonian for the many-particle system in second quantization or, more
informatively, the field-theoretical Hamiltonian. We have to find its solutions,
i.e. its eigenstates. In order to see the connection between the field-theoretical
problem (6.1) and the "classical" many-particle problem of elementary quan-
tum mechanics we introduce the n-particle state
-J
IXn, t) - d 3 Xl .. ' J
d 3 XnXn (Xl, ... , Xn) tJi'At (Xl, t) ... tJi'At (Xn' t) 10)
(6.3)
in analogy to our previous considerations (3.66). Xn (Xl"'" Xn) represents
an ordinary function of the n-particle coordinates Xl,"" x n . According to
the conventional rules of quantum mechanics, we interpret
1Xn (Xl, ... , Xn) 12 d3Xl ... d3Xn (6.4)
as the probability of finding particle 1 in d3xl, particle 2 in d3x2,"" and
particle n in d3xn. We determine the amplitude function Xn (Xl, ... ,xn ) in
such a way that IXn, t) is an eigenstate of the field-theoretical Hamiltonian
(6.1):
(6.5)
Indeed, it will be shown in Exercise 6.1 that (6.5) provides the following
defining equation for the amplitude function Xn (Xl, ... ,Xn):
[ t;n ( /i2
-2m V;+V(Xi)
) 1 n n ]
+2t;t;V(Xi,Xj) Xn(XI, ... ,Xn)
EXERCISE
where
and
1
2'LV(x i ,Xj)Xn(X1, ... ,xn) . Exercise 6.1.
i,j
We write
V IXn, t)
~/ d 3x / d3x'~t(X,t)~t (XI,t)V(X,XI)~(X,t)~(X',t)
x/ d3x~ ... / d3x~Xn(x~, ... ,x~)~t(x~,t) ... ~t(x~,t)IO)
~/ d 3x d 3x' d3x~ ... d3x~ ~t (x, t)~t (x', t) V (x, x')
x ~(x, t)~(X', t)Xn(x~, ... ,x~)~t(x~, t) ... ~t(x~, t) 10).
We note the commutation relation
~(X)~t(X') = ±~t(X')~(X) + 6(x - x'),
V IXn, t)
~/ d 3x d 3x ' d3x~ ... d3x~ ~t (x, t)~t (x', t)V(x, x')
I I A A I At I At I
xXn(x1, .. ·,xn)!li(x,t)!li(x ,t)!li (x1,t) ... !li (xn,t) 10)
~/ d 3x d 3x ' d3x~ ... d3x~ ~t (x, t)~t (x', t)V(x, x')
x Xn(X~, ... , x~)~(x, t) [6(x' - x~) ± ~t(x~, t)~(X', t)]
x ~t (x~, t) ... ~t (x~, t) 10) .
The successive commutations are performed analogously to the method for
(3.69) and the equations following it:
Vlxn,t)
~ / d3xd3x~ ... d3x~ ~t(x, t) L v (x, x~) Xn (x~, ... , x~)
,
x ~(x, t)~t (x~, t) ... ~t (x~, t) 10) .
In the same manner the second annihilation operator goes through the com-
mutations; it produces a second sum
For the case when the external potential V(x) vanishes the n particles
move only as a result of their mutual interactions. Then the field operator -tP
can be expanded into plane waves:
-tP(x, t) = bk (t)L
k
ik·x
&. (6.7)
A '"' '"' At A
H = L...J L...J bk1 bk2 2m
!i2k~ JV
d3 x
exp [i (k2 - k 1 ) . x]
k, k2
+ '"'
L...J '"'
k,
L...J '"'
L...J '"'
k2 k3 k4
At bk2
L...J bk1 At bk3
A bk4
A d x
V J J 3 3 '
x
dL3
A '"' !i2 k 2 At A
H = L...J 2m bkbk + '"' '"' '"' - At At A A
L...JL...JL...J V(q)bkl+qbk2_qbk2bkl· (6.11)
k k, k2 q
The last term becomes more plausible once we calculate in more detail:
= V(k3-kl)c5k3+k4,k,+k2 (6.12)
with the substitution z = x - x'. We set q = k3 - kl' so that
k3 + k4 = kl + k2
or
follows from the Kronecker delta symbol. Therefore the last term of (6.9)
becomes
6.1 Quantum Gases 165
L L L bL bl,bk,+qbk2-q V (q)
k, k2 q
LLLbl~_qbl~+qbk~bk~ V(q)
k~ k~ q
(6.13)
which is identical to the last term in (6.11). The interaction term arising in
(6.11) is illustrated in Fig. 6.1.
The operators bk 1 and bk2 annhilate two particles with corresponding mo-
menta Ilki and Ilk2 at the vertex. In the same interaction (at the same vertex)
the operators bL +q and bl, _q create two new particles with momenta Il( ki +q)
and ll(k2 - q). For this process, momentum conservation is guaranteed since
Il(k i + q) + ll(k2 - q) Ilk I + Ilk2, (6.14)
(momentum after collision) (momentum before collision) . k2
Fig. 6.1. Momentum con-
This momentum conservation stems from the assumption (6.8) that the inter- servation at a vertex for
action between particles depends only on x - x', which reflects translational two- particle collisions
invariance. We can rephrase this result: During the scattering the two parti-
cles exchange the momentum Ilq (at the vertex); the corresponding amplitude
is determined by v (q).
XX
k k' k'-q k+q
fJN(k)
(6.15)
fJt
k' q
particle with momentum lik; they decrease the number N(k). In order to
translate the symbolic relation (6.15) into a quantitative equation we have
to introduce the transition probability per unit time for each diagram. We
calculate the latter in first-order perturbation theory, where the interaction
term is treated as a perturbation potential. We use
( trans. prob.) k
time X k
I = 211" 1~
h 12
V(q) N(k + q)N(k' - q)
k+q k'-q
x [1 ± N(k)][1 ± N(k')]
(6.17)
The plus or minus sign within the factors (1 ± N) has to be chosen for bosons
or fermions, respectively. Since the matrix element enters quadratically into
Fermi's golden rule the square-root factors appearing in (6.16) are carried into
Nand (N ± 1). The argument of the 0 function expresses energy conservation
for the scattering process; note that because of elastic collisions only kinetic
energy is considered. Analogously to the process for (6.17), the transition
probability per unit time for the second scattering process appearing in (6.15)
can be calculated. We then arrive at the following master equation for (6.15):
8N(k)
at
LL
k' q
2; IV (q) 120 ( : : (Ik + q 12 + Ik' - q 12 - k 2 - k'2))
this goes back to the fact that outside the interaction region the particles
here move freely without interactions, so the matrix element (6.10) is calcu-
lated with plane waves (Born approximation). In order to find an equilibrium
solution of the Boltzmann equation (6.18) we have to require that
aN(k) = 0
at . (6.19)
N(k) = 1 (6.21)
Aexp [E(k)/kBT] 'f 1 .
Here, A is a normalization constant,
E (k) = h2 k 2 (6.22)
2m
is the kinetic energy, and kBT represents an energy related to temperature.
The plus and minus signs in (6.21) hold for fermions and bosons, respectively;
only with this assignment (6.21) is a solution of (6.20) possible.
EXERCISE
N p = LN(k) =
k
L3 J d3 kN(k)
L Jd kAexp(h k /2mk
3 3 1
BT) ± 1 .2 2
(6.23)
The distribution functions calculated with (6.21) are called the Fermi-Dirac
(for the plus sign) and the Bose-Einstein (for the minus sign) distributions. l In
order to proceed further we need the notion of the entropy of a quantum gas,
which characterizes the disorder of the gas. We will explain some basics about
entropy in Exercise 6.6, where we will also derive the following expression:
S = ±kB L [(1 ± N(k)) In(l ± N(k)) =t= N(k) lnN(k)] . (6.24)
k
1 Historically it is interesting to know that Einstein was the referee of the manu-
script of the Indian physicist Bose, unknown at that time, which had been sub-
mitted to Zeitschrift fUr Physik. He realized its deep implications and improved
the paper by suggesting some valuable comments to Bose.
6.1 Quantum Gases 169
ks represents the Boltzmann constant. Again, the upper sign holds for bosons
and the lower sign for fermions. We will now show that entropy will increase
permanently, i.e.
dS
-dt >
-
o. (6.25)
This follows from the Boltzmann equation (6.18). For the proof we restrict
ourselves to the case of bosons; for fermions the arguments are the same. We
start from (6.24), which we differentiate with respect to time; then we employ
again the Boltzmann equation (6.18). This gives
dS
dt
kB ~ {O~~k) {In[l + N(k)]-lnN(k)}
oN(k) 1 oN(k) 1 }
+ [1 + N(k)]----at [1 + N(k)] - N(k)----at N(k)
ks L L L ~ IV(q)1
k k' q
2
4 dS
dt
LN(k)
k
===} £3 J d3 vf(v) (6.32)
J
classical momentum transfer mu, respectively. Then
~
=? £3 J d3
_q_
(211")3
= £3 m 3
li3 (211")3
d3 u (6.34)
V(q) =
(211" )3/2
-£3
J 3 eiq . x
d xV(x)~
J
(211")
(211"1i/m)3/2 3 eimu,x/fi _ (211"h) 3/2 V (u)
d xV(x) - - --
£3 (211"1i/m)3/2 - m £3
(6.36)
and setting the transition probability as
EXERCISE
Solution. We denote
h f (v + u) h f (v' - u)
(1)
f{ f (v) , and f~ f (v') .
We define the temperature T = 1/ (kBfJ) , the chemical potential f.t(r, t) Exercise 6.3.
a(r, t)/fJ(r, t) and the average flow velocity u = )../fJ(r, t) and obtain
J d3 vf(v,r,t) = p.
EXERCISE
6.4 From the Entropy Formula for the Bose (Fermi) Gas
to the Classical Entropy Formula
Problem. Show that expression (6.24) for the entropy of a quantum gas
becomes the classical formula for the entropy
where the upper sign holds for bosons and the lower sign for fermions. As we
have already stressed a classical ideal gas is characterized such that the num-
ber of possible states for the particles is large and that the average occupation
number N(k) of each single state remains small, i.e. the particles occupy a
large number of states independently. The effects of quantum statistics, which
for the case of fermions allows up to only one fermion per state but for bosons
allows the average occupation number to become arbitrary large, do not play
a role in the classical limit. We denote gases for which the average occupa-
tion number N(k) « 1 remains small as ''far from degeneracy". Since then
N (k) « 1 , we get 1 ± N (k) ~ 1 , and the expression for the entropy of the
Bose (Fermi) gas becomes
EXERCISE
Problem. Use the classical expression for the entropy derived in the previous
problem to prove the H theorem (pronounced 'eta theorem', because H is
meant to represent the greak letter eta).
)]
Boltzmann distribution
1 1
No = A-I (6.39a)
e XO - 1 '
where the normalization constant A has been written as
(6.39b)
Rearranging (6.39a) we get:
e
~xu
= -No
-. (6.40)
No + 1
Since No cannot become negative and since (6.40) approaches 1 for large No,
we deduce that
(6.41)
The value for the normalization constant xo is determined from the require-
ment to conserve the particle number:
N - '" 1 (6.42)
- ~ eo exp (Ek/kBT) - 1.
The ratio between the total number of particles and the number of particles
No in the state with E = 0 is given by
No 1 ( 1 ) (6.43)
N = N e XO -1 .
If e Xu > 1 it holds that No/N < 1; if, on the other hand, exa ~ 1 we get
(No/N) --7 1. Hence, the state with E = 0 is strongly populated only for
e xa ~ 1. Particles in such a state are called a condensate. In order to analyse
the expression for N further, we separate the state with E = 0 from (6.42)
and change the sum over k into an integral:
N -
- eo
1
---
-1
+ -1 (sm)3/2
4
-2
!i
7r £ 3
1
0
00
El/2
dE -----::-=-:-:---=:---
exoexp(E/kBT) -1 '
(6.44)
with
F' (6.46)
and
x = E/kBT.
We expand the integral:
F' =
(6.47)
176 6. Nonrelativistic Quantum Field Theory of Interacting Particles
(6.48)
then
F'
(6.49)
Hence, we obtain
(6.50)
or
Nh3
e-xo + 2- 3 / 2 e-2xo + 3- 3 / 2 e- 3xo + ... = (6.51)
(27rmkBT)3/2 £3 '
if e Xo is not close to 1. We can rewrite (6.51) as
T_~[_l
- 27rmkB L3F
(N __
l-1 )]2/3 .
e XO
(6.52)
(6.54)
N (6.57)
or
6.2 Nearly Ideal, Degenerate Bose~Einstein Gases 177
1
(6.58)
Xo
Since Xo > 1 this region corresponds to T < To. We arrive at the result that
a critical temperature
To - - -
2
/i - - - - (P) 2/3
(6.59)
- 27rmkB 2.612
exists, which depends on the density p = N / L3 of the Bose~Einstein gas;
beneath this critical temperature nearly all particles are in the state with
energy E = O. This phenomenon is known as Bose~Einstein condensation. It
should be clear that for T = 0 all particles occupy the state with E = O.
Only one fluid occurs in nature for which this quantum effect becomes
observable macroscopically: liquid helium, 4He. We insert its density into re-
lation (6.59) and obtain the critical temperature To = 3.13 K. Below this
temperature 4He represents a macroscopic Bose-Einstein condensate. 2
After these introductory statistical considerations we now focus on the
quantum features of the Bose~Einstein gas. First we consider the noninter-
acting ground state of a Bose gas with N particles
l<po(N)) = IN, 0, 0, .. ·) , (6.60)
i.e. all N particles occupy the state with E = O. In the following, we will
consider a box of volume V = L3 and periodic boundary conditions. The
creation and annihilation operators for the state with E = k = 0 are denoted
as o'b and ao. The usual relations
(N)
V
1/2
Ipo(N -1)) ,
(6.64)
( ----v-
N+l)1/2
Ipo(N + 1)) .
We are interested in the behavior of a gas, i.e. in large particle numbers and
volumes. Hence, we consider the so-called thermodynamic limit: N --+ 00,
V --+ 00, but constant density p = N jV --+ const. Then bo and bb multiply
the ground state with a constant whereas the commutator becomes
A At
[b o, bol --+ 0 (6.65)
in the thermodynamic limit, i.e. we obtain the classical limit for which bo and
bb are ordinary C numbers. 3 For the case of photons (see Chap. 1) we have
already seen that a field for which the quanta are particles behaves classically
once a large number of particles occupy the same state.
So far we have considered a non interacting ground state; now we inves-
tigate a system of bosons at T = 0, which is slightly nonideal. We keep the
interaction term from (6.11) but, for simplicity, we replace v(q) by the factor
gj2V:
H = ~ to. At A 9 ~ At At A A ;:
~ItWkakak + 2V (6.66)
A
~ aklak2ak3ak4ukl+k2,k3+k4'
k klk2k3k4
+ ~'
~
(4 At A At A + AtAt
akakaoaO
AA
aka_kaOao
AtAt A A ) ]
+ aoaoaka-k ; (6.67)
k
here 2:' signals that terms with k = 0 have to be omitted from the summation.
Also, we have kept only terms up to second order in ak and and have al
neglected fourth-order terms; momentum conservation means that terms with
an odd number of these operators do not exist.
As a further approximation we replace ao and ab in (6.61) and (6.67) by
C numbers:
(6.68)
where No represents the number of particles in the state with k = O. It follows
that
~
9
2V (J\T2 +
1Vo
27\T
1'0 L;:
~'(AtA At A )
akak + a_ka_k
"T~'(AtAt
+ 1'0 AA))
~ aka_ k + aka-k . (6.69)
k
3 The phrase C numbers means "commuting" numbers, i.e. ordinary real or complex
numbers.
6.2 Nearly Ideal, Degenerate Bose-Einstein Gases 179
fI = ~Vgl+~L'[(EZ+pg)(atak+a~ka_k)
k
(6.71)
(6.73)
H, =
1
2Vgp 2
+"
~ (Ek + pg)Vk + pgukVk
,'[ 0 2 ]
where Uk and Vk are related via (6.74); however, the corresponding ratio may
be chosen freely. In order to make fI diagonal in At, Ak we use this freedom
to eliminate the last line in (6.75); we require that
pg(uk + v~) + 2UkVk(EZ + pg) = o. (6.76)
4 For the following, see also Sect. 7.1, where we follow the same steps in a slightly
different way.
180 6. Nonrelativistic Quantum Field Theory of Interacting Particles
We choose
Uk = cosh 'Pk , Vk = - sinh 'Pk (6.77)
such that (6.74) is fulfilled; then (6.76) becomes
2UkVk _ pg
(6.78)
u~ + v~ - EZ + pg
or
2 sinh 'Pk cosh 'Pk sinh 2'Pk pg
(6.79)
sinh2 'Pk + cosh 'Pk
2 cosh 2'Pk + pg . EZ
Use of the known trigonometric relations then yields
pg
tanh 2'Pk = EO . (6.80)
k + pg
Since -1 ::; tanh 2'Pk ::; 1 this last equation can be fulfilled for all k only once
9 > 0, i.e. the pseudopotential has to be repulsive! It follows that
v~ = u% - 1 = ~ [E;1(E2 + pg) - 1] , (6.81)
where we have defined
Ek = [(E2 + pg)2 + (pg)2] 1/2 . (6.82)
With the use of (6.74) to (6.82) we are able to write the Hamiltonian (6.75) in
its final form:
fl = ~Vgp2+~~'[Ek-(E2+pg)]
1~' At' At'
+"2 L Ek(Ak Ak + A_kA_k) . (6.83)
k
Here E2 stands for the free particle energies, which are obtained from (6.82)
if the interaction 9 = 0, Le. E2 = (hk)2/2m. The operator AlAk resembles
a particle-number operator and has eigenvalues 0,1,2, .... Hence, the ground
state of fl is determined by the requirement that
for all k i- o. (6.84)
Be aware that 10) represents a complicated combination of unperturbed eigen-
functions since neither ak nor al
annihilate the groundstate 10). Because of
(6.84) we obtain for the ground-state energy
E =
,
(OIHIO) = "2 Vgp
1 2
'7' (Ek -
+"21~' 0
Ek - pg). (6.85)
We realize that (6.86a) and (6.86b) describe a system of quanta with mo-
mentum fik. The operators ih, At
annihilate and create those quanta we have
called quasiparticles. The quantity Cs in (6.86b) represents a velocity, which we
can interprete as the speed of sound in the nearly ideal degenerate Bose gas.
The long-wave excitations (quasiparticles) corresponding thereto are called
phonons. In the limit of large momenta (6.86a) becomes lik
fi 2 k 2
Ek ~-- (6.87) Fig. 6.2. Quasiparticle
2m ' energies. For low momen-
i.e. for large momenta the quasiparticles do not interact with each other. tum the energy grows lin-
Hence, we obtain the qualitative result presented in Fig. 6.2. early with k, for large
momentum quadratically.
In this section we have established that particles of an interacting gas
These two domains cor-
can be treated as a gas consisting of noninteracting quasiparticles. Phonons respond to the phonon
represent a first example for quasiparticles. Later on we will encounter further and particle sectors, re-
examples. spectively
EXAMPLE
J W(E) dE = 1. (3)
Example 6.6. The number of quantum states that correspond to the energy interval ilE is
given by
-1
2n1i
f pdx = n+-1
2
for a particle with one degree of freedom. The integral f p dx represents the
area confined by the closed classical phase trajectory of the particle, i.e. the
trajectory in the p-x plane (phase space) of the particle. By dividing this area
into cells of area 27rn each we obtain n cells in total. On the other hand, n
represents the number of quantum-mechanical states with energies that are
not larger than the corresponding value of the phase trajectory considered.
We realize that in the quasiclassical limit every quantum-mechanical state
corresponds to a phase-space cell with area 27rn. Put into other words, the
number of states in the volume element ilp ilx of phase space is given by
ilpilx
27rn .
If we consider s degrees of freedom this number turns out to be
ilPl ilXl ilP2 ilX2 ilps ilxs
27rn 27rn 27rn
i.e.
ilps ... ilps ilXl ... ilxs
(27rn)s
in total. We introduce the abbreviations ilp ilPl ... ilps and ilq
ilXl ... ilxs and obtain
ilpilq
(27rn)s .
This relation yields well known results; for example, for one particle
d3 p
(27rn)3
represents the number of states within the momentum region d3 p and the unit
volume of the configuration space.
6.2 Nearly Ideal, Degenerate Bose-Einstein Gases 183
Since the number of states fj.T always has to be fj.T » 1 we conclude S ::::: O!
In statisitical mechanics it is customary to set
S = kB Infj.T (10)
instead of the dimensionless expression S = In fj.T. Since entropy has to fulfill
the thermodynamic relation T dS = dE the proportionality constant kB has
to have the dimension
[ te;;::!~ure] .
kB is called the Boltzmann constant. With this relation we are able to derive
the entropy for an ideal, a Bose-Einstein, and a Fermi-Dirac gas.
(a) Ideal Gas. We consider a group of N j particles as an independent system.
Its statistical weight is given by fj.Tj , and the statistical weight of all particle
groups is given by
184 6. Nonrelativistic Quantum Field Theory of Interacting Particles
Within the classical limit the average occupation numbers are small compared
to unity, i.e. the number of particles N j is small compared to the number Gj
of states (Nj « G j ); however, N j should still be a large number. Therefore
we assume that the particles are distributed completely independently of each
other over the various states. Once we assign to each of the N j particles one
of the G j states we arrive at (Gj)N; possible distributions. Some of them are
identical because N j ! permutations between the N j undistinguishable particles
are possible without changing the distribution. Hence, the statistical weight
is given by
(12)
and we get
S kB In 1l.T = kB L In 1l.Tj =?
j
Since N j is large we can use Stirling's formula (see Exercise 6.8) and write
InNj!::::O N j In Nj/e, i.e.
""'
S = kB~Njln eGo
NJ (14)
j J
This constant is often dropped. Then the entropy of the ideal gas reads
Combinatorics can be illustrated with playing cards. For most of the hands Example 6.6.
of cards it does not matter in which order the cards are handed out since
afterwards it is possible to rearrange them arbitrarily. Instead of the possible
k-tupel only the subsets consisting of k elements of the given set are of interest.
They are called combinations of order k of n elements without repetition. For
an ideal gas we have seen that the number of permutations of n elements is
given by n!. If only k elements are picked from the n elements then n elements
are available to occupy the first place of the k-tupel; for the second place only
n - 1 elements are available and for place k only n - (k - 1) elements remain.
Hence, the number of variations of order k consisting of n elements without
repetition is given by
n!
n· (n - 1) . (n - 2) ..... [n - (k -1)] = (n _ k)! . (18)
(Example: For six people to take a seat from ten chairs 10!/4! = 151200 pos-
sibilities exist.)
If we further allow the possibility of rearranging k elements, i.e. k permu-
tations, then we obtain exactly
(20)
Again we introduce the average number of particles per state Tij = NjlG j
and it follows that
SF = kB LG j [Tij In Tij + (1 - Tij) In(1 - Tij)] . (22)
j
For the case that the G j is independent of j we can take them out of the
summation and arrive at the entropy relation for the Fermi gas already men-
tioned.
(c) Bose Gas. For this case an arbitrary number of particles can occupy
each quantum state. As a consequence the statistical weight 6..rj is given as
the number of ways to distribute N j particles over G j states; all particles are
allowed to occupy the same state. Using arguments analogously to the case
of fermions we obtain
(Gj + N j - I)! .
6..r, = (23)
< (G j - 1)!Nj ! '
186 6. Nonrelativistic Quantum Field Theory of Interacting Particles
Example 6.6. see Exercise 6.7. We neglect the -1 because it is small compared to the G j
and N j , take the logarithm, and again use Stirling's formula. This results in
or with nj
EXERCISE
···11····1·1··
We numerate the G cells and separate them by G - 1 vertical lines. In the
sketch shown we have: 3 balls in the first cell, 0 in the second, 4 in the third,
1 in the fourth, and 2 in the fifth. The total number of sites that are either
occupied by points or lines is G + N - 1. Hence the number of possibilities is
identical to the number of ways of choosing G - 1 sites for the vertical lines,
i.e. the number of combinations of G - 1 elements out of N + G - 1; thus,
(N + G -I)! (N+G-1)!
M=
(G -I)! [N + G - 1 - (G - l)J! (G -l)!N! .
EXERCISE
EXAMPLE
In this example we will study briefly the connection between entropy and the
information about a system.
First, we have to find a statistical definition of information. We consider
a situation for which Po different successive situations may arise, all with the
same probability. We denote the information as I. For the initial situation
10 = 0 since all processes are equally probable; we have no information about
the system.
We consider two independent processes, which come with probabilities POI
and P02 , respectively; the total probability is then given by
Po = POI P02 · (1)
The information should be additive, i.e.
I = It + 12 . (2)
This property leads to the following, plausible definition of information:
I = KlnP (3)
188 6. Nonrelativistic Quantum Field Theory of Interacting Particles
However, this result is not conclusive; we know that different letters do not
appear with the same frequency within a sentence. For example, the frequency
of the letter "E" is approximately p :::::; 0.105, whereas "Q" appears with only
p:::::; O.OOL
In order to find the correct relation we consider an "alphabet" consisting
of only two "letters": 0 and 1. In total we have G sites: No sites have a "0"
and NI sites have a "1".
It holds that No + NI = G and the probabilities that one of the G sites
contains a "0" or "1" are
No
Po = G and PI (7)
6.2 Nearly Ideal, Degenerate Bose-Einstein Gases 189
I~-KG
NoI nNo
(- NlI nNl)
-+- - .
G G G G
With (7) we obtain the information per "letter" of the message:
I
G ~ -K(po lnpo + pdnpI).
This generalized to M different symbols 0,1, ... is called Shannon's formula:
I = -KGl.:>jlnpj. (12)
j=1
Example 6.9. It is this bound information which is connected to the physically relevant
situations and which we take to be related to entropy. We identify the possible
cases with the complexions; then it holds that
(3) The friend is partly deaf and does not understand some words; thus, the Example 6.9.
bound information gets lost.
(4) After some time the original person forgets the information and loses the
free information since it was only present within thoughts (brain).
EXAMPLE
Example 6.10. Since the transformation process is at best complete, the demon will also
allow some "wrong" molecules to pass from time to time and, thus, it holds
again that
6..8 :::: 0, because 6..N ::; 0,
Le. negentropy N gets lost and entropy is produced; as a consequence the
second law of thermodynamics is not violated!
7. Superfiuidity
P/atm
50
30
He II
10 Fi~. 7.1. Phase diagram
of He at low temperature
o 6 8 9 10 TIK
To
Pn/P the flow is curl free, Le. curl Vs = 0, where Vs represents the velocity of the
superfluid component. Its density is Ps. The second component with density
Pn and velocity Vn is represented by a "normal" fluid, which has viscosity.
This picture from Landau has been confirmed experimentally. 3
The presence of two components in He II allows two different excitation
T modes, which differ in the relative orientation between Vs and Vn: Once Vs
and Vn are parallel a perturbation in He II leads to a change in density and
Fig. 7.2. The ratio of nor- pressure (for constant T and constant entropy); it is denoted as the first (or
mal fluid density pn to to- ordinary) sound.
tal density P = pn + p. If Vn and Vs point to opposite directions a wave is obtained, leading to
as a function of tempera- periodic changes in Psi Pn. This ratio depends on temperature and P = Ps + Pn
ture (qualitatively). P. in-
is constant (also the pressure is constant), so we obtain a temperature wave,
dicates the density of the
superfluid phase the so-called second sound. A detailed investigation of this problem shows that
the corresponding speed of sound is proportional to (PsIPn)1/2. The second
sound was observed for the first time by Peshkov. 4
H(ri) represents the operator for the kinetic energy of the ith He atom and
V(lri - rjl) describes the interaction between the ith and jth atoms. For
a complete and orthogonal set of eigenfunctions we choose the solutions of
H(ri); these are given by plane waves
'Pk(r) = me'
1 'k
·r (7.2)
which are normalized with the conventional periodic boundary conditions for
a large volume £3 [compare for example with (1.23)]. Hence, it holds that
3 E.L. Andronikashvili: J. Phys. (Moscow) 10 (1946) 20l.
4 V.P. Peshkov: J. Phys. (Moscow) 10 (1946) 389.
5 P.L. Kapitza: Nature 141 (1938) 74.
6 N.N. Bogoliubov: J. Phys. (Moscow) 9 (1947) 23.
7.1 Basics of a Microscopic Theory of Superfiuidity 195
(7.3)
2m
According to Chaps. 3 and 6 [(6.7) to (6.13) in particular]' the total Hamilto-
nian can be rewritten in second quantization:
with
V(q) J d 3 r V(lrl) e+ iq . r
J
-271' drr 2 V(r) 1- 1
dxe- iqrx
J
271' dr r2 V(r) eiqr ~ e- iqr
lqr
IT
ak
no k '
yInObob
(7.14)
At 1 At A
ak yInObkb o
no
for k '" o. They represent boson-boson-hole pairs. Equation (7.10) then be-
comes
_1_N2V(0) + ",' (lik)2 ata
2£3 L...J 2m k k
k
k
+ 1 At A no + 1
' " At A no
L...Jbkbk-- - bobo- -
k no no
IV (no + 1_
no (no + 1))
no no
A) no
(N -no - -
A
+ 1
no
~ N - no -> 0 for L -> 00 , (7.16)
where (7.14) has been used. It follows that a~ak '" I1N 2 so
",'At A ",' 1 N 1
L...J akak '" L...J N2 '" N2 = N -> 0.
k k
Thus, in the limit N -> 00 we are allowed to neglect the term iI' in (7.15),
which contains four operators of a and at. We focus on the interesting part,
Le.
(7.17)
198 7. Superfiuidity
No doubt the interaction matrix elements V(k) are small, but perturbation
theory is not allowed to run the show. Infinitely many terms exist in the
interaction sum of (7.17) for pairs k and -k once Ikl ---. O. In other words,
scattering out of the condensate with one-particle states k = 0 into infinitely
many states k, -k might take place which are energetically all close to Ikl = 0;
a perturbation calculation then becomes impractical.
Bogoliubov suggested the following approach: Take the Hamiltonian fIo
and diagonalize it with a canonical transformation in the boson operators:
u(k)Ak + v(k)A~k'
(7.18)
u(k)At + v(k)A_k .
This is exactly the same as in (6.72). We will repeat now some of the steps
presented in Sect. 6 in more detail and with a different stress. u(k) and v(k)
represent real functions with the property
u( -k) u(k) ,
(7.19)
v( -k) v(k) .
This guarantees that no asymmetry occurs between the bosons at and a~k'
or lh and A~k' respectively. Also, (7.18) can now easily be transformed into
U(k)ak - V(k)a~k
u 2 (k) - v 2 (k)
(7.20)
A'tk --
u(k)at - v(k)Lk
u 2 (k) - v 2 (k)
This suggests the normalization
u 2 (k) - v 2 (k) = 1. (7.21)
The latter follows from the requirement that the Ak should again fulfill the
boson commutation relations
[Ak,A
, 't]k, _ 8kk , ,
['t
Ak,A't]
k, _ O.
Of course, these relations must also hold for ak, at. This implies
For the moment the values of the coefficients Dk remain unknown. Inserting
(7.24) into (7.18) we deduce
1 ' 't
~(Ak+DkA_k) ,
(7.25)
1 't '
~(Ak+DkA_k) .
1 ~ Dk
Using this result for ih in (7.17), we finally express iII in terms of the new
boson operators Ak , At. This calculation will be presented in detail in Exercise
7.1. Here, we state the outcome:
where
~: V(O) + ~ L k
[E(k) ~ C~~2 + ~~V(k))] (7.27)
and
EXERCISE
The prime (') at the sum indicates that the terms with k = 0 have to be
omitted from the summation. We introduce new boson operators rh, A! [see
(7.18) and (7.24)]:
u(k)Ak + v(k)A~k ,
(2)
aAtk u(k)A! + v(k)A_k ,
where
1
u(k) v(k) (3)
)1- D~ , )1- D~ .
Inserting (2) into (1) gives
N2
iI = 2L3 V(O)
+ 2~ V(k)u(k)v(k) }AlAk
with
N2
+ 22:
l' ( (fik)2 no ) (10)
2£3 V(O) £(k) - 2m - £3 V(k)
k
and
E(k) (11)
Ho =
(14)
or
N2
Ho 2£3 V(O) (15)
D~,
+ "7:
"'"" [( (fik')2 no V(k')) no V(k') Dk' ]
2m + £3 1- D~, + £3 1 - D~,
202 7. Superfluidity
Exercise 7.1. From the requirement that the variation of Ho with respect to Dk should
vanish, i.e.
8Ho
0, (16)
8Dk
the condition for Dk is obtained:
o. (17)
The operators A1 , jh from (7.26) and (7.22) describe new boson oper-
ators. According to (7.26) the excitations of the system can be described as
single-particle excitations
't (7.29)
Akl¢o) ,
where I¢o) characterizes the ground state with
jhl¢o) = o. (7.30)
These new one-particle states (7.29) are called quasiparticles. According to
(7.20) they consist of a linear combination of creation and annihilation op-
erators a~k and ak of the customary boson operators. The latter themselves
represent particle-hole pairs, as is clearly expressed in the defining equations
(7.14). As a result of the transition to quasiparticles, an important part of
the interaction between the original boson particles (described by bl,h) has
already been taken care of, i.e. "diagonalized away". Hence, quasi particles can
be viewed as freely moving structures consisting of a superposition of ordi-
nary particles and holes. An illustrative example of a quasiparticle is given,
for example, by the motion of a charged particle in a strong electrolyte, which
consists of positive and negative charge carriers.
The particle ED, which is here positively charged, attracts negative charge
carriers around it (Debye cloud) and is thus screened (see Fig. 7.4). Out of
the originally charged particle a neutral quasiparticle has developed: A part of
+ + the Coulomb interaction with the electrolytic medium has been "transformed
away" with the Debye cloud. It is evident that a quasiparticle, evolved in this
+ + =
~ + + + --
- --+ Q +=--
+-- + way, possesses a different mass as compared to the original particle, which
in general also depends on the velocity. This is in complete analology with
-_ + \::J + - the nontrivial k dependence of E(k) from (7.28a). We refer to and already
+ --=--+ + + --==-+ recommend Exercise 9.1, where the screening potential of the Debye cloud is
+ + calculated.
The part flo is constant and independent of the quasiparticle operators
Fig. 7.4. Illustration of a
quasiparticle by a Debye
Al,Ak [see (7.27)]. Obviously, it represents the energy of the ground state of
cloud around a charged the system; we have shown this already in Exercise 7.1.
particle ED in a strong elec- The operator P of the total momentum of the system can be expressed in
trolyte terms of quasiparticle operators, too. With (7.14) and (7.18) we obtain
7.1 Basics of a Microscopic Theory of Superfluidity 203
P Lhkblbk
k
Lhkala k
k
The second term inside the curly bracket vanishes as it is summed over k,
because positive and negative k contributions cancel pairwise. With the nor-
malization (7.21) we get
P = LhkAlAk. (7.32)
k
Thus, it now becomes clear that the low-lying excitations of the system con-
taining Helium atoms at low temperatures are given by elementary (quasipar-
ticle) excitations of the energy Ek (which we now write also E(k) to indicate
its k dependence more clearly) and the momentum (hk). For small excitations
no ~ N, E(k) from (7.28a) can be rewritten as
Ek = E(k) =
(hk)4 + N(nk)2 V(k) . (7.33)
4m 2 mL3
For small momenta this becomes
E(k) =
~
- 3 V(O) nlkl (1 + ... ) .
The veloci yrghhe quasiparticles is given by
(7.34)
Cs
= (fJE)
fJnk k=O
= J NV(O) =
mL3
J V(O)
Pm'
(7.35)
V(O) = lim
Iql-.O
J d 3 r V(lrl) e iq .r
where (7.6) has been used. This implies that the interaction energy V(lr I) has
to be essentially positive, i.e. repulsive.
Now we will look at the quasiparticle energies E(k) from (7.33) for large
momenta. For this purpose we write (lik)2/2m in front of the square root and
arrive at
N4m
E(k) 1 + (lik)2£3 V(k)
(lik)2 ( 1 N4m )
:;::j 2m 1 + "2 (lik)2£3 V(k) + ...
(lik)2 NV(k)
:;::j 2m +-0. (7.39)
As k increases we get
lim V(k) -+ 0 (7.40)
k->oo
according to (7.6). Then the quasiparticle energy (7.39) for larger momenta
becomes
E(hk)
at zero temperature this is not the case. The initial momentum of the cluster
shall be tiq, the momentum of the excitations tik. In a scattering event of the
cluster with the fluid, energy and momentum are conserved.
tiq' + tik,
(7.42)
ti2(q')2 + E(k) .
2m
Here tiq' denotes the momentum of the cluster after the scattering. The ele-
mentary excitations of the fluid (quasiparticles) E(k) are given by (7.39). The
equations (7.42) cannot be satisfied. To see this let us follow up on energy
conservation, which reads
ti 2q2 ti 2
2m = 2m (q - k)2 + E(k) (7.43)
or
ti2 ti 2
0= -mq·k+2mk+E(k), (7.44)
and it follows for the angle 0: between q and k that
coso: = Ek/tik + tik/2m = S: + tik/2m , (7.45)
v v v v
where v = tiq/m is the initial velocity of the cluster. We learned that the
low-energy excitations (quasi particles) of the fluid are phonons, for which
Ek/tik > es . Hence for the excitation (emission) of a phonon the cluster
velocity v must be larger than es , i.e. v > es . This clearly follows from (7.45)
for k -+ 0, otherwise the angle 0: becomes imaginary. We can state that a
cluster moving with velocity v < Vcrit (where Vcrit = es in this case) cannot
lose energy to the fluid; thus there is no friction and one has superfluidity.
It does not matter, of course, in which coordinate system the analysis is
carried out. Here we have choosen the fluid at rest and the cluster moving in
the fluid. The same holds if the cluster is at rest and the fluid moves along the
cluster. Our arguments also hold if the cluster is a dislocation of the surface
surrounding the fluid (rough surface).
For liquid Helium the critical velocity is vcr it « eg • The relation between
momentum and energy is believed to appear as indicated in Fig. 7.5, where the
critical velocity is given by the slope of the dashed line. Also indicated are the
rotons and curls of high momentum; the latter are believed to be responsible
for the critical velocity.
Only if this condition is fulfilled can elementary excitations take place, i.e.
quasiparticles can be excited, and the liquid slows down. Then it possesses
viscosity because energy is lost for the quasiparticle excitations.
Condition (7.48) has to be fulfilled at least for some values of p. As already
outlined before, this implies that elementary excitations can occur only once
v is larger than the minimum of E(p)/p. We are thus led to a critical velocity:
v > (E(P))
p
. mm
= Vcrit (7.49)
3 4
Po
otherwise superfluidity would be destroyed because the fluid would lose en-
ergy as a result of the excitation of quasiparticles. dE / dp is the velocity of an
elementary excitation. For every Bose liquid this required condition is fulfilled
for one point, i.e. for p = O. Near p = 0 the elementary excitations move with
the sound velocity. These are the phonons with E(p) = csp! The condition for
superfluidity is not fulfilled for sure once the velocity of the liquid is v > cs .
For the case of Helium, however, more than one point exists that fulfills this
condition. For small p the experimental spectrum E(p) (see Fig. 7.6) shows
the expected behavior for phonons, i.e. E(p) = CsP = csnk. However, another
minimum exists at Po. In thermal equilibrium most of the elementary exci-
tations have energies very close to the minimum of E(p), i.e. for p = 0 and
p =Po!
In such a case, we may expand around Po (approximation with a parabola)
,
and obtain
where .1 = E(po) and 1-£ are constants characterizing the minimum. Quasi-
particles that obey this dispersion relation are called Tatons. For 4He these
parameters are
L1 To;::, 8.5K,
ko = Po/n (7.52)
1-£ 0.16m(4He) .
The second point [i.e. second solution of (7.50)] we were looking for lies right
of the minimum for rotons at Po as can be identified from Fig. 7.6. With (7.51)
we calculate the critical velocity Vcrit from (7.49). It follows from condition
(7.50), which now reads
p - Po kBL1 + (p - po)2/21-£
1-£ L1
It has the solution
p = V2I-£kBL1 + P5
and, hence,
208 7. Superfluidity
or, once we consider the experimental values (7.52) for 4He (they show that
P5 » 2J-lL1),
kBL1
v < --.
Po
We can express this in another way: There is no superfluid flow in 4He if the
flow velocity is v > k BL1/po = kBTo/Po! Remember that L1 = To is measured
as a temperature - see (7.52). Consider now the Bose liquid for T « To but
T =f O. In this case excitations may be present. Think of a "quasiparticle gas"
that moves with v relative to the liquid. This quasiparticle gas can be at rest
relative to the wall, the latter moving with -v with respect to the liquid: it
crawls, so to speak, out of the vessel walls because of the temperature, which
causes the quasiparticle excitations. The distribution function of the gas at
rest is denoted as p(E); p(E - p. v) represents the distribution of the moving
gas. Hence, the total momentum of the system is given by
For small velocities we expand with respect to p. v. The first term is linear
in p. Because of the isotropic orientation of p, this term vanishes with an
integration over all directions. The second term is
p = -(211")3
1 J 3
d PP(P,v)fJE'
fJp (7.55)
p = -~411" t
dpp 4dp (E) . JO
(7.56)
3 dE 10
For phonons the relation E = CsP holds. Via partial integration we obtain
3C-s 0
dpp 3 p(p) . (7.57)
The integral
411" 1 00
dpcs pp(p)p2 = Jd 3 pE p(E)
gives the energy Eph of a unit volume of the phonon gas. We arrive at
P = v4Eph
-- = vm*. (7.58 )
3c~
7.2 Landau's Theory of Superfiuidity 209
Obviously we can interpret this in the following way. The phonon gas has
an effective mass m *. Like all the liquid, this part of it also represents mass
transport. The excitations are allowed to collide with the walls and to ex-
change momentum; this leads to viscosity. Evidently, viscous flow can occur
in a Bose gas that contains elementary excitations and does not violate the
condition for the existence of superfluidity. The total transported mass m* is
not identical to the total mass of the liquid. We obtain the following picture.
For T = 0 no excitations exist, i.e. the motion is superfluid. For T 1= 0 ele-
mentary excitations exist in the system, which cause a part of the gas (mass
m*) to be viscous and converts it into a "normal" liquid. A superfluid rest
remains, so that
P = Pn + Ps·
This is the two-fluid picture we mentioned earlier.
The portions Pn and Ps depend on temperature: For T = 0 we have Pn = 0
and Ps = P, for 0 < T < To we expect Ps 1= 0 and Pn 1= 0, and for T > To
we expect Ps = 0 and Pn = P (compare again with Fig. 7.2). Notice that two
different kinds of elementary excitations "contribute" to the normal fluid:
phonons with E ~ p and rotons with E ~ p2. Their relevance depends on
temperature. For T ;::;:; 0.6 K the contributions from phonons and rotons are
comparable whereas for larger temperatures the rotons dominate. However,
except for the dispersion relation, no difference exists between phonons and
rotons.
EXERCISE
Solution. The critical velocity Vcrit (;::;:; 23 cm/ s) represents the velocity of
surface waves on the water. If v > Vcrit the object produces waves in such a
way that the latter run away from the object, interfere according to Huygen's
principle, and form a cone (with a fixed opening angle;::;:; 20°). In contrast
to the Mach cone phenomenon appearing at supersonic flight velocities, this
angle does not depend on the shape of the object, the velocity or any other
exterior factor. It is a consequence only of the dispersion relation for water
waves. In the case of deep water, where the depth of the water is much greater
than the wavelength>. of the surface wave, the dispersion relation of the wave
reads
9>' (1)
2n
210 7. Superfluidity
Exercise 7.2. c denotes the speed of propagation and 9 = 9.8 m/s 2, the gravitational accel-
eration. Thus, the angular velocity w = 27fc/ A is
w = g/c. (2)
Now, consider a radially symmetric surface wave centered around r = 0,
¢(r, t) =
27fr
T - wt 27fr
= T - t T'
)27f9 (4)
Employing r' = Vr2 + v2t2 - 2rvt cos 1j;, the condition d¢/dt = 0 (stationary
phase) leads to
t = ~;(cos1j;±Vcos21j;-8/9). (8)
For cos 2 1j; < 8/9, i.e. 11j;1 > 19.5°, no real solution exists. In this case the phase
has no stationary point, i.e. all phases contribute equally, and destructive in-
terference leads to a vanishing amplitude outside this cone. If we consider this
problem quantum mechanically, we could say that the moving object emitts
quasiparticles (we could call them "hydrons" f and in this way could obtain
7 J.L. Synge: Science 138 (1962) 13.
7.2 Landau's Theory of Superfluidity 211
momentum and energy as far as v > vcrit. We have the same situation as in the v < Vcrit
The validity of this relation becomes instantly clear once the representation
PF
of the ground state is inserted. The Fermi momentum is fixed by the condition
that the average particle-number density n of the system is constant:
2
VLl. (8.3)
s,P Ipl~PF
n = 2
j PF d3p
(27r)3 =
p~
37r 2 . (8.4)
¢s(r) j (27r)3
d3p
e
ip.r jj ( ).
s P , (8.7)
we obtain
ps(r, r') (<I>olps(r, r')I<I>o)
j (27r)3 (27rp
d 3P d3 PI
e
-ip.r ip' .r'
e
At A I
(<I>olbs(p)bs(p )1<1>0) . (8.8)
ps(r, r') =
(8.10)
II '(Ir-rl) =
, {O for Ir - r'l --+ 0
(8.12b)
ss 1 lor
C Ir - r 'I --+ 00 ' s=s'.
This represents the so-called Pauli blocking for small distances; for large dis-
tances the correlations vanish, too. We will now determine the probability
amplitude of finding a particle at place r' given another one at place r at
the same time. A mathematical formulation of this problem is provided in the
following way: With the help of the field operators -0s(r) and -0,,(r') a particle
with spin s at r and another one with spin s' at r' are annihilated from the
(N-particle) ground state I¢o). The amplitude of transition into an (N-2)-
particle state 1¢;;-2) is given via (¢;;-21-0s,(r')-0s(r)I¢0). After a summation
over all possible final states we arrive at the total probability amplitude for
the annihilation of two particles:
(¢o 1-01 (r)-0!, (r')-0s' (r')-0s (r) I¢o) (~) 2 IIss' (Ir - r'l) ,
(8.13a)
216 8. Pair Correlations Among Fermions and Bosons
where use has been made of the completeness relation for two-particle states.
From the form of the probability amplitude (8.13a) it becomes obvious that
an equivalent formulation of the problem is provided by first annihilating a
particle with spin 8 at position r and then calculating the expectation value
with the remaining (N-1)-particle state I4>N-1) = -0.(r) 14>0) to find a particle
with spin 8' at position r':
(¢F -11-0!, (r')-0s' (r') I4>N -1)
= (4)01-0! (r)-0!,(r')-0s' (r')-0s(r)l4>o) . (8.13b)
For the further manipulation of the amplitude (8.13b) we insert an expansion
into momentum states (8.7) for each field operator and obtain
[h 8 (p),h 8 '(P')L = 0,
(8.15)
(2Jr)3 888 , 8(p - p') .
The factor (2Jr)3 on the right-hand side of the last equation is a normaliza-
tion factor which corresponds to the dp3/ (2Jr)3 volume element in momentum
space. Indeed, taking the vacuum expectation value and integrating over mo-
menta yields
(8.16)
(8.17)
(8.18)
(8.19)
(8.20)
(8.21)
(8.22)
8.1 Pair-Correlation Function for Fermions 217
We transform further:
(¢o Ih! (p )h!, (q )h8' (q')b 8(p') I¢o)
-(27r)3588 ,5(p' - q)(¢0Ih!(p)h 8,(q')1¢0)
+ (¢olh! (p )h8 (p')h!, (q)h8' (q') I¢o)
(27r)65(p - p')5(q - q')
x (¢olb! (p )b8(p') I¢o) (¢olh!, (q)b 8, (q') I¢o)
- (27r)65 88 ,5(p' - q)5(q' - p) (¢olh!(p)bs(p)l¢o) .
Inside the integration domain the remaining matrix elements are all equal to
unity. Inserting (S.14) provides
2 jPF d 3p jqF d3q
(
~) 1I88 ,(lr-r'l) = (27r)3 1 (27r)3 1
IIss
1
We realize that the probability of finding two fermions with parallel spins
for relative distances smaller than the inverse Fermi momentum is drastically
reduced. The Pauli principle induces long-range correlations for the motion of
fermions with equal spin; they imply that these fermions cannot approach each
other too closely. This effective repulsion results explicitly from the exchange
symmetry of the many-particle wavefunction and not from the real, external
interactions. This observation is formally expressed in the following rewriting
of the amplitude (S.13b):
218 8. Pair Correlations Among Fermions and Bosons
The first term represents the direct term, whereas the second one stands for
the exchange term. The relative minus sign reflects the antisymmetry.
Again, we introduce an expansion into free momentum states for the field
operators, this time for the boson operators:
'( )- J
'P r -
3
d p e ip·r'a ( p ) .
(27rp (8.28)
The latter is only nonvanishing if p = p' and q = q', or if p = q' and q = p'.
Both cases are identical once p = q. Hence, we can write the following:
8.2 Pair-Correlation Function for Bosons 219
(8.31 )
n2 + If d3p
(21l')3
eip.(r-r') n(p)1
2
- f d3p n(p) (n(p)
(21l')3
+ 1) .
Let's inspect this result closer. In two ways it is different from the result
obtained for fermions. First, the sign of the second term is positive which is a
direct consequence of the exchange symmetry of bosons. Second, an additional
term shows up which is a result of the fact that many bosons are allowed
to occupy the same state. Furthermore we notice that this latter term is
independent of the positions and, according to dimensions, is smaller than
the first two terms by an order l/V; this will be rectified in Exercise 8.2.
For a homogenous Bose gas that is extended to infinity, the pair correlation
function II (r - r') is defined via
220 8. Pair Correlations Among Fermions and Bosons
n2 + If d 3p eip.(r-r') n(p)1
2
(27r)3
The first term appearing in this equation emerges from the second one for the
case r = r'. In other words: the exchange symmetry causes bosons to cluster.
According to (8.32) the probability for finding two particles in the same space
point at the same time doubles. 1 The following example provides a further
illustration of this point.
EXAMPLE
(2)
J(R) =
(3)
with
F(R) = LX> dk ke-{3k /2 sin (kR) .
2
(4)
1 Such effects are of some importance in e.g. high-energy heavy-ion collisions, where
in one encounter up to 10000 pions (bosons!) are produced.
2 1.M. Ryzhik, A. Seffrey, 1.S. Gradshteyn: Table of Integrals, Series, and Products,
5th ed. (Academic Press, New York 1994).
8.2 Pair-Correlation Function for Bosons 221
in = 2{3
we find the solution via a separation of variables:
rH(R)
iH(O)
dH'
H'
_~
(3
r dR' R' '
io
R
H(R) ~ -R 2/2{3
= V (9)
2{33 e .
An insertion into (6) yields F(R) and according to (3) we get the result for
the wanted Fourier integral:
In total we obtain
n 2 II(r - r') = n 2 (1 + e- lr - r '1 2 /(3) . (13)
A qualitative sketch of the pair-correlation function is illustrated in Fig. 8.2.
n
2
bosons
Ir- r'l
222 8. Pair Correlations Among Fermions and Bosons
Example S.l. Obviously, a significant increase can be observed for the probability of
finding two bosons at small relative distances at the same time. In particular,
the probability of finding a pair at the same position is twice as large as the
asymptotic value for large relative distances.
EXERCISE
Solution. We indicate again the derivation of (8.31), but this time we use an
expansion of the field operators into discrete momentum states, i.e.
L evV
ip·r
<j;( r) = ITT o'k . (1)
p
n2 + 1~
V 'L...
" e-ip.(r-r')nP 12 - ~
V2 'L...
" n
p
(n
p
+ 1) . (4)
p p
The light intensities, which are a measure for the average photon numbers,
are measured: J 1(t) in detector D1 at time t and J 2(t + T) in detector D2 at a
later time t + T. The difference in time T defines the relative distance r = cT
between two points in the primary beam. The product of the two intensities is
averaged with respect to t over the observation window T. This measurement
prescription is equivalent to the measurement of two photons at two different
points of the beam with a relative distance cT. For fixed T the observed,
correlated average value
shows the same characteristic trend as a function of T as has been derived for
the pair-correlation function II(r) in the previous example.
This experiment appears to reveal a typical quantum effect in Bose gases.
It is quite remarkable that it can also be explained completely within the
tools of classical electrodynamics. This should not be too surprising; we re-
member that a light beam (macroscopic electromagnetic field) contains very
many photons and, thus, also contains many photons with the same momen-
tum. An analysis of correlation functions in terms of Glauber states, which
we have briefly introduced in Chap. 1 in order to calculate, for example, the
classical value of the E field, would lead to classical quantities such as the
3 R. Hanbury-Brown, R.Q. Twiss: Nature, 177 (1956) 27; 178 (1956) 1447.
224 8. Pair Correlations Among Fermions and Bosons
Q
Fig. 8.4. Oversimplified
setup of the Hanbury- D2
Brown and Twiss experi- r'2
ment
For simplicity we assume that the source of the light beam consists of two
closely adjacent emitters Ql and Q2; see Fig. 8.4. Furthermore, Ql is required
to emit coherent light with a wave-number vector k1 and an amplitude a1; Q2
emits coherent light with k2 and a2. The light of the two emitters comes with
the same polarization; the relative phases of the amplitudes vary randomly
between 0 and 1r. From Ql, light with the amplitude a1 e ik1 ·rl reaches the
detector D 1; here r 1 represents the distance vector from Ql to D 1. In the
same manner the light originating from Q2 and falling into the detector D1
positioned at r~ has the amplitude a2 eik2·r~. According to the superposition
principle the total field amplitude reaching D1 is given by
(8.33a)
where we did not write down the polarization vector. The intensity becomes
(8.34b)
R denotes the real part of the corresponding quantity. As stated earlier the
relative phases of al and a2 can vary randomly. A more suited measure
for the corresponding total intensities is given by an average over the rela-
tive phases of the partial amplitudes; this average is equivalent to the time
average performed in the Hanbury-Brown and Twiss experiment. Suppose
a1,2 = Q1,2 ei!h.2, then the averaging implies
4 R.G. Newton: Scattering Theory of Waves and Particles (Springer, Berlin, Hei-
delberg 1982).
8.3 The Hanbury-Brown and Twiss Effect 225
1
(aiaz) = aiaz-
27r
l 0
Z7J"
d~e-l€ = 0,
•
(8.35)
As a consequence, terms containing products alaZ drop out after the average
is taken. Hence, the averaged intensities become
(8.36)
The product of the averaged intensities (h)(Jz ) is independent ofthe distance
between the detectors. This does not hold for the average of the product of
intensities; we obtain for the product hJz
h Jz = IA1Azlz = lar eik,.(r l +r2) + a~ eikdr; +r;)
(8.37)
Multiplying out and averaging over the relative phases eliminates terms of the
form alazlall z , alazlazl z , ... , and we obtain
+ lazl 4 + lallzlazlzlek,.r, eik2·r; + ek2·r; eikl.r212
lall 4
(J1)(JZ) + 21 a ll z lazl z
x cos [k2 . (T~ - T~) - kl . (Tl - T2)] . (8.38)
If both emitters Ql and Q2 are very close together, which should be the case
for a well-collimated light beam, we can replace Tl - TZ with T~ - T~ in the
above calculation.
Hence, we deduce for the correlated intensity as a function of the relative
distance between the detectors:
(8.39)
with p = k z - k 1. This function takes on a maximum for Tl = TZ. In a realistic
light beam, p has a distribution; a Gaussian, for example. Thus, an additional
averaging over different p yields again a function of the form (13) of Example
8.l.
The quantum-mechanical interpretation of the separate terms in (8.37)
would be as follows: The ai and the a~ terms represent the probability ampli-
tudes of observing a photon pair from Ql and Qz, respectively. They lead to
the position-independent terms of (8.39). The mixed term ala2 represents the
probability amplitude of measuring a photon from Ql and another one from
Q2' The two probabilities of registering one photon from Ql at Tl and the
other one from Q2 at T2 or vice versa cannot be distinguished. The interfer-
ence of the two amplitudes leads to the cosine term in (8.39). Therefore, the
tendency of photons to clump together can be understood as a consequence
of the superposition principle.
The HBT effect has been used in astrophysics for the analysis of the spacial
extension of cosmic light sources (stars, galaxies, and so on). It has also become
of importance for pions emitted from hot compressed nuclear matter regions
as they occur in high-energy heavy-ion encounters where nuclear shock waves
226 8. Pair Correlations Among Fermions and Bosons
cause the heating and compression of the nuclear matter. It serves there as a
tool to measure the spatial extend of the source. 5
nv are integers with -00 ::; nv ::; +00. Every single state (8.40) can be filled
with two electrons (spin degeneracy); as a consequence the state with the
lowest energy for the total system is characterized by
N= 2. (S.42)
nwithk,,<k F
N represents the total number of electrons within the box and kF is known
as the Fermi momentum. From (S.4la) the Fermi momentum is related to the
Fermi energy:
EF
h2 kF2
= __
. (8.43)
Fig. 8.5. The Fermi sea. 2m
All states with Ikl ::; kF In k space all states within the sphere of radius IkFI, i.e. Ikl ::;; IkFI, are
are occupied with elec- occupied; see Fig. 8.5.
trons. Two noninteracting
electrons with momenta These states constitute the Fermi sea. For the case that Nand L become
tikI and tik2 outside of the very large, the vectors k n are very close together according to (8.4lb) and the
Fermi sphere are indicated summation over n in (8.42) can be replaced:
Ln, ---->
2~ Jdk x or
Ln ----> L3
(2'llV
J V
d 3k = (27r)3
J d 3k. (8.44)
from which
kF = (37r N)
V 2
1/3
(8.47)
The interaction leads to the electrons scattering from each other; as a conse-
quence (8.48) no longer represents an energy eigenstate, We have to take a
superposition of unperturbed (free) states (8.48) of the form
(8.49)
with
ak,k2(t) = e-iEt/liak,k2' (8.50)
Here E stands for the total energy of the electron pair. The correlated wave-
function (8.49) is a double Fourier series in T1 and T2, and ak,k2(t) represents
the probability amplitude of finding particle 1 with momentum fik1 and par-
ticle 2 with momentum fik 2 . Since states inside the Fermi sphere are occupied
it has to hold that
(8.51)
228 8. Pair Correlations Among Fermions and Bosons
This is a pure phase factor, which is already known from the uncorrelated
wavefunction (8.48). Once the particles interact they will constantly change
their momentum because of the ever recuring scattering from each other: In
one scattering event kl' k2 will become k~, k~. The Hamiltonian of the two-
electron pair with interaction V( TI, T2) reads
h2 2 h2 2
H =- - V I - -V2 + V(TI,T2) (8.54)
A
2m 2m
and we deduce from (8.49) and the time-dependent Schrodinger equation that
. a
Ihatak,k2(t) (ck, +Ck2)aT,T2(t)
(8.55)
Evidently, the matrix element (k i k21 V (T I, T2) Ik~ k~) describes the elementary
scattering process k I k2 - 4 k~ k~ mentioned above. In detail, it becomes
(8.56)
In general, the Fourier transform of a matrix element (kIk2IVlk~k;) is given
by
(8.58)
Hence, a change in the wavefunction P(TI' T2, t) at positions TI, T2 also de-
pends on the wavefunction at the distant positions T~, T~. Only if
(8.59)
8.4 Cooper Pairs 229
does (8.58) turn again into a local Schrodinger equation. In this case
J
(klk2IV( Tl, T2, T~, T~) Ik~ k~)
2..
V2 d 3r 1 d 3r 2 d 3r'1 d 3r'2 eikl·Tl eik2·r2
which can be simplified further if the potential V depends only on the relative
distance (for two-body forces this is always the case), i.e.
V(Tl' r2) = V(TI - r2) .
Then, it holds that
(klk2IV( T}, T2, r~, T~) Ik~ k~)
Since the total momentum is conserved during the scattering process, the
matrix element of a nonlocal interaction in general will only connect states
with identical total momentum K = kl + k2 = k~ + k~. Thus,
and the nonlocal potential takes on the following form in coordinate space:
(rlr2IVIT~r;) = J d3 K d3keiK·Reik.r
R R'
r~ + r~
2 (8.65)
r r' r~ - r~
represent the center-of-mass and relative coordinates, respectively. Let us con-
sider an eigenstate with energy E and sharp (conserved) total momentum. The
amplitude (8.50) can be transformed into
a k,k2 (t) = a k(K) e-iEtjli (8.66)
and (8.55) becomes
Although the total momentum fiK only appears as a parameter, these equa-
tions are very difficult to solve with a general force. Electrons in a metal
experience two kinds of forces: electrons repel each other as a result of the
Coulomb interaction and they are attracted by the ions in the crystal lat-
tice. A moving electron pulls the ions a little bit in its direction and away
from their corresponding crystal lattice sites. Since the ions are very heavy,
the shifts remain very small. But the moving ions pull along other electrons
which in this manner effectively follow the first electrons. In other words, two
electrons that usually repel each other are able to attract each other in a
crystal lattice. Of course, this depends on the structure of the crystal lattice
and the kind of ions. The interaction resulting from these repulsions and at-
tractions are in some metals attractive for electrons near the Fermi surface.
This can be approximated as
Vo
Vkk,(K) = { 6' for kF < kr, k2' ki, k~ < ka ,
otherwise,
(8.68)
(E - Ck, - ck 2)ak(K) = - Vo
V "'"
L.J ak,(K) . (8.69)
k'
The prime (') for the summation sign indicates that the k' summation only
runs over those k' that lie inside the shaded spherical shell sketched in Fig. 8.6:
From the relation between k' and K given after (8.61) it is clear that
K
Fig. 8.6. Electrons in- 2+ k (8.71)
side a small spherical shell K -k.
kF < k < ka and at the 2
same time outside the
Fermi sphere kF attract Condition (8.70) corresponds exactly to the spherical shell for the involved
each other particle momenta noted in (8.68).
8.4 Cooper Pairs 231
The sum 2:~, in (8.69) cannot vanish. This can be seen as follows: The
amplitude ak(K) has to be different from zero for some k; otherwise the
correlated wavefunction (8.49) would vanish identically. If 2:~, in (8.69) were
equal to zero then it would follow for the corresponding k on the left side of
(8.69) that
E _ _ (nk)2 (nK)2
- Ek, + Ek2 -
m
+ 4
m
. (8.72)
For a given K and E the last equation can only be fulfilled for one k; con-
sequently, only one ak (K) can be different from zero. As a consequence the
sum would not vanish, i.e. 2:~, ak' (K) f= 0, which contradicts the original
assumption. Thus, we have to deduce 2:~, ak,(K) f= O.
In order to solve (8.69), both sides are divided by (E - Ek, - Ek2) and are
then summed over k. This leads to
~' vo~' 1 ~' (8.73)
~ ak(K) = -V ~ (E-E -E ) ~ adK).
k k k, k2 k'
Since 2:~ak(K) = 2:~, ak,(K) we get
vo~' 1
(8.74)
1 = -V ~ (E-Ekl -Ek 2)'
Graphically this equation can be solved easily: Let us depict the function
l' 1
X( E) = - ~ -:-=-----;- (8.75)
V ~ (E - Ek, - Ek 2)
and look for intersections with the line -1 I vo, the latter being parallel to the
E axis (see Fig. 8.7).
X(E)
Evidently, the function X(E) possesses poles at all possible energy values
E = Ekl + Ek2 of a noninteracting electron pair outside the Fermi sphere.
These electron pairs need to have a total momentum nK and single mo-
menta nk 1 , nk2 in the spherical shell between nkF and nk a . Hence, the
lowest pole position has to lie on the Fermi surface itself, i.e. at energy
E = 2EF = 2(nkF)2 12m. The function X(E) has many intersections with
the straight line -l/vo above 2EF. Essentially these energies correspond to
the energies of unperturbed (noninteracting) pairs; qualitatively, the corre-
sponding states are similar to these uncorrelated pairs. Since Vo is positive
232 8. Pair Correlations Among Fermions and Bosons
X(E) = r kad3 k 1
JkF (27f)3 E - 2Ek
r kad3k 1
lea
JkF (27f)3E-(hk)2/m
m 1
(8.77)
27f 2 h2 eF k dEk E - 2Ek '
where Ea = (hk a)2/2m. Since ka is close to kF, the k in the integrand can be
replaced by kF. For E < 2EF it follows that
(8.79)
simply represents the density of states near the Fermi surface. The last
identity can easily be verified by using the well-known relation o(j(x)) =
(1/1df Idxl)o(x - a) for 0 functions, where a is the position x for which
f(x) = O. Once according to (8.76) E = Ebound = 2EF - 2.:1 is inserted
into (8.78), so that the group of equations
1
X(E)
Vo
_ N(O) In 12E a - 2EF + 2.:11 1
2 2.:1 Vo
2.:1 e- 2/ N (O)vo (8.80)
2(Ea - EF) + 2.:1
following from (8.74) is automatically solved. We get
e 2/ N (O)vo - 1.
(8.81 )
Now
(8.82)
8.4 Cooper Pairs 233
is of the same order of magnitude as the Debye energy nwD. The Debye fre-
quency WD characterizes the maximum frequency of lattice vibrations in met-
als; see Exercise 8.3.
EXERCISE
Exercise 8.3. where ni represent integer numbers. This implies that the level density in k
space for a volume L is given by
L3
Pk = (27r)3· (6)
The maximum value for k is determined in such a way that we consider N
particles in the solid-state system which have exactly 3N vibrational degrees
of freedom. We divide the solid-state excitations into longitudinal and trans-
verse oscillations. kmax is determined by requiring that the maximum number
of longitudinal oscillation modes is equal to Nand 2N for the transverse
oscillation modes because of two polarization degrees of freedom. Hence, for
longitudinal oscillations
l o
lkmax'
d 3 kpk = N
or
N. (7)
The maximum wave number
k max
3 6.".2N (8)
= "V
is essentially given by the number of lattice atoms per volume. Making use of
the dispersion relation
(9)
with the propagation velocity VI (speed of sound) of the longitudinal oscila-
tions in the solid and a similar calculation for the transverse waves, we obtain
for the Debye frequency
_ 3 67r2N/V.
WD = W max = VIVt
vr + vr ' (10)
Typical values are nwD/cF ;: : ; 1/100 and N(O)vo ;::::; 1/4. Thus, with the
help of (8.82) the relation (8.81) can be simplified to
.:1 ;::::; nwD e- 2 / N (O)v o . (8.83)
This is the binding energy per electron in the bound (electron-pair) state. The
extremely nonlinear dependence of this pair energy .:1 on the force parameter
Vo is remarkable. This mechanism of pairing of electrons was discovered by
L.N. Cooper,6 hence the name Cooper pairs.
6 L.N. Cooper: Phys. Rev. 104 (1956) 1189. This idea was most essential for under-
standing superconductivity. Together with J.R. Schrieffer and J. Bardeen, L.N.
Cooper received in 1972 the Nobel Prize for physics.
8.4 Cooper Pairs 235
Our analytical studies make use of the assumption that the total momen-
tum of the Cooper pair is fiK = O. If this is not the case, i.e. for fiK =1= 0, the
number of possible k values in (8.75) decreases rapidly. As already stated ear-
lier, the poles of x( E) characterize the energies of the noninteracting (or only
weakly interacting) pairs. Since now fewer k values are available (allowed), the
value of X(E) for E < 2cp (see Fig. 8.7) decreases. Hence Ebound = 2cp - 2.1
approaches closer to 2cp and the binding energy 2.1 of the Cooper pairs be-
comes smaller. This decrease is strong for an increasing total momentum fiK.
Fig. 8.8. Electrons that
It is important to note that the smaller the number of states is, which are
are in opposite directions
connected via the attractive interaction (klk2IVlk~k;), the smaller the bind- outside the Fermi sphere,
ing energy of the Cooper pairs becomes. This also follows from (8.83) for i.e. (k 1 = -k 2 ), have total
N(O) --+ O. We obtain the largest binding energy for electrons that possess the momentum fiK = 0 and,
total momentum fiK = 0; for them kl = -k2' Those single electron states lie hence, the largest level
density. This implies that
outside the Fermi sphere in opposite directions (see Fig. 8.8).
that their pair energy be-
In order to determine the wavefunction of a Cooper pair we use (8.69), comes largest
according to which
1
aK(K) = E const. (8.84)
- Ck, - ck2
VJ (rl,r2) =
1
N 1eiK '(T,-T2)/2_
V
L ' Ee-
k
ik .(r,-r2)
Ck, - Ck 2
(8.85)
= rl
J'
Hence, with r - r2 the relative wavefunction of the Cooper pair reads
d3k e ik .r
tp(r) = N2 -- . (8.86)
(21T)3 E - Ck, - ck2
where (8.72) has been used; this will be calculated in detail in Exercise 8.4.
This relative wavefunction has its first extremum at
from which
236 8. Pair Correlations Among Fermions and Bosons
1 h,zk
d = rmax = - ~ ~ 1O-4 cm (8.88)
a 2mLl
follows. The correlation length d approximately characterizes the extension of
a Cooper pair. It is remarkable that cp(r) in (8.87) represents an s wavefunc-
tion; it does not depend on the angle and is thus proportional to Yoo CO, cp).
Hence, the angular momentum of a Cooper pair is l = O.
Superconductivity is a result of the simultaneous interaction of all elec-
trons. All electrons at the Fermi surface build up correlated pairs with the
same total momentum. Their correlation can be such that a current is pro-
duced, which is then called a super-current. In order to stop it, all electron
pairs have to be stopped at the same time. This is not the case for a con-
ventional conductor: electrons contributing to the current are stopped one
after the other. In a superconductor the correlated electron current has to
be stopped in total and instantly. Of course, this is very difficult; this is the
reason why the super-current flows for ever. 7
EXERCISE
Problem. Determine the relative wavefunction cp( r) of a Cooper pair for the
total momentum K = 0 [see (8.87)]. Estimate the extension of a Cooper pair.
cp(r) = N2 Jd3k
-- -----
e ik .r
(2np E - 10k, - ck 2
(1)
= N2
J d3k
(2np (h 2 jm) (k~
eik ' T
- k2 - 2mL1jh2)
(3)
L inp'n + l)jn(kr)Pn(cosrJ).
00
<p(r) = N2 ~
J 00 k 2 dk (2n + l)jn(kr)
(21f)3 (h2 /2m) (k~ - k 2 - 2m6./h2)
where (5) has been used. With jo(x) = sin x/x we obtain
<p(r) =
41fm
N2---,;:2
J k 2 dk sin kr
(21f)3 kr (k~ _ k2 - 2m6./h2) . (8)
<p (r )
~ - N 241fm
~ --
1L1k d X
sin [(kF + x)r]
. (11)
h2r 0 x + 2m6./!i2kF
Mainly small x values contribute to the integral. Hence, we are allowed to
shift the upper integration bound to approximately 00; remember that a =
2m6./!i2k F. This gives
<p(r) ~
41fm ( sin kFr
- Nr--;:z
nr
10
00
dx -
x+a
xr
cos - + cos kFr 10
00
dx -
x+a
xr) (12)
sin -
ci(x) j00
x cost dt.
t
(15)
The series expansion of these functions reveals their asymptotic behavior for
large r:
238 8. Pair Correlations Among Fermions and Bosons
EXERCISE
J dX~2
l+x
=x - arctan x .
The result is
!.-
Vo
= ~
27r 2 h 2
(ka - J 2mLl arctan
h2
ka
J2mLl/h2
) (3)
We want to find out the conditions under which a bound pair is able to exist;
hence, we consider the limiting case ..1 -> 0 and obtain
(4)
(9.1)
The index i distinguishes the particle species, i.e. electrons or ions. We expand
~t and ~ into plane waves,
, , e iq .",
Wi = Li
biq fTj
vV
(9.2)
(9.3)
0(q2 - qd = JVd3x .
e,(q2-qd''''<p(x) (9.4)
the interaction Hamiltonian can be written as [see (6.7) and the equations
following it]
(9.6)
Here [A, BJ- = AB - BA denotes the minus commutator. With the help of
(9.3) and (9.5) we can calculate the commutator. The result does not depend
on the choice of boson or fermion commutation relations for the operators biq .
We obtain
a At A
at biq,biq
(9.7)
The first part of (9.7) is a consequence of Ho: For the case q = q' the operator
b!q,biq represents the particle-number operator for particles of species i with
momentum q; for this case the first term in (9.7) vanishes, i.e. Ho does not
effect any change in the particle number. Only the potential changes the
particle number. The first term within the square brackets ~ b!q,b sp annhilates
a particle with momentum p and creates another one with momentum q';
it describes the increase in the particle number with momentum q' due to
scattering from other momentum states. The second term within the brackets
describes the decrease of the particle number with q' due to scattering to the
momentum p.
In order to find the relationship to classical quantities it is convenient to
introduce the distribution function of the particle species i:
(9.8)
where 10:) represent the states of the system and Pa is the probability of
finding the system in state 10:). Equation (9.7) becomes
A
(Ni(x,t)) =
" '" '" At A eiq .x
'L.,..L.,..L.,..P",(o:lbipbiP+qlo:)V
q p '"
iq·x
' " ' " At Aip
L.,.. L.,.. (b ip b +q I
e
V
q P
iq·x
L L Fi(p,p + q, t) e V . (9.11)
q p
On the other hand we derive for the momentum distribution for the particle
species i
(9.14)
\7 2 tp _- - 'L.,..L.,..L.,..
" '" '" 471' e i ( p,p+q,t ) e iq·x .
VFi (9.15)
. q p
From these two equations, (9.9) and (9.15), the distribution function Fi(p,p+
q, t) and the averaged potential cp have to be determined by successive it-
eration. Note, however, that (9.15) already represents a certain approxima-
tion because the true charge density has been replaced by the averaged one.
By starting from an initial distribution function Fi(O), an averaged potential
cp(O) can be calculated. With this potential cp(O) another new approximation
for Fi can be determined, and so forth. In atomic physics this approxima-
tion is known as the Hartree approximation. Equation (9.15) and (9.9) are
the quantum-mechanical analogy to the so-called Vlasov equations in classical
plasma physics.
To gain more insight into this system of equations let us consider the
equilibrium case for which the charge density and the potential cp vanish. In
equilibrium (cp = cp(O) = 0 --> <prO) = 0), (9.9) takes on the form
Fi(O)(q',q,t) = F?\q)8q'q.
We now consider the case in which Fi deviates only slightly from equilibrium
and oscillates around it. Hence, we set
Fi (q ' , q, t) = F(O)()J:
i q Uqq' + Fi ( l )q(,,q) e -iwt , (9.16a)
where FP) should stay small compared to Fi(O). Also for the potential cp we
set
cp = cp(O) + cp(l) e -iwt . (9.16b)
We insert this into the equation of motion (9.9) and deduce
once the terms on the right-hand side proportional to <p(1) F(l) are neglected.
<p(l) and F(l) are treated as small quantities, so quadratically small terms,
such as the term <p(l) F(l) just mentioned, can be neglected. This yields
9. Quasiparticles in Plasmas and Metals: Selected Topics 245
(9.17)
Again we insert the solution (9.17) into the Poisson equation (9.15) and arrive
at
(9.19)
and we have
V2rp(X) = ~ Lq2cp(1)(~q)eiq.X. (9.21 )
q
LE(q,w)q2cp(1)(~q) = 0, (9.22)
q
where
(9.23)
Lp
----> V J d 3 v;
we get
In order to find the classical limit of (9.25), we expand the nominator of the
integrand into a Taylor series and let Ii ----> 0:
246 9. Quasiparticies in Plasmas and Metals: Selected Topics
(9.26)
. +1
hm --.- = P
1)--+0+ X IT}
(1).l7TO(x) ,
-
X
- (9.27)
EI(q,W) = 1 + "'""'
2
47re i PJd3v f(O)(
i v -
) f(O)(
i
ti~/
V -'''1
+ n2q2/2mi
mi
)
(9.29a)
, miq2
~ w - q. v
:::1 J
and the imaginary part
After these introductory remarks we will now study in more detail the impor-
tance of the quasiparticle concept for a plasma. We neglect quantum correc-
tions to the frequency and employ the classical dielectric function (9.26).
Furthermore, for the equilibrium distribution functions of the electrons
and ions we assume Fermi functions:
9.1 Plasmons and Phonons 247
for V> VF i Fi
where vF; represents the Fermi velocity of the particle species i. It is deter-
mined from v
471" k 3 _ 471" P~i
P = 3 F, - 3"""iiJ Fig. 9.1. Fermi distribu-
tion function of the elec-
trons or the ions
= !!... (3P)1/3 ,
(9.31 )
mi 471"
where p represents the particle density, which is identical for electrons and
ions because of homogeneity. Since we can rewrite FlO)(v) according to (9.30),
li(O) = (3P/471"v~J8(v - VF,), where 8 denotes the step function. The deriva-
(47::;r/ 2
, (9.33)
{ I for z < 1 }
u(z) = = 8(z - 1) .
o for z> 1
Here Wpi (i = electron or ion) are characteristic frequencies, which, as we
shall see, characterize the plasma oscillations to zero order. This enables us
to perform the integration (9.26) and we obtain
f(q,W) = 1 +"".
~
471"e; Jd3v q. /o~
W- q .v +
Oli(O)
, miq2 17]
In order to search for the zeros we assume Zel » 1, i.e. W » qVFd for the
electrons; for the ions this holds all the more because VF iou = vFel(meI/mion),
i.e. VF iou «VFel'
Furthermore,
2 1 2 1
2_ZlnI1+ZI ;:::;
3 z2 5 Z4
for Z» 1 . (9.36)
1-z
248 9. Quasiparticles in Plasmas and Metals: Selected Topics
w2 ~ w2
Pel
(1 _ mel)
mion
+ ~v2
5 F 01
q2. (9.38)
2 _ Zion In I~ + Zion I ~ 2
- Zion 3z;on '
(9.39)
2 - Zel In 111 +- Zel
Zel I ~ 2
We set E(q,W) = 0, solve it, taking into account the outlined approximations,
and get
(9.41)
where
W= (9.42)
_ _ VFel
w -- (w- -
Pion )
q_
-VFcl
- (mel) 1/2
-- q
v'3 w pel v'3 mion
and hence
1/2
_ VFel ( mel
W~W=- - -) q. (9.43)
v'3 mion
This is the typical relation for sound waves (proportionality between frequency
and momentum), i.e.
9.1 Plasmons and Phonons 249
The experimental results for longitudinal sound waves in metals are in good
agreement with (9.44); this holds particularly for alkali metals where a 20%
agreement is found. The oscillations associated with these quanta are denoted
as phonons. For large wavelenghts we obtain from (9.41)
W ~ W (1 __ (~) i
3y'3 mien
1/2) , (9.45)
i.e. a small negative imaginary part. Here the phonons are weakly damped.
In general the weak damping of waves in plasmas is called Landau damping. 2
EXERCISE
Solution. Without a plasma the potential would be Q/r and the correspond-
ing Fourier transform would then be
41l"Q
tp(q) = Vq2 (V = L3 = volume) .
with
2 See e.g. A.L. Fetter, J.D. Walecka: Quantum Theory of Many Particle Systems
(McGraw Hill, New York 1971); U.C. Kittel: Quantum Theory of Solids (Wiley,
New York 1966).
250 9. Quasiparticles in Plasmas and Metals: Selected Topics
<p(r) = ~ jd3q<p(q)eiq.x
(27r )3
(2~)3 10 00
q2 dq 107r sin 19 q d19 q 1027r d<pq<p(q) eiqrcos t9 q .
Since the integrand is independent of <pq the integration over <pq can be per-
formed:
<p(r) =
V
(27r)3 ioroo q2 dq ior sin19 q
.
d19q <p(q)elqrCost9q .
- ; j q2 dq <p(q) sinqr.
27r qr
Insertion of <p(q) gives
<p(r) = 2. e- r /). .
r
-=--
Within the distance A the charge is screened. If instead of the Fermi distri-
bution a Maxwell-Boltzmann distribution is used for the derivation of (9.35)
~~~ .Q_~~ we arrive at
-
A = (~)
47rpe 2 .
Fig. 9.2. Illustration of This quantity is denoted as the Debye screening length. It characterizes the
Debye screening of a charge screening of a charge by a plasma charge cloud which is built up around the
Q in a plasma charge Q (see Fig. 9.2 and Exercise 9.2).
EXERCISE
Problem. Use (9.26) to determine the classical dielectric function for particles
obeying a Maxwell distribution.
9.1 Plasmons and Phonons 251
Eelass -
-
1+~
~ 41l"e;
i
--2
miq
J d 3 q. (of?) /av)
V
w - q. v
.
E(q,W) = 1-
81l"e 2p
23/25
mq 1l" 0;
J 3 q·ve- v2 / a2
dv-=------
w-q·v
The choice of cylindrical coordinates leads us to
E(q,w)=l-
81l"e 2p
23/25
1271" 1
dv<p
00
vl!dvl!
mq1l" 0; 0
1
0
00 qvz e- V 2/ a 2 e- V •2/ a 2
%
X dv z - - - - - - - - - - - - - - -
1
-00 w - q. V z
1-
161l"e p
2 1/2 5
2
00
VI!
d
VI! e
/a21°O d
_v 2
• Vz
qvz e- Vz
-=--"-------
2/
a
2
mq 1l" 0; 0 -00 W - q . Vz
The elementary integral over vI! is of the type
1o
00
x
2n+l
c
_px 2
=--
2pn+l
n!
(p > 0) ,
1
and therefore
81l"e 2p 00 qvze-v~/a2
E(q,W) = 1 - / dv z .
2
mq 1l"1 20;3 -00 W - qv z
l°°
Adding a zero in the form of adding and subtracting an identical term yields
1
mq 2 1l"1/20;3 -00 z
2 2/ 2
1- 81l"e p 00
dv (
- e_ 2/ 2
Vz a
we-
+- - - a- )
V
%
mq21l"1/20;3 -00 z W - qv z
1_ 81l"e 2p
mq 2 1l"1/20;3
(1 00
-00
w e-v~/a2 dv z
W - qv z
_ 0;,;7r)
where
A = (m0;2 )1/2
81l"pe 2
( ~)1/2
41l"pe 2
252 9. Quasiparticles in Plasmas and Metals: Selected Topics
Z(z) = 100
-00
dx--
z - x
e-x2
EXERCISE
Problem. Use expression (9.34) to verify the result (9.35) for E(q,W).
E (q , w) -_ 1 - L. 47re-
- - - ) j d3v -
; ( -3p
miq2 47rv~
q·v O(V-VFJ
- ----'---...::..:....-
v w - q . v + i1]
~ t , v .f
=1
We now determine I:
I = r2"" dcp r sin {} d{) roo v2 dv qv cos {} __0.. :.(v_-_V_F..,:.i.:....)_
1"
io io io v w - qv cos {} + i1]
cos {}
2
27r sin {} d {} qVF {} . .
o 'w - qVFi cos + 11]
The substitution U = cos {}, du = - sin {} d{} yields
r+ 1 udu
I = 27rqv~i LI W - qVFiU + i1]
Making use of (9.27) for the solution of the integral leads to
I = 27rqv~i (p
i-I
r+ 1
udu
w - qVFiU
-i7rjUdUO(w- qVFiu ))
r+l w -udu
27rqv~., {p i-I qVFiU
- i7rjUduo [(u -~)
qVF,
qVFi]}
27rqv 2 (p
Fi
r+l
udu
i-I W - qVFiU
_ i7r_w__ )
(qvFJ 2
With the integral
j xdx
a + bx
= ~ _ ~ lnx
b b2
it follows that
3 B.D. Fried, C. Conte: The Plasma Dispersion Function (Academic Press, New
York 1961).
9.1 Plasmons and Phonons 253
I
2
27rQVF i {[ (_
U
qVFi
) -
w
- (
qVF i
)2 In Iw - qVFi ul ]+1}-1
.
- l 7qVFi
r -)2
(
W Exercise 9.3.
Zi,
mi
3 wp Zi +1 .
1 + " - --;- 2- Zi In 1- - 1 + l7rZiU(Zi) ) .
~i
2 q2 v2Fi ( z·-l
t
10. Basics of Quantum Statistics
state, and the projector Plpn) is called the density matrix of this pure state.
However, as stated above, one usually cannot determine exactly in which
state n the system is. For example, if we allow an uncertainty of b.E for the
measurement of the energy E of the system there will be very many state
vectors l!li~(n») which fulfill E < E(n) < E + b.E. The reason is that in a
macroscopic system with many particles (N ---+ 00, V ---+ 00, N/V = const) the
energy levels are very close together. We therefore do not have the information
about exactly which state vector the system is in. This lack of information
about the exact state vector can be accounted for if we allow each state vector
to be assumed with a certain probability pn. We write
(0) = (!li~(n)IOI!li~(n»)pn. (10.5)
The probabilities pn shall be normalized according to
(10.6)
n
This average can be rewritten, by inserting the completeness relation from
(10.2), as
In the last step we have rearranged the bra and ket vectors such that we can
furthermore write
(10.7)
where the density matrix P is defined as
(10.8)
n
where Pi is the probability of the possible outcome i and the sum runs over
all of them. This function is exactly zero if and only if there is no uncertainty
about the outcome of the experiment, i.e. if one of the Pi is 1 and all others
are zero. This means the outcome of the experiment is then only the special
event i. Note that lIn 1 = 0 and limX->CXl x In x = o. On the other hand 1i
is constructed such that the case of equal probabilities has the maximum
uncertainty. For an event with n possible outcomes we obtain Pi = lin and
10.2 Density Operator of a Many-Particle State 259
1i = - Ln
-n In n I
i=l
l
= -n n In n = In n .
J&
(10.12)
However, the Pi are not independent of each other and from the normalization
we have a constraint for the allowed variations 5Pi:
(10.14)
;=1 ;=1
If we multiply (10.14) with a Lagrange multiplier, Cl:, and add the resulting
equation to (10.13) we find
Now we can argue that the variations 5Pi may be assumed to be independent
of each other, such that each term of the sum must vanish itself if we account
later on for the normalization condition by a suitable choice of Cl:. Then we
find
In Pi = 1 - Cl: or Pi = e 1 -o<. (10.15)
The important fact is that the right-hand side of (10.15) is independent of
i so that all Pi are equal and the normalization then yields Pi = 1/ n for n
possible outcomes. We have shown in this way that our assumption of equal
probabilities in the micro canonical ensemble in a given small-energy shell
corresponds to the assumption of maximum uncertainty.
Now, if the macrostate of the system is not specified by the energy but
rather by the temperature, T, of the surroundings of the system (e.g. a heat
bath) the system can assume all possible energy values and not only a small
interval around E (we have assumed b.E « E!). The reason is that the
heat bath is assumed to be very large compared to the system of interest
(Ebath » E) and the exchange of energy between the heat bath and system
therefore leads to fluctuations of the energy of the system. Fluctuations are
generally deviations of an observable from its mean value. However, the mean
value of the energy, i.e. of the Hamiltonian, is still fixed:
U = (H) = L(1ji~(n)IHI1ji~(n))pn
n
(10.16)
n
An ensemble of systems in which the macrostate is specified by a fixed mean
value of the energy and fixed particle number and volume is called a canonical
260 10. Basics of Quantum Statistics
(10.17)
n
(10.18)
n
Now we have the additional constraint (10.16), which leads to the variational
condition
(10.19)
n
Here we can again assume all variations opn to be independent of each other,
so that
pn = ce-{3E(n) ,
Here the probabilities of the states drop exponentially with the energy. In
Thermodynamics and Statistical Mechanics 2 it is shown that from (10.16)
and the fact that the entropy of the system is given by
(10.21)
n
Now we see immediately that the pn of (10.20) can be obtained as the ex-
pectation value of an abstract operator
, e-{3H
Pc = Tr(e-{3H) , (10.24)
the canonical density operator. Here the trace can be rewritten in the basis
of the It[/~(n) as
(10.25)
n n
and in the last step we used the fact that the abstract operator e-{3H, which
is to be understood as
(10.26)
The mean particle number is then given as the average of the actual particle
number, N,
00
Furthermore, in most cases a particle reservoir acts also as a heat bath that
fixes a certain temperature, so our macroscopic knowledge about the system
is only the corresponding mean value, exactly as in the canonical ensemble:
00
N=O n
Again we can argue that each of the brackets must vanish separately if we
fulfill the subsidiary conditions (10.26)-(10.28) later on by appropriate choice
of the Lagrange multipliers. We find
pn,N = exp [-(3(E(n, N) - IlN)] (10.30)
gc 2:.";'=0 2:.n' exp [-(3(E(n',N') -IlN')] ,
where we have determined a from (10.26) and introduced the chemical po-
tential 11 by the relation 'Y = (311· The physical meaning of 11 = "//(3 as the
chemical potential must be derived from condition (10.27), which determines
this parameter and we refer the reader again to the lecture on thermodynam-
ics and statistical mechanics for this calculation. 3 The meaning of (3 is the
same as in the canonical ensemble and is given by (10.22). A system whose
macrostate is determined by a mean energy or temperature, a certain volume,
and a mean particle number or chemical potential is called a grand canonical
system and (10.30) is the corresponding probability distribution of the grand
canonical ensemble. This means that in a set of N systems (N -> 00) with
common temperature, volume, and chemical potential (T, V,Il) the probabil-
ity of finding one of these N systems exactly with N particles and in the
special microstate number n is given by (10.30). Exactly as for (10.24) for
the canonical density operator, we can also find the representation of the free
density operator of the grand canonical ensemble:
, exp[-(3(H - IlN)]
(10.31 )
pgc = Tr {exp[-(3(H -Il N )]} ,
where the trace now contains not only a sum over all expectation values with
a complete set of basis vectors in the N-particle Hilbert space, but also a sum
3 W. Greiner, L. Neise, H. Stocker: Thermodynamics and Statistical Mechanics,
2nd ed. (Springer, New York 1997).
10.2 Density Operator of a Many-Particle State 263
where we used here the complete set of energy eigenfunctions but, as we know,
any other complete set can also be used to evaluate the trace even though the
actual calculation will be more complicated.
It is clear that for other definitions of the macrostate other probability
distributions and other ensembles will be needed, but for most situations the
canonical or grand canonical ensemble is sufficient. Before we proceed it is
useful to state some general properties of the density matrix, which follow
directly from the general definition (10.8) and hold in all ensembles:
Trp = 1, (10.33)
since the trace of the exponential operators in (10.24) or (10.31) exactly can-
cels the c number in the denominator, which is given explicitly by (10.25) or
(10.32), respectively.
pt = p, (10.34)
i.e. the density matrix is an Hermitean operator, which follows from the fact
that p is a sum of Hermitean projection operators, (10.8), where the weights,
pn, are real probabilities.
p2 = P <=> P = Itj/~(~,N») (tj/~(~,N) I , (10.35)
i.e. if p2 = jJ then jJ describes a pure state of a
system in an exactly known
microstate (here state number n, and the system has exactly N particles). In
this case p is a projector onto this microstate, otherwise jJ2 # p. Finally we
remember that averages of observables have to be calculated as
(0) = Tr(pO) = Tr(Op) , (10.36)
where the cyclic invariance under the trace may be used. Note that
this average contains a purely quantum-mechanical average such as
(tj/~(~.N) 101tj/~(~,N») and additionally a statistical average over all these quan-
tities weighted with the probabilities, pn,N, according to our starting point
(10.5).
We can now write the entropy according to (10.21) as the average of
-klnjJ, i.e. 4
s (-klnp) = -kTr(jJlnp)
-k L(tj/~(n)ljJlnpltj/~(n») , (10.37)
n
4 The expression originates from ideas of L.E. Boltzmann and M. Planck; the gen-
eral interpretation in terms of information theory has been formulated by C.
Shannon and E.T. Jaynes (see Example 6.9).
264 10. Basics of Quantum Statistics
(10.38)
i.e. the states I!IiE(n)) are eigenstates of the density operator and the proba-
bilities pn are its eigenvalues. Therefore the matrix element in (10.37) can be
evaluated to be the number pn In pn such that (10.37) is only a more com-
plicated formulation of (10.21). This formulation has the advantage that it is
representation free and holds for all kinds of ensembles. Special forms of the
density operator and the density matrix will be exemplified in the following
problems. However, here we note some of the results here. Expressed by field
operators tf,t (x) and tf,( x) the density operator takes on the form
jJ(x) = tf,t(x)tf,(x) . (10.39)
Analogously, the operator for the density matrix reads
jJ(x,x') = tf,t(x')tf,(x). (10.40)
EXAMPLE
The starting point for our consideration is an element 1!Ii) of the N-particle
Hilbert space. In order to represent this rather abstract state vector we need
to have a complete basis of the Fock space. For this, arbitrary one-particle
states 1,8) can be used; we assume that they are eigenstates of an operator
B. ,8 stands for a corresponding set of quantum numbers. Out of states 1,8)
an arbitrary n-particle state can be constructed by taking direct products:
symmetric for bosons and antisymmetric for fermions. It is convenient to
know about the "unity operator" i in Fock space:
(1)
The various terms express the contributions from the ground state, the one-
particle, two-particle, ... states to the complete 1. They represent symmetric
or antisymmetric many-particle states for bosons or fermions, respectively.
An explicit distinction becomes necessary only once commutation relations
are derived; see Exercise 10.3.
We project the state 1!Ii) onto the Fock space i:
1 !Ii) i !Ii)
1
The quantum numbers 13 stand, for example, for momentum or position quan- Example 10.1.
tum numbers. The expansion (2) reduces to exactly one term for a pure n-
particle state: 1131··· 13n) = ItJi n). The quantities 1(131··· 13nltJi)j2 represent the
probabilities for the realization of a n-particle state with quantum numbers
131· . ·13n· Creation and annihilation operators b1 and b(3 are defined via their
action on a many-particle state, i.e.
b11 0 ) = 113), b(310) = 0, (3)
These operators create and annihilate a quantum at position x. They obey the
commutation relation (3.44), i.e. [~(x), ~t (x')]± = 8(x - x'). The operators
b1, b(3 and ~t (x), ~(x) are related via a unitary transformation of the form
~t(x) = Lb1(13lx) == L b1tJih(x), (9)
(3 (3
Example 10.1. It is possible to define the density operator with the help of the field
operators (9) and (10). With (7) and (8) we get
p(x) tPt(x)tP(x)
Ix)(xi + J d3xllxXl)(XlXI
Again, the various contributions result from the one-particle, two-particle, ... ,
n-particle subspace of the Fock space. The one-particle density describes the
probability of finding a particle in an n-particle state Iwn) at position x and
is represented by the matrix element
(w n Ip( x) Iwn)
nJ d3x2 ... d3xnwt (x, X2, ... ,xn)w(x, X2,· .. ,xn ) . (12)
In view of (12)-(14) it becomes clear that the operator p(x) in (11) acts on
n-particle Hilbert spaces. We also stress that the form of the density operator
(11) does not imply a space representation of p. The special appearance of
the density operator comes from its expression in terms of field operators.
The arguments x have to be viewed merely as parameters (position quan-
tum numbers) which just characterize this class of creation and annihilation
operators.
In order to make this point even clearer we discuss another form of the
density operator. According to the unitary transformation (9) and (10) this
leads to
p(X) = Lb~b{3(alx)(xl;3). (15)
a,{3
Let's take the expectation value in analogy to (12):
Due to the hermiticity of the matrix na(3 a unitary transformation (j can be Example 10.1.
found which diagonalizes (na(3):
1/
(xl L naIO')(O'lx) .
Then, the density operator becomes
(17)
Example 10.1. The one-particle density PI<pn) (x) in the n-particle state from (12) results as a
special case (diagonal element): PI<pn) (x) = PI<pn)(x,x). From (20) we deduce
the spatial representation for the nonlocal operator of the density matrix
[compare with (13)]:
n
P<p n (x, x') = L 8(x' ~ xi)8(x ~ Xi) . (21)
;=1
As in the case of the density operator the form
EXERCISE
and the expansion (7) for the field operator pt(x) (see Example 10.1) we find
pt(x) Ipt(x)
x (IX)(OI + J d3X1IxX1)(X11
Exercise 10.2.
This yields
+L J d3x11,6,61)(,6lx)(,61Ix1)(X11
J
(3{3,
x (,61,62Ix1X2)(X1X21 + ... ;
all the other matrix elements like (,610) and (,6,61Ix) vanish. Only the matrix
elements between states having the same number of particles are different from
zero. Keeping this in mind, we easily understand that the equation above can
also be written as
J
{3, {3,{32
x (10)(01 + d3X1Ix1)(X11
Remembering (5) of Example 10.1, we identify the expression within the first
bracket as the decomposition of the operator b1; the second term yields just
the i [see (1) in Example 10.1]. Thus, we obtain the wanted transformation
from b1 to ~t(x):
this leads to
b1 = Jd3x~t(x)(xl,6)· (2)
270 10. Basics of Quantum Statistics
Exercise 10.2. The corresponding transformation equations for the annihilation operators
are obtained by taking the adjoint of (1) and (2):
ir(x) = 2:)(3(xl(3), (3)
(3
EXERCISE
Solution. For the case of fermions we have to take the Fock space of the
antisymmetric many-particle states as a basis. The action of the antisym-
metrization opemtor A on a one-particle product state 1(31(32 ... (3N) creates
a completely antisymmetric many-particle state normalized to the particle
number N:
VNlAI(31) 1(32) .. ·1(3N)
VNlAI(31(32' .. (3N) , (1)
where the index 'a' symbolizes antisymmetry. We find for the matrix element
between two arbitrary many-particle states:
a ((3~(3~ ... (3'tv, 1(31(32 ... (3N) a = 0 for N' =J N (2)
and
(7)
In order to derive commutation relations we determine the operator products
b!3'b1 and b1b13' according to (5) and (6). Let us have an explicit look on the
first terms:
(8)
131/31
+ 2;: _(2~)2Ifilfi2)a a(fJ' filfi2IfJfJ~fJ~)a a(fJ~fJ~1 + ... . (9)
13;13;131132
The appearing matrix elements have to be analyzed according to (3) and yield
- ~ (2~)2 (lfJfJ~)aa(fJ'fJ~I-lfJ~fJ)aa(fJ'fJ~l)
Exercise 10.3. Calculating the next higher-order terms in (8) we convince ourselves that
those terms proportional to the factor bf3'f3 add up to the Fock space i [see
(4)].
We obtain
bf3 ,b1 = bf3f3,i -1,6)(,6'1- LI,6,6Daa(,6',6~1
f3;
- (more remaining terms) , (10)
where the remaining terms in the operator product b1bf3' originate with op-
posite sign. Furthermore,
(10.41 )
The dynamics of each (pure) state ItPn) of the ensemble is described by the
Schrodinger equation (Schrodinger picture)
(10.42)
Here, we have set It = 1, which is the same as replacing II by II lit: i.e.
II -; II lit. As we ask for the evolution of the whole ensemble we have to find
the evolution equation for the density operator. Directly from the Schrodinger
equation (10.42) and its adjoint equation it follows that the density operator
of the pure state (10.3) is
(iOt ItPn) )(tPn I (1IItPn))(tPnl,
ItPn)( -iOt (tPn I) ItPn)((tPnllI) ,
so that
10.3 Dynamics of a Quantum-Statistical Ensemble 273
Wt (I!p"n) (!p"n I)
i[(atl!p"n»)(!p"nl + I!p"n)(at(!p"nl)l
(HI!p"n»)(!p"nl-l!p"n)(H(!p"nl)
HI!p"n)(!p"nl-l!p"n)(!p"nIH
[HI!p"n) (!p"n Il-
and therefore
(10.43)
Since the weights pn are constants, the ensemble averaging (10.8) over (10.43)
can be performed directly:
LpnWtPI<lm) = Lpn[H,PI.pn)l·
nEE nEE
Finally we arrive with (10.8) at
WtPE = [H, PEl. (10.44)
Hence, we have derived an evolution equation for the density operator of
the quantum-theoretical mixture. It can be understood as the quantum-
mechanical analogue to Liouville's equation of classical many-particle mechan-
ics.
With the help of the evolution equation for the density operator we succeed
in deriving the equation for the time evolution of an arbitrary observable. 0
denotes the Schrodinger operator, which is assigned to the observable (O)E
and does not explicitly depend on time, i.e. dO / dt = aD/at = o. We consider
now the change with time of the ensemble average (10.7)
(O)E = Tr(pEO) ,
and indicate some of the intermediate steps. First,
mEE
+(!p"mlpEo(atl!p"m»)] .
Making use of the Schrodinger equation (10.42) and the evolution equation
(10.44) we deduce
EXERCISE
Problem. Determine the form of the density operator of a mixture for the
case in which information is given about its pure states.
it is the sum over projectors onto the pure states of the ensemble.
We can also provide another ansatz for the density operator, which is
equivalent to (1). For this we only have to take into account that on the one
hand the sum occuring in (1) represents a part of the complete unity operator
in Fock space and that on the other hand all expectation values always imply
ensemble averaging, i.e. taking the trace with the density operator. We write
(1) as
p = pi - pQ, (2)
where the operator Q simply represents the remaining sum over all projectors
onto pure states, which do not belong to the ensemble. Thus, it holds that
for nEE. (3)
Once the action of Q onto one l\]in) from the ensemble vanishes, this part in
the density operator does not contribute in all possible trace expressions that
contain p. In this respect, the form of the density operator equivalent to (1)
is simply given by
pi = pi. (4)
We still have to determine the weight p. It follows in a straightforward manner
from the general property Tr(fJ) = 1 of density operators. We arrive at the
result
pi i i (5)
Tr(i) N'
where N reflects the number of possible pure states of the ensemble.
With absolutely no information about the pure state of the system the
density operator can be set proportional to the unity operator.
10.3 Dynamics of a Quantum-Statistical Ensemble 275
EXERCISE
Solution. Only the spin part of a possible pure many-electron state is of im-
portance. The pure spin states of the system are the two possible eigenspinors
of the operator (j z. The two spinors
(1)
form the ensemble. For the construction of the density operator we need a
representation of the projectors I!lin) (!lin I onto the pure spinstates Xl±l in
2 2
first place. They result as dyadic products:
(1, 0) (~ )
(3)
(0, 1) (~)
Hence, we obtain the form
p = p+F+ + p_F_ (4)
for the density operator. Now we have to determine the weights p+ and p_,
i.e. the probabilities for the realization of a pure state with a z projection of
the spins of + ~ and - ~, respectively. In general it has to hold that
(5)
ms
In the preceding subsections we have learned how the existing lack of knowl-
edge about the state vector Itli) is reflected in the density operator for the
description of a state of a physical many-particle system. We have also real-
ized that the uncertainty determined by Pis restrained from below and above
by two, so-to-say, extreme density operators. The lower bound is fixed by the
density operator of the pure state:
(10.46)
This implies that the state of the system is entirely known. On the other hand
we have experienced the density operator
, i i
(10.47)
pu = Tr(i) = N'
which has to be chosen when there is full ignorance about the state of the
system (see Exercise 10.4 and 10.5); the index 'U' stands for 'unawareness'.
Often it is important to possess a meaningful measure for the 'deviation'
of the density operator fJ from its pure state PI '/I). This measure can also
be understood as a measure for the missing information about the system. In
accordance with information theory this measure, which we will simply denote
as deviation in the following and which is assumed to be a positive number
a, is defined as a functional a[fJ] of the density operator in the form
alp] = -Tr[pln(fJ)]· (10.48)
The deviation alfJ] can also be understood as a measure for the disorder of the
system. Here we will not justify (10.48) from the information theory point of
view and refer to Example 6.9 instead, but we will straightforwardly prove the
most important properties of a[fJ]. We proceed with the eigen representation
of the density operator:
fJltlin) = pnltlin) with Lpn = 1 and 0 <::;; pn <::;; 1.
n
(10.49)
n
10.4 Ordered and Disordered Systems: The Density Operator and Entropy 277
compare with (12) of Example 6.9. Since 0 :::; pn :::; 1 holds for all weights, we
deduce at first positive definiteness:
o-[p] 2: o. (10.50)
Next we show that the measures 0" [Pu] and O"[PIIP)] corresponding to the density
operators Pu and PIIP) are indeed upper and lower bounds for every other O"[p],
so that
(10.51)
Once all weights pn except one (p' = p = 1) are set equal to zero we obtain
from the eigen representation (10.49) the density operator PIIP) of a pure state.
The limit limx_o x In(x) = 0 leads to
(10.52)
As it should do, the deviation 0" vanishes exactly for the density operator of
the pure state since it implies a complete knowledge about the state vector of
the system. In order to determine the upper bound for 0" we have to calculate
O"[pu]. The density operator pu originates from (10.49) in such a way that we
set all weights equal to pn = p = liN. This yields
o-[pu] = - ~
N 1In (1)
N N = - NIn N~ 1
lIN
1
-N(1n1-lnN)N = In(N). (10.53)
:t O"[p] = O. (10.55)
We should mention another aspect here. The deviation 0" also represents a
measure for the disorder within a system. This becomes plausible once we
278 10. Basics of Quantum Statistics
remember, for instance, the example of the unpolarized electron system (Ex-
ercise 10.5). For this system the density operator was Pu = i/ Tr(i), which led
to the largest deviation IJ = In(2). Unpolarization means arbitrary orientation
of the single spins. With respect to spin orientation the system is completely
disordered. However, once a certain spin orientation is favored (magnetiza-
tion) this reflects a state of less disorder. Hence, the deviation IJ also becomes
smaller:
IJ = -[(1 - p) In(l - p) + pln(p)] :::: In(2), p :::: 1/2,
where p represents the weight of the less favored spin orientation. Once the
system is in a pure state, i.e. all spins are oriented in the same direction
(complete magnetization), IJ vanishes. Then the system is in a state of largest
order.
Another conclusion is at hand: (10.55) implies that for the case of an
isolated system, which is left to its own, the degree of disorder does not change.
Detailed studies show that the deviation IJ increases because of the influence
of external stochastic perturbations. The measure IJ behaves like an order
parameter. In fact, IJ is related to the thermodynamic entropy. The quantum-
mechanical definition of entropy is given by
S = kBIJ, (10.56)
where kB denotes Boltzmann's constant (see Examples 6.6 and 6.9). From
experience we can say that a system always tends to a state of maximum
entropy. In other words, the density operator p of a quantum-mechanical
mixture always maximizes the deviation dp]. In the next subsection we will
consider the extremal properties of IJ to pin down the density operators for
stationary ensembles. For further information on these subjects we refer the
reader to the lectures on thermodynamics and statistical mechanics. 5
where Gin = (y;nIOily;n). Since the variations 6pnn fulfill only the condition
L6 pnn = 0,
n
and are otherwise arbitrary, the relation (10.63) can be fulfilled only once
every square bracket vanishes for its own. This immediately leads to
(10.65a)
(1O.65b)
In addition to the mean energy the mean particle number (N)E = N is also
kept fixed for the so-called grand canonical ensemble. The corresponding vari-
ational equations
80'[.0] = 8 Tr[pln(p)] = 0 (10.68a)
together with the conditions
Tr(p) = 1, E, (1O.68b)
lead to the grand canonical density operator
EXAMPLE
and
(2)
PGk = _1_ exp[ -t3(if - /-IN)] = Zl exp (-t3 :L(t a - /-l)n a ). (5)
ZGk Gk a
where the sum runs over all possible configurations of occupation numbers
n a ,. The ith one-particle state may have quantum numbers ai' For a general
many-particle state everyone-particle state contained in it appears with occu-
pation numbers according to (3) or (4). Hence, we are allowed to interchange
the summation with the product series: first we sum over possible individual
282 10. Basics of Quantum Statistics
Example 10.6. occupation numbers n and then we take the product over all allowed sets of
quantum numbers ai. We get:
(8)
<> n
For fermions we arrive at
zb = II {I + exp [-,8(10<> -IL)]} (9)
n" = Z
1
Gk
L n" exp [-,8(E" -1L)n,,]
{".nV···nO i "·}
'"
of the ensemble considered.
11. Structure of Atoms
H =
pi + -p~ - -Ze 2 - -Ze 2 +
-
e2
. (11.1)
2m 2m Tl T2 ITl - T21
The first two terms in (11.1) describe the kinetic energy of the two electrons
with mass m. The next two terms represent the attractive interaction between
the electrons and the nucleus. Z is the nuclear charge and Tl and T2 represent
the position vectors of the electrons from the nucleus (see Fig. 11.1). The last
term appears because of the repulsive Coulomb interaction between the two
electrons. The kinetic energy of the nucleus is neglected in (11.1), because
the nucleus is assumed to stay motionless in space as a result of its large
mass. The Hamiltonian H is supplemented by Ha , indicating the spin of the
ao = ;,,2/me 2 is Bohr's atomic radius. Because the spatial part of the wave-
function is symmetric under exchange of the electron indices, the spin part
has to be antisymmetric. This can be fulfilled only by the singlet state
Eg = ---
Z2 2
e
(11.5)
ao
The superscript 0 refers to the zeroth order of perturbation theory. The sub-
script 0 reflects the ground state.
Now we will calculate the perturbative contribution of the electron-
electron interaction. In first order the change in the energy is
e2
D.E = ("pI I 11"p) , (11.6)
TI - T2
/j,E J
d3rl d3T2 [_1_ ( Z) ~l2 e-Z(rl +r2)/ao e2
Vii ao
T
ITl - T21
J
d3rl d3T2 ~ (~)6 e2 e-2Z(r,+r2)/aO. (11.7)
7f2 ao ITl - T21
To solve the integrals, we use the Fourier representation of 1/1Tl - T21 (see
Exercise 11.1):
h -
1
T21
= J d3 k e ik .(r,-r 2 ) 47f
(27f)3 k2 .
(11.8)
Here the integrals over Tl and T2 have been exchanged with the integration
over k. The integral over Tl is complex conjugated to the integral over T2,
which explains the last factor in (11.9). Now the following relation holds (see
Exercise 11.1):
Eo = - Ze 2 (Z -~) . (11.13)
ao 8
The calculated energy of the ground state represents an upper value for the
real energy. This can be seen as follows. The wavefunction in first-order per-
turbation theory can be regarded as a test wavefunction. If the expectation
value of the corresponding Hamiltonian is generated, an upper limit is always
288 11. Structure of Atoms
obtained. This statement is nothing more than the variational principle from
Ritz. 1
EXERCISE
rOO dx 57r
(c)
io (1 +x 2 )4 32
Solution.
(a) We calculate the integral
I
1
= J d3k eik .r 47r .
(27r)3 k2
(1)
4 1
h =~
00
k 2 dk j+~ cos8d8 127r dlP zeikrSinll.
1 (4)
(27r) 0 - ~ 0 k
After the intergration over lP is performed, the substitution x = sin 8 gives
h = ~ 27r {= dkj+l dxe ikrx
(27r)3 io -1
h =
47r
--27r2
00
dk - kr
sin -
1
(27r)3 0 kr
47r
--27r2-1 00 dTsin
--
T
(27r)3 roT
47r 1 7r 1
--27r2- - (6)
(27r)3 r 2 r
(b) For the calculation of the integral
47r 1 o
00 kr _
sin _
r 2 dre- ar _
kr k
47r 1 0
00
rdre-<>rsinkr. (8)
1
This results in
47r 8 [ e-<>r ] 00
--- (-asinkr - kcoskr)
k 8a a + k
2 2 0
47r 8 k B (2 2)-1
-TBaa2 +k 2 = -47r 8a a +k
2 2 -2 87ra
-47r(-1)(a +k) 2a = 2.
(a 2 + P)
(c) We employ the analytic continuation of the integrand to the complex
plane and apply the residue theorem in the following way:
lim
R->oo
f
-R
+R R
f(x)dx = 27riLRes(f(z),z=zk) ,
k=1
where 8'(Zk) > 0 is valid for all Zk, i.e. we close the integration contour in the 1m
upper half-plane.
If Zk is a pole of order n, the residue is determined by
g(z) (z _ i 4 1
) (z - i)4(z + i)4
and
g(z) (-4)( -5)( -6)(z + i)-7
the result
5
Res (J(z); z = Zl) = ~(-4)( -5)( -6)(2i)-7 25 i .
3!
The integral thus has the value
roo dx 5
-Jr.
io (1 + x 2)4 32
_ Ze 2 _ ~ (_ Zeffe 2 ) (11.18)
r - Zeff r .
Now the effective charge appears in the expectation value. The energy contri-
bution of the potential, i.e. the expectation value, is
We take this value and the energy of the ground state becomes
E2e(z) = - ( Z - -5)22
~ = -2 (Z5
- )
- 2 Ry. (11.22)
o 16 ao 16
For He we obtain
E6e (Z = 2) = 5.695 Ry. (11.23)
Compared to the preceding calculation (11.16b) the additional energy shift is
enormous. However, the accuracy is not sufficient, because the same method
leads to an incorrect result for the H- atom, for which we obtain
Eo (H-) = -0.945 Ry , (11.24a)
292 11. Structure of Atoms
The Hartree method is constructed in such a way that it works for large atoms.
It is a "mean-field" method. This means that the electrons create a central
potential as a mean field, in which they are moving. The mean potential is a
function of the electron wavefunctions. First we construct an ansatz for the
mean potential and solve the Schrodinger equation with it. Then a new mean
potential is reconstructed with the help of the wavefunctions obtained. With
the new mean potential the Schrodinger equation is solved again, to obtain
improved wavefunctions. These iteration steps are repeated several times until
convergence is obtained, i.e. until we reach consistency. We are now going to
explain this procedure in more detail.
To begin with we construct an ansatz for the wavefunction of an N-electron
system, which fulfills our physical expectations as well as possible. The sim-
plest ansatz is a product ansatz, which constructs the entire wavefunction
from None-particle wavefunctions:
(11.25)
ipi(k) represents the single-particle wavefunction of the kth electron. The
wavefunction (11.25) is not antisymmetric, so one has to be aware of unphys-
ical states. This can be taken into consideration: for example, each electron
is placed into another single-particle state. In this way the Pauli principle is
fulfilled.
Initially, the mean potential is constructed as follows. The mean charge
density of the electron in state ipj at the point r' is given by -elipj (r') 12.
Considering the Coulomb interaction between the electrons, we find that the
potential energy of another electron at position r is
J r'Ir ~2
d3 r'llipj(r)1 2 . (11.26)
2 See, e.g., H.A. Bethe, R. Jackiw: Intermediate Quantum Mechanics, 2nd ed. (Ben-
jamin, London 1968).
11.3 Thomas-Fermi Method 293
Vi'(r) = -1
47T
J . d'Psm19d19Vi(r). (11.29)
This method was introduced the first time by Hartree. It is therefore called
the H artree method.
To determine the mean behavior of the potentials Vi (r), Fermi and Thomas
developed a semi-classical approach, which can be applied to many-electron
atoms. First the mean potential (11.27) for the ith electron is approximately
given by
V (r) = J 2
d 3 r,_e_ _ '"
Ir -r'l L
J
I'P· (r')1 2
J
_ Ze 2
r
. (11.30)
Contrary to (11.27) the sum also runs over j = i: it is supposed that for
a many-electron atom the influence of one electron can be neglected. The
electron density is given by
L l'Pj (r')1 2
= p(r') . (11.31)
j
How does this density depend on the potential V (r)? This is a self-consistent
problem, since V(r) again is given by p(r'). According to the Thomas-Fermi
method, 'Pi(r) is replaced by a plane wave
'Pi (r) rv e(i/h)Pi(r).r. (11.32)
The justification for this is that the kinetic energy of the electrons in heavy
atoms is very large; the potential varies only slowly compared to the spatial
294 11. Structure of Atoms
change (oscillation) of the electrons. The total value of the linear momentum
Pi (r) depends on the energy ti of the ith electron via
r
Fig. 11.3. Electron dis-
tribution in the ground
state of the Thomas-
Fermi model. The kinetic
energy of an electron at
the Fermi edge EF is
marked for various dis-
tances r from the center
For a given r, electron states exist for which the kinetic energies are be-
tween 0 :::; Ekin (r) :::; tF - V (r). The maximum momentum is given by the
kinetic energy of the electron with energy tF, i.e.
PF (r) = {2m [EF - V (r)]} 1/2 . (11.35)
Only electrons with energy ti 2: V (r) are considered. Energetically lower lying
electrons are hardly contributing to the density at position r. The electron
density at r is then given by the sum over all electrons allowed to stay at r,
i.e.
p (r)
P~ (11.36)
37r 2 h3 .
In correspondence with (1l.32) we have used J d 3n = J d 3 kL 3 /(27r)3 for
the number of plane waves in volume L3; consequently the density becomes
J d 3n/L3 = (2/(27r)3) J d 3k. The factor 2 takes into account that two elec-
trons (with spin projection +1/2 and -1/2) can occupy a plane wave state.
With (11.35) and (11.36) we obtain
11.3 Thomas-Fermi Method 295
() _ {2m[EF-V(r)]}3/2 (11.37)
p r - 37r 2 n3
for the electron density. Insertion of (11.37) and (11.31) into (11.30) deter-
mines the mean potential V (r) of a many-electron system. The following in-
tegral equation results:
V(r) = JdVlr~2r'IP(r')
J d3r,e2 {2m [EF - V (r,)]}3/2
Ir - r'137r 2 n 3
(11.38)
There is also the possiblity to determine the potential V (r) with a differential
equation. To do this we transform (11.30) into a Poisson equation:
for r> 0, (11.39a)
where ,1 = V . V represents the Laplace operator. Here we have used the
relation ,1lr - r'I- 1 = -47r!5Ir - r'l. If we additionally require a rotationally
symmetric potential V(r), we obtain
1 a 2 a 4e 2 3/2
r2 orr or [-V (r)] = 37rn3 {2m [EF - V (r)]} (11.39b)
by using (11.37) and (11.39a). Notice that the angular part of the Laplace
operator disappears once applied to V(r). Equation (11.39b) represents the
determining equation for V (r). It is a nonlinear differential equation. To solve
the problem we use the asymptotic behavior of V(r) for r ~ 0 and r ~ 00.
In the first case we expect V(r) ~ -Ze 2/r, since for r ~ 0 the screening
of the Coulomb potential of the central nucleus by the electrons decreases.
If r is equal to the atomic radius R, we require EF - V(R) = 0 for neutral
atoms, i.e. from outside no charge is observed. In this case the electrons totally
screen the central charge. If the atom is k times ionized and possesses only
(Z - k) electrons, a Coulomb potential of the form (Z - k)e 2 /r has to show
up outside the atom. For simplicity we restrict ourselves to neutral atoms.
Since for neutral atoms we set V(R) = 0, EF = 0 is fixed, too. This will be of
use later. Let us make the ansatz
Ze 2
V (r) = --¢(x) (11.40a)
r
and introduce the new combination for the variable r and of the parameter
ao
x
(11.40b)
b =
With this we obtain for (11.38) and (11.39b) the differential equation
One boundary condition for ¢(x) is ¢(O) = 1, because for r ---+ 0 the potential
V(r) has to agree with the Coulomb potential. Unfortunately (11.41) can be
solved only numerically. The solution has the following form:
1 - 1.59x for x ---+ 0
¢(x) = { . (11.42)
144/x 3 for x ---+ 00
For x ---+ 00 this solution behaves unphysically. ¢(x) vanishes only for x ---+ 00
and not for x ;:: (R/b )Zl/3, i.e. for x larger than the radius R of the electron
distribution. For x (or r) ---+ 0 the potential has the form
Ze 2 e 2
V(r) ~ - - + 1.80Z4/ 3 _. (11.43)
r ao
The reason for the appearance of the last term in (11.43) originates from
the finite contribution of the electrons near the origin. The potential at the
origin increases because of the outer electron shell. There is another interest-
ing property predicted by the Thomas-Fermi model: Since ¢(x) in (11.40a)
does not depend on atomic parameters (radius, charge, etc.), the form of the
atom is always the same. Only the length scale is changed. Because the scale
parameter r is proportional to Z-1/3 in correspondence with (11.40b), the
extension of the atom decreases with Z. According to (11.43) the second term
dominates for large Z and r ~ ao, i.e. V(r) can be approximately set equal
to V(r) = 1.80(e 2 /ao)Z4/3 near r ~ ao. With this we obtain for the electron
density p(r) [see (11.37)] of neutral atoms (tF == 0) the behavior p(r) rv Z2.
Hence, according to (11.36) it follows a behavior rv Z2/3 for the mean elec-
tron momentum. A typical atomic density distribution, calculated with the
Thomas-Fermi model, is sketched in Fig. 11.4.
p(r)
140
120
100
80
Fig. 11.4. A typical den-
sity distribution of a large 60
atom (mercury) within
the Thomas-Fermi model. 40
r is given in atomic length
units 20
0 0.2 0.4 0.6 0.8 1.0 r
V.elf () _ V () l (l + 1) !i,2
r - r + mr
2 (11.44)
11.4 The Hartree-Fock Method 297
-
A bound state exists only if Veff(r) has only one minimum for 1= 0 states for
Z ::; 4; for 4 < Z ::; 19 it has a minimum for I = 0 and I = 1, and so forth.
This means that for Z ::; 4 only bound s states exist, but no p states, etc.
All this is in good agreement with experiments, e.g. the first bound p state
appears at Z = 5.
Finally it should be mentioned that for small and large distances from
the atomic nucleus the Thomas-Fermi approximation breaks down. For small
distances the solution of the potential varies too strongly and for large dis-
tances the behavior of the potential is unphysical (see remark given above).
A drawback of the Thomas-Fermi model is the neglection of the exchange
interaction of the electrons. 3
E
\
\
\ l(l +1),,2
\" --z;;;-;:2
\ Fig. 11.5. The sum of the
" ........ Coulomb potential V (r)
.......... _
-------
Vcrc(r)
and centrifugal potential
-- -------------
[1(1+ 1)n?l/2mr 2 yields the
.,..- effective potential Ve/f(r).
,/' If a minimum arises at
,I-- V(r)
E < 0, then stable atoms
I
with angular momentum
,
I
I exist. For very large an-
gular momenta this is not
the case; the atoms are un-
stable
We have not yet considered the exchange effect of the electrons; the Hartree-
Fock method will take care of that. For that purpose a test wavefunction
is constructed, which is already entirely antisymmetric. One uses the Slater
determinant of one-particle states:
All one-particle wave functions are already orthonormal, i.e. (<pil<pj) = Dij. The
expectation value of the Hamiltonian is calculated in the following way. The
kinetic energy is given by
3 Modern versions of the Thomas-Fermi model can be found in the book by R.M.
Dreizler, E.K.U. Gross: Density Functional Theory - An Approach to the Quan-
tum Many-Body Problem (Springer, Berlin, Heidelberg 1995).
298 11. Structure of Atoms
(11.46)
A partial integration has been used and Gauss' law was applied. The interac-
tion energy of the electrons with the nucleus is given by
~J d3 r l<pdr) I2 -:;:-.
- L..- Ze 2 (11.47)
.=1
In order to calculate the contribution of the electron-electron interaction it
is of advantage to employ second quantization, which we encountered in the
third chapter. There the electron-electron interaction takes on the following
form [see Chaps. 4 and 6, especially (6.1) and Exercise 6.1]:
1 "J
2:1 3 e3' At2
At ,A
/ ,A )
d rd r Ir_r'I\1/>s(r)1/>s' (r)1/>s' (r)1/>s(r) . (11.48)
The subscripts 8 and 8' attached to the field operators ¢!(r), ¢s(r) refer to
the spins. The expression inside the bracket "(- .. )" describes the annihilation
and subsequent creation of two electrons at points rand r' with spin 8 and
8', respectively. This interaction is weighted with the factor e2 /1r - r'l. The
bracket itself gives the expectation value. This expectation value provides the
so-called pair-correlation function:
Nl (r) ¢1, (r') ¢s' (r') ¢s (r»)
= N! (r) ¢s (r) )N!, (r') ¢8' (r') )
At A , At ' A
- bS8'(1/>s (r) 1/>8 (r ) )(1/;8 (r) 1/>s (r») . (11.49)
This will be proven in Exercise 11.2. The second term allows for the possible
exchange of electrons at positions rand r'. The density of the electrons at
point r is
p(r) = L Nl (r) ¢8 (r») (11.50)
EXERCISE
Solution. If <pier) are the orbital wavefunctions and i includes all quantum Exercise 11.2.
numbers needed, then the field operators ~!(r) and ~s(r) are given by
(2)
and
~s(r) = L<Pj(r)hjs (3)
j
[compare with (3.43)]. The left-hand side of (1) can then be written as
All states, from the lowest up to the highest, are occupied. By anticommuta-
tion on both sides (bra and ket) we find that 11/1) can be written as
(6)
which are the corresponding creation operators belonging to the two annihi-
lation operators in (4). Since we apply the anticommutation on bra and ket,
no sign is changed. The combination (ks') and (Is) does not appear in 11/1'),
since otherwise two electrons would be in the same orbit. Applying hks' his on
11/1) gives
(7)
The same result is obtained once h!);;s' is applied to the left-hand side. There
is another simplification: hks,hl s annihilates states (ks') and (ls). In order to
have the same state in the expectation value on the left-hand side (otherwise
the expectation value is 0), it has to hold that (is) = (ls) and (is') = (ks'),
or (is) = (ks') and (is') = (ls). The latter gives an additional minus sign,
because one has to transform the expectation value into the form
At At A A)
(blsbks,bks,bls . (8)
This leads to
(bi/bjk - bjlbikbss' )(b1kbss' - 1)(b1kbss' - 1) (9)
Inserting (9) into (4) gives
300 11. Structure of Atoms
Furthermore,
ij
Here we have used the fact that the expectation value is only different from
zero if the same state is annihilated in bra and keto For r' = r in (11) the
squared modulus appears. Inserting this formula into (10) yields
(if) = - L
,
j d r (h2I v :(rW+ Z;2 1'Pi(r)1 2)
3
2
+~2 "
~
j d3 rd 3 r' e2 l'Pi(rWI'P ·(r')1 2
Ir - r'l J
'J
Here 6SiSj means that the summation is only over such states i and j, that have
the same spin quantum number. In order to obtain the equation of motion for
'Pi(r), (11.52) has to be minimized. It needs to be guaranteed, that {'Pi(r)}
remains an orthonormal set. The variation with respect to 'Pt (r) immediately
provides the following nonlinear equation:
( - - -ze
-h2fJ.
2m r
2)
'Pi (r) + j 2
d3r, - -e - L"
Ir - r'l
'P* ( 'r)
. J
J
x ['Pj(r')'Pi(r) - 'Pj(r)'Pi(r')6 siS;l = Ei'Pi(r), i = 1, ... ,N. (11.53)
These are the Hartree-Fock equations. 4 They differ from the Hartree equations
(11.28) by the additional -'Pj(r)'Pi(r') part in (11.53). The antisymmetric
part
4 D.R. Hartree: Proc. Cambridge Phil. Soc. 24 (1928) 89, III; V. Fock: Z. Physik
61 (1930) 126; J.C. Slater: Phys. Rev. 35 (1930) 250.
11.4 The Hartree-Fock Method 301
indicates the amplitude with which the electrons in states i and j can be
found at positions rand r', respectively. This term automatically fulfills the
Pauli priciple in (11.53).
EXERCISE
fWi(r) = (-~11-
2m
ze
r
2
) 'Pi(r) + jd 2
3 r,_e_ - " 'P*(r')
Ir - r'l L
)
)
(1)
Obviously
V(r) = e 2+
-Z-
r
jd 3,
r -e -
Ir -r'l
2 (Z *" )'P(r) )
"'P(r
L)
)=1
)
(2)
represents the local potential energy consisting of the central potential energy
-Ze 2 /r and of the mean energy of the electrons in the field of all electrons
(Hartree energy)
where
2 Z
U(r, r') = -Ir ~ r'l L 'Pj (r')'Pj (r)t5 siSj •
(5)
)=1
Exercise 11.3. which enforces the reality of the eigenvalues ti. In order to convince ourselves
we rewrite (6) for <'ok(r):
After multiplication of (6) with <'ok(r) and (8) with <'oi(r), subsequent integra-
tion over r and subtraction of both equations yields
If the sum runs over all states (Le. Z ----> 00) and not only over the occupied
states, it would result in
00
and
(13)
The nonlocal interaction in (5) or (6) would then be local. Equation (13)
remains approximately valid for the nonlocal density (11). Often it is practical
to consider the mean potential
These relations define U(r) and Ueff(r). In Example 11.4 the exchange poten-
tial will be considered in more detail and a handy expression will be derived
within a reasonable approximation.
11.4 The Hartree-Fock Method 303
has two electrons in the Is orbit, two in the 2s shell, and three in the 2p shell.
The electron configuration is then given by (ls)2(2s)2(2p)3. The number in
front of s or p indicates the main quantum number n.
EXAMPLE
X = -~ I)SiSj
t,J
J d 3rd 3r'Ir ~2 r'l <p:(r)<Pi(r')<pj(r')<pj(r)
-e2Jd3rld3r2 ~ p;(rl,r2) , (1)
r12
where rl2 = Irl - r21 and ps(rl' r2) is given by (11.51). A factor 2 results,
because the combination Si = Sj appears twice in the sum over i and j. We
now introduce relative (r = rl - r2) and center-of-mass coordinates [R =
(rl + r2)/2]:
(2)
-1 L e.1 ik·(rj-r 2)
V .
J
l
(27r)3
47r kF (R) sin kr 2
-()3
27r 0 r
-k-k dk
1 1
27r 2 r3 (sin kp(R)r - kp(R)r cos kp(R)r)
12
X::::: -e 47r447r J r 2 dr d 3R ~1 [1 . kpr - kpr cos kpr)
r3 (sm ]2
-e2-1-Jd3Rk~(R) . (4)
47r
3
-J 3r'Ir ~ r'l L
d
J
rpj(r')rpj(r)rpi(r')8 si ,sj ---> _e 2 (~p(r) r/3 (6)
instead of the nonlocal term in (11.53). The idea of this approximation goes
back to Slater,6 the correct implementation leading to (6) to Kohn and Sham. 7
have similiar chemical properties. Hence, these atoms are arranged in the
same column of the periodic system. The last column on the right-hand side
indicates rare gases. For rare gases the outer shell is entirely filled, Le. all
electrons are paired. For this reason the chemical activity is low, or rather is
nonexistant. Alkali elements are presented in the first column of the periodic
system. They are characterized by a weakly bound s electron in the outer
shell. The alkali metals are for that reason chemically very active.
We draw particular attention to the recent discovery of elements with
Z = 104 and Z = 112.8 They are precursers of the predicted superheavy
elements,9, expected to be centered around Z = 114. Their chemical prop-
erties had also been predicted by utilizing Dirac-Hartree-Fock calculations.
The word "Dirac" in this context means that Hartree-Fock calculations as
described here are based on the relativistic Dirac-Hamiltonian. It will be in-
teresting to watch the discovery of further superheavy elements in the near
future. 10
8 See e.g. S. Hofmann, G. Miinzenberg, et al.: Z. Phys. A 350 (1955) 277 and 288,
and further references therein.
9 W. Greiner: Int. J. Mod. Phys. E 5 (1996) 1.
10 The nuclear aspects of superheavy nuclei are discussed in W. Greiner, J.A.
Maruhn: Nuclear Models (Springer, Berlin, Heidelberg 1996); J. Eisenberg, W.
Greiner: Nuclear Theory, Vols.I-3 (North-Holland, Amsterdam 1985).
11.6 Splitting of Orbital Multiplets 307
(11.58)
The triplet spin function XIMs is symmetric under exchange of both particles.
This situation is depicted in Fig. 11.6. If the electron-electron interaction is
neglected, then all states of the same main quantum number would be degen-
erate. The electron-electron interaction causes states belonging to the same
configuration but with different L or S to differ somewhat in energy. If various
configurations differ by several eV, then the difference between the states be-
longing to one configuration is given by severallO- I eV. States with the same
Land S are degenerate. Once the spin-orbit interaction is added, this degen-
eration vanishes. The splitting only amounts to 10- 2 -10- 3 eV for light atoms.
For heavy atoms the spin-orbit coupling becomes important and cannot be
neglected or treated perturbatively.
E
,..-------4= .... Spin-orbit splitting
(order of magnitude "" IO- L IO-3 eV).
" Different Land S multiplets
-.,...-f------f:::=:: .... [see (11.56)] of the same
\------t---'"": -- configuration.
'--------1= / They lie "" 10- 1 e V apart.
'T'<~"1 'V 5
Different configurations.
Fig. 11.6. Configurations
and their splitting into
'pm
L - S multiplets. The lat-
ter split further as a result
of the spin-orbit coupling
x
If [Yl-l(DIYll(D2) + 2Yoo(Dl)Yoo(D2)
+Yll(Dl)Yl-l (D2)] for M2 =0
If [YlO (f?I)Y1- 1(D 2) + Y1- 1(D 1)YlO (f?2)] for M2 =-1
for M2 = -2.
(11.59)
Here D 1 ,2 denote the polar angles of the electron 1 or 2, respectively. Obviously
the wavefuntions (11.59) are symmetric against exchange ofthe two electrons.
For the entire wavefunction to be antisymmetric the electrons have to be
coupled to spin 0:
XS=OM,=O =
, ~ (Xll(I)xLl(2)
v2 22 2 2
- XL1(I)Xll(2))
2 2 22
, (11.60)
z analogy, illustrated in Fig.ll.7. Two balls are connected with two strings at
point O. The two balls are kicked and obtain an orbital angular momentum. It
o depends only on the velocity v, because the distance R between the balls and
the z axis is determined by the velocity v (interplay between graviational and
centrifugal force). If the orbital angular momentum increases, the distance
between the two balls becomes larger. The same situation holds in an atom.
The larger the orbital angular momentum is, the larger the distance between
the electrons becomes. The role of the gravitational force is adopted by the
central Coulomb force of the nucleus. The electrons behave similarly to the
balls of our classical example. An increased distance leads to a lower repulsion
between the electrons.
Before we now consider the spin-orbit interaction more closely, a technical
point has to be mentioned. For the construction of the Hartree-Fock state, the
Fig. 11.7. Classical me- wavefunction is built from a determinant involving the orbital wavefunctions.
chanical model for illus- However, often many angular momentum states exist for the same configura-
tration of Hund's rules: tion. The entire wavefunction cannot be described by a single determinant, but
The faster the balls rotate
(i.e. the larger the angular
rather it is described by a linear combination of several determinants. Gener-
momentum becomes) the ally the Hartree-Fock state therefore is a linear combination of determinants.
further they fly apart: 'P As an example we consider the L = S = 0 state of two p electrons [consult
and, consequently, R in- the preceding discussion, (11.59) and (11.60)]. Except for the normalization,
crease the wavefunction is given by
[Yll (I)Y1 - 1 (2) + Y1 -l(I)Yll (2) - Y lO (I)YlO (2)]
x [XI(I)X!l(2) - X!l(I)XI(2)] . (11.63)
2 2 2 2
The factors attached to each term are given by the Clebsch-Gordan coeffi-
cients [see (11.59) and (11.60)]. The wavefunction (11.63) can be expressed
by the following linear combination of determinants:
1
Yll(I)X~l (1)
2
1
Yll (2)x~ l (2)
2
1 1
YlO (I)XI (1) YlO(I)X~l (1)
2
1 1
2
(11.64)
Y lO (2)XI (2) YlO(2)X~l (2)
2 2
EXERCISE
Solution.
Boron (B): As can be deduced from the periodic table there is a p electron.
This can be coupled to J = 1/2 or 3/2 (L = 1, S = 1/2). Since Land S are
fixed, only the third of Hund's rules is important. It states that the minimum
value of J has to be taken because the shell is filled to less than half. As a
consequence 2p 1 represents the ground state.
2
Carbon (C): The following orbits are allowed:
ID 2, 3p O, 3p 1 , 3p 2, and ISO,
which altogether amounts to 15 states. According to the first of Hund's rules
the state with the largest S is lying lowest. This is the case for the states
3p O, 3p 1 , and 3P 2. The second of Hund's rules makes no difference since it is
L = 1 for all states. The third rule states that 3p o is the energetically lowest
state, because the 2p shell (six possible states) is filled only to one third (two
electrons in the p shell).
Nitrogen (N): Nitrogen contains three electrons in the p shell. Altogether
we have (6 x 5 x 4)/(1 x 2 x 3) = 20 possible states. This number can be
deduced as follows. In a p orbit (two spin multiplied by three orbital degrees of
freedom) there are six possible states. The first electron then has six states to
choose from. The second can find a place in only five states (Pauli's principle)
and the third in only four. The denominator takes into consideration that
the electrons are undistinguishable (3!). Three p electrons can be coupled
to L = 0,1,2. L = 3 is not possible, because the wavefunction is totally
symmetric with respect to exchange of coordinates and we are not able to
construct an antisymmetric state with three electrons (this can be done for
two electrons at most). For the total wavefunction to be antisymmetric, L = 1
and 2 can belong only to S = 1/2 and L = 0 only to S = 3/2. This can be
seen if the states are explicitly coupled to L = 0,1 (or S = 1/2, 3/2 in spin)
and a permutation is applied to the functions. L = 2, S = 1/2 splits into
2D3/2 and 2D5/2 as a result of the spin-orbit interaction. L = 1 and S = 1/2
splits into 2P 1/ 2 and 2P 3/ 2; L = 0, S = 3/2 leads to 4S 3/ 2 . In total we have
20 states. The 4S 3/ 2 state possesses the largest value and therefore it is lowest
in energy.
Oxygen (0): Oxygen has four electrons in a p orbit, which therefore is more
than half full. Always two electrons have to be paired. The property of the
ground state is then determined by the two remaining electrons. The problem
therefore is equivalent to the C atom once we consider the holes instead of
the electrons. There are exactly two. By applying the first of Hund's rules, we
find that the 3p O, 3p 1 , and 3P 2 states are lowest. But now the shell is more
than half full, so according to the third of Hund's rules the state 3P 2 is lowest.
312 11. Structure of Atoms
Exercise 11.5. Fluorine (F): F is equivalent to the B atom. There are five electrons in the p
orbit, i.e. only 1 hole exists. It is L = 1, 8 = 1/2, i.e. the possible states are
2P1/2 and 2P 3 / 2 . Since the shell is more than the half full, the 2P 3 / 2 state lies
energetically lowest.
Neon (Ne): In Ne the p shell is full, i.e. all electrons are paired. There is only
one state: ISO.
The spin-orbit interaction is due to the magnetic moment of the electron with
the magnetic field caused by the electric current of the central nucleus. In the
electron's rest frame the nucleus is orbiting and thus causing a magnetic field
at the position of the electron. One may also understand it by remembering
that a particle (electron) moving with the velocity Ve through a electric field
E (of the central nucleus) sees a magnetic field proportional to Ve x E. We
shall derive the spin-orbit interaction from the relativistic Dirac equation in
Example 11.7. The result is
(11.65)
Here "i" represents the particle index and Si the spin of the ith electron.
G(ri) has the form
(11.68)
Here we have separated the spatial part from the spin part. The first matrix
element in (11.69) is the element of Gi between states I}/b'1ML
and 1}/2;~~;~.
The second matrix element is related to the spin part. Here it is useful to
apply the Wigner~Eckart theorem. It states, that
(11.70)
i.e. the matrix element of the vector operator Si is proportional to the matrix
element of the total spin S with respect to the same states. This is explained
in more detail in Example 11.6. The proportionality factor does not depend
on Ms and MS'. A similar relation holds for the spatial part:
(11.71)
With this, the matrix elements for the spin~orbit interaction of the ith electron
read
(NLSMLMsIGi · SiINLSM~LM's)
'" (LMLILILMD' (SMsISISM's) . (11.72)
Summing over all electrons we obtain
(QLSMLMsIHsoIQLSM'sM L )
~(QLS)(LMLILILMD . (SMsISISM's)
= ~(QLS)(QLSMLMsIL· SIQLSM~M's) . (11.73)
The proportionality factor ~(QLS) depends on the form of the operator G i ,
but it is the same for all states of the LS multiplet. We realize the usefulness
of the Wigner~Eckart theorem: From the relative complicated sum over one-
particle spin~orbit couplings (11.65), the structure L . S with total spin S
and total orbital angular momentum L is peeled off in (11.73); we would
not have expected such a remarkable relationship from the beginning. The
L . S operator is easy to diagonalize once the relationship to the total angular
momentum J2 = (L + S)2 is noted. We obtain
(J 2 _L2_S2)
L·S = (11.74)
2
The matrix element vanishes, if J' =1= J and/or M' =1= M. This is simply
a consequence of the fact that Hso is rotationally invariant, i.e. a scalar of
SU(2) [or 0(3)]' while J and M classify the states of the various multiplets and
those within these multiplets, respectively. The value of the matrix element
(11.73) is
314 11. Structure of Atoms
(QLSJMIHsoIQLSJM)
This equation describes the splitting of the energy levels resulting from the
spin-orbit interaction. The splitting of two neighboring levels J and J - 1
within the same LS multiplet (i.e. levels with identical Land S) is now easy
to calculate:
EXAMPLE
The states
IJM), M = -J, -J + 1, ... , J (1)
represent the 2J + 1 eigenvectors of the squared angular momentum operator
)2 and Jz:
)2IJM) = n?J(J + l)IJM), (2)
JzIJM) = nMIJM). (3)
They form a multiplet of SU(2) [or 0(3)] and J can be considered as the
Casimir operator classifying multiplets. If the representation of IJ M) is given
in a fixed coordinate system with the z axis as the quantization axis, then in
a system rotated by the Euler angles a, (3, 'Y these vectors read II
where yJjl with M = -J, ... , J represents the (2J + 1) components of the
irreducible tensor operator of order J.
With
ITJM) (9)
we denote quantum-mechanical states that fulfill the eigenvalue equations (2)
and (3). The quantity T includes all remaining quantum numbers and the
following is valid:
(10)
The Wigner-Eckart theorem now states the following about matrix elements
of vectors (9) with irreducible tensor operators:
(TJMly(klIT'J'M') = 1 (rJlly(k11IT'J')(J'kJIM'qM)
q ..j2J + 1
q = -k, -k + 1, ... , k . (11)
The quantity
(T Jlly(k 1liT' J') (12)
is the reduced matrix element. It depends only on T, T', J, J', and k and no-
longer on q. Its brackets are indicated by a double bar, like (II II). (J' kJIM' qM)
denotes a Clebsch-Gordan coefficient. Obviously the Wigner-Eckart theo-
rem states that dependence of a matrix element on the quantum numbers
M qM is solely contained in a Clebsch-Gordan coefficient. For its proof see
the literature. 14
Equivalent to the transformation property (8) of the irreducible tensor
operators is the validity of the following commutation relations:
12 E is the entire space, which is built up from vectors IJ M) for all J and M E
{-J, -J + 1, ... , J}.
13 See also W. Greiner, B. Muller: Quantum Mechanics - Symmetries, 2nd ed.
(Springer, Berlin, Heidelberg 1994).
14 M.E. Rose: Theory of Angular Momentum (Wiley, New York 1957).
316 11. Structure of Atoms
To = (Sz)i (15)
fulfill the commutation relations given in (13) with k = 1. Hence, (Sq)i,
q = -1,0,+1, represents the components of the irreducible vector operators
Si and S = L:~=l Si (n is the particle number) is the total spin operator.
Using the commutation relations (13) and the relations (14) we obtain
(16)
with q = -1,0, +1. The first matrix element on the right-hand side of (16)
is the reduced matrix element, which is independent of Ms, M~, and q. We
will now show the validity of (16). First we derive the following relations from
(13):
[Jo,T±l] ±nT±l,
[J±,T±l] 0, (17)
[J±,TCf1 ] nhTo.
In (17) we have used the abbreviations introduced in (15).
From (17) and (14) it follows that
(T1MIT±IT'lM') ±(M - M')(T1MIT±1IT' 1M')
= 8M ,M'±l (T1MIT±l IT'lM') .
Therefore this matrix element has to be proportional to (T1MIJ±IT'lM').
Furthermore, by using (14) again it follows from (17) that
(T1MI[l±,T±dIT'lM') = °
8M,M'±1((T1M' ± IIT±lIT' 1M')(lM' ± 21J±11M' ± 1)
- (T1M' ± 21T±llT'lM' ± 1)(JM' ± IIJ±11M')). (18)
This equation can be rewritten in another way:
11. 7 Spin-Orbit Interaction 317
"" C(T±1' J, T, r )6 MM ,
ltv 2
X ((JM'IJ'f IJM' ± l)(JM' ± 1IJ±IJM')
- (JM'IJ±IJM' =f l)(JM'I J'fI JM' =f 1))
C(T±l' J, T, r')6 MM ,(JMIJoIJM') . (21)
We realize that the coefficient C(T±1' J, T, T') has to be independent of the
component Tz . Therefore one can write
(TJMITqlr' JM') C(T±l' J, r,r')(JMIJ~IJM)
= (rJIITIIT' J)(JMIJ~IJM') ,
J~ = =f y'2J± , J~ = Jo .
This proves the validity of (16).
EXAMPLE
(1)
with
H HD - e[a· A(r) - Ao(r)] , (2)
2
HD ea· p + moc (J . (3)
A A
HD is the free Dirac Hamiltonian and a and j; represent the well-known Dirac
matrices
318 11. Structure of Atoms
Example 11. 7. A
U=
(0iT iT)
0 '
A
(3 =
(i -10) '
0 (4)
QT = L -yJ1.".tC/ w . (10)
J1.,11
An example is given by the interaction of a non-Dirac-like magnetic dipole
with the electromagnetic fields
F - 8AJ1. _ 8AlI (11)
/L II - 8x ll 8xJ1.·
In this case it is
(12)
where /-10 = eh/2moe, mo is the rest mass and 9 the coupling constant. For
electrons 9 = 0, but for nucleons 9 # o.
Example 11.7. These equations are still exact. Now we intend to investigate the nonrelativis-
tic motion of a spin-1/2 particle influenced by the external fields Bo(r) and
V (r). For this we take into consideration only terms up to order v 2 / c2 . An
expansion up to the first order in [E' - Bo(r) - V(r)]/2moc2 yields for x(r)
from (20)
iT· P ()
( 1 - E' - Bo (r) - V (r )) --<p
X (r ) = r . (21)
2moc2 2moc
Inserting this expression into (19), we obtain the differential equation
iT· P ( E' - Bo(r) - V(r))
[E' - Bo(r) + V(r)] <p(r) = -
2mo
1- 2
moc
2 iT· p<p(r)
(22)
for the two-component spinor function <p( r). Using the relations
(iT·A)(iT·B) = A·B+i(a-·[AxB]) (23)
and
(a- . p) f (r ) (a- . p) = f (r ) (a- . p) (iT . p) -
ili( a- . V f) (a- . p)
= f(r)p2-ili{(Vf·p)+i[a-·(Vfxp)]}) (24)
we can transform (22) into
H'<p(r) = E'<p(r) (25)
where
J pd 3 r = 1. (28)
which represents the proper normalization condition for 'P(r). On the other
hand, the normalization for a Schrodinger wavefunction 'lj; is given by
(32)
fl = (1 + p2
Sm6c2
) iI' (1 p2)
- Sm6 c2
p2 + Bo(r) _ V(r) _ [EI - Bo(r)]2 - V 2(r)
2mo 2m6c2
+ h{u· [VBo(r) x p]} + h{u· [VV(r) x p]}
4m6c2 4m6c2
h2 h2
---2-2 \J 2 Bo(r) + --2-2 \J2V(r) (3S)
Smoc Smoc
up to order v 2 / c2 . We have used the relations
p 2V(r) - V(r)p2 = -h2\J2V(r) - 2ih(VV(r)· p)
and
EI - Bo(r) + V(r)) ,2 ~ ,2 [EI - Bo(r)]2 - V 2 (r)
( 1- 2 P ~ P - 2 . (39)
2moc c
The first three terms in (3S) correspond to the known nonrelativistic Hamil-
tonian. The additional terms are relativistic corrections of order v 2 / c2 . We
distinguish them as
(40)
322 11. Structure of Atoms
(42)
The term
(43)
2m6c2
represents a correction to the kinetic energy as a result of the velocity depen-
dence of the particle mass. The last term
U3 = h{u· [V Bo(r) x p]} + Ii{u· [VV(r) x p]} (44)
4m6c2 4m6c2
describes the spin-orbit interaction. In a spherical symmetric field we find
dBo(r) r
VBo(r) ----
dr r
(45)
dV(r) r
VV(r) ---
dr r
With the help of the known expressions for the angular momentum operator,
t = n1 = [f x p], and the spin operator, 8 = (1i/2)iT, we obtain, with an
insertion of (45) into (44),
From this consideration alone we would deduce that the spin-orbit strength,
which is caused by relativistic effects of the central potential (which simulates
the mesonic exchange), provides a contribution that is too small to explain the
strong spin-orbit strenght in the shell model. The coupling has to be caused
by the mesonic field directly. At the moment the mesonic field theory is not
far enough developed to confirm or to exclude such additional terms with
11.7 Spin-Orbit Interaction 323
certainty. The spin-orbit coupling in the nucleon-nucleon interaction leads to Example 11. 7.
a corresponding term in the potential; its influence increases with the number
of nucleons. This agrees with the empirically secured observation that the
spin-orbit coupling for nearly completely full shells is stronger than at the
beginning of a new shell. For a continued discussion of the nuclear interaction
see the book by Eisenberg and Greiner.1 7
EXERCISE
where Li and 8i are the orbital angular momenta and the spins of the elec-
trons.
Solution. We assume that the potential, which generates the mean electric
field in the atom, is a central potential V (r). Then the electric field experienced
by the ith electron becomes
1 dV
eEi = -ViV(r) = -Ti--· (1)
ri dri
For Hso we obtain
H' so = L 1
--8i·
' (Ti , ) 1 dV
X Vi - - . (2)
i 2mc2 r-1.. dr-1.
Here the order of Ti and Vi have been exchanged. The orbital angular mo-
mentum Li of the ith electron is L; = m(Ti x Vi). Therefore we obtain for
Hso
,
Hso = --
1 L 1 dV,
--(8;· Li).
'
(3)
2m 2 c2 i r-1. dr-1.
This is the desired expression.
17 M. Eisenberg, W. Greiner: Nuclear Theory Vol. I: Nuclear Models, 3rd ed. (North-
Holland, Amsterdam 1987).
324 11. Structure of Atoms
---2--Vi x ri
2mc ri dri
lei 1 dV(ri)
- - - - - - r· x p.
A
2m 2 c2 ri dri • •
lei 1 dV(ri) t (11. 77)
2m2c2G~ i·
ti represents the orbital angular momentum of the ith electron. The spin-
orbit coupling part of the Hamiltonian is then written as
(11. 78)
= L
,6,6'
a~a,6'(Qj3l L
i
2 ; 2
m c ri ri
~V li . silQj3') . (11.80)
Only for 13 = 13' do we obtain nonzero matrix elements. This can be seen as
follows. The matrix element goes with a sum of one-particle operators. If we
pick one term, for example, the ith term, then it acts on the ith electron and
does not modify the states of the other electrons. This means that in IQj3)
and IQj3') all electrons must have the same one-particle wavefunctions except
11.8 Treatment of the Spin-Orbit Splitting in the Hartree-Fock Approach 325
for the ith electron. However, the ith electron also has to possess the same
wavefunction, since IQ,8) and IQ,8') are eigenstates of t z . If the ith electron
were not in the same state, ML would be different for IQ,8) and for IQ,8'), so
that the matrix element would vanish.
In order to calculate (Q,8IHsoIQ,8) we perform the sum over all occupied
states:
(Q,8I HsoIQ,8) (11.81)
= L
nlmlms
Jd3ripnlmt(r)2m;c2r ~~ Lipnlmt(r)· (mslslm s).
s has diagonal matrix elements (mslszlms) = lims only for sz. Therefore only
tz appears in (11.81) and we have
(Q,8I HsoIQ,8) (l1.82a)
with
'T/nl f 3 21 dV
d rIIPnlmt(r)1 ~dr' (l1.82b)
because the sum is performed over all ml and ms within a shell. In fact, for
a completely occupied shell for any given occupied state with ml, another
occupied state with -ml exists. Consequently (l1.82a) vanishes. Therefore
we only need to focus our attention on the last open shell. We denote it as nl
and we get
(11.85)
o. (11.87)
occupied nonoccupied all states
states states of a shell
In other words, ML and Ms of the nonoccupied states are the same as the
negative values of the corresponding values for the occupied states. Hence,
we switch from a particle to a hole picture, for which all holes are coupled
parallel in spin. Consequently, we obtain
Here the Bohr radius ao = h,2/(me 2 ) has been inserted (see Sect. 11.1). For ~
we then get
~ '" 2 Z2 2
aom c
(2e2ao ) = Z0:2Ry (11.91)
States with the same ML + 2Ms are degenerate in energy. Adding the spin-
orbit interaction results in an additional energy shift, which is
(11.104)
The total energy shift is then
eltB
LJ.EQLS(ML,Ms) = --(ML + 2Ms) + ~(NLS)MLMs· (11.105)
2mc
This combined splitting is called the Paschen-Back effect. If, however, the
magnetic field becomes comparable to the spin-orbit field, then Hmag + Hso
should be considered together. If the magnetic field becomes too large, A 2
terms (A for vector potential) have to be added and we are confronted with
a highly nonlinear problem.
11.9 The Zeeman Effect 329
EXERCISE
Here di is the dipole moment of the ith electron. All electrons feel the same
field E z .
The splitting of the nth energy level in first-order perturbation theory is
given by
(3)
However, the dipole operator d changes parity, so the matrix element in (3)
is different from zero only if Ina') has the opposite parity to Ina). Equation
(3) then contributes only if odd and even parities are mixed in the nth energy
level. For larger atoms the levels split because of the electron-electron inter-
action. For hydrogen the energy levels split rv E z . For larger atoms one has to
consider second-order perturbation theory. The energy splitting then is
=I:~k
ggg $~
!if!TIl4(
I::
0j; ~i ......
M ......
1~ <0
1 >-3
::r
1"1""'' ;;; ctl
&l N
&; it
S
§
tr:I
II ffi1
g.
=,
Fig. 11.8. Table of the periodic system of elements. Notice the incorporation of the recently discovered pre-superheavy elements with
Z = 104, ... , Z = 112. They were predicted by theory, as were their chemical properties.
w
w
.....
332 11. Structure of Atoms
him. The discovery of the splitting of the spectral lines in strong magnetic
fields is due to him and Back (Paschen-Back effect).
(12.2)
As for atoms the orders of magnitude are some eV. For the relative vibration
of the nuclei we assume a harmonic potential Mw2(R - R O)2/2. Here R - Ro
corresponds to the distance from the equilibrium position Ro (see Fig. 12.1).
If the relative distance between the two nuclei is (R - Ro) ;::j a, then the cor-
responding excitations are approximately equal to the electronic excitations.
Hence, we can set
1 li2
_Mw 2 a 2 ;::j --
2 2ma 2
and obtain an estimate for w:
Here l represents the angular momentum of the molecule and 8 its moment
of inertia (8 ~ Ma 2 ). Hence, we obtain
h 2 2l(l
Erot ~ ma + 1) (m)
M . (12.6)
Here the factor (m/M) appears; it lowers the rotational energy with respect
to the electronic excitation:
Erot ~ ( : ) Eel. (12.7)
The excitation of a molecule in general is given by the combination of all
excitations mentioned, i.e.
E = Eel + Evib + E rot . (12.8)
The ratio of the energies is
m)1/2 m
Eel: EVib : Erot = 1: ( M :M . (12.9)
Obviously the rotational energy is lowest: the excitation steps for a rotation
are smaller by a factor of Jm/M ~ J1/2000 ~ 1/40 than for those a vibra-
tion and these are again smaller by the same factor compared to those for the
electrons.
12.1 Born-Oppenheimer Approximation 337
To describe the various motions of the molecule we begin with the Schrodinger
equation. The Hamiltonian is given by
(12.10)
where
(12.11)
TN = L2~1I
11=1
(12.12)
is the kinetic energy of the nuclei. VeN represents the attractive electron-nuclei
potential. Vee describes the repelling electron-electron interaction. VNN indi-
cates the repelling Coulomb interaction between the nuclei. Since the masses
of the nuclei are very large, TN can be neglected. This step is called the
Born-Oppenheimer approximation. In the following, we will explain the ap-
proximation in more detail.
If we neglect the kinetic energy TN of the nuclei (static approximation:
fixed distance R of the nuclei), the relative distance R between the nuclei
only occurs as a parameter. The Schrodinger equation becomes
[Te + Vee(r) + VeN(r, R)] 'Pn(r, R)
= [€n(R) - VNN(R)] 'Pn(r, R). (12.13)
Here r indicates the position of the electron. The solutions 'Pn(r, R) depend
parametricaly on the distance between the nuclei. The energy of this state
is given by the electronic energy €n(R) lowered by VNN(R). The solutions
'Pn (r, R) represent a complete set of functions. The true wavefunction 1jJ( r, R)
can be expanded within this set:
(12.14)
m
Now we multiply from the left-hand side with 'P~ (r, R), integrate over the full
space, and get
338 12. Elementary Structure of Molecules
with
CnmrPm(R) -n2 L 2~
'" '"
f d3T'P~(r,R) (12.1gb)
(12.20)
6R indicates the shift from the equilibrium position. The factor M is canceled
by l/M in (12.1gb) and the contribution is proportional to the vibrational
energy Iiw. As noted earlier, this goes like'" (m/M)1/2. As a summary, the
C nm term can be neglected or treated with the help of perturbation theory.
Without the C nm term, (12.1ga) reduces to
(12.21)
This equation has an interesting interpretation: the energy of the electron
states €n(R) acts like an effective potential in R. We imagine that the elec-
trons build a "medium" in which the atomic nuclei move. This medium acts
as an elastic band. If the nuclei try to leave the equilibrium position, they will
be drawn back. There is an equilibrium position where €(R) has a minimum
deep enough to generate binding. The elastic band behavior is then nothing
other than the expansion up to the order (R - Flo) 2 .
12.2 The Ht Ion as an Example 339
The C nm produce a mixing between different states 'Pn and 'Pm. This
mixing between the 'Pn(R) states can be neglected in lowest order, because
the C nm are small [of order (m/M)1/2, as explained previously]. Accordingly
the wavefunction is approximately given by
(12.22)
Here 1/ stands for all quantum numbers of level n. Env indicates the energy
of the molecule, which is calculated from (12.21).
In order to describe vibrations and rotations of the molecule cn(R) is
expanded in coordinates describing vibration and rotation, respectively. The
expansion in (5R = IR - Rol up to the squared order leads to a harmonic
vibrational potential (see Fig. 12.1). cN(R) does not depend on the angles
(Euler angles). Hence the rotations of the molecule are free. An excitation
of the molecule is a combination of excitations of the harmonic vibrational
oscillator and of the rotations.
We summarize: in the Born-Oppenheimer approximation, first the energy
levels of the electrons are determined for fixed distances R of the nuclear
centers. The electron energy cn(R) plays the role of a potential, in which the
nuclei are moving. If this potential has one or several deep enough minima,
one or several bound states of the molecule can exist. If the minima are only
weak or do not exist at all, then the molecule is not bound.
(12.23)
The first term is the kinetic energy of the electron, the second and third terms
describe the attractive Coulomb interactions between electron and nuclei 1
and 2 with position vectors Rl and R2, respectively. The last term represents
the repulsive interaction between the two protons (nuclei). Furthermore, r
describes the electron's position and Rl and R2 the positions of the two
protons (see Fig. 12.2). It is impossible to determine the eigenfunctions of
(12.23) exactly and analytically. Hence, test wavefunctions are constructed
and the energy is minimized by variation.
In case of the Ht ion we use a test wavefunction that is a linear combi-
nation of hydrogen wavefunctions referring to the first (Vh) and second ('l/J2)
proton:
(12.24)
This crude method is particularly instructive and transparent. For the lowest
state we take the Is wavefunction, i.e.
(12.25a)
340 12. Elementary Structure of Molecules
and, correspondingly,
1 = N~ [2 ± 2Jd 3r1/J!(r)1/J2(r)]
With the so-called overlap integral
U(R) = J d3r1/J!(r)1/J2(r)
= (1 + R+ R:)
ao 3a o
e-R/ao
(1IHI1) = J d3 r'ljl[(r)H'IjIl(r)
2
Cl+ eR - J e
d3rl'ljll(rWlr_RI'
2
(12.30)
Cl is the energy of the Is state in the hydrogen atom, which is -1 Ry. This
contribution comes from the first two terms in (12.23). The Coulomb energy
e2/ R, with R = IRI - R21, results from the repulsive interaction between
both protons and the last term in (12.30) comes from the contribution of
the interaction between the electron and the second proton. Inserting the
wavefunction, we arrive at (see Exercise 12.1)
(1IHI2) = J d3 r'ljl[(r)H'IjI2(r)
A = J d3 r 'IjI[(r)'IjI2(r) Ir ~2R21
-
_ -
e
ao
2
(1 +- R) e - Rjao .
ao
(12.33)
U = (1 + R
ao
+ R2) e-Rjao
3ao
we can calculate the energy c±(R) as a function of R. The behavior of c±(R) is
depicted in Fig. 12.3. The state with positive parity is bound, since c+(R) has
a minimum. On the other hand, the energy of the state with negative parity
has no minimum; this state is never bound. Experimentally one finds a binding
energy of -2.8 eV for 'IjI+ (r) when the distance between the two protons is
342 12. Elementary Structure of Molecules
1.06 A. Our calculated energy is still to high, which is to be expected with the
crude test wavefunction (12.26).
The wavefunction of an electron, for example, 'l/J±(r), is called a molecular
orbit wavefunction. The form of the test wavefunction (12.24) is also called
a linear combination of atomic orbits (LCAO). 'l/J+ is called the bound and
'l/J- the unbound orbit. 'l/J- is not bound, which can be seen from the following
argument. Since 'l/J- has negative parity, 'l/J-(r) = -'l/J-( -r), the wavefunction
has a zero at r = O. However, the electron profits especially from the attractive
interaction with the protons, which is at a maximum between the protons.
For very large proton distances the approximated function 'l/J+ (r) becomes
more realistic. For R -> 0 the wavefunction (12.26) becomes the wavefunction
for a He+ atom, because for R « ao the electron practically sees a point nu-
cleus with charge number Z = 2; indeed 'l/J+(r) becomes the Is wavefunction
of the hydrogen atom. This means that for small proton distances the ap-
proximation (12.26) becomes inaccurate. This is the reason why the binding
energy calculated within this method is too small (see Fig. 12.4).
R
Fig. 12.4. Schematic com-
parision between calcu-
lated and realistic energy
E+CR) for the bound orbit realistic
EXERCISE
Problem. Calculate
Solution.
(a) The overlap integral is
U(R) = ~Jd3rexp(-lr-RII/ao-lr-R21/ao)
rra o
The introduction of relative R = Rl - R2 and center-of-mass coordinates
Rs = !(Rl + R 2 ) of the two nucleis gives
11 = Jd3rexp(-lr-~R-Rsl/ao-lr+~R-Rsl/ao)
J d 3 rexp ( -Ir - ~RI/ ao -IT + ~RI/ao)
because of the obvious translation invariance of 11. The azimuthal symmetry
of the integrand of h offers the use of prolate-elliptical coordinates: l
R
x "2 [(e - 1)(1 _1J2)]1/2 cos cp,
Exercise 12.1. with ~ E [1, (0), TJ E [-1,1]' cP E [0,21r) and R = IRI - R21. Using
h = (~) 3 1 2
1< dcp [II dTJ [JO d~ (~2 - TJ2)
U(R) = (~+ R
2 ao
+3a6R2) e-R/ao .
HI = ~Jd3rexp(-2Ir-R11/ao)
'Irao
IR e R I
1 -
2
2
e2
R
with R = IR1 - R21.
The expectation value of the third term of H is
-
H2 = -3
1
1rao
J 3
d rexp(-2Ir - Rli/ao) I
_e 2
R I.
r- 2
The introduction of prolate-elliptical coordinates [see part (a) of this exercise]
gives
12.2 The Ht Ion as an Example 345
Exercise 12.1.
With the help of parts (a) and (b) of this exercise, the contribution of H12
can be written down immediately; from the terms 1, 3, and 4 of H we get
H12 = coU(R)
2
+ eR U (R)+-31
1rao
J
d r
3
Ir
_e 2
-
R I
1
The next, more difficult molecule is the hydrogen molecule, with two electrons.
Again we choose a test wavefunction and calculate the expectation value of
the Hamiltonian. We construct a symmetric test wavefunction and make the
following ansatz:
1
tPs(I,2) = 2[1 + U(R)] [tPl(rl) + tP2(rl)][tPl(r2) + tP2(r2)] Xsingiet .
(12.34)
The index "S" stands for "singlet". The subscripts 1 and 2 on tP indicate
the atomic centers with respect to which the one-particle wavefunctions are
defined. The arguments rl and r2 in the wavefunctions describe the positions
of the two electrons. We choose again a symmetric wavefunction, since we
want to have a state that is energetically as low as possible. Consequently,
we set both electrons in the same molecular orbit tP+(r) = tPl(r) + tP2(r),
which is in accordance with our last section about the HI ion. Because both
electrons are now in the same spatial state, their spins have to be anti parallel
due to the Pauli principle: therefore we have a state Xsingiet with S = o. If we
want a triplett state, i.e. S = 1, we have to put one electron into the tP+ orbit
and the other one into the tP- orbit. Since tP- is energetically unfavored, the
triplett states are higher in energy.
The test wavefunction (12.34) has two disadvantages. First, for very small
distances between the two protons the results have to be identical to those for
the Helium atom. This is not the case here, because for R ----> 0 both electrons
are in a Is orbit of a H atom and not in the Is orbit of the He atom. We see
the second disadvantage once we multiply out the wavefunction (12.34):
tPs(I,2) '" [tPl(rdtPl(r2) + tP2(rdtP2(r2)]
+ [tPl(rl)tP2(r2) + tPl(r2)tP2(rd]· (12.35)
The expression in the first bracket [ ... ] describes two electrons both attached
to the vicinity of the first or second proton. The wavefunction of the second
bracket stands for one electron attached to the first proton and the other
electron to the second proton. There is some problem with the wavefunction
of the first bracket because it will be quite improbable that the two electrons
are bound to the same proton for large distances R between the atomic centers.
This is mainly because the energy of the H- ion is higher than that of the
two H atoms. For large separations R, the true two-electron wavefunction will
therefore be better described by the terms in the second bracket in (12.35).
An alternative method for determining the energy of the H2 molecule is
given by the valence binding or Heitler-London method. In (12.35) only the
term for which one electron can be found around each proton is used, i.e. for
the singlet state
(12.36)
12.3 The Hydrogen Molecule 347
(12.37)
Both functions are already normalized. U(R) is the overlap integral given after
(12.26).
We take the expectation value of the Hamiltonian in order to estimate the
energy:
(H)±
(12IHI12) + (21IHI21)
2(1 ± U2)
(12IHI21) + (21IHI12) (12.38)
± 2(1 ± U2) .
Furthermore we use
(12IHI12) (21IHI21)
and
(12IHI21) (21IHI12) .
Hence, (12.38) is simplified to
(12IHI12) (21IHI12)
e± = 1 ± U2 ± 1 ± U2 . (12.39)
(12.40)
The first two terms describe the kinetic energies of the first and second elec-
tron, respectively. The next four terms represent the attractive Coulomb in-
teraction between electrons and nuclei. The second from last term represents
the repulsive interaction between the nuclei, and the last term that between
the two electrons. It holds that
, e2
(12IHI12) = 2el + R + Vc(R), (12.41)
where R == IRI - R21 . Here we have taken advantage of
(11-:l-ITl~2RlI11) = el;
Vc(R) = J d 3r l d3r21'lfI(Tl)121'lf2(T2W
x
(
h- e2 e2
T21 - IT2 - RII - ITI -
e2
R21
)
'
(12.42)
348 12. Elementary Structure of Molecules
and
with
(12.47)
Now it turns out, that VeeR) + e2/ R is always positive and A(R) + U 2 e 2 / R
is mostly negative. For this reason E+(R) is lower in energy than c(R). If
the "binding energy" c (R) - 2El or E+ (R) - 2EI is a function of R, the first
expression shows no and the last one minimum (see Fig. 12.6).
From this result it is obvious that the hydrogen molecule is bound in the
state '0+, i.e. the ground state is a spin singlet state. Note that the binding is
due to the negative term A(R) + U 2 e 2 / R. It is thus the large overlap of the
wavefunctions that causes the binding. The strength of the binding is, as a
12.4 Electron Pairing 349
For the H2 molecule we have seen that a large overlap of the two electron
wavefunctions causes binding. This has been the case for the the spin singlet
state. In the triplet state the overlap is small, which leads to a repulsion.
Now we can imagine why there is no H-He molecule: the two electrons of
the He atom in the ground state are in a spin singlet state. Both electrons
are coupled antiparallel. If both nuclei are very close to each other, the three
electrons (one from H and two from He) see a Li nucleus. Since the Is orbit is
completely occupied, the third electron has to go into an orbit with one main
quantum number higher. In the Li- ion the last electron has an ionization
energy of only 0.4 Ry. As a consequence we expect that there is no binding in
the H-He molecule. If H and He are separated (see Fig. 12.7), the electron of
the H atom has the possibility of interacting both with the electron of the He
atom that is parallel to the electron of the H atom and with the electron that
is oriented antiparallel. The last case does not work, since the interaction is
spin independent and cannot change the spin in the exchange integral (see
Fig. 12.7). Therefore it can only interact with the electron of the He atom
that is aligned parallel. This is similar to the triplet state of He, where (by
definition) two electrons are aligned parallel and interact with each other. The
triplet state is not bound: just as it is not bound the H-He molecule. In other
words, there is no H-He atom.
The reason for this can also be seen directly from the wavefunctions. For
this we build the Slater determinant of the three-electron system:
350 12. Elementary Structure of Molecules
CB-®
H He
N- 2 = 6 J
d3rl d3r2 d3r3¢i (1)¢i (2)¢~(3)
x [¢1(l)¢2(3) -¢2(1)¢1(3)]¢1(2)
6(1 - U 2 ) . (12.49)
N is the normalization constant and U denotes the overlap integral between
¢l and ¢2:
U = J d 3r¢i(r)¢2(r).
Only electrons with the same spin direction can interact with each other
because of the spin-independent Coulomb interaction. Look again at the spin
wavefunctions in (12.48)! Two electrons in the same spatial state but with
opposite spin are called paired electrons. In the example of the H-He molecule
we observed that paired electrons interact repulsively with other electrons.
This explains why rare gases show no chemical activity. In rare gases the
shells are all closed, so all electrons are paired. The chemical activity arises as
a result of non paired electrons. They still have a "free" place, which can be
occupied by an electron belonging to another atom. The number of possible
bindings is determined by the number of valence electrons, i.e. the nonpaired
electrons in the outer shell. In addition, there is still the repulsive interaction
with the paired electrons belonging to deeper shells. But this can be neglected,
because the overlap of the outer-shell electrons with the inner-shell electrons
is small.
The chemically active electrons are mostly outer (i.e. from higher shells)
s or p electrons. The d and f electrons are often lying too close to the in-
dividual nucleus to bind with electrons of other atoms. The binding results
in all electrons being paired for a molecule. Hence, such molecules have total
spin zero. One exception is given by the molecules of the transition elements
(with unpaired d and f electrons). Since the unpaired d and f electrons hardly
overlap, they remain unpaired and cause little or no binding.
The alkaline earth metals, which are found in the second main group of the
periodic system, have closed orbits in their ground states: for example, Be has
the occupation 1s22s2. In this state they are chemically inactive. However, it
does not cost much energy (some eV) to lift an s electron into the next p orbit.
The p electron now has three different possibilities for orientation according
to the projection onto the z axis (me = 0, ±1). Here, we have placed the z axis
along the line connecting the two atomic nuclei. Figure 12.8 shows the state
for m = O. For m = ±1 the wavefunction is perpendicular to the connecting
z axis. If the p electron binds with an another p electron (of the other atom),
very strong binding results as long as both p electrons are in a m = 0 state.
Then the two electrons can interact actively at a relatively large two-center
distance, and therefore keep the Coulomb repulsion of the nuclei relatively
low.
We call states orientated along the x, y, and z axis Ipx), Ipy), and IPz); P
stands for the p orbit. The spatial part of the wavefunction has an angular tnl='±!
dependence of the form
Fig. 12.8. Qualitative
picture of the angular dis-
{3 ~,
YlO =
v{3
4.;
cos(} =
V4.;r
(12.51a) tribution of a p elctron
with angular-momentum
1M2(Yl l
v,t,
- Y1-t) = {f;Y
--
47T r '
(12.51b)
projection ml = 0 and
ml =±1
352 12. Elementary Structure of Molecules
1 (3x
Ipx) = J2(Yn + Y1- 1 ) = Y4;-;::- . (12.51c)
Here the cartesian coordinates are expressed in terms of the spherical ones.
In order to see how the bindings come about in a p orbit, we look, for
example, at the C 2 molecule. In the ground state C has two unpaired 2p
electrons, i.e. the valence number is 2. The two 2s electrons can be excited
and also contribute to binding; we will not take this into account for the
present consideration. The question is: Which state (configuration) is lowest in
energy? We search for an arrangement with the largest overlap of the electron
wavefunctions. This is the case if both electrons are in the IPz) state (see
Fig. 12.9).
a bonding
Fig. 12.9. (j bond of two
C atoms: the IPz) orbits of
the two carbon nuclei have
a large overlap at a rela-
tively large distance R be-
tween the C nuclei
Once this binding has occured, there will be no place anymore for another
electron because the interaction with it would be repulsive. The binding de-
scribed above is also called u binding (u bond), since it possesses me = 0, in
analogy to the notation of the s orbit with I = O. Since the spherical symme-
try is violated for a molecule, the molecular orbits cannot be classified by the
angular momentum. On the other hand, the projection onto the z axis (L z )
still represents a good quantum number for two-atom molecules.
The next p electrons cannot occupy the IPz) state anymore as they have
to switch over to the Ipx) and Ipy) states. As can be seen in Fig. 12.10 these
electrons will have a weaker binding since the overlap is lower: With the
reduction in the central distance R the overlap increases but, in addition, the
Coulomb repulsion of the nuclei increases. The wavefunction of this binding
has the projection me = ±1 onto the z axis. Thus it is also called 7r binding
(7r bond). So the C 2 molecule has two bindings, a u and a 7r binding.
Nitrogen has three 2p valence electrons. One takes a (J bond and the other
two have to switch over to the IPx) and Ipy) orbits. They experience a weaker
7r binding.
An interesting exception is the O 2 molecule. The 0 atom has four 2p va-
lence electrons. The first three take one (J and two 7r bonds as for the nitrogen
molecule. For the fourth electron there is now no binding orbit available. To-
gether with the fourth electron of the other atom it forms a state with an
overlap that is as low as possible: one electron in Ipx) and the other in Ipy).
Both electrons now form an antibound state (as for 1f;- in H2). A triplet state
has spin 1. The O 2 molecule represents an exception from the rule that states
that all two-atom molecules for which the atoms are identical and do not
belong to transition elements have spin 0 in the ground state.
12.6 Hybridization
One way to amplify the (J binding is to deform the orbit in such a way that
the overlap becomes larger (see Fig. 12.lla,b).
This only works if different angular momenta are mixed with each other.
It becomes possible because the rotational symmetry is broken for a two-atom
molecule. An example is the Li2 molecule. Li has one 2s electron in the outer
shell. But only little energy is needed to shift the 2s electron into a 2p orbit.
We denote the s orbit by Is) and the p orbit by Ip); then we are able to
construct a test wavefunction
(12.52)
A is the variation parameter. The s wavefunction is spherically symmetric (see
Fig. 12.12). The p wavefunction has positive sign in the range z > 0 and a
negative one for z < 0 [see Fig. 12.12 and (12.51a)]. We have put the p electron
into the IPz) state.
For z > 0 both wavefunctions amplify each other. For z < 0 a negative
interference (attenuation) appears. For this reason the shape appears on the
right-hand side of Fig. 12.12. The overlap with the electron of the other atom
is now larger and therefore the binding is stronger. The phenomenon of mixing
two atomic orbits described here is called hybridization.
354 12. Elementary Structure of Molecules
Next we consider the water molecule H 2 0. The atoms are not lined up,
which leads to a larger dipole moment. In the following, we will explain the
phenomenon and follow two opposite extremes: no hybridization and maxi-
mum hybridization. The truth lies somewhere in between.
Without hybridization we can assume the following: The 0 atom has four
p electrons; if no hybridization occurs, they remain P electrons. We place one
electron into the Ipx) and one into the Ipy) state. The last two are paired in
the IPz) state. The electrons in Ipx) and Ipy) bind with the electrons of the
H atoms. Since Ipx) and Ipy) are oriented perpendicular to each other, the
calculated angle of the binding is 90°. Experimentally one finds lOso. If one
takes into account that the two protons of the H atoms repel each other, then
the angle becomes larger. Thus, we can imagine how the lOso arises.
The other way of arguing is to assume that the 2s and 2p orbits are
energetically degenerate. (In reality the 2p orbit is somewhat higher.) Now
mixing can occur. The states with lowest energy, which are obtained by a
variation procedure, are
11) ~(Is) + Ipx) + Ipy) + IPz)) , (12.S3a)
12) ~(Is) + Ipx) - Ipy) - IPz)) , (12.S3b)
13) ~(Is) - Ipx) + Ipy) - IPz)) , (12.S3c)
14) ~(Is) - Ipx) - Ipy) + IPz)) . (12.53d)
They have maximum mixing.
Now we consider the spatial orientation of these states (12.S3a-d). In
(12.S3a) the p part has the form [see (12.S1a-c)]
one of the corners, for example, in the upper corner (see Fig. 12.14). If the 2p
electrons of the nitrogen atom were only in the Ipx), Ipy), and IPz) orbits, their H
,, H
angle would be 90 Via mixing with the 2s state, the angle becomes larger.
0 •
In the ammonia molecule the ground state splits into two energetically close ,, IH I
configurations. This can be explained by the fact that the N atom can be on , I I
either side of the pyramid (see Fig. 12.14). The wavefunctions representing the , I I
,-¥I
N atom on the upper or lower side may be added or subtracted. The antisym-
N'
metric wavefunction belongs to the ground state of NH 3. The symmetrically
Fig. 12.14. The two,
combined wavefunction slightly above lies. One speaks of a splitting of the
nearly degenerate ammo-
ground-state configurations as a result of tunneling of the N atom through nia configurations (NH3).
the plane spanned by the three hydrogens. N can be located either
We can understand some of the properties of water from the electronic above or below the plane
properties of the H2 0 molecule. The molecule has ten electrons altogether, spanned by the three hy-
drogens
two from the two H atoms and eight from the oxygen atom. The two Is
electrons in the 0 atom can be neglected in the following consideration. They
belong to a lower-lying closed shell and are, so to speak, inert and therefore
inactive. The remaining eight electrons fill the hybridized orbits (12.53a-d).
In addition the oxygen attracts the electrons, which implies polarization of
the water molecule.
Altogether the 0 atom has four charged legs, two in the directions of the
H atoms and two away from the 0 atom (see Fig. 12.15). If now a second H 2 0
molecule comes close to the first one, the H atoms, which now appear to be
more positively charged, can bind the oxygen atom with a negative leg (see
Fig. 12.15). This kind of binding is called a hydrogen bond. With only 0.2eV it
is quite weak. The mean distance between a H atom and an 0 atom of another
molecule is about 1.8 A, whereas the distance between Hand 0 within the
same molecule is about 1 A. This explains the low binding energy! Each water
molecule may obviously undergo hydrogen bonding to four neighbors. This
mechanism accounts for the accumulation of many water molecules of the
form H 2n O n = (H 2 0)n in water (cluster formation).
356 12. Elementary Structure of Molecules
There are many more molecules that form hydrogen bonds. They belong
to the water-soluble substances because they untergo hydrogen bonding with
the H20 molecules of the water. Substances that do not form hydrogen bonds,
are not water soluble. Oils belong to this category. Heating causes the hydro-
gen bonds to break and the substance becomes more fluid, i.e. its viscosity
decreases.
12.7 Hydrocarbons
H The considerations of the last sections have shown that the geometry of a
CH4 slruc lure molecule depends on the structure of the electron orbits and the hybridiza-
tion. An excellent example of how electron configurations and the geometry
\09.6° of molecules are connected with each other can be found in hydrocarbons.
H In carbon the ground state is given by the configuration (ls)2(2s)2(2p)2. In
H this state there are two p valence electrons. But the C atom has the most
bonds via the (ls)2(2s)1(2p)3 configuration, i.e. one 2s electron is shifted into
Fig. 12.16. Structure of the 2p orbital. Now the C atom has four valence electrons, which can bind
the methane (CH 4 ) mole- with electrons of other atoms. We again use the completely hybridized states
cule: One s electron and (12.53a-d) and occupy each of the four states with one electron. These elec-
three p electrons of the C
are hybridized and form a
trons form (5 bonds with the electrons of the H atom and give rise to the
bonds with the electrons methane (CH 4 ) molecule. The angle between the bonds corresponds exactly
of the H to the calculated angle of 109.6 0 (see Fig. 12.16).
In the C 2 H4 molecule, ethylene, a planar structure occurs (see Fig. 12.17).
This is explained by the following hybridization of the orbits:
11) IPz) , (12.55a)
electrons in states 13) and 14) form a bond with the electrons of the hydrogen
atoms.
The electrons in states 11) and 12) form a bond with the electrons of the
other C atom. The orbits of the second half (CH 2 ) of the C 2 H4 molecule are
mirror-reflected to the first half. The electrons in 12) form a (J bond and the
ones in 11) a 1': bond, since they are oriented perpendicular to the binding axis.
The reason for the planar structure of the C2H4 molecule is the 1': bond. This
bound becomes effective only if both CH 2 parts of the molecule are located
in the same plane. The orbits for the 1': bond are oriented perpendicular to it
(along the z axis in Fig. 12.18) Rotation around the x axis causes the 1': bond
to break up (see Fig. 12.19). Thus, the 1': bond enforces the coplanarity of the
two CH 2 complexes.
(12.56)
H In the carbon compounds discussed so far all bonds were localized, i.e. we
know which bonds appear between which atoms. A particularly interesting
situation appears in the case of the "aromatic" compound benzene. It consists
of a ring with six carbon atoms, on each of which a hydrogen atom is bound
(see Fig. 12.21).
The bonds C-H are all hybridized a bonds. One bond between the C atoms
is also of a type. Hence, six out of eight valence electrons of two neighboring
carbon atoms are involved in a bonding, the other two will form a 11' bond.
c Therfore, between the six C atoms of the benzene-ring only three 11' bonds
U
H
will be possible. This is illustrated in Fig. 12.21 by thicker lines. The states of
the electrons causing the 11' bonds are oriented perpendicular to the a bonds,
Fig. 12.21. The structure
of the benzene molecule i.e. out of the plane of the benzene ring. Because of the symmetry it is not
CsHs. It is ring-shaped possible to say between which C atoms the 11' bond appears. In Fig. 12.22 two
and lies in a plane possibilities are depicted.
Indeed, the state with lowest energy is represented by a mixing of these
two possibilities. The electrons of the 11' bonds can be visualized as moving
~c
c ]'( c freely in the ring. Due to the larger extension available for these electrons
]'(1 their kinetic energy decreases (uncertainty principle!). With the lower kinetic
energy the state is additionally lowered and, hence, stronger bonds result.
c c
~C Summarizing, we can say that one can already reach good qualitative
understanding of the structure of molecules by simple arguments such as the
or overlap of the electrons in singlet states and hybridization or by increasing the
number of valence electrons by exciting an atom (e.g. carbon atom). But this
procedure is not sufficient to make accurate, quantitative predictions of the
structure of molecules. This can be accomplished only by large-scale computer
calculations.
1 R.P. Feynman, A.-R. Hibbs: Quantum Mechanics and Path Integrals (McGraw-
Hill, New York 1965); C. Itzykson, J.-B. Zuber: Quantum Field Theory (McGraw-
Hill, New York 1980); W. Greiner, J. Reinhardt: Field Quantization (Springer,
Berlin, Heidelberg 1996).
defines the so-called action functional, which assigns a defined value for any
path x(t) connecting the fixed end points Xa = x(t a ) and Xb = X(tb). Within
an infinite number of such paths the classical trajectory of a particle is rep-
resented by the special path x(t), for which the action (13.1) takes on an
extremum, more precisely a minimum. The necessary condition for this is
that for a small variation 8x of the classical path x the corresponding action
remaines unchanged in first order in 8x:
8S = S[x + 8x] - S[x] = o. (13.2)
As we know this leads to
8S = 8x aL
a±
l
t
t"
b + t
ita
b
dt' (aL _
ax
~ aL) 8x.
dt' a±
(13.3)
For fixed end points 8x(ta) = 8X(tb) = o. Hence the Euler Lagrange equation
for the classical trajectory follows:
aL d aL
ax - dt ax = O. (13.4)
In classical mechanics the form of the action functional is most interesting,
and not the value of the action itself. We consider the action along a classical
trajectory as a function of the upper bound
From this it becomes plausible that the action S(x) enters somehow into the
phase of probability amplitudes.
These preliminary remarks suffice and we now introduce the Schrodinger
propagator for later use. We define the Greens operator by
( ata,)
iii
, t')
- G(t, H =
~
iii 1 5(t - t') . (13.9)
The Greens function in space and time results from this as a matrix element
G(x, tj x', t') = B(t - t')(xl exp ( -~iT(t - t')) Ix') . (13.13)
From here we will derive later the path integral description for the Greens
function (13.13).
364 13. Feynman's Path Integral Formulation of Schrodinger's Wave Mechanics
EXERCISE
Problem. Determine the momentum and energy at any end point of a clas-
sical trajectory.
Solution. The starting point is the classical action along the trajectory x(t)
as a function of the upper bound:
8S = 8x fJL
fJi;
It
to
+ rt dt' 8x (fJLfJx _ ~
lto
fJL)
dt' fJi;
(2)
Along the classical trajectories the integral term vanishes. Assuming, further-
more, 8x(t o) = 0 and p = fJLlfJi; yields
8S=p8x. (3)
This equation has to be understood in the following way. It compares the
variation of the action with respect to paths that differ by the endpoints x(t)
taken at the same time t. Equation (3) tells us that the partial derivative of
the action with respect to the coordinate of the upper end point is equal to
the corresponding momentum, i.e.
(4)
Analogously, we can take the action as an explicit function of time as we
consider paths that end in the given location x at different times t.
The definition of the action (1) means the total time derivative of the
action along a trajectory is
-- + -
dS fJSclass. fJSclass. -'-
dt class. - --x (6)
fJt fJx
if we understand Sclass. in the above mentioned sense as a function of coordi-
nates and time. From this it follows that
fJS .
at = L - px = -E. (7)
13.2 Transition Amplitude as a Path Integral 365
1) All possible paths contribute equally, Le. formally in the same way as the
amplitude; but different paths contribute with different phases.
2) The phase of the contribution of a given path is determined by the action
S along this path (measured in Ii).
The contribution of each path has a phase proportional to the action S along
the path, i.e.
Xa Xb x
stationary. Hence, infinitesimal neighboring paths have the same phase in first
order. Compared to the classical one other paths lead to strongly oscillating
phases (see Fig. 13.2). The corresponding contributions in (13.14) interfere de-
structively and cancel each other. Therefore the amplitude (13.14) essentially
reduces to
NE tb - ta , tb > ta ,
Let X a , ta and Xb, tb be the two fixed end points. We now construct a certain
path by choosing special points Xi for all intermediate time points ti and
connect the selected points by straight lines (see Fig. 13.3). By refining this
t
~+---------------L
ti+l +-___________ ~
tj +-----------R-
lattice, we can approximate every path with any desired quality. Hence, it is
natural to define the sum over all paths as a multiple integral over all values
of Xi (i = 1, ... , N - 1; XQ, XN fixed):
Let us now define the sum implied in (13.14) more precisely. In Riemann's
integral E can be made smaller and smaller. Here one cannot directly generate
a well-defined limiting value in this way. In order to force convergence, a
normalization factor A(E), depending on E, has to be introduced. Summation
leads to the path integral:
K(b,a) =
. 1
!~ A
JA'"
dXl dXN-l
- A - exp
{ i
/is[b,a]
}
, (13.18)
N--->oo
S[b, a] =
Ito
tb dt' L(x, x, t') .
S is a line integral along a route dXl ... dXN-l through the points Xi. The
above construction of the path integral is not unequivocal. Instead of connect-
ing the Xi by straight lines it is more useful in general to choose sections of
classical paths. Such a construction is possible for any Lagrange function.
We introduce the abbreviated notation:
For the amplitude (13.18) and (13.19), an important property can be derived.
Let us consider the path integral for two successive events (see Fig. 13.4) with
ta < tc < tb. The action fulfills the obvious property:
S[b, a] = S[b, c] + S[c, a] .
tc+----~.""
dXN-I
x -A-exp (iIi(S[b,c] + S[c,a]) ) , (13.20)
Therefore the amplitude K(b, a) can be calculated in such a way that first
the path integration is performed from a to a fixed intermediate point c and
afterwards from c to b and then over all possible intermediate points c. Equa-
tion (13.21) expresses therefore, how amplitudes of successive time events are
multiplied. We can immediately generalize this to
K(i+l,i) = ~exp(~S[i+l,il)
~exp [~lti+l dt' L[X(t'),X(t'),t']]
~
1
A [ifL(XHI+Xi XHI-Xi
exp Ii 2' f ' 2
f)] (13.23)
Thus the integrand from (13.22), i.e. the amplitude for any complete path,
can be written as
N-I
¢[x(t)] = lim
<--->0
II K(i + 1, i) . (13.24)
N--->oo i=O
Let us specify (13.22) for the case of a particle's motion in a potential V(x).
According to the discretized form implied by (13.23) the corresponding La-
grangian reads
13.2 Transition Amplitude as a Path Integral 369
J
r-
K(b,a) = lim dXl ... dXN-l
<--+0
N-->oo
EXERCISE
Problem. Determine the transition amplitude K(b, a) for a free particle with
the help of (13.26).
with Xo = Xa, XN = Xb, and N E = (tb - ta). The various integrations appearing
in (1) are reducible to simple Gaussian integrals by means of the quadratic
supplement in the exponent. The successive performance of the N - 1 inte-
grations leads to a set of Gaussian integrations:
1 00
-00
dye a (x- y )2+.6(z-y)2 = (~) 1/2 exp [~(X
a. + j3 a. + j3
_z)2] (2)
1:
Let us begin with the integration over Xl (J-L = im/2nf):
1:
Integration over X2 gives:
dXl dX2 el'[(X3- X2)2+(X2 -x,)2+(Xl -xo)2]
( -7r) 1/2
2J-L
1
-00
00
dX2 el'(X2-Xo)2 /2+I'(X3- X2)2
Exercise 13.2.
i:
Performing the N - 1 integrations one after the other yields the expression
K(b,a) = ~~
. (1)
A(f)
N (27rill£) N /2
~
N ..... oo
and obtain:
G(Xata;Xbtb) = lim (xbl(e->-T/N e->-V/N)Nlxa). (13.30)
N--->oo
From here only a few more steps lead to the path integral. Inserting a complete
set of spatial states
yields
lim
N--->oo
J dXl ... dXN-l
N-l
X II (xi+11 e->-T/N e->-v/Nlxi) , (13.31)
i=O
(13.32)
In order to calculate the spatial matrix element of the operator e->-T/N we
insert a complete set of momentum eigenstates:
_1_
21rh
/00 dpe->-p2/ 2mN eip
-00
(X;+l- X ;) .
1:
Such Gaussian integrals can be calculated, resulting in
Thus, we obtain
lim
<->0
J dXl .. . dXN-l ( m )~2
--.-
27rlru:
N->CXJ
With this we have deduced the path integral expression for the Schrodinger
propagator. It is identical in form with the expression for the transition ampli-
tude K(b, a) in (13.26). Now the undetermined normalization factor appearing
r
in (13.26) can be identified as
A(t:) = C%lit: /2
(13.35)
Note that we were led to the Greens function K(b, a) of the Schrodinger
equation exclusively by the path integration method.
EXERCISE
C(a) = ~ (a)n
~ n!
(dnC)
dan
I (2)
n=O a=O
In the following, a useful operator identity will be applied:
, aA' -aA ~ (a)n , ,
K(a) = e Be = ~ - , [A,B](n), (3)
n.
n=O
with [A,E](o) = 13, [A,E](1) = [A,E], [A, 13](2) [A, [A, 13]], .... For the
proof of (3) the coefficients (dnkjdan)la=o of the Taylor series have to be
calculated:
k(a) = ~ (a)n
~ n!
(d nk)
dan
I .
n=O a=O
13.3 Path Integral Representation of the Schrodinger Propagator 373
n=2
d2 KI
d0: 2 a=O
= [A, [A, Bll = [A, B](2) . (9)
dn K I
-n
" [A, Ell
= [A,··, , (10)
do: a=O "---v---'
n tinles
Inserting (10) into the Taylor series for K(o:) yields the identity (3).
Turning to the operator function 0(0:) = e- aT e- av and calculating ex-
plicitly the first terms of its Taylor series, we obtain:
n=O
O(o:)la=o i·,
n=1
dO
(-)i'0(0:) + (_)e-aTVe- aV (11)
do:
(_ )i'0(0:) + (-) e-aTV eaT e- aT e- av (12)
, ,
(-)TG(o:) ~ -----:mr-[T,v](m)
+ (-) ('V + ~ (_o:)m " ) G(o:)
' , (13)
(15)
374 13. Feynman's Path Integral Formulation of Schrodinger's Wave Mechanics
Exercise 13.3.
" ~ (-a)m " ) dC
( (-I)(T+ V) + (-1) ~l ~[T, VjCm) da (16)
00 ( )m-l
2 '" -a " ,
+ (-1) ~l (m _ I)! [T, VjCm)G(a) , (17)
(18)
In the way indicated, all higher derivatives can be determined. Then one gets
dnCI
dan 0=0 = (-I)n(T
' + v)n
, + {commutator terms}.
Inserting this into the Taylor expansion (2) and performing the summation
one obtains
2
C(a) = F(a) + ~ [T, Vj + 0(a 3 ) (19)
Hence we find
[F(a)t - (C(a))N = 0(a 2 ), (20)
i.e. the above difference is at least proportional to a 2 = ).,2/ N2. In the limit
N --> 00 the right-hand side of (20) vanishes, which proves the validity of
Trotter's formula (1).
In the preceding subsection we convinced ourselves that the path integral for
the transition amplitude is identical with the Greens function for the one-
dimensional Schrodinger equation
The Greens function K(b, a), or if we specify the arguments in more detail,
K(xt, x't'), represents the probability amplitude for the propagation of a par-
ticle from place x' at time t' to x at a later time t. The probability amplitude
for finding a particle at time t at position x is described by the wavefunc-
tion P(x, t). If we are not interested in the particle's past, we can define a
wavefunction by
P(x, t) = J
dx' K(xt, x't')P(x', t') . (13.37)
The integral core K(xt, x't') propagates the wavefunction from time t' to time
t. We now consider the special situation in which t and t' differ only by an
13.4 Alternative Derivation of the Schrodinger Equation 375
ll/(x, t + f) =
['>0 dx , exp [if
Ii1 Loo li,L (x+x' x-x' f)]
- 2 - ' - f - ' t + "2 If/(x, t).
(13.38)
The Lagrangian is given by
L = ; j;2 - V (x, t) .
= i:
+ f)
d~~expG:e)exp[-~v(x+~,t+~)]If/(x+~,t),
we expand the integrand and the left-hand side of the above equation, keeping
only terms up to linear in f and quadratic in ~. This leads to
i:
The integrals are given by
1 00
-00 d~ exp e~
em) 0, (13.41)
i:
2lif
If/(x, t)
a
+ I' at If/(x, t) (13.43)
= Ii1 [( --:;:;:;:-
27rilif) 1/2 (
If/(x,i
t) f a
ilif ax
- Ii, V(x, t)ll/(x, t) + 2m
2
2 1f/(x, t) ) 1.
376 13. Feynman's Path Integral Formulation of Schrodinger's Wave Mechanics
The normalization factor A(f) has to be choosen in such a way that (13.43) is
fulfilled in the limit f ---+ O. Therefore we have to choose A(f) = (27rnf/m)-1/2,
which we have seen already [see (13.35) and Exercise 13.2J. It follows that
8 k ~ 82
tIt + f 8t tIt tIt - fi VtIt - 2i m 8x 2 tIt .
In first order in f this equation is fulfilled only if
8 n2 82
in-tIt = - - - t I t + VtIt
at 2m 8x 2
is valid. This is the Schrodinger equation.