0% found this document useful (0 votes)
541 views154 pages

Lecture Notes: Introduction To Condensed Matter Theory

This document provides an outline for a lecture series on condensed matter theory. The lectures will cover topics ranging from basic concepts in quantum statistical mechanics and symmetries to more advanced topics like topological insulators, spin-orbit coupling, and density functional theory. The lectures separate the study of condensed matter into different energy scales and make use of symmetries to tackle the many-body problem. Specific topics that will be covered include Bloch's theorem, Fermi liquid theory, magnetism, interacting instabilities like superconductivity, and relativistic effects in band structures. Hands-on sessions are also planned to demonstrate computational tools for analyzing crystal and band structures.

Uploaded by

谭志阳
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
541 views154 pages

Lecture Notes: Introduction To Condensed Matter Theory

This document provides an outline for a lecture series on condensed matter theory. The lectures will cover topics ranging from basic concepts in quantum statistical mechanics and symmetries to more advanced topics like topological insulators, spin-orbit coupling, and density functional theory. The lectures separate the study of condensed matter into different energy scales and make use of symmetries to tackle the many-body problem. Specific topics that will be covered include Bloch's theorem, Fermi liquid theory, magnetism, interacting instabilities like superconductivity, and relativistic effects in band structures. Hands-on sessions are also planned to demonstrate computational tools for analyzing crystal and band structures.

Uploaded by

谭志阳
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 154

Lecture Notes: Introduction to Condensed Matter Theory

Titus Neupert1
1
Department of Physics, University of Zurich,
Winterthurerstrasse 190, CH-8057 Zürich, Switzerland
(Dated: May 28, 2018)

1
CONTENTS

Plan of the lectures 5

I. Introduction 6

II. From quantum statistical mechanics to condensed matter 9


A. Basic concepts 9
1. Second quantization 9
2. Hamiltonians and partition functions 12
3. Thermodynamic potentials 15
4. Spontaneous symmetry breaking 16
5. Mean-field approximation 19
6. Ginzburg-Landau theory for the Ising model 21
7. Critical exponents 23
B. Linear response 24
C. Symmetries in condensed matter 26
1. Non-local vs. local, unitary vs. antiunitary 26
2. Symmetries of a crystal 28
3. Reciprocal space 29

III. Noninteracting electron systems 32


A. Bloch’s theorem 32
B. Nearly free electron approximation 33
C. Tight-binding approximation 35
D. Wannier states 38
E. Interlude: Group representations and character tables 40
F. Symmetries in band structures 42
G. Obtaining information about band structures and symmetries 46
H. Filling of bands and density of states 47
I. Coupling to an external electro-magnetic field 48
1. Semiclassical equations of motion 49
2. The integer quantum Hall effect 51

2
3. Quantum oscillations 57
J. Phonons 59
1. Quantizing lattice vibrations 59
2. Peierls instability in 1D 61

IV. Interactions in itinerant systems 65


A. Particle-hole excitations 65
1. In semiconductors 65
2. In metals 68
B. Fermi liquid theory 73
1. Quasiparticles 74
2. Basics of Landau Fermi liquids 75
3. Specific heat – Density of states 77
4. Compressibility and susceptibility 78
5. Galilei invariance – effective mass and F1s 80
6. Stability of a Fermi liquid 82
7. Microscopic justification 83
8. Distribution function 86
9. Fermi liquid in one dimension? 87
C. Transport in metals 89
1. Linear response and complex analysis 89
2. Boltzmann transport theory 91
3. Conductivity 92
4. Optical properties 94
5. Computing the relaxation time 95
6. Kondo effect 98
7. Anderson localization in one dimension 100
D. Interacting instabilities 105
1. The fractional quantum Hall effect 105
2. Superconductivity 107
3. Density waves 118
4. Mott insulators 118

3
V. Magnetism in solids 125
A. Antiferromagnetism 125
1. Superexchange in the Hubbard model 125
2. Mean-field theory and its shortcomings 126
3. Spin-wave theory 128
B. Stoner magnetism 133
1. Mean-field theory 133
2. Susceptibility on the paramagnetic side 136
3. Magnons 137

VI. Advanced Topics 140


A. Relativistic effects in solids: spin-orbit coupling 140
B. Effective relativistic dispersions in band structures: Topological insulators and
Weyl semimetals 142
1. Topological insulators in three dimensions 143
2. Weyl semimetals 147
C. Density functional theory 148
1. Variational principle and Hartree-Fock approximation 148
2. Hohenberg-Kohn theorems 150
3. Kohn-Sham equation 152
4. Local density approximation and beyond 153

4
PLAN OF THE LECTURES

1. 22. Feb: I. Intro; II. Basic concepts

2. 23. Feb: II. Basic concepts

3. 1. Mar: II. Basic concepts, II. Linear response

4. 2. Mar: II. Symmetries in condensed matter

5. 8. Mar: III. Bloch theorem, III. Nearly free electrons

6. 9. Mar: III. Tight binding, III. Wannier states

7. 15. Mar: III. Group representations and character tables; III. Symmetries in band
structures; HANDS ON with ICSD, Materials project, Vesta and Bilbao server;

8. 16. Mar: III. Filling of bands and DOS, III. Coupling to EM field

9. 22. Mar: III. Coupling to EM field

10. 23. Mar: III. Coupling to EM field

11. 29. Mar: III. Phonons

12. 30. Mar– 6. Apr: (Easter break – no lecture)

13. 12. Apr: IV. Particle-hole excitations

14. 13. Apr: IV. Fermi liquid theory I

15. 19. Apr: IV. Fermi liquid theory II

16. 20. Apr: IV. Fermi liquid theory III

17. 26. Apr: IV. Transport I (lecture by Frank Schindler)

18. 27. Apr: IV. Transport II (lecture by Frank Schindler)

19. 3. May: IV. Transport III

20. 4. May: V. Interacting instabilities I: FQHE/supercondutivity

5
21. 10. May: (Auffahrt – no lecture)

22. 11. May: V. Interacting instabilities II: superconductivity

23. 17. May: V. Interacting instabilities III: Mott insulators

24. 18. May: VI. Antiferromagnetism

25. 24. May: VI. Stoner magnetism

26. 25. May: VI. Advanced topics: Spin-orbit coupling

27. 31. May: VI. Advanced topics: Topological insulators

28. 1. Jun: VI. Advanced topics: DFT

I. INTRODUCTION

The study of quantum condensed matter is a very hard problem. We consider in gen-
eral interacting many-body systems of 1024 degrees of freedom, typically represented by the
electrons in a solid. Since the Hilbert space is exponentially large in the number of degrees
of freedom, any ’brute force’ theoretical approach using the resources of classical computers
to this problem must fail. A quantum computer might help, but is not (yet) available. Nev-
ertheless, impressive progress has been made in the theoretical study of solid state systems,
thanks to two bold simplifications:
Separation of energy scales: In contrast to high-energy physics, where new physics
appears at ever higher energies, in condensed matter physics the phenomena of interest
typically appear at very low energies. To put it boldly, these two energy frontiers are where
we can expect new physics to appear. Everything in between is, at least on a fundamental
level, understood. Different physical phenomena can be separated by the energy scale at
which they appear, and their impact on the phenomena at smaller energy scales can often be
incorporated in some effective form. For example, it is in many cases sufficient to consider the
crystal as a rigid lattice of ions in which the electrons move or to only consider the coupling
between the electrons and the low-energy vibration (phonon) modes of the crystal. As
another example, the Coulomb interaction between electrons can in some cases be neglected
so that one solves only the problem of free electrons moving in the background potential

6
of the ions. A plethora of more sophisticated approximations has been developed, some of
which we will encounter in this course.
Use of symmetries: The formation of a crystal breaks (spontaneously) the Lorentz
symmetry of open space. At first sight, this appears to be problematic, as one cannot take
advantage of this symmetry group to describe the low-energy properties of electrons, in the
same way as one can identify allowed terms in a Lagrangian in high-energy physics, say.
However, the crystal breaks the symmetry not completely, but retains a discrete translation
symmetry and potentially other symmetries such as rotations, mirror reflections, inversion,
time-reversal symmetry and so on. In addition, at low energies effective symmetries can
emerge that are not shared by the whole crystal. For example, while a cubic crystal has
only four-fold rotational symmetry, the dispersion of electrons at low electron-density might
still be well described as parabolic, with an effective continuous rotation symmetry. This
renders the situation much better than, say, in bio-chemistry, where one cannot take much
advantage of constraining the problems by symmetries.
Thus, the ’master equation’ for solid state systems in equilibrium is given by the Hamil-
tonian

H = H0,e + H0,i + He−e + Hi−i + Hi−e ,


X
H0,e = (α · pj + βme ),
j
X Pj2
H0,i = , (1)
j
2M j

X e2 X e2 Zl X e2 Zj Zl
He−e = , He−i = , Hi−i = ,
j,l
|rj − rl | j,l
|rj − Rl | j,l
|Rj − Rl |

where pj and rj are momentum and position operator of the j-th electron, me is the electron
mass, Pj and Rj are momentum and position operator of the j-th nucleus, Zj and Zj are
its charge and mass. The Dirac form of the electronic kinetic Hamiltonian was chosen to
include the physics of spin-orbit coupling, which is important to topics of recent interest
in solid state physics. Hamiltonian (1) is essentially impossible to solve without further
approximations. (What is not included in Hamiltonian (1) is the physics associated with
the nuclear spins.)
To make progress in understanding a specific phenomenon or system, one often works

7
with a toy model. While such models can be motivated using qualitative or quantitative
arguments, solid state theorists can become very creative in dreaming up new models. Some-
times, these models are still very hard to solve, like the paradigmatic Hubbard model, for
example. In other cases, the advantage of a specific model might be its exact solubility, as
is the case with the transverse-field Ising model, for example.

8
II. FROM QUANTUM STATISTICAL MECHANICS TO CONDENSED MATTER

A. Basic concepts

1. Second quantization

In first quantization, we denote the (non-degenerate) quantum state of a many-body


system of N identical particles by a wave function

Ψ(α1 , α2 , · · · , αN ), (2)

where the αi , i = 1, · · · , N run over labels for a basis of the single-particle Hilbert space
H1 and each αi denotes the single-particle state that the i-th particle is in. For example, α
could label the position of a particle and in addition some internal degrees of freedom such as
spin. The Pauli principle, the axiom underlying many-body quantum mechanics, says that
equivalent particles are indistinguishable. Any observables, such as the probability density,
may thus not depend on the order in which the states αi of indistinguishable particles are
listed in the arguments of Ψ. For the exchange of only two particles, this amounts to impose
the equality

|Ψ(α1 , · · · , αi , · · · , αj , · · · , αN )|2 = |Ψ(α1 , · · · , αj , · · · , αi , · · · , αN )|2 . (3)

Thus, upon exchange, Ψ could be multiplied by an arbitrary phase eiϕ(αi ,αj ) . However,
exchanging the particles twice needs to bring the wave function back to itself, in order for
the wave function to be single valued, i.e., eiϕ(αi ,αj ) = ±1. (Non-degenerate wave functions
that are not single-valued are not permitted.) As a result, there are only tow possibilities

Ψ(α1 , · · · , αi , · · · , αj , · · · , αN ) = ±Ψ(α1 , · · · , αj , · · · , αi , · · · , αN ), (4)

which correspond to bosons (the + sign) and fermions (the − sign).


We recall that the N -particle Hilbert space HN is a direct product of N copies of the
single-particle Hilbert space, and the Fock space F is a direct sum over these

N
O ∞
M
HN := H1 , F := HN , (5)
i=1 N =1

9
where H0 is a one-dimensional space spanned by the vacuum |0i alone. A basis for the
N -particle Hilbert space is then given by

1 X
Ψ(α1 , α2 , · · · , αN ) = √ (±1)σ(p) ψα1 (p1 )ψα2 (p2 ) · · · ψαN (pN ), (6)
N ! p∈PN

where the sum runs over all permutations p of the first N integers, σ(p) denotes the sign of the
permutation, and the signs (+1) and (−1) apply to bosons and fermions, respectively. Here,
ψα are a set of basis functions of the single-particle Hilbert space H. The basis functions (6)
of HN are thus fully symmetrized (antisymmetrized) products of N single-particle basis
functions ψα in the case of bosons (fermions). The latter can be written in the form of a
determinant of the matrix with elements Mij = ψαi (j), i, j = 1, · · · , N , the so-called Slater
determinant. In bra-ket notation, the state corresponding to Eq. (6) is often written in
the occupation number representation by listing the number of particles that are occupying
a each single particle state α by nα . For fermions, nα = 0, 1, for the fully antisymmetric
wave function (6) vanishes if two or more particles occupy the same single-particle state. For
bosons, nα can be any nonnegative integer. For example, in a system with a five-dimensional
single-particle Hilbert space H1 (i.e., in which α can only take 5 different values), we would
denote the state with 3 particles, that occupy the first, third, and forth single particle state
by
Ψ(1, 3, 4) or |{nα }i = |1, 0, 1, 1, 0i. (7)

c†α and
The key object of the second-quantized formalism are particle creation operators b
annihilation operators b
cα . For fermions, these operators act as

c†α |n1 , · · · , nα , · · · i = δnα ,0 (−1)n1 +···+nα−1 |n1 , · · · , nα + 1, · · · i,


b
(8)
cα |n1 , · · · , nα , · · · i = δnα ,1 (−1)n1 +···+nα−1 |n1 , · · · , nα − 1, · · · i,
b

and obey the relations

{b c†α0 } = δα,α0 ,
cα , b {b cα0 } = {b
cα , b c†α0 } = 0,
c†α , b (9)

where {A,
b B}
b =A
bBb+B
bAb is the anticommutator. For bosons, they act as


c†α |n1 , · · · , nα , · · · i =
b nα + 1|n1 , · · · , nα + 1, · · · i,
√ (10)
cα |n1 , · · · , nα , · · · i =
b nα |n1 , · · · , nα − 1, · · · i,

10
and obey the relations

[b c†α0 ] = δα,α0 ,
cα , b [b
cα , b
cα0 ] = [b c†α0 ] = 0.
c†α , b (11)

c†αb
For example, the occupation number of a state α can be measured with the operator b cα ,
c†αb
because b cα |n1 , · · · , nα , · · · i = nα |n1 , · · · , nα , · · · i. Any state in the basis of F can be
generated by application of a product of c†α operators to the vacuum state |0i, i.e.,
Y
|{nα }i = c†α )nα |0i,
(b (12)
α

c†α )0 is the identity.


where it is understood that (b
The second quantized operators transform natural under a basis change of the single
particle Hilbert space H1 . Let uα,a = hα|ai be the matrix elements of the unitary operator
that encodes the change from a basis ψα of H1 to a different basis ψa . Then,
X X
c†a =
b c†α ,
uα,ab ca =
b u∗α,ab
cα . (13)
α α

This transformation also applies if the label a is a continuous variable, such as the position
r of a particle, in wich case we obtain the field operators
X X
φb† (r) = c†α ,
hα|rib φ(r)
b = hr|αib
cα (14)
α α

that obey in the case of fermions

{φ(r),
b φb† (r 0 )} = δ(r − r 0 ), {φ(r),
b b 0 )} = {φb† (r), φb† (r 0 )} = 0,
φ(r (15)

and in the case of bosons

[φ(r),
b φb† (r 0 )] = δ(r − r 0 ), [φ(r),
b b 0 )] = [φb† (r), φb† (r 0 )] = 0.
φ(r (16)

For example, a transformation of the form (14) is relevant to the case of a particle on a
ring of circumference L, i.e., r ∈ [0, L) with periodic boundary conditions. The momentum
k takes discrete values k = 2πl/L, l ∈ Z (like the index α), while r is continuous. The
coefficients hk|ri are nothing but the Fourier modes and we get
1 X ikr †
φb† (r) = √ ck ,
e b (17)
L k
with the sum running over k = 2πl/L, l ∈ Z, as the transformation law between the operator
c† that creates a particle with momentum k on the ring and the operator φb† (r) that creates
bk

a particle at position r on the ring.

11
2. Hamiltonians and partition functions

Most of the time, we are interested in the properties of the unperturbed condensed matter
system, or in its repose to small perturbations. In the former case, we can treat the system
as closed, neither exchanging energy nor particles with the environment (corresponding to
the microcanonical ensemble of statistical mechanics). In the latter case, we can use the
isolated system as a good starting point for perturbative calculations within linear response
theory.
Within the Hamiltonian formalism, the energy conserving system is described by a Hamil-
tonian H
b that acts on the Hilbert or Fock space. Finding the eigenvalues and eigenstates of

this operator amounts to solving the problem, a task that is often too hard to be achieved
without further approximations. The Hamiltonian can be given either in first or second
quantization. For the free particle of mass m on the ring with periodic boundary conditions,
its first quantized version is
pb2
Hµ =
b − µ, (18)
2m
d
where pb is the momentum operator that has the representation pb = −i dr in the position
basis of the single particle Hilbert space. (Throughout these notes, we will set ~ = 1, and
only reinstate it in specific places to connect to well known-results in the right units.) The
real number µ is the chemical potential the meaning of which will be discussed below. In
the second quantized formulation, we can express Hµ either in the operators φ(r)b or b
ck
Z L
1 d2 X k2
   
Hbµ = dr φb† (r) − 2
− µ φ(r)
b = −µ b c†k b
ck , (19)
0 2m dr k
2m
where we used that Z L
dr i(k−k0 )r
e = δk,k0 . (20)
0 L
b := P b †
Notice that the term multiplying µ is nothing but the operator N k ck b
ck that counts
the total number of particles in the system. From Eq. (19) we see that we have diagonalized
the Hamiltonian in momentum space, and its eigenvalues when acting on the single-particle
Hilbert space H1 are the energy levels of the system, given by εk = k 2 /(2m) − µ, with
k = 2πl/L, l ∈ Z. The 2π/L discretization is due to the finite size of the system. One
frequently encounters this splitting when considering small systems that can be solved (e.g.,
numerically), from which the behavior of the thermodynamically large system is to be ex-
trapolated.

12
An important difference between the two Hamiltonians (18) and (19) is that the former
always acts on a single-particle Hilbert space (as we only wrote the momentum operator
for one particle – for N particles we should have written the sum N b2i over their N
P
i=1 p

momentum operators instead), while the latter can act on any HN or the Fock space F.
b µ at zero temperature, when acting on F. When
Let us understand the ground state of H
µ < 0, it is just the vacuum |0i, as occupying any state would cost energy. For µ > 0 we have
to separately consider the case of bosons from that of fermions: When considering bosons,
an infinite number of particles would occupy the single-particle eigenstate of lowest energy
0 . In the case of fermions, every state can only be occupied once due to the Pauli exclusion
principle. At the same time, it is only energetically advantageous to occupy single-particle
eigenstate with negative energy εk < 0. (We don’t consider here the accidental case when
on of the εk equals 0, in which case the ground state would be degenerate.) The fermionic
zero-temperature ground state, the so-called Fermi sea (FS), is thus
!
Y
|FSi = c†k
b |0i. (21)
k: εk <0

Finite-temperature properties of the system at inverse temperature β = 1/(kB T ) can be


computed from the partition function

Zβ,µ = TrF e−β Hµ . (22)


b

Equation (22) can be rationalized by recalling that the partition function is a sum of the
unnormalized probablities e−βEs over all microstates s of the system, with associated energy
Es . Evaluating the trace TrF in the basis spanned by the eigenstates of H b µ readily yields

this connection. With Eq. (22) we compute the grand canonical partition function, i.e.,
the partition function for the grand canonical ensemble that is defined by not having a
fixed particle number. This is reflected by the fact that we take the trace over the Fock
space F instead of a Hilbert space HN of fixed particle number N , and by the fact that
the Hamiltonian contains the term −µNb . We indicate it with the subscript µ to remind

us of the dependence on the chemical potential. The canonical partition function Zβ,N
for a system with fixed particle number N can be computed by dropping the latter term
in the Hamiltonian and by restricting the trace in Eq. (22) over HN . Equation. (22) is
a completely general formula that can be applied to any, possibly interacting, quantum

13
system. The finite-temperature expectation value of an operator O
b is given by

b β,µ = 1 TrF Oe
h i
hOi b −β Hbµ . (23)
Zβ,µ

As an example, we consider the momentum occupation number n c†k b


bk := b ck for noninter-
acting particles in one dimension (an example would be the system governed by Hamiltonian
(19)). We first compute the grand canonical partition function. To do so, we break up the
trace
TrF = TrH0 + TrH1 + TrH2 + · · · + TrHN + · · · (24)

to compute

Zβ,µ = TrF e−β Hµ


b

X X X
=1 + e−βεk + e−βεk1 e−βεk2 + · · · + e−βεk1 × · · · × e−βεkN + · · ·
k k1 ,k2 k1 ,··· ,kN (25)
!
Y X
= e−βεk nk
k nk

For fermions, the momenta in each sum in the second line of Eq. (25) have to be all distinct
k1 6= k2 6= · · · and the nk -sum in the last line is only taken over nk = 0, 1. For bosons, the
sums are unrestricted and nk runs over all nonnegative integers. As a result, we obtain
Y Y 1
F
Zβ,µ = (1 + e−βεk ), B
Zβ,µ = , (26)
k k
1 − e−βεk

for fermions and bosons, respectively. Note that (1 + e−βεk ) and 1/(1 − e−βεk ) are the
partition functions for a system of only a single degree of freedom (i.e., a system where
the single-particle Hilbert space is restricted to only one momentum state). In general, the
partition function of a system of independent degrees of freedom (i.e., degrees of freedom not
coupled by the Hamiltonian through interactions), are given by the product over the partition
functions of each of these degrees of freedom. Note that the form of the partition functions
in Eq. (26) is general for any noninteracting fermion/boson system. The momentum k
could be replaced by any label of the energy eigenvalues of the Hamiltonian acting on the
single-particle Hilbert space.
Using the last line of Eq. (25), we can compute hb nk iβ,µ as
!  
1 X Y X 1 ∂
hb
nk iβ,µ = nk e−βεk nk  e−βεk0 nk0  = − ln Zβ,µ (27)
Zβ,µ n k0 6=k n
β ∂εk
k k0

14
and obtain using Eq. (26)
1 1
nk iFβ,µ =
hb , nk iB
hb β,µ = , (28)
eβεk + 1 eβεk − 1
for fermions and bosons, respectively. The functions fF-D (ε) := 1/(eβε + 1) and fB-E (ε) :=
1/(eβε − 1) are the Fermi-Dirac and Bose-Einstein distribution functions, respectively. They
dictate how the zero-temperature ground state occupations (i.e., the Fermi sea in the case
of fermions) are smeared out by thermal fluctuations at finite temperature.

3. Thermodynamic potentials

From the partition function, one can derive the thermodynamic potentials that are useful
quantities to characterize the state of a system and to compute its thermodynamic properties,
such as the specific heat or the response to perturbations in the form of susceptibilities. Here,
we will only discuss the quantities that will be relevant in the later course of the lecture.
From the canonical partition function, we can compute the Helmhotz free energy as
1
Fβ,N := − ln Zβ,N . (29)
β
Analogously, from the grand canonical partition function, we can compute the grand poten-
tial as
1
Ωβ,µ := − ln Zβ,µ . (30)
β
The internal energy U is given by
∂ ∂
UN = − ln Zβ,N , Uµ = − ln Zβ,µ , (31)
∂β ∂β
and can be used to compute the specific heat
∂U
C= . (32)
∂T
We can further compute susceptibilities in of the system to external perturbations. If we
perturb the system with an external parameter A and want to know how another parameter
B of the system reacts to it, we compute
d2 F
χAB =− . (33)
dAdB
For example, the magnetic susceptibility of an isotropic system is obtained by setting A =
B = H, where H is the magnetic field.

15
4. Spontaneous symmetry breaking

One of the central questions of condensed matter physics is to classify different phases
of matter, study their properties and the transitions between them. The term “phase” can
be used on different levels of rigor, but generically denotes a region in the space of external
system parameters (such as applied electromagnetic fields, pressure, temperature etc) in
which the properties of the system do not change. Hence, a phase is associated with some
rigidity or stability against perturbations. At transitions between phases, system properties
may change in a non-smooth fashion.
Some refined considerations apply to the definition of phases at zero temperature. Such
phases and the transitions between them are purely associated with quantum fluctuations,
rather than thermal fluctuations. Phase transitions at zero temperature are therefore called
quantum phase transitions, or if happening at a point in the phase space this is called
quantum critical point. At zero temperature, one can further distinguish gapless and gapped
phases of matter, depending on whether or not the ground state is separated from all excited
states by a finite energy gap in the thermodynamic limit.
One paradigm to classify phases of matter is that of spontaneous symmetry breaking,
associated with the names Ginzburg and Landau. The system undergoes a phase transition
from a high-symmetry, high-temperature phase, to a low-symmetry, low-temperature phase
at a critical temperature phase Tc . Associated with the transition is a local observable, the
so-called order parameter. Its expectation value is zero for T > Tc and nonzero for T < Tc .
One can deduce properties of the phase transition by general group-theoretic considerations.
Denote by G the symmetry group of the system above Tc and by G 0 ⊂ G the symmetry group
below Tc . The cardinality of the quotient G/G 0 determines the degeneracy of the ordered
phase. The order parameter manifold is a representation of this quotient group. It can be
either a continuous or discrete group.
Let us consider two examples, an Ising and a Heisenberg magnet. The Hamiltonian of
the Ising model is given by
bI = − J
X
H sbi sbj (34)
2
hi,ji

and the Hamiltonian of the Heisenberg model is given by

bH = − J
X
H Sbi · S
bj , (35)
2
hi,ji

16
where the sum over i, j runs over the sites of a d dimensional square lattice and the hi, ji
notation means that we only sum over nearest neighbor sites of the lattice. The coupling
constant J is chosen to be positive for a ferromagnet and negative for an antiferromagnet.
We assume the Ising spins sbi to be operators with a spectrum ±1 that all commute with
one another. The Heisenberg spins Sbi are an operator-valued 3-vector with components that

obey
[Sbi,α , Sbj,β ] = iα,β,γ δij Sbi,γ , bi · S
S bi = S(S + 1), (36)

where S is a positive half-integer or integer constant. This renders the Ising problem com-
pletely classical (all operators entering the Hamiltonian commute), while the Heisenberg
model could in principle show quantum-mechanical behavior.
In either case, we can guess the zero-temperature ground states for the ferromagnetic
couplings J > 0. In the Ising case, they are given by

|+i := |si = +1 on all sitesi, |−i := |si = −1 on all sitesi. (37)

In the Heisenberg case, they are given by

|ni := |n · S
b = +1 on all sitesi, (38)

where n is a unit vector in any chosen direction. These are two states for the Ising case
and a continuum of states that represent the rotation group SO(3) in spin space. It is not a
coincidence that we obtain these degenerate ground states. Hamiltonian (34) has a so-called
global Z2 symmetry, which physically corresponds to time-reversal: It is invariant under
the spin-flip sbi → −b
si ∀i. The Hamiltonian (35) has a global SO(3) symmetry, because
Q iw·Sbi /S
the spin-rotation i e , that rotates all spins around the axis w by an amount |w|
commutes with the Hamiltonian for all choices of the constant vector w. For that reason
all states at a given energy must span a subspace of the Hilbert space that is invariant
under the application of the symmetry operation. Therefore, if |+i is a ground state, there
must be a degenerate ground state |−i, and likewise for the Heisenberg case. In fact the
Hamiltonians (34) and (35) have larger symmetry groups that we denote by GI and GH ,
respectively, and which includes lattice translations and spatial rotations, for example. We
factorize these groups as GI = GI0 × Z2 and GH = GH0 × SO(3) into a part GH,I
0
of which each
ground state is a trivial representation and the nontrivial part discussed above.

17
In a real system, the symmetries are never exactly present, but broken by small local
perturbations. (The emphasis on locality comes from the physical requirement that terms
in the Hamiltonian should not couple degrees of freedom that are far away, or if they do
so, then with a strength that decays sufficiently fast with their distance.) For example, the
spin-rotation symmetry can be broken by magnetic impurities, or even by coupling to the
earth’s magnetic field. As a result of these perturbations, the system will choose one of the
(almost) degenerate ground states. Once chosen, this ground state is robust, even if the
perturbation is removed or reversed, because changing between, say, |+i and |−i requires
reorganizing a macroscopic number of degrees of freedom. This constitutes a large energy
barrier as domain walls have to propagate through the system. In fact, this process has
vanishing probability if the system size is taken to infinity (the thermodynamic limit).
The process of spontaneous symmetry breaking can be cast in more rigorous terms as
follows. We define an order parameter, the magnetization

c= 1 c= 1
X X
M sbi , M S
bi , (39)
N i N i

where N is the number of lattice sites. We further add perturbations to the Hamiltonians
X X
H
b I,h = H
bI + hb
si , H
b H,H = H
bI + H ·S
bi (40)
i i

0
that break the symmetry subgroup GI,H /GI,H . Spontaneous symmetry breaking occurs if

0 6= lim lim hM
ciβ,H ,
I,h
0 6= lim lim hM
c iβ,H ,
H,H
(41)
h→0 N →∞ H→0 N →∞

while it is zero if the limits were taken in the reverse order. The expectation value at finite
inverse temperature β was defined in Eq. (23). Equation (41) holds for the ferromagnetic
case J > 0. For the antiferromagnet, we would need to use the staggered magnetization as
an order parameter and apply a staggered perturbation. To be able to define a staggered
magnetization, it is important that our square lattice is bipartite in any dimension, i.e.,
the sites can be divided into two sets A and B such that all nearest neighbors of a site
in A are in B. For instance, a triangular lattice in d = 2 is not bipartite and we say the
antiferromagnet is geometrically frustrated on this lattice. We will study these magnetic
orders in more detail in Chapter V.
An important distinction is made between spontaneous symmetry breaking of a discrete
and a continuous symmetry group. The latter are accompanied by Goldstone modes (one

18
mode per generator of a continuous symmetry), which are gapless excitations above the
symmetry-breaking ground state manifold. This implies that the ground state of the Ising
ferromagnet can have (and has in fact) a finite gap to all other excitations, while the ground
state of the Heisenberg ferromagnet has gapless excitations that are spin-wave deformations
of the ferromagnetic pattern.
A natural question is: Under which conditions can spontaneous symmetry breaking oc-
cur? This question is answered by the Mermin-Wagner theorem, stating that a continuous
symmetry cannot be spontaneously broken at any finite temperature in d ≤ 2 (with suffi-
ciently short-ranged interactions). In d = 2, a continuous symmetry can be spontaneously
broken at zero temperature. In d = 1, a continuous symmetry can never be broken sponta-
neously.

5. Mean-field approximation

States with spontaneously broken symmetry are often treated within the so-called mean-
field approximation. Above, we have seen that in a symmetry-broken ground state, observ-
ables that have a vanishing expectation value in any eigenspace of the Hamiltonian (which
is necessarily invariant under the symmetries of the system) acquire a non-vanishing expec-
tation value. For the example of the spin operator in the Ising model, we can relate it to
the classical magnetization M of the system

sbi = M + δb
si . (42)

The mean-field approximation consists of dropping higher orders of the operators δb


si from
the Hamiltonian, so that the system becomes soluble.
We will go though this mean-field calculation for the example of the Ising model, where
si = sbi − M for all lattice sites i. The Hamiltonian is rewritten as
δb

b =− J
X X
H [M + (bsi − M )][M + (b
si − M )] − h sbi
2 i
hi,ji
(43)
X M2
 X
=−J zM sbi − z −h sbi + O(δb
si δb
sj ),
i
2 i

where z is the coordination number of the nearest neighbors on the lattice (on a hypercubic
lattice z = 2d). Neglecting the last term is justified if hδb sj i/(hδb
si δb si ihδb
sj i) = hδb sj i/M 2 
si δb

19
1. The mean field Hamiltonian
X z
b mf = −
H sbi heff − N J M 2 + O(δb
si δb
sj ), (44)
i
2

which is the Hamiltonian for an ideal paramagnet (a collection of independent spins) in a


magnetic field heff = JzM + h. The partition function for this Hamiltonian is
2
Y X
ZN (β, M, h) = e−βJzM N/2 eβheff si
i si =±1 (45)
−βJzM 2 N/2 N
=e [2 cosh (βheff )] ,
from which we find the free energy
z
F (β, M, h) = −β −1 log ZN = N J M 2 − N β −1 log [2 cosh (βheff )] . (46)
2
To find out whether a stable mean-field solution for M exists, we are looking for a minimum
of the free energy at finite M . Indeed
∂F
0= = N JzM − N Jz tanh (βheff ) (47)
∂M
has such a solution iff the self-consistency equation

M = tanh [β (JzM + h)] (48)

has one. We particularize on the case of zero magnetic field, and expand the tanh to linear
order in M in order to find the phase boundary. Solutions to Eq. (48) exist when the slope of
the function on the righthand side at M = 0 is larger than 1. This linearized self-consistency
equation
M = βJzM (49)

yields from this condition the critical temperature

Tc = Jz/kB . (50)

We can use this to expand the free energy as


" 3 #
M2
 
Tc 1 Tc
F (T, M, h = 0) ≈ F0 (T ) + N Jz 1− + M4
T 2 12s2 T
 2 (51)
M4
 
T M
≈ F0 (T ) + N Jz −1 +
Tc 2 12s2
where for the last step we took into account that our expansion is only valid for T ≈ Tc .
Moreover, F0 = −N kB T log 2. This form of the free energy expansion is the famous Landau
theory of a continuous phase transition which we will analyze in more detail now.

20
6. Ginzburg-Landau theory for the Ising model

We have used the Landau expansion of the free energy above to discuss phase transitions
in the vicinity of the critical temperature where M was small. This can be extended to
a highly convenient method which allows us to discuss phase transition more generally, in
particular, those of second order. Landau’s concept of the disorder-to-order transition is
based on symmetry and spontaneous symmetry breaking.
A further important aspect emerges when long-length scale variations of the order pa-
rameter are taken into account. This can be easily incorporated in the Ginzburg-Landau
theory and allows to discuss spatial variations of the ordered phase as well as fluctuations.
For the Ising model of the previous section, we can identify M as the order parameter.
The order parameter M is not invariant under time reversal symmetry K b,

b M = −M .
K (52)

The two states with positive and negative M are degenerate. The relevant symmetry group
above the phase transition is
G =G×K (53)

with G as the space group of the lattice (simple cubic) and K, the group {E, K
b } (E

denotes the identity operation). As for the space group we consider the magnetic moment
here independent of the crystal lattice rotations such that G remains untouched through the
transition so that the corresponding subgroup is

G0 = G ⊂ G (54)

The degeneracy of the ordered phase is given by the order of G/G 0 which is 2 in our case.
The Ginzburg-Landau free energy functional has in d dimensions the general form
Z  
d A 2 B 4 κ 2
F [M ; h, T ] = F0 (h, T ) + d r M (r) + M (r) − h(r)M (r) + [∇M (r)]
2 4 2
Z (55)
d
= F0 (h, T ) + d r f (M, ∇M ; h, T )

where we choose the coefficients according to the expansion done in (51) as


 
Jz T Jz Jz
A= d − 1 = − d τ and B = d . (56)
a Tc a 3a

21
Here a is the lattice constant. The form of all the terms that are allowed to appear in the
free energy functional can be determined by symmetry: They must transform trivially under
all elements of G. We have introduced the spatial continuum limit for the order parameter
M which is based on the procedure of coarse graining. We take a block of sites with the
volume Ldb with Lb much larger than the lattice constant a and define
1 X Ldb
M (r) = hsi i with Nb = (57)
Nb ad
i∈Λb (r)

and Λb (r) is the set of sites in the block around the center position r. Here we assume that
hsi i is changing slowly in space on the length scale Lb .
Under this condition we can now also determine κ from the model Hamiltonian using the
following consideration. (This argument here is a short cut, the more consistent calculation
would involve making a Hubbard-Stratonovic transformation. However, the results are the
same.) The variational equation of the free energy functional is given by
δF ∂f ∂f
0= ⇒ 0= −∇· = −κ∇2 M + AM + BM 3 − h. (58)
δM ∂M ∂∇M
We seek for comparison of this equation with another approach. Let us go back to Eq. (43)
and keep M formally as a spatially varying function M (ri ) with slow variations for one
lattice site to the other. We consider the case h = 0 for simplicity. Evaluating the partition
function for this case yields
 
J X Y
ZN (β, {M (ri )}, h = 0) = exp −β M (ri )M (rj ) [2 cosh (βJzM (ri ))] (59)
2 i
hiji

For the free energy we obtain


JX 1X
F (β, {M (ri )}, h = 0) = M (ri )M (rj ) − log [2 cosh (βJzM (ri ))] , (60)
2 β i
hiji

which we still kept in the form of a function, not a functional, having discrete lattice points
in mind. The derivative with respect to a fixed M (ri ) at one site gives
∂F (β, {M (ri )}, h = 0) J X
= M (rj ) − β(Jz)2 M (ri ) = 0, (61)
∂M (ri ) 2 j∈NN
i

where we have already linearized in M Going to the continuum limit, we can now insert
that
X X 1 X X ∂2
M (ri + a) = zM (ri ) + a · ∇M (ri ) + aµ aν M (ri ). (62)
2 ∂rµ ∂rν
{a}NN {a}NN {a}NN µ,ν=x,y...

22
P
The sum {a}NN runs over nearest-neighbor sites. Note that the second term on the right-
hand side vanishes due to symmetry. We obtain
 
T
0 = Jz − 1 M (r) − Ja2 ∇2 M (r) , (63)
Tc
where factors of 1/2 are eaten up by accounting for double counting in the sums over nearest
neighbors. Comparison of coefficients leads to

κ = Ja2−d . (64)

We may rewrite the equation (63) as


a2 kB Tc
0 = M − ξ 2 ∇2 M with ξ 2 = (65)
zkB (T − Tc )
where we introduced the length ξ which is equal to the correlation length for T > Tc . It
diverges as the phase transition is approached.

7. Critical exponents

Close to the phase transition at Tc various quantities have a specific temperature or field
dependence which follows power laws in τ = 1 − T /Tc or h with characteristic exponents,
called critical exponents. We introduce here the exponents relevant for a magnetic system
like the Ising model. The heat capacity C and the susceptibility χ follow the behavior

C(T ) ∝ |τ |−α and χ(T ) ∝ |τ |−γ (66)

for both τ > 0 and τ < 0. Also the correlation length displays a powerlaw

ξ(T ) ∝ |τ |−ν . (67)

For τ > 0 (ordered phase) the magnetization grows as

M (T ) ∝ |τ |β . (68)

At T = Tc (τ = 0) the magnetization has the field dependence

M ∝ h1/δ . (69)

These exponents are not completely independent but are related by means of so-called scaling
laws:

23
• Rushbrooke scaling: α + 2β + γ = 2

• Widom scaling: γ = β(δ − 1)

• Josephson scaling: νd = 2 − α

Let us now determine the exponents within mean field theory. The only one we have not
found so far is δ. Using the Ginzburg-Landau equations for τ = 0 leads to

BM 3 = h ⇒ δ=3 (70)

Thus the list of exponents is

1 1
α=0, β= , γ=1 δ=3, ν= (71)
2 2

These exponents satisfy the scaling relations apart from the Josephson scaling which depends
on the dimension d.
The critical exponents arise from the specific fluctuation (critical) behavior around a
second-order phase transition. They are determined by dimension, structure of order param-
eter and coupling topology, and are consequently identical for equivalent phase transitions.
Therefore, the critical exponents incorporate universal properties.

B. Linear response

Linear response studies the change in an observable A


b due to a small change H
b b (t) in
B

the static Hamiltonian H 0 of the system. The perturbation can be time-dependent with,
b

say, a single frequency ω and is adiabatically slowly switched on, i.e., the time-dependence
is taken to be H b e−iωt+ηt with η a small positive real number. The Schroedinger
b b (t) = B
B

equation for the perturbation is then

d b
i A =[A
b, H
b0+ H
b b (t)]
B
dt (72)
=[A
b, H
b 0] + [ A b ]e−iωt+ηt .
b, B

Taking the thermal average of this equation h A b e−β Hb 0 ]/Tr[e−β Hb 0 ] yields


b i = Tr[ A

b i = h[ A
(ω + iη)h A b 0 ]i + h[ A
b, H b, B
b ]i, (73)

24
which is subsequently solved to linear order in B
b . Here it was assumed that the expec-
b and its commutators acquire the same time dependence e−iωt+ηt as the
tation values of A
perturbation, and we are only probing this one Fourier component.
Consider the example of perturbing a free electrons gas (a metal in condensed matter
language) with a spatially and temporally varying potential (e.g., caused by an external
electric field). We are interested in the response of the electron density. The density operator
in momentum space is given by
Z
ρb q = dr φb † (r) φb (r)e−iq·r
1X †
= c b c k+q (74)
Ω k k
b

1X
≡ ρb k,q .
Ω k

The setting corresponds to


X 1X †
H
b0= c †k b
k b c k, H
b
B = ρb †q V (q, ω)e−iωt+ηt = ρb k,q V (q, ω)e−iωt+ηt (75)
Ω k
b
k

b = ρb q . Using [AB, BD] = A{B, C}D − {A, C}BD − C{A, D}B + CA{B, D} we
and A
obtain the commutator

[ ρb k,q , ρb †k0 ,q ] = δk,k0 ( b


c †k b c †k+q b
ck − b c k+q ) (76)

and
[ ρb k,q , ρb †k0 ,0 ] = (δk+q,k0 − δk,k0 ) ρb k,q (77)

we have
nk+q iF − hb
(ω + iη)h ρb k,q i = (k+q − k )h ρb k,q i + (hb nk iF )V (q, ω), (78)

where hb c †k b
nk iF = h b c k i is determined by the Fermi-Dirac distribution. Thus, summing over
k, we find the change in the electron density due to the external perturbation as
1X
δρ(q, ω) = h ρb k,q i = χ0 (q, ω)V (q, ω), (79)
Ω k

where we defined the linear response function as


1 X hbnk+q iF − hb
nk iF
χ0 (q, ω) = , (80)
Ω k k+q − k − ω − iη

which is known as the Lindhard form.

25
C. Symmetries in condensed matter

1. Non-local vs. local, unitary vs. antiunitary

In particle physics, the fundamental symmetries are Lorentz symmetry, C (charge conjuga-
tion), P (parity or inversion), T (time-reversal symmetry), as well as the gauge symmetries
of the standard model. Of these symmetries, only T and P are important in condensed
matter physics. Wigner proved that any symmetry is either represented by antiuntiary or
unitary operators in quantum mechanics (i.e., by operators that contain complex conjuga-
tion or not). Time-reversal symmetry is of the antiuntary kind. To see this, consider the
Schrödinger equation: In order to flip ∂t to −∂t in general, the only option we have is to send
i → −i. The fundamental representation of time reversal on systems with bosonic degrees
of freedom or ‘spinless’ fermions is T = K. For spin-1/2 fermions, in contrast T = K(iσy ),
where σy is the second Pauli matrix acting on each fermion’s spin degree of freedom. Most
other symmetries that we will be concerned with are unitary (another fundamental exception
is the charge conjugation symmetry C in high energy physics mentioned above).
A local unitary symmetry U can be written as U = U1 U2 · · · Un , where each U1 acts
only on a small region in space and the number of operators n grows proportional to the
system size if the latter is taken to infinity. As an example, consider the spin rotations
Q
Rα = spins i exp[iα · Si ], that would rotate all spins in the Heisenberg model that we
discussed above by the same amount.
In contrast, a nonlocal unitary symmetry cannot be decomposed in the above way. Typ-
ical examples for such symmetries include spatial symmetries like reflections, rotations of
configuration space.
Such spatial symmetries shift the degrees of freedom (like lattice sites) around in space,
but also act on atomic orbitals and on the spin degree of freedom of an electron on each
site. For the latter part of their action, inversion (I), mirror M eb , and rotations Rα act on
a spin-1/2 as
I = σ0 , M eb = i eb · σ, Rα = exp[iα · σ/2]. (81)

As a side remark, time-reversal (and other anti-unitary symmetries) can be seen as a


non-local symmetry, since the complex conjugation acts non-locally. However, up to this
subtlety it is acting as a local symmetry. Time-reversal has to commute with all spatial

26
a) b) c)
K
b2 b1
a1 a2 M
K0

a1 a2

Figure 1. a) The honeycomb lattice with two different choices of unit cells. The lattice is not
primitive, i.e., the repeating unit cell contains more than one (here, two) sites marked in green and
yellow. They are called the two sublattices of the honeycomb lattice. The hexagonal unit cell is
the Wigner-Seitz cell. b) Symmorphic and nonsymmorphic one-dimensional lattice with unit cells
marked in blue. The nonsymmorphic symmetry of the latter is a glide mirror for which half of a
lattice translation is combined with a reflection about the one-dimensional axis of the chain. c)
Brillouin zone of the honeycomb lattice. High-symmetry points are marked. The Γ point is also left
invariant by the six-fold rotational symmetry, the M point has two-fold rotational symmetry and
the K point has three-fold rotational symmetry. Planes left invariant under the mirror reflection
about the dotted line in a) are marked by dotted lines.

symmetries like rotations, mirror reflections, and inversion.

Finally, in condensed matter Lorentz symmetry is broken by the existence of a crystal


lattice, that is fixed (or vibrates only in small perturbations – phonons and sound waves) and
therefore the most prominent example of a broken symmetry state. However, the Lorentz
group is broken down to a discrete subgroup rather than completely broken in a crystal.
This makes our life a lot easier and we will explore these remaining symmetry properties of
a crystal in the next section.

27
2. Symmetries of a crystal

A crystal is regular lattice of repeating structures (the unit cell) attached to all positions
Rn

Rn = n1 a1 + n2 a2 + n3 a3 , (82)

with linearly independent basis vectors ai , i = 1, 2, 3 and n ∈ Z3 .


The space group of a crystal R is the group of all symmetry operations that leave the
crystal invariant: translations, rotations, inversion, reflections, and combinations thereof.
[Recall that a group G is a set of elements a, b, c, · · · ∈ G with a multiplication ? such that
it is: closed a ? b = c ∈ G; associative a ? (b ? c) = (a ? b) ? c; ∃ identity e, such that ∀a,
a ? e = e ? a = a; ∀a ∃ an inverse a−1 such that a ? a−1 = a−1 ? a = e. If a ? b = b ? a
the group is Abelian, otherwise it is non-Abelian; space groups are often non-Abelian, as
for example rotations do not commute.] In 2D there are 17 space groups (the so-called
wallpaper groups), in 3D there are 230 space groups. These counts apply to systems with
time-reversal symmetry, in magnetic systems the numbers are larger.
The unit cell of a lattice is not fixed uniquely. The Wigner-Seitz cell is a specific choice
that consists of the region of space that is closer to a given lattice point than to any other.
[See Fig. 1 a)]
Using the so-called Wigner notation, we can write a general spatial symmetry transfor-
mation of a crystal as a combination of a translation and another spatial symmetry

r 0 = gr + a ≡ {g|a}r, (83)

where the elements g form the (smaller) point group P. It is tempting to think that all
allowed operations in a crystal are either translations {E|a} or rotations, mirror reflections,
inversions {g|0} in which case R = P × trn, i.e., the space group is the direct product
of the point group elements and translations. This holds for so-called symmorphic space
groups (in 3D these are 73 of the 230). The remaining nonsymmorphic space groups also
contain screw rotations and glide mirror symmetries {g|a}, where a is a fractional lattice
translation. [See Fig. 1 b)] A screw rotation is a rotation combined with a translation a
along the rotation axis. A glide mirror is a mirror reflection combined with a translation
within the mirror plane. The associative multiplication rule for symmetry transformations

28
Table I. List of all space groups in 3D time-reversal symmetric crystals.

crystal system angles equal sides space group numbers point groups

triclinic α, β, γ 0 1–2 C1 , C1̄


π π
monoclinic 2, 2, β 0 3–15 C2 , Cs , C2h
π π π
orthorhombic 2, 2, 2 0 16–74 D2 , C2v , D2h
π π π
tetragonal 2, 2, 2 2 75–142 C4 , S4 , C4h , D4 , C4v , D2d , D4h
π π
trigonal 2, 2, 4 π3 2 143–167 C3 , S6 , D3 , C3v , D3d
π π
hexagonal 2, 2, 4 π3 2 168–194 C6 , C3h , C6h , D6 , C6v , D3h , D6h
π π π
cubic 2, 2, 2 3 195–230 T , Th , O, Td , Oh

in the Wigner notation is


{g|a}{g 0 |a0 } = {gg 0 |ga0 + a}. (84)

The group of lattice translations {E|Rn } is an Abelian subgroup of R.


A generic point in space is mapped to some other point in space by any of the symmetry
operations of the space group. However, certain points, lines, or planes in the unit cell
are left invariant by a subgroup of the space group symmetries. For example, in a crystal
with inversion symmetry, the inversion center is left invariant under inversion. These high-
symmetry points are called Wyckoff-positions and are all tabulated. In real crystals, atoms
are often sitting on such higher-symmetry Wyckoff-positions.

3. Reciprocal space

Discrete translational symmetry of a system implies that eigenstates of the Hamiltonian


can be labelled by the eigenvalues of the translation operator, which are the momenta k.
We will now explore the reciprocal space, in which the k live.
If we are interested to evaluate a function f only on lattice points Rn , i.e., in discretized
space, two terms fk and fk0 in a Fourier expansion
X
f (Rn ) = fk eik·Rn (85)
k

0
are linearly dependent if ei(k−k )·Rn = 1 for all Rn in the lattice. This allows to define the

29
reciprocal lattice as the collection of points G in momentum space with

1 = eiG·Rn , ∀Rn . (86)

The reciprocal lattice is spanned by bi , i = 1, 2, 3 with

aj × ak
bi = 2π (87)
ai · (aj × ak )

for which ai · bj = 2πδij


Gm = m1 b1 + m2 b2 + m3 b3 . (88)

A function g that is expanded as


X
g(r) = gG ei G·r (89)
G

and evaluated at r in continuous real space will have the periodicity of the lattice. If we
have a chance to probe such a lattice-periodic function g(r) with a plane wave, say in a
scattering experiment, we are frequently compute
Z
F [g] = dreik·r g(r)
XZ
= dxeik·(x+R) g(x + R) (90)
R unit cell
X
= eik·R S(k),
R

where Z
S(k) = dxeik·x g(x) (91)
unit cell

is called the structure factor.


The Brillouin zone (BZ) is the unit cell of the reciprocal lattice. Certain points, lines,
or planes in the BZ are left invariant under some of the point group symmetry operations.
By definition (since k are the eigenvalues of a translation operator), translations leave each
point in reciprocal space invariant. Mirror reflections leave the planes perpendicular to the
mirror normal invariant. Inversion leaves the so-called TRIM points (time-reversal invariant
momenta) k = 0 (the Γ point), k = b1 /2, k = b2 /2, k = b3 /2, k = b1 /2 + b2 /2, k =
(b1 + b3 )/2, k = (b2 + b3 )/2, k = (b1 + b2 + b3 )/2 invariant, because it maps k to −k,
but k ≡ k + G. The same points are left invariant by the time-reversal transformation.

30
Rotations leave the points along some lines parallel to the rotation axis invariant. [See
Fig. 1 c) for the honeycomb lattice as a 2D example.]
The group of symmetry operations that leave a point, line or plane in momentum space
invariant is called its little group.

31
III. NONINTERACTING ELECTRON SYSTEMS

A. Bloch’s theorem

We explore the consequences of the Abelian subgroup of translations of the point group
of a crystal. A Hamiltonian H b that commutes with all translation operators Tb Rn , i.e,

[Hb , Tb Rn ] = 0 is considered. The translation operator is defined through its action on

single-particle operators in the position basis

Tb Rn φb † (r) Tb −1 b†
Rn = φ (r + Rn ), Tb Rn φb (r) Tb −1
Rn = φ (r + Rn ),
b Tb −1
Rn = T −Rn .
b

(92)
We are, however, going to formulate things in first quantized language now. Consider the
single particle Hamiltonian
pb 2
H =
b + V ( rb ), (93)
2m
with the periodic potential V (r + Rn ) = V (r) that could stem from a lattice of ions. Hb

and Tb Rn are all commuting and hence have a common eigenbasis. Bloch’s theorem says
that the eigenstates of Tb Rn have eigenvalues on the unit circle in the complex plane in order
not to blow up and to be extended. More precisely, the eigenstates of Tb a are plane waves
i

Tb ai |k, mi = eik·ai |k, mi, (94)

where m labels a possible degeneracy of the eigenspace.


A wave-function (notice the subtle distinction between wave function and state) of the
form
1 1
hr|k, mi ≡ ψm,k = √ eik·r um,k (r) ≡ √ eik·r hr|uk,m i (95)
Ω Ω
satisfies this property if
um,k (r + Rn ) = um,k (r) (96)

because then
ψm,k (r + Rn ) = eik·Rn ψm,k (r). (97)

We choose |k, mi to also diagonalize H


b

b |k, mi = n,k |k, mi,


H (98)

where m is the so-called band index, k is the pseudo-momentum, and um,k (r) the Bloch
wavefunction. Because eik·Rn = ei(k+Gm )·Rn , two momenta that differ by a reciprocal lattice

32
vector Gm are in the same eigenspace of the translation operator and we can restrict k to
the first BZ. Using

pb |k, mi = pb eik· rb |uk,m i = eik· rb ( pb + k)|uk,m i, (99)

we find the Bloch equation


( pb + k)2
 
+ V ( rb ) |uk,m i = m,k |uk,m i, (100)
2m
which has now to be solved for periodic states, i.e., TRn |uk,m i = |uk,m i only. (Notice that,
had we not set ~ = 1, the momentum shift would read ~k.) Taking the overlap with a
position eigenstate from the left, we obtain the alternate form in terms of wave functions
(−i∇r + k)2
 
+ V (r) uk,m (r) = m,k uk,m (r). (101)
2m
The fact that differential equations involving periodic functions have solutions that have the
same periodicity is a theorem named after Gaston Floquet (Floquet theory) in math. It was
rediscovered in the condensed matter context by Felix Bloch. In physics, Floquet’s name is
attached to the study of systems that are driven periodically in time.
We will now use two strategies to attack this problem, the nearly free electron approxi-
mation and the tight-binding approximation.

B. Nearly free electron approximation

In the nearly free electron approximation we consider electrons in a weak potential that
has the periodicity of the lattice
X
V (r) = VG eiGr . (102)
G

We want to solve the Bloch equation using perturbation theory in V (r). In configuration
space, the potential is a real function. If in addition V (r) = V (−r) (inversion symmetry),
then VG = VG∗ = V−G . Using the periodicity of the Bloch wave function, we can expand
X
uk,m (r) = cG e−iG·r (103)
G

to obtain
(k − G)2
  X
− m,k cG + VG0 −G cG0 = 0, (104)
2m G 0

33
which presents an eigenvalue problem for the matrix VG0 ,G = VG0 −G in infinite dimensions.
The zeroth order energy eigenvalues are given by the free electron parabolas

(0) (k − G)2
G,k = , (105)
2m

where we can use the reciprocal lattice vectors G as band labels. The first order result is
given by
(1) (0)
X |VG−G0 |2
G,k = G,k + (0) (0)
, (106)
G0 6=G G,k − G0 ,k

valid if the bands are nondegenerate. If bands are degenerate, as it happens for example at
the zone boundary, we have to resort to degenerate perturbation theory (as we will exemplify
below).
1D example: Consider a 1D system perturbed by the potential with general coefficients
VG  G2 /(2m), where G = nb and b the unit vector spanning the reciprocal lattice. Let us
first compute the corrections to the lowest band for small k, i.e., near the zone center k  b.
We obtain

(1) k2 X |VG |2
0,k = −
2m G6=0 [(k − G)2 − k 2 ]/(2m)
(107)
k2
≈ ? + E0 ,
2m

which we recast into a parabolic term with altered mass (the so-called effective mass) and
an overall energy shift
" #
1 1 X  2m|VG | 2 X 2m|VG |2
= 1−4 , E0 = − . (108)
m? m G6=0
G2
G6=0
G2

Next, let us consider what happens to the next-higher bands G = ±b at the zone center,
(0)
i.e., for k  b. This problem has to be tackled with degenerate perturbation theory, for b,k
(0)
and −b,k are degenerate at k = 0. Degenerate perturbation theory amounts to diagonalizing
the Bloch equation in the degenerate subspace first (note that we cannot use the reciprocal
lattice vectors anymore as a label for the eigenstates, the perturbation is now really mixing
them)
  
(k−b)2
− k V−2b cb
 2m    = 0. (109)
(k+b)2
V2b 2m
− k c−b

34
The characteristic equation of the above yields the eigenvalues
 s 
2 2
2
1 (k + b) + (k − b)
 (k + b) + (k − b)
2 2 
±,k = ± + 4|V2b |2
2 2m 2m 
(110)
2 2
k b
= ?
± |V2b | + ,
2m± 2m

with the effective mass


b2
 
1 1
= 1± , (111)
m? m m|V2b |
which due to |V2b |  b2 /(2m) have opposite sign. Thus the two bands obtain opposite
curvature. A band gap has opened. At k = 0, where the nontrivial part of the above matrix
is proportional to σ1 , the eigenstates are given by cb = ±c−b yielding the real-space Bloch
wave functions
uk=0,+ (x) = cos(bx), uk=0,− (x) = sin(bx), (112)

i.e., they are even and odd under the parity operation.
We can analyze the momenta at the zone boundary in a similar fashion using degenerate
perturbation theory, where we would likewise find the opening of a band gap. [See Fig. 2
a).]

C. Tight-binding approximation

We are now attempting to solve the Bloch equation in exactly the opposite limit, where
electrons are bound to a regular lattice of ionic potentials and the quantum mechanical
tunneling process of an electron between two atoms is seen as the perturbation. This so-
called tight-binding approximation is of much higher practical relevance than the nearly free
electron approximation.
Let us consider atoms positioned on sites Ri that could form a simple lattice or a lattice
with a basis. An isolated atom at R is described by a Hamiltonian
2
b atom,R = pb + Vatom ( rb − R)
H (113)
2m

which we can formally diagonalize

b atom,R |α, Ri = α |α, Ri,


H (114)

35
a) ✏ b)
M
3

2
3

2
1
1

band X 0 ✏k
ky 0
gap -1

-2
-1 -3

-2
band
gap -3
-3 -2 -1 0 1 2 3
k
2b b 0 b 2b kx
first BZ van Hove singularities

Figure 2. a) Band structure of a 1D system obtained within the nearly free electron approxima-
tion. b) Band structure of a tight-binding model on the square lattice with only nearest-neighbor
hopping. The blue dots mark the position of van Hove singularities where the density of states
diverges. Together with the black high-symmetry points, they from the set of independent time-
reversal invariant momenta in this BZ. The contours correspond to the different Fermi lines as the
band filling is changed.

noting that the eigenstates (the atomic orbitals) in the position basis hr|α, Ri are functions
of r − R only. We understand Vatom as the (rotationally symmetric) atomic potential and
subsume all quantum numbers (orbital/angular momentum and spin) in the index α.
A single-particle Hamiltonian of the whole lattice is then given by

2
b = pb +
X
H Vatom ( rb − Ri )
2m R i (115)
b atom,R + ∆Vatom,R ( rb )
=H j j

where we have defined


X
∆Vatom,Rj (r) := Vatom (r − Rj ) (116)
Ri 6=Rj

as the perturbation of the Rj atom potential due to the presence of all other atoms. Treating
this as a small term is precisely the spirit of the tight-binding approximation.
We are thus lead to computing the matrix elements of the Hamiltonian in the eigenstates

36
of two atoms j and j in order to solve the perturbed problem

hα, Rj | H
b |β, Rl i = α δα,β δR ,R + hα, Rj |∆ Vb atom,R |β, Rl i.
j l j
(117)

The last term describes the tunneling of an electron in orbital α of atom j to orbital β of
atom l. This is only likely to happen if Rj and Rl are close to one another.
In order to make progress, let us directly assume a form for these matrix elements. Let
us focus on atoms placed on a simple square lattice in 2D with lattice constant a, Nx × Ny
sites and periodic boundary conditions. We consider only the lowest s orbital of each atom.
If we also neglect the electron spin degree of freedom (implicitly assuming that everything
is trivially spin-degenerate), we can drop the index α on each atom. Now, set




−t if Rj and Rl are nearest neighbors

hRj |∆ Vb atom,Rj |Rl i = −t0 if Rj and Rl are next-nearest neighbors (118)




0 else.

We also drop the irrelevant global shift of all onsite energies.


This is a great occasion to introduce second quantized notation, even though it is abso-
lutely not needed for this problem. However, the hopping processes are intuitively under-
stood in the second quantized language. We rewrite the tight-binding problem as

X X
b tight−binding = −t
H c†Rj cRl − t0 c†Rj cRl , (119)
hRj ,Rl i hhRj ,Rl ii

where c†Rj (cRj ) is the creation (annihilation) operator for an electron on site Rj and the
summation subscripts hRj , Rl i (hhRj , Rl ii) are frequently used to indicate sums over nearest
(next-nearest) neighbors. Rewriting the second quantized operators using their Fourier
transform
1 X
c†Rj = p eik·Rj c†k , (120)
Nx Ny k

where the sum is taken over the first BZ k = 2π/a(n/Nx , m/Ny ), n = 0, · · · , Nx − 1,


m = 0, · · · , Ny − 1, we obtain

X
b tight−binding = −
H {2t[cos(kx a) + cos(ky a)] + 4t0 [cos(kx a) cos(ky a)]} c†k ck . (121)
k

37
We observe that the Hamiltonian is readily diagonal in momentum space so that we can
read off the energy band dispersion

k = − 2t[cos(kx a) + cos(ky a)] − 4t0 [cos(kx a) cos(ky a)]


(122)
0 k2
= − 4t − 4t − ?
+ O(k4 ),
2m
where the effective mass m? = (2t + 8t0 )−1 near k = 0. Expanding the Hamiltonian and
energy bands in small deviations p around a momentum point k is called a k · p approxi-
mation.

D. Wannier states

The Fourier transform of Bloch wave functions

1 X −ik·R
|R, ni = √ e |k, ni (123)
Ω k

is called a Wannier function and provides an alternative basis for the single particle Hilbert
space, where n is a band index. It has the properties

Wn (r − R) ≡ hr|R, ni, (124)

i.e., it is a function of (r − R) only and


Z Z
∗ 0 1 X i(k·R−k0 ·R0 ) ∗
drWn (r − R)Wm (r − R ) = e drψn,k 0 (r)ψm,k (r)
Ω k,k0
1 X i(k·R−k0 ·R0 ) (125)
= e δk,k0 δn,m
Ω k,k0

=δR,R0 δn,m .

It is important to stress the role of a gauge freedom here. In momentum space, we can
change the gauge of the Bloch states by multiplying with any momentum-dependent U(1)
phase factor ϕ(k)
|k, ni → eiϕ(k) |k, ni. (126)

This local transformation in momentum space has to leave all physical observables invari-
ant. However, following Eq. (123) it changes the Wannier functions shape in real space,
i.e., it changes the shape of the real space basis functions. In a multi-band system either a

38
U(1)× · · · ×U(1) gauge transformation or, for N degenerate bands, a U(N ) gauge transfor-
mation is allowed.
1D example: Consider a dimerized chain in 1D, where the hopping between neighboring
sites is alternating between t and t0 . This makes the unit cell to be 2 sites large. We call
these two sites (sublattices) A and B. The Hamiltonian is then given by
X  X † 
b = −t
H c †A,j b
b c †B,j b
c B,j + b c A,j − t0 c B,j b
b c †A,j+1 b
c A,j+1 + b c B,j . (127)
j j

Using the Fourier transform

1 X −ijk †
c †A/B,j = √
b e c A/B,k
b (128)
N k

we obtain the momentum space Hamiltonian


t X X −i(k−k0 )j  † †

H =−
b e c A,k b
b c B,k0 + b c B,k b
c A,k0
N j k,k0
t0 X X −i(k−k0 )j  ik0 † 
− e e b c A,k0 + e−ik b
c B,k b c †A,k b
c B,k0
N j k,k0 (129)
  
0 −ik
X †  0 t + t e c
c †B,k    A,k 
b
=− c A,k , b
b
0 ik
k t+te 0 c B,k
b

which has the spectrum


q
εk,± = ± t02 sin2 k + (t + t0 cos k)2 . (130)

For two cases we can write down the Bloch wave functions particularly easily
   
1 1 1 1
t0 = 0 : |k, −i = √   , t=0: |k, −i = √   . (131)
2 1 2 eik

For these two cases we can also express the Wannier states |J, −i := N −1/2 e−ikJ |k, −i
P
k

in a simple way: For t0 = 0, we have

1 1
hjA|J, −i = √ δj−J , hjB|J, −i = √ δj−J . (132)
2 2
For t = 0, we have

1 1
hjA|J, −i = √ δj−J , hjB|J, −i = √ δj−(J−1) . (133)
2 2

39
We observe that the Wannier functions are not centered at a site of the lattice (in general
they can be centered at different Wyckoff positions than the atoms).The difference between
the two cases is that the Wannier functions are centered in the middle of the unit cell and at
the boundary of the unit cell, respectively. Since the model has inversion symmetry, these
are the only places where they could be located. For generic values of t and t0 , the Wannier
functions would simply smear out over more lattice sites but keep their center.

E. Interlude: Group representations and character tables

The group of symmetry operations of a crystal constrains the electronic structure in


momentum space, as we shall see in the next section. To be prepared for this, we need to
quickly learn or recall some facts about the representation of groups.
A representation of a group over some vector space V is a collection of linear operators
on this vector space, which satisfy the same algebra as the group elements. As we are
only concerned with finite groups, we can always think of these operators as matrices. An
irreducible representation is a set of such matrix operators that is simultaneously block-
diagonalized and cannot be diagonalized into smaller blocks anymore.
To be concrete and explicit, we will illustrate the terminology that is introduced using
the example of the symmetry group of an equilateral triangle, C3v (the latter is called the
Schoenflies symbol for the group). The symmetry group consists of an identity element E,
clockwise and anti-clockwise rotations by ±2π/3, denoted by C3 and C32 , as well as reflections
σv , σv0 , and σv00 that exchange a pair of corners each. We now represent these operations on
V = R3 as
   
0 1 0 0 0 1
   
E = 113×3 , C3 = 0 0 1 , C32 = 1 0 0
   
   
1 0 0 0 1 0
      (134)
1 0 0 0 1 0 0 0 1
0 00
σv = 0 0 1 , σv = 1 0 0 , σv = 0 1 0 .
     
     
0 1 0 0 0 1 1 0 0
If the three components of the vector in V represent the corners of the triangle (A, B, C),
then we directly see that for instance C3 (A, B, C)T = (B, C, A)T implements the desired
rotation (cyclic permutation).

40
The above representation of the symmetry group is reducible. To see that, we perform
the similarity transformation
 √ √ 
1/
 √ 3 2/ 6 0
√ √ 
R = 1/ 3 −1/ 6 1/ 2  , (135)
 
 √ √ √ 
1/ 3 −1/ 6 −1/ 2

e = R−1 GR.
which brings any of G ∈ {E, C3 , C32 , σv , σv0 , σv00 } to a block-diagonal form G
Concretely
   
1 0 √0  1 0 0 

e3 = 0 − 1 3 2
e3 = 0 − 1 − 3 
E
e = 113×3 , C , C
  
2 2   √2 2 
 
 √
0 − 23 − 12 0 23 − 12
      (136)
1 0 0 1 0 0 1 0 0
   √   √ 
0 3 00
ev = 0 1 0  ,
σ σ 1
ev = 0 − 2 2  , ev = 0 − 2 − 23  .
σ 1
     
   √   √ 
0 0 −1 0 23 12 0 − 23 12

We have managed to split up the representation of the group into two irreducible rep-
resentations, one with dimension one and one with dimension two. Information about the
irreducible representations is summarized in character tables.

C3v E C3 C32 σv σv0 σv00


A1 1 1 1 1 1 1
E 2 −1 −1 0 0 0

The entries of the table (the characters) are the trace of the respective irreducible repre-
sentation. Group element representations of the same class have the same trace and are
thus usually summarized in a character table. We also note that we have not discovered one
more irreducible representations via our considerations, which corresponds to rotations in
three-dimensional space. The full character table of C3v thus reads as follows

C3v E 2C3 3σv basis functions


A1 1 1 1 z, x2 + y 2 , z 2
A2 1 1 −1 Rz
E 2 −1 0 (x, y), (Rx , Ry ), (x2 − y 2 , xy), (xz, yz)

41
The first column consists of the Mulliken symbols of the irreducible representations and we
have added the basis functions to lowest power in the coordinates in the last column. These
basis functions transform under the symmetry operations as the irreducible transformations
do. Note that the entry in the column E that corresponds to the identity element of the
group always gives the dimension of the irreducible representation. The number of classes
of group elements is always equal to the number of irreducible representations. There are
various further constraints on the entries of the character table. For example, the sum of the
squares of the dimensions of the irreducible representations equals the order of the group
(i.e., the number of group elements).
The characters also contain information about how the irreducible representations trans-
form under the symmetry operations. In groups that contain inversion, the subscript ‘u’
and ‘g’ with the Mulliken symbol stands for even and odd irreducible basis functions under
inversion.
Below we will need the character tables for the groups Oh and D4h which are given in
Fig. 3.

F. Symmetries in band structures

We have already explored the symmetries of crystal lattices and the implications for
functions defined on such a lattice. It is now imperative to ask how this carries over to
quantum-mechanical band structures and Bloch states.
Let S{g,a} be the representation of an element of the space group when acting on a wave
function
S{g,a} ψk (r) = ψk ({g, a}−1 r) = ψk (g −1 r − g −1 a) (137)

where the inverse appears because Sb {g,a} |ri = |gr + ai, but hr| Sb {g,a} = hg −1 r − g −1 a|. We
now show that the transformed wave function ψk (g −1 r − g −1 a) is also an eigenstate of the
translation operator

Ta0 S{g,a} ψk (r) =S{g,a} Tg−1 a0 ψk (r)


−1 a0 )
=S{g,a} e−ik·(g ψk (r) (138)
0
=e−i(gk)·a S{g,a} ψk (r),

but with an eigenvalue determined by gk rather than k. This suggests that ψk (g −1 r − g −1 a)

42
Figure 3. Character tables for the point groups Oh and for D4h , which is a subgroup of the former.

is (proportional to) ψgk (r). Indeed, because the Bloch wave functions have the periodicity
of the lattice S{g,a} uk (r) = ugk (r) we have

1 −1 −1
S{g,a} ψk (r) = √ eik·(g r−g a) ugk (r)
Ω (139)
−i(gk)·a
=e ψgk (r).

From this transformation property, we can infer, for example, the symmetries of the band
structure
b |ki = hk| Sb −1 H
k = hk| H b Sb {g,a} |ki = hgk| H
b |gki = gk , (140)
{g,a}

which defines a ‘star’ of k points whose band eigenvalues are related.


Notice that the above calculation assumed nondegenerate bands. If the bands are degen-
erate then Sb {g,a} may also act nontrivially on this internal degree of freedom. The action

43
that leaves k unchanged is the aforementioned little group symmetry of k. Assume that we
have a set of d degenerate states at some momentum

b |n, ki = k |n, ki,


H n = 1, · · · , d. (141)

For every element of the little group of k, Sb {g|a} we have a representation of the symmetry
X
Sb {g|a} |n, ki = |n0 , kiMn,n0 ;k , Mn,n0 ;k = hn0 , k| Sb {g|a} |n, ki. (142)
n0

If the representation is irreducible, its dimension is that of the Bloch states at the momentum
k.
Example: Consider a tight-binding model for p-orbitals on a cubic lattice spanned by
the basis vectors ax , ay , and az . On each lattice site there are three atomic orbitals with
the from

hr|x, Ri =(R − r)x f (|R − r|),

hr|y, Ri =(R − r)y f (|R − r|), (143)

hr|z, Ri =(R − r)z f (|R − r|),

which due to the symmetry of the simple cubic lattice remain degenerate in energy. For
nearest neighbors, the overlaps from Eq. (117) evaluate to

tδα,β

for γ = α σ-bonding
hα, Rj |∆ Vb atom,Rj |β, Rj + aγ i = (144)
−t̃δα,β

for γ 6= α π-bonding,

with α, β, γ ∈ {x, y, z}. For next-nearest neighbors, the overlaps from Eq. (117) evaluate to
(for γ 6= ρ)




 t0 δα,β for γ = α

hα, Rj |∆ Vb atom,Rj |β, Rj + aγ ± aρ i = −t̃0 δα,β for γ 6= α (145)


−(±)t̃˜0 δ δ


α,γ β,ρ

with α, β, γ ∈ {x, y, z}. Notice that some hoppings are zero by symmetry, for example the
hopping along the x direction from an x to a y orbital.
Translating the above into a second quantized Hamiltonian in momentum space yields

44
Figure 4. Generic band structure for s and p orbitals placed on the sites of cubic lattice, including
the irreducible representations at each high-symmetry point.

(we set the lattice constant to unity)


  
Ex (k) ˜ 0 ˜ 0
−4t̃ sin kx sin ky −4t̃ sin kx sin kz  cpx ,k 

(cpx ,k , c†py ,k , c†pz ,k ) −4t̃˜0 sin kx sin ky −4t̃˜0 sin ky sin kz  cpy ,k 
X †
Hbp= Ey (k)
  
k
  
˜ 0 ˜ 0
−4t̃ sin kx sin ky −4t̃ sin ky sin kz Ez (k) cpz ,k
(146)
with

Ex (k) = 2t cos kx −2t̃(cos ky +cos kz )+4t0 cos kx (cos ky +cos kz )−4t̃0 cos ky cos kz , (147)

and the analogous expressions for Ey (k) and Ez (k). We have neglected an overall on-site
energy shift which is equal for all p-orbitals. The resulting band structure is shown in Fig. 4.
We can now analyze the degeneracies of this band structure at high-symmetry points.
At the Γ point, k = 0, we have the full crystal symmetry of the Oh group, the Bloch matrix
simplifies to
(2t + 8t0 − 4t̃ − 4t̃0 )113×3 , (148)

and accordingly all three bands are degenerate belonging to the representation Γ−
15 of Oh .

At the X point, k = (π, 0, 0), the little group is D4h (a tetragonal symmetry) and the Bloch

45
matrix is given by
diag(−2t − 8t0 − 4t̃ − 4t̃0 , 2t + 4t̃0 , 2t + 4t̃0 ), (149)

which is separated into a two-fold degenerate irreducible representation [X5− , corresponding


to basis functions (x, y)] and a singly degenerate one [X2− , corresponding to basis function
z]. In fact, the degeneracy between two bands persists for any momentum k = (kx , 0, 0)
that lies on the line connecting the Γ and the X point. The Bloch matrix is given by

diag 2(t + 4t0 ) cos kx − 4(t̃ + t̃0 ),


2[t + 2t̃0 − t̃ + (2t0 − t̃ − 2t̃0 ) cos kx ], (150)

2[t + 2t̃0 − t̃ + (2t0 − t̃ − 2t̃0 ) cos kx ] ,




Such a degeneracy between two bands is called a nodal line. Depending on the space group
and irreducible representation, band structures can also posses nodal points, i.e., point-like
degeneracies, or (less common) nodal surfaces, i.e., degeneracies on a surface in momentum
space. Since these degeneracies are related to the irreducible representations of bands, they
are not an artifact of the type of tight-binding model that we have chosen, and could, e.g., not
be lifted by longer-range hoppings or other perturbations compatible with the point-group
symmetries.

G. Obtaining information about band structures and symmetries

Several big repositories and tables have recently emerged on the web which help to find
materials with a desired electronic structure or to help understand the electronic structure of
a given material. Here, we mention three repositories with different focus and functionality

• https://materialsproject.org: The Materials Project is a database with gives


rough information about the electronic structure of many materials. It is not the most
complete database, but it allows to take a quick look even at the band structure and
diffraction spectra. Exercise: Try to find the crystal and band structure of diamond
and (rock)salt. A list of the space groups is for example here: http://img.chem.ucl.
ac.uk/sgp/large/sgp.htm.

• https://icsd.fiz-karlsruhe.de/search/: The ICSD is a more complete and accu-


rate database of previously synthesized materials, but it requires institutional access.

46
One can search for a material and then find all relevant references in which the struc-
ture of the compound was experimentally determined. One can then download a
structure file (*.cif) which can be handled by programs computing band structures or
by the free VESTA software (http://jp-minerals.org/vesta/en/) to visualize the
structure.

• http://www.cryst.ehu.es/#retrievaltop: The Bilbao Crystallographic Server


contains very comprehensive information about the symmetry representations in band
structures. It does not contain any database related to materials.

Exercise: We want to understand the band structure of tungsten carbide (WC), a


material traditionally used for hardened tools and jewelry. Determine (maybe with the help
of google and Wikipedia) which space group WC is in and find the correct entry in the
Materials Project. Use the Bilbao Crystallographic Server to determine the labels of high-
symmetry points in the Brillouin zone for this space group. Along the Γ-A line two bands
cross in an X-like fashion above the Fermi energy. Why is this not an avoided crossing?
Use the Bilbao Crystallographic Server to find possible irreducible representations of the
bands along this line in momentum space and determine to which of them the two bands in
question belong. Picture the three-dimensional band structure around the degeneracy point
of the bands.

H. Filling of bands and density of states

So far we have only considered a single particle picture of electronic bands. In a many-
body Hilbert space, fermionic states can all be at most singly occupied. In the ground
state at zero temperature, all states below the Fermi level in a solid are singly occupied,
all states above are empty. The occupied states form the Fermi sea. If the Fermi level
is located such that it cuts though a band (or more generally, there are excited states ate
arbitrarily low energy available) one speaks of a metal. For a noninteracting system, the
locations in momentum space of these single-particle excitations with arbitrary low energy
and momentum transfer, form the Fermi surface. Materials with a finite excitation energy
above the ground state are called semiconductors (for small excitation energies) or insulators
(for large excitation energies). At finite temperature, the single-particle levels dictated by

47
the band structure are occupied according to the Fermi-Dirac distribution.
When counting the number of states available, it is always important to pay attention
whether spin-degeneracies have to be taken into account or not. If all bands are spin degen-
erate (as is the case in an electronic model which neglects spin-orbit and Zeeman couplings),
each state can be occupied by a spin-up and a spin-down electron.
To better understand the relation between the band dispersion and the number of occu-
pied states, we return to the example of the simple square lattice and restrict it to nearest
neighbor hopping only. The dispersion is k = −2t cos kx − 2t cos ky . We want to compute
the density of states, i.e., dN/d, where N is the number of occupied states. One interesting
point is the band bottom and the zero energy around which k ≈ k2 /(2m? ). We have in 2D

dN = 2πkdk and k = 2m?  such that
r
dN dN dk m?
= = 2πk = 2πm? , (151)
d dk d 2

i.e., it is a constant. The same calculation yields in 1D dN/d ∝ 1/  and in 3D dN/d ∝

. The second interesting point to consider in 2D is near k = (π, 0) and symmetry
equivalent. The effective band dispersion is k ≈ (ky2 − kx2 )/(2m? ), i.e., it is a saddle point.
The density of states is given by
Z
dN
∝ dkx dky δ[ − (ky2 − kx2 )/(2m? )]
d
Z
1
∝ dkx p
2m?  + kx2 (152)
h p i Λ
∝ log kx + kx + 2m 
2 ?
0

∝ (regular part) − log ,

i.e., we find a log divergence as the energy approaches the saddle point in 2D. This is called
a van Hove singularity. [See Fig. 2 b).]

I. Coupling to an external electro-magnetic field

Electrons couple to a magnetic field in two ways which are considered distinct in the
realm of condensed matter physics (of course they are unified on the level of relativistic
quantum mechanics): For one due to their orbital motion to the electromagnetic gauge field
and on the other hand due to the Zeeman effect to the electronic spin. The former can be

48
accounted for by minimal coupling, i.e., the replacement

b ( pb , rb ) → H
H b ( pb − eA( rb ), rb ). (153)

The latter is an additive term

H b ( pb , rb ) + µB g ? S
b ( pb , rb ) → H b · B, (154)

b is the spin operator and g ? is the effective Landé g factor, which is g = 2 for the
where S
free electron but can deviate substantially in the electronic environment of crystals, where
values of g ? = 20 are easily possible. Depending on the physics one is interested in, often
only one of the two effects is considered at a time. Typically, the orbital coupling has the
more dramatic effect on the electronic structure and we will explore its consequences in the
following sections.

1. Semiclassical equations of motion

A minimal way to include quantum-mechanical phenomena in the description of electrons


in solids is to consider the dynamics of wave packets which are extended both in position
and momentum space. We write a wave function of the form
0
X
ψk (r, t) = gk (k0 )eik ·r−ik0 t , (155)
k0

where gk (k0 ) is a function centered around k with a characteristic width ∆k much smaller
than the size of the BZ. In turn, by the uncertainty principle the wave packet ψk (r, t) is
spread over many unit cells in position space. Assuming that the system is perturbed weakly
by electric and magnetic fields, E and B, such that no interband transitions are induced,
one can derive the following equations of motion
∂n,k
ṙ ≡ vn (k) = − k̇ × Ω(k)
∂k (156)
k̇ = −eE(r, t) − eṙ × B(r, t).
Notice that k is the pseudo momentum of the electron (i.e., related to the expectation value
of the translation operator), not the expectation value of the momentum operator. The
quantity Ωn (k) is a purely quantum-mechanical contribution known as the Berry curvature.
From the Bloch eigenstates of the bands, it is defined as

Ωn,α (k) = −α,β,γ i h∂kβ un,k |∂kγ un,k i. (157)

49
From the form of Eq. (156) it is suggestive to interpret the Berry curvature as a ’magnetic
field in momentum space’, that has physical consequences for the electron motion in position
space.
Let us consider a 1D system with dispersion k = −2 cos k in a static electric field E.
Equation (156) reads for this case

ẋ = 2 sin k, k̇ = −eE, ⇒ ẋ = −2 sin (eEt) (158)

thus, the position of the wave packet oscillates according to


2
x(t) = cos (eEt) , (159)
eE
which are called Bloch oscillations. They can be observed in very clean systems.
Now, consider a system which for simplicity is taken to have a quadratic band disper-
sion (such that ∂n,k ∂k = k/m) with applied homogeneous electric field E. We study
two situations (1) with an applied external homogeneous magnetic field B and (2) with a
nonvanishing intrinsic Berry curvature Ω.
For (1), the equations of motion reduce to

mv̇ = −e(E + v × B). (160)

We are interested in steady state solutions v̇ = 0, i.e.,

v × B = −E. (161)

The electric current is j = −en0 v, where n0 is the carrier density. Comparing the above to
the definition of the conductivity jα = σα,β Eβ , we find that the system has a non-vanishing
Hall response. Taking B = (0, 0, Bz ) and E = (Ex , 0, 0), we have

σxy = jy /Ex = n0 e/Bz = νe2 /h, (162)

where ν = n0 hc/(Be) is called the filling fraction and we have reinstated the appropriate
factors of c and h to obtain the well known form of the Hall conductivity. Measuring the Hall
response can thus determine both the carrier density and the electron or hole like nature of
the carriers.
For (2) we find
k
v= + eE × Ω, (163)
m

50
i.e., the Berry connection induces a Hall response similar to an external magnetic field. For
a 2D system in the x-y plane with only one (partially) filled band the area of the Fermi sea
SFS is the density of carriers n0 . Taking the latter to be (2π)2 , we find

e2 d2 k
Z 
j= SFS Ω(k) × E, (164)
h/(2π) FS (2π)2

where the integral is taken over the Fermi sea (FS). Thus, the Hall conductivity for such a
system is given by
e2
Z
SFS
σxy = 2π d2 kΩz (k). (165)
h (2π)2 FS

Notice that if the band is completely filled, i.e., the system is an insulator SFS = (2π)2 and
R
the Hall response can still be nonzero. In fact, it is then given by e2 /h 2π d2 kΩz (k), which
for deep mathematical reasons is always an integer called the Chern number.

2. The integer quantum Hall effect

Quantum Hall effects are one of the driving thrusts of condensed matter research of the
past decades. Here, we will first consider the integer quantum Hall effect (IQHE) which can
be understood without considering electron-electron interactions. Later, we will mention
the fractional quantum Hall effect (FQHE), which needs interactions to exist.
The possibility of an IQHE was first noted in a theoretical paper by Ando et al. in 1975,
but the authors doubted that their result connects to the realm of experiments. The Nobel-
prize winning experimental discovery of the IQHE by von Klitzing et al. thus came as a
big surprise. In the experimental setup, electrons of charge e < 0 and effective mass m are
confined to a 2D plane in a high-quality GaAs heterojunction and a large magnetic field B is
applied perpendicular to that plane. In this situation, the electronic density of states consists
of discrete peaks, the so-called Landau levels. These Landau level bands are broadened
by disorder and equally spaced in energy with spacing given by the cyclotron frequency
ωc := eB/m. If the broadening is small enough such that there exist insulating regions
between the peaks, the transverse conductivity σH is quantized in units of the resistance
quantum e2 /h whenever the chemical potential lies within such an insulating region. One
year after the experimental discovery, Laughlin gave a theoretical explanation for the high-
precision quantization of the conductivity. However, his description did not account for the

51
topological nature of the IQHE. The deep connection to topological quantum numbers was
made one year later by Thouless, Kohmoto, Nightingale, and den Nijs (TKKN).
The completion of the theoretical understanding of the IQHE coincided with the next
groundbreaking and Nobel-prize winning experimental discovery made by Störmer, Tsui and
coworkers in 1982. They observed that plateaus in the Hall resistance develop at fractional
fillings ν of the Landau level, accompanied by a dip or vanishing of the longitudinal resis-
tance. This FQHE turned out to have a much richer phenomenology than the integer effect
with more and more examples of stable filling fractions discovered as the sample quality
improved. As with the IQHE, it was Laughlin who gave a theoretical explanation for the
occurrence of the principal filling fractions ν = 1/n, n ∈ Z, based on the specific form
of the quantum state of the two-dimensional electron gas. However, he did not make the
connection to the topological character of the state which was later established by Wen and
Niu.
We consider a 2D electron gas in the x-y plane in a magnetic field B = (0, 0, −B) that is
orbitally coupled to the motion of the electrons. Such a situation can occur in heterostructure
interfaces. We have

b = 1 ( pb x − eAx ( rb ))2 + ( pb y − eAy ( rb ))2 ,


 
H (166)
2m

where m is not necessarily the bare electron mass, but an effective mass. We now introduce
the operators

` `
a † = √ [( pb x − eAx ( rb )) + i( pb y − eAy ( rb ))] ,
b a = √ [( pb x − eAx ( rb )) − i( pb y − eAy ( rb ))] ,
b
2 2
(167)
where `2 = (eB)−1 is the magnetic length. Their algebra

i`2 e i`2 e
ab
b a† − b
a†b
a =− ([ pb x , Ay ( rb )] − [ pb y , Ax ( rb )]) + ([ pb y , Ax ( rb )] − [ pb x , Ay ( rb )])
2 2
`2 e `2 e
= (∂y Ax (r) − ∂x Ay (r)) − (∂x Ay (r) − ∂y Ax (r))
2 2
= `2 eB

=1
(168)

allows them to be interpreted as bosonic lowering and raising operators. We can define a

52
second set of operators

bb = √1
n o
2 2
rb x − ` ( pb y − e A y ) + i[ rb y + ` ( pb x − e A x )] ,
b b
2`
(169)
bb † = √1
n o
rb x − `2 ( pb y − e A
b y ) − i[ rb y + `2 ( pb x − e A
b x )] ,
2`
which realize an independent bosonic algebra for

[b
a , bb ] = 0, a , bb † ] = 0,
[b [ bb , bb † ] = 1. (170)

The Hamiltonian rewritten in terms of these operators reads


 
† 1
H = ωc b
b a ba+ . (171)
2

where again ωc = eB/m is the cyclotron frequency. The energy spectrum is that of a
harmonic oscillator, En = ωc (n + 1/2), n = 0, 1, 2, · · · . Each of these energy levels is called
Landau level. However, due to the commuting operator b, every level is highly degenerate, as
its single-particle eigenstates can be written |n, mi = (n!m!)−1/2 b
a † n bb † m |0i. To understand
this better, consider the eigenstates of the lowest Landau level in a symmetric gauge A =
(−Bry , Brx , 0)/2

1  z m 2 2
hr|0, mi = √ e−|z| /(4`) , m = 0, 1, 2, · · · , (172)
2π2 m!` `
m

where z = rx − iry . We have h0, m| rb 2 |0, mi = 2(m + 1)`2 . The amplitude of the wave

function hr|0, mi is peaked at a radius rm = 2m`. This means that the area taken by
a “Hall droplet” with all states m < m0 filled, is proportional to m0 and the inverse field
strength. Thus the Landau level degeneracy m0 is given by the area S of the system divided
by the area taken by one magnetic flux quantum m0 ∼ S/(2π`2 ). (The magnetic flux
quantum is Φ0 = hc/e.)
To study the transport response, it is easier to use the Landau gauge A = (0, Brx , 0).
Then, ky is a good quantum number (instead of the angular momentum in the case of the
symmetric gauge) and
1 −[rx −x0 (ky )]2 /(2`2 ) iky ry
ψ0,ky = √ e e , (173)
2π`
with x0 (ky ) = `2 ky , are the lowest Landau level wave functions. The energy is independent
of ky . We now apply an electric field Ex in x-direction, which enters the Hamiltonian as the
additive term −eEx rb x . Clearly, this term can be absorbed by shifting the origin of the x

53
a) b)
En En

5!c /2 ry
n=2
3!c /2 L
n=1
Er
!c /2 I'
n=0
ky ky rx

kkB
c) N (E) d)
En,kz 10 Cn 2

8 Cn 1

6 Cn
4

2 Cn
kz E/!c
Cn 1
0 1 2 3 4 5

Figure 5. a) Landau level spectrum of a 2D electron gas without and with a linear potential in
the x direction. The dispersion of the eigenstates in presence of the potential can be interpreted
as an edge state propagating unidirectional at the sample boundary. b) Corbino disk geometry
for Laughlin’s flux insertion argument. c) Landau level band structure of a 3D electron gas and
its density of states without (blue) and with magnetic field (orange). d) 3D Fermi sphere with
extremal circle (dotted) and allowed semiclassical trajectories (blue lines) that are enclosed by
wave packets in presence of a magnetic field.

axis, which amounts to the replacement x0 (ky ) → x00 (ky ) := `2 ky +eEx /(mωc2 ). However, the
Landau level degeneracy is now lifted by the extra term such that the energies corresponding
0
to the eigenstates ψ0,k y
read [see Fig. 5 a)]
2
m2

ωc Ex
E0 (ky ) = − eEx x00 (ky ) + , (174)
2 2 B

which allows us to directly read off the velocity

dE0 (ky )
vy (ky ) = = −Ex /B. (175)
dky

There follows the Hall conductivity as

e2
σxy = ν , ν = n0 2π`2 . (176)
h

54
a. Hall plateaus and disorder Von Klitzing’s experimental observation was that the
longitudinal conductivity σyy vanished for certain ranges of the magnetic field, which pre-
cisely corresponded to integer fillings ν ∈ N. At the same time, the Hall conductivity is
quantized very precisely to the integer multiple of e2 /h that corresponds to the number of
filled Landau levels. What was observed, however, is that the quantization forms stable
plateaus over a full range of magnetic fields, not only for the exact field value that corre-
sponds to a specific filling. This behavior can only be explained by considering the role of
disorder in the system. For simplicity, we return to the symmetric gauge and add a rotation-
ally symmetric disorder potential to the system of the form V (r) = Cr 2 . Its effect can be
absorbed in the wave function (172) by replacing ` with `0 = `(1 + 2mC)−1/4 . The Landau
level degeneracy is now lifted to
  2
`2
 
ωc `
E0,m = m 02 − 1 + 02 , (177)
2 ` `

which for m  1 and a weak potential 2mC  1 can be recast as

ωc 2
E0,m ≈ + Crm , (178)
2

where rm = 2m`0 is the wave function localization radius. The locking between the wave
function localization and the equipotential lines means that the quasiclassical trajectories
trace out equipotential lines. A weak electric field cannot induce a motion of these wave
packets in the orthogonal direction to the concentric rings. This picture can be elevated to
a completely disordered sample as follows: At a specific chemical potential, equipotential
lines that form a closed loop enclose states that do not contribute to bulk longitudinal
transport. Only at specific values of the chemical potential, where the equipotential lines
connect to form a delocalized channel that spans over the entire sample, connecting two leads,
bulk transport can happen. Thus, when slightly doping away from the chemical potential
corresponding to a filled Landau level, only small disconnected islands of equipotential lines
appear and no transport is possible. The sample remains insulating and we observe a plateau
in the Hall conductivity.
b. Edge states We now return to Eq. (174), namely the spectrum with an applied
electric field in the x direction. This mathematical result allows for a second very different
physical interpretation. For that, we reinterpret the source for the additional term in the
Hamiltonian as being a (linear) confining potential at the sample boundary [see Fig. 5 a)].

55
We are then lead to the conclusion that the sample boundary hosts a set of states ψ0,ky with
linear and unidirectional dispersion E0,ky ∝ ky that propagate along the boundary. The fact
that these modes can only move in one direction is very specific to the quantum Hall effect
and cannot be realized in a system confined to strictly one dimension (a 1D systems hosts
always as many right-moving modes as left-moving ones). While the velocity of these modes
is not universal, the number of them (each filled Landau level contributed one branch of
boundary modes) is in equal to the integer that determines the bulk Hall conductivity.
c. Laughlin’s argument Laughlin gave an argument as to why the Hall conductivity of
an insulator has to be quantized. The 2D insulator that is to be probed has the shape of a
so-called Corbino disk, that is, a disk with a hole [see Fig. 5 b)]. The magnetic flux Φ trough
the center of the disk is varied in time. This flux does not change the field distribution in
the disk and can therefore only be seen by states that extend fully around the disk. A
change in the flux by δΦ can be accounted for by a change in the ϕ component of the
vector potential Aϕ → Aϕ + δΦ/(2πr). The symmetric gauge wave function (172) changes
by ψ0,m → ψ0,m eiϕ δΦ/Φ0 . This change can be absorbed by m → m − δΦ/Φ0 . Hence, when
increasing Φ by one flux quantum, the index m has to be decreased by 1. This implies that
the wave functions are all continuously shifted in their localization radius during the flux
insertion. As a result, one electron is pumped from the outer to the inner edge of the system
after one flux quantum insertion.
When an electric field Er is applied radially, the charge transfer results in a potential
energy change −eEr L, where L is the distance between the two edges of the disk, which we
assume to be much smaller than the inner radius. A second energy change Iϕ δΦ results from
the current Iϕ that is caused by the difference in angular momentum between the electronic
states that were filled and emptied, respectively. After insertion of a full flux quantum, the
energy of the system should remain unchanged, and hence these two energetic contributions
must add up to zero. This leads to the relation eEr L = Iϕ Φ0 . Returning to cartesian
coordinates and reinstating all constants we obtain the Hall conductivtiy

jy Iy e2
σxy = = = . (179)
Ex LEx h

This argument is topological in nature and thus valid independently for any Landau level
and also for disordered systems.

56
3. Quantum oscillations

In studying the quantum Hall effect, we considered for simplicity a dispersion p2 /(2m) and
introduced the magnetic field by minimal substitution. For other effective 2D dispersions, a
similar Landau level structure emerges, where the energies of the degenerate Landau levels
depend on the dispersion. For example, a linear dispersion as found around the K point in

graphene leads to Landau level energies ∝ n.
Now, we want to extend the problem to a translationally invariant 3D systems in a homo-
geneous magnetic field, which we choose without loss of generality B = (0, 0, B). Working
again the Landau gauge A = (0, Brx , 0), the system retains its translational invariance along
the z direction even in presence of the magnetic field. It can thus be seen as a collection of
2D Landau level problems parametrized by the good quantum number kz . The Hamiltonian
2
b = [ k − eA( rb )] ,
b
H (180)
2m
has eigen wave functions
1 2 )2 /(2`2 )
ψn,ky ,kz (r) = √ eiky ry eikz rz Hn [(rx − ky `2 )/`]e−(rx −ky ` (181)
2n n! 2π `2
with energies
kz2
 
1
En,kz = + ωc n + . (182)
2m 2
where n is the Landau level index [see Fig. 5 c)]. The spectrum thus takes a quasi one-
dimensional form of dispersing Landau levels – a general feature of 3D systems in a magnetic
field. We have to keep in mind that the Landau levels are still highly degenerate due to the
ky quantum number (this not being kx simply reflects an arbitrariness in the gauge choice).
Let us compute the density of states of a system of size L3
dN 1 X L2 X
(E) = 3 δ(E − En,kz )
dE L n 2π`2 k
z

kz2
Z   
1 X dkz 1
= δ E− − ωc n + (183)
2π`2 n 2π 2m 2
(2m)3/2 ωc X 1
= 2
p .
8π n E − ωc (n + 1/2)
Comparing this to the 3D system without magnetic field,
dN 1 X √
(E) = 3 δ[E − k2 /(2m)] = (2m)3/2 E/(2π 2 ), (184)
dE L k

57
we observe that the magnetic field enforces a dramatic redistribution of states in the spec-
trum [see Fig. 5 c)]. The density of states develops a series of singularities located at the
bottom of each dispersing Landau level. This has profound implications for the magnetic
field dependence of all quantities that are sensitive to the density of states. If the Fermi
level is held fixed and B is increased, the bottom of Landau levels will pass the Fermi level
successively, each time providing a singular density of states. This leads to magnetic field
dependent oscillations of observables. Such oscillations measured in the magnetization of
the system go under the name de-Haas-van-Alphen effect. Oscillations in the conductivity
are called Shubnikov-de-Haas effect.
We return to semiclassics to get some analytical insight, neglecting both Berry and electric
fields
∂n,k ∂n,k
ṙ = k̇ = −eṙ × B ⇒ k̇ = −e × B. (185)
∂k ∂k
Since k̇ is perpendicular to ∂k n,k , the wave packet will stay on the Fermi surface and
encircle a closed path in a plane perpendicular to the applied magnetic field. Semiclassical
Bohr-Sommerfeld quantization yields the condition
I
p · dr = 2π(n + γ), (186)
Cn

on allowed paths Cn for the wave function to be single valued. Here γ is an extra phase that
could be accumulated along the path (for example due to the here neglected Berry curvature
contributions). Replacing the momentum

p = k − eA = −e(r × B + A), (187)

we obtain
I I I
p · dr = − e A · dr + eB r × dr
Cn Cn Cn

= − eΦn + 2eΦn (188)

=2π(n + γ),
where Φn = B⊥ Sn is the magnetic flux through the area Sn in configuration space that is
enclosed by the path Cn . Thus,
Φn = (n + γ)Φ0 . (189)

The path Cn also encloses an area An in momentum space, which due to

dk = eB × dr ⇒ |dk| = eB⊥ |dr| (190)

58
is related as An = (eB⊥ )2 Sn , where B⊥ is the magnetic field component perpendicular to
the path in momentum space [see Fig. 5 d)]. Thus

B⊥
An = e2 B⊥ Φn = 2πeB⊥ (n + γ) = 4π 2 (n + γ). (191)
Φ0

Now, we change the mindset and keep the area A fixed, while changing the magnetic field.
Let combinations (n, Bn ) and (n + 1, Bn+1 ) correspond to paths enclosing the same area.
Subtracting the two equations

1 4π 2 1 4π 2
= (n + γ), = (n + γ + 1) (192)
Bn AΦ0 Bn+1 AΦ0

yields
1 1 4π 2
− = . (193)
Bn+1 Bn AΦ0
This means that the frequency of oscillations as a function of 1/B directly yields the area
in momentum space of the enclosed path. Since the largest (diverging) density of states
results from the bottom of Landau levels passing through the Fermi energy, which appear at
the extremal areas of the Fermi surface (minima and maxima in diameter perpendicular to
the applied field direction), the dominant contribution to the above frequency measures the
extremal area of the Fermi surface. Rotating the external field allows for a fairly complete
map of the Fermi surface shape, if it is of sufficiently simple form.

J. Phonons

1. Quantizing lattice vibrations

So far, we have considered the electrons as dynamical degrees of freedom that are moving
in a rigid background of the ionic lattice potential. However, these ion positions are not
necessarily as rigid and their motion can back-react on the electronic properties of the crystal.
The study of the dynamics of ionic positions around their equilibrium position (where the
displacements are small enough so that the crystalline order is not destroyed) is the physics
of phonons, i.e., coherent lattice vibrations.
Classically, and in a long-wavelength limit (comparing the wavelength of displacements
with the lattice spacing) we can consider a displacement field of a continuous medium
u(x) = x0 (x) − x, where x is the absolute measure of space while x0 (x) is the new position of

59
an infinitesimal part of the medium as a result of the displacement. For simplicity, are only
going to be concerned with the case of a 1D system here and only consider displacements
along this 1D directions (which will give rise to so-called acoustic phonons). Displacements
perpendicular to the direction of propagation, which are not captured by our continuous
medium approximation, are called optical phonons.
The classical displacement field u(x, t) has both elastic (or potential) and kinetic energy
Z Z
λ 2 ρ0
Eel = dx [∂x u(x, t)] , Ekin = dx [∂t u(x, t)]2 , (194)
2 2

where ρ0 is the mass density of the moving ions (i.e., the mass of an ion times their in-
verse distance). We now want to quantize this theory. To that end, we first expand the
displacement field in Fourier modes (such that the displacement is real)

1 X
qk (t)eikx + qk∗ (t)e−ikx

u(x, t) = √ (195)
Ω k

and re-express the Lagrangian L = Ekin − Eel using qk (t). Taking the derivative with respect
to qk∗ (t) yields the equation of motion

∂t2 qk (t) + ωk2 qk (t) = 0, qk (t) = qk,± e±iωk t (196)


p
where ωk = cs |k| is the acoustic phonon dispersion and cs = λ/ρ0 is the speed of sound.
We us the equation of motion to substitute the time derivative in the expression of the
kinetic energy to obtain
X
E = Eel + Ekin = ρ0 ωk2 [qk (t)qk∗ (t) + qk∗ (t)qk (t)] . (197)
k

To obtain a Hamiltonian description, we deduce the canonically conjugate variables

√ √
Qk ≡ ρ0 (qk + qk∗ ), Pk ≡ ∂t Qk = −iωk ρ0 (qk − qk∗ ) (198)

in terms of which
1 X 2
Pk + ωk2 Q2k .

E= (199)
2 k
Canonical quantization now amounts to elevating these quantities to (hermitian) operators
and imposing the commutation relation

[Q
bk , Pbk0 ] = iδk,k0 . (200)

60
The problem is essentially a quantum Harmonic oscillator, and, as we just saw in the Landau
level eigenfunctions, it is convenient to rewrite the operators in a pair of bosonic creation
and annihilation operators

bbk = √ 1 bb† = √ 1
   
ωk Q
bk + iPbk ,
k ω Q
k k
b − i P
b k (201)
2ωk 2ωk

which satisfy
h i h i h i
bbk , bb† 0 = δk,k0 , bbk , bbk0 = 0, bb† , bb† 0 = 0. (202)
k k k

We thus obtain that the elementary excitations of the crystal, the phonons, are bosons
created by bb† with a dispersion ωk . They are governed by the Hamiltonian
k
 
X †b 1
H
b= ωk bk bk +
b . (203)
k
2

The displacement of ions is now endowed with the uncertainty principle, reflected by the
fact that their displacement becomes an operator

1 X 1 
i kx † −i kx

b(x) = √
u √ b k e + bk e
b b . (204)
Ω k 2ρ0 ωk

2. Peierls instability in 1D

Phonons couple to electrons by the deformation of the ionic background potential. This
coupling can have various effects on the electronic structure and on the lattice structure.
The electron-phonon coupling renormalizes various quantities such as the speed of sound in
the material. Here, we want to exemplify possible implications of electron-phonon coupling
by a very dramatic effect that appears preferably in (quasi-)1D systems, the so-called Peierls
instability. It consists of a static deformation of the crystal that goes along with a opening
of a gap in the electronic spectrum. The lattice distortion costs energy, but the opening of a
gap in the electronic spectrum yields energy so that the combination of the two effects can
be advantageous. A 1D electronic system with a simple k 2 /(2m) dispersion has two Fermi
points at k = ±kF . To open a gap, one needs to couple these two points, i.e., one needs to
provide a momentum difference of Q ≡ 2kF to scatter the electrons. The idea is that this
momentum is provided by the static lattice deformation, which thus needs to have the same
wave vector.

61
With our understanding that the phonons are bosons, a static lattice deformation requires
a state of the lattice with macroscopic occupation of the phonon state with momentum Q.
This is realized by a coherent state
∞ b† n
−|α|2 /2
X (bQ )
|α, Qi = e αn |0i (205)
n=0
n!

which has the properties


hα, Q|bb†QbbQ |α, Qi = |α|2 (206)

and
1
αeiQx + α∗ e−iQx .

hα, Q|b
u(x)|α, Qi = (207)
2Lρ0 ωQ
The state can be thought of as a Bose condensate of momentum Q phonons. We will now
determine whether such a lattice deformation can be energetically favorable. Since the mode
occupation is macroscopic, it suffices to treat the distortion classically and assume

u(x) = u0 cos Qx, (208)

with an amplitude u0 that is to be determined.


The Hamiltonian describing phonons, electrons (which we take spinless for simplicity),
and their coupling reads in real space
X k2 † λ
Z
2
Z
H=
b ck b
b ck + dx [∂x u(x)] − n0 dxdx0 V (x − x0 )[∂x u(x)]φb† (x0 )φ(x
b 0 ), (209)
k
2m 2

where V is the Coulomb interaction. In momentum space, and particularizing on the single
phonon mode at momentum Q, we have
X k2 † Ωρ0 2 2 X †
H
b= ck b
b ck + ωQ u0 + iQṼQ n0 u0 (b
ck+Qb c†k−Qb
ck − b ck ), (210)
k
2m 4 k

where ṼQ is the respective Fourier component of the Coulomb interaction.


We now assume that u0 is small so that we only consider scattering between the phonon
states to first order in u0 . The electronic part of the spectrum is then approximately deter-
mined by the eigenvalues of the matrix
 
k2
2m

 2
 (211)
∆∗ (k−Q)
2m

62
where ∆ ≡ −iQu0 nṼQ . They are given by
s 2
2 2
(k − Q) + k (k − Q)2 − k 2

k = ± + |∆|2 . (212)
4m 4m

The total energy of the Fermi sea and the deformed lattice is thus
X λLQ2 2
E(u0 ) = 2 −
k + u0 , (213)
0≤k<Q
4

where the factor of 2 arises from the changing the summation originally ranging from −Q
to Q toward only positive k values. To find out whether a finite u0 is favored, we minimize
the energy with respect to u0

1 dE(u0 )
0=
L du0
Z Q
2 dk 1 λ
= − 16Q mn0 ṼQ2 u0 q + Q2 u0
0 2π [(k − Q)2 − k 2 ]2 + 16m2 Q2 n20 ṼQ2 u20 2
q
(214)
 
3 4 + 16m2 Q2 n2 Ṽ 2 u2
1 −Q + Q Q
 + λ Q2 u0
0 Q 0
=(−)2 16Q2 mn0 ṼQ2 u0 log  q
4πQ +Q3 + Q Q4 + 16m2 Q2 n20 ṼQ2 u20 2
!
8Qmn20 ṼQ2 mn0 ṼQ u0 λ
≈ u0 log + Q2 u0 .
π kF 2

The solution u0 = 0 is not of interest to us, but


" #
kF kF πλ
u0 = exp −
mn0 ṼQ 8mn20 ṼQ2
(215)
2F −1/ν(F )g
= e
kF n0 ṼQ

is. Here, ν(F ) = 2m/πkF is the density of states at the Fermi level and g = 4n20 ṼQ2 /λ
can be interpreted as a coupling constant that describes a phonon-induced electron-electron
interaction (hence the power 2 of ṼQ ). The exponential dependence of the ‘order parameter’
u0 on the density of states and the coupling constant is a characteristic feature of such
weak-coupling instabilities of the Fermi sea, of which we will encounter more examples in
the next chapter. As promised, the Peierls instability will open a finite gap in the electronic
spectrum for
− −1/N (F )g
∆ = +
kF − kF = 2|∆| = 8F e . (216)

63
Thus, the 1D metal has turned into an insulator/semiconductor. As a result of this insta-
bility, the charge density of the electrons actually becomes modulated with a wave vector
Q. This is called a charge density wave. We will discuss density wave instabilities in more
detail in the next chapter.

64
IV. INTERACTIONS IN ITINERANT SYSTEMS

A. Particle-hole excitations

1. In semiconductors

What is a semiconductor? In theory, there is no sharp distinction between the notion of


an insulator and a semiconductor: both have a gapped band structure and hence (without
electron-electron interactions considered) a gapped zero temperature ground state. The
difference is quantitative, namely, the size of the gap in relation to the practically relevant
energy scale of thermal activation at room temperature (300 K ∼ 0.25 meV). Materials with
multi-eV band gaps (like diamond with ∆ = 5.5 eV) remain insulating at room temperature
as thermal activation is exponentially suppressed in ∆/(kB T ). Semiconductors with gaps of
up to 1 eV gain carrier densities of n ∼ 103 –1011 cm−3 at room temperature, meaning that
they are not fully insulating.
A band structure is gapped if (at zero temperature) the minimum of the fully unoccupied
conduction band is above the maximum of the fully occupied valence band. We speak of a
direct gap if the minimum of the conduction band and the maximum of the valence band
coincide at the same momentum. Otherwise it is an indirect gap.
The obvious elementary excitations in semiconductors are electron-hole excitations. To
study what interactions do to these excitations, we take
X † †

H =
b k,v b
c v,k b
c v,k + k,c b
c c,k b
c c,k , (217)
k

c †v,k ( b
as a spinless model of a semiconductor. Here b c †c,k ) creates an electron in the va-
lence (conduction) band at momentum k and we take the dispersions of the two bands, for
simplicity, as a k · p approximation around a momentum with direct gap ∆
k2 k2
k,v = − , k,c = + ∆. (218)
2m 2m
In general there is no need for the two effective masses to be equal. The zero temperature
(many-body!) ground state is
Y
|Ψ0 i = c†v,k |0i. (219)
k
Particle-hole excitations above it

|k, qi = c†c,k+q cv,k |Ψ0 i, (220)

65
carry a possibly nonzero momentum q and are eigenstates of the Hamiltonian with energy
Ek,q = E0 + c,k+q − v,k compared to the ground states energy E0 . For a fixed q, due to
the freedom to choose k, such electron-hole excitations are possible for a range of energies.
The spectrum
X
I(q, E) = δ[E − (c,k+q − v,k )] (221)
k

forms the so-called electron-hole continuum above a q dependent energy threshold, which
for direct gap semiconductors has a minimum at q = 0 where it equals the gap ∆.
We now study how electron-electron interactions can alter the nature of the ground state
by favoring a coherent superposition of electron-hole pairs, so-called excitons, energetically.
We add the Coulomb interaction term

e2
Z X
H
b int = drdr 0 0
φb †β (r 0 ) φb †α (r) φb α (r) φb β (r 0 ), (222)
α,β
|r − r |

where α, β = 1, 2 are local degrees of freedom (like orbitals or sublattices) that gave rise to
the two bands of the semiconductor. Their Fourier transforms are unitarily rotated to the
band eigenstates by the Bloch vector un,α;k in momentum space

1 X X
φb α (r) = √ un,α;k eik·r b
c n,k . (223)
Ω n=c,v k

We now consider a general trial wave function for an exciton state

X
|Ψq i = Aq (k)|k, qi, (224)
k

parametrized by the function Aq (k). We can solve the eigenvalue problem ( H b int )|Ψq i =
b +H

(E0 + Eq )|Ψq i in the space of two-particle excitations with momentum q above the ground
state
X
hk, q| H b int |k0 , qiAq (k0 ) = (E0 + Eq )Aq (k).
b +H (225)
k0

The matrix elements are

b |k0 , qi = δk,k0 Ek,q


hk, q| H (226)

66
and
e2
Z X
hk, q| H 0
b int |k , qi = drdr 0 hk, q| φb †β (r 0 ) φb †α (r) φb α (r) φb β (r 0 )|k0 , qi
α,β
|r − r 0 |
e2
Z
1 X X X 0
= drdr 0 ei(k1 −k2 )·r ei(k3 −k4 )·r
Ω2 α,β k1 ,k2 ,k3 ,k4 n,n0 ,m,m0 =c,v
|r − r 0 |

c †m0 ,k4 b
× u∗n0 ,α;k2 un,α;k1 u∗m0 ,β;k4 um,β;k3 hk, q| b c †n0 ,k2 b c m,k3 |k0 , qi.
c n,k1 b
(227)

Since we are only interested in small momentum transfers (the main contributions come
from around k ∼ 0), we can approximate
X
u∗m0 ,α;k1 um,α;k2 ≈ δm,m0 (228)
α

yielding
e2
Z
b int |k0 , qi = 1 0
X X
hk, q| H drdr 0 ei(k1 −k2 )·r ei(k3 −k4 )·r
Ω2 |r − r 0 |
k1 ,k2 ,k3 ,k4 n,m=c,v (229)
c †m,k4 b
× hk, q| b c †n,k2 b c m,k3 |k0 , qi.
c n,k1 b

The expectation value of the second quantized operators can, for k 6= k0 be replaced by

c †m,k4 b
hk, q| b c †n,k2 b c m,k3 |k0 , qi → − δm,v δn,c δk+q,k2 δk,k3 δk0 ,k4 δk0 +q,k1
c n,k1 b
(230)
− δn,v δm,c δk+q,k4 δk,k1 δk0 ,k2 δk0 +q,k3

resulting in
e2
Z
2 0 0
hk, q| H int |k , qi = − 2 drdr 0 ei(k −k)·(r−r )
b 0
Ω |r − r 0 |
(231)
2 2π e2
=− .
Ω (k − k0 )2
We now include the electrostatic environment of the medium phenomenologically by in-
troducing the dielectric constant, replacing e2 → e2 / in the interaction strength. The
eigenvalue equation for Aq (k) now reads
1 X 4πe2
(Eq,k − E)Aq (k) − Aq (k0 ) = 0. (232)
Ω k0 (k − k0 )2

Returning to real space


1 X
Ãq (r) = √ Aq (k)eik·r (233)
Ω k

67
a) b) c) (q, 0)
E ! (0, 0)

electron-hole plasma electron-hole 1D


continuum resonance continuum 2D
1
!p 3D
exciton states

q 2kF q 2kF q

Figure 6. a) Excitation spectrum of a direct gap semiconductor with a gap ∆ and interaction-
induced exciton bound states in the gap. b) Excitation spectrum of a 3D metal as deduced
from the charge susceptibility with electron-hole continuum and the interaction-induced plasma
resonance. c) Form of the singularity of the static (ω = 0) charge susceptibility of metals in
different dimensions.

and using the dispersions Eq. (218) we obtain (shifting k to k − q/2)

∇2 e2 q2
   
− − Ãq (r) = E − E0 − ∆ − Ãq (r), (234)
2(m/2) |r| 2(2m)

which is exactly the Schroedinger equation for the hydrogen atom with reduced mass µex =
m1 m2 /(m1 + m2 ) = m/2 and total mass Mex = 2m. The spectrum is given by

µex e4 q2
Eq = E0 + ∆ − + , (235)
22 n2 2Mex

which implies the existence of bound states below the electron-hole continuum. [See Fig. 6 a)]
The fact that these excitations have a sharp dispersion relation Eq qualifies them be called
a quasiparticle in condensed matter. This quasiparticle is called exciton. Quasiparticles
are a very powerful concept covering many interaction effects in many-body systems. A
quasiparticle can only exist because of the many-electron background. In the case at hand,
they are a collective excitation that includes the polarization from all electrons. Excitons
carry no charge and are bosonic in character, as they are made of two fermions. Bose-Einstein
condensation of excitons has been observed experimentally.

2. In metals

a. RPA susceptibility As in semiconductors, the elementary excitations in metals are


particle-hole excitations. However, in metals they can appear at arbitrarily small energies.

68
As a result, the effective Coulomb interaction is not only rescaled as is the case in semicon-
ductors due to the dipolar electric environment, but substantially modified taking the form
e−r/l /r, as we will derive below.
We recall that the noninteracting linear response of a metal to potential perturbation is
given by
δρ(q, ω) = χ0 (q, ω)V (q, ω), (236)

with
1X nk+q − nk
χ0 (q, ω) = , (237)
Ω k k+q − k − ω − iη
where nk = 1/(eβk + 1) is the occupation number of the band according to the Fermi-Dirac
distribution. We now want to see how the Coulomb interaction changes this response. The
redistribution of electron density δρ(q, ω) due to the external potential V (q, ω) sources an
additional Coulomb potential if interactions are considered. The latter is determined by the
Poisson equation
4πe2
∇2 VC (r, t) = −4πe2 δρ(r, t), VC (q, ω) = δρ(q, ω). (238)
q2
The total potential felt by the electrons is thus the sum of the two contributions

Vtot (q, ω) =V (q, ω) + VC (q, ω)


4πe2 (239)
=V (q, ω) + 2 δρ(q, ω),
q
which is a self-consistent equation for δρ(q, ω) when considered together with

δρ(q, ω) = χ0 (q, ω)Vtot (q, ω). (240)

Solving for Vtot (q, ω), we can write


1
Vtot (q, ω) = V (q, ω), (241)
(q, ω)
where
4πe2
(q, ω) = 1 − χ0 (q, ω) (242)
q2
is called the dynamical dielectric function that describes the renormalization of the external
potential due to the electrons in the metal.
Inserting this in the equation for δρ(q, ω), we find

δρ(q, ω) = χ(q, ω)V (q, ω), (243)

69
where response functions including interaction effects is given by

χ0 (q, ω) χ0 (q, ω)
χ(q, ω) = = 2 , (244)
(q, ω) 1 − 4πe
q2
χ0 (q, ω)

which is called the random phase approximation (RPA) form of the response function. (The
name stems from the fact that, when χ(q, ω) is computed from a perturbative expansion
in the interaction strength, certain terms are exactly summed in the above form to infinite
order, while others which are assumed to cancel on average due to random relative phases
are not summed.)
Both χ0 (q, ω) and χ(q, ω) are complex valued quantities. The real and imaginary parts
contain distinct information about the physics of the system. Divergences in the real part
signal instabilities of the ground state toward symmetry-breaking states. χ(q, ω) can diverge
if the interaction is sufficiently strong such that the denominator goes to zero (evidencing
that interactions are necessary for such an instability). We will defer the study of instabilities
for later. The imaginary part contains information about excitations of the metal which will
be our primary interest for now.
First, for the noninteracting case of χ0 (q, ω), we can use

1
lim+ = P(x−1 ) + iπδ(x), (245)
η→0 x − iη

where P denotes the Cauchy principal value to obtain

nk+q − nk
 
1X
Re χ0 (q, ω) = P
Ω k k+q − k − ω
(246)
πX
Im χ0 (q, ω), = (nk+q − nk ) δ(k+q − k − ω).
Ω k

The imaginary part is exactly what Fermi’s golden rule would yield for the transition rate
from the ground state to an excited state with an electron-hole pair excitation. In a 3D
metal with spherical Fermi surface, excitations with arbitrarily small energies are possible
in a range of momenta |q| ≤ 2kF , where kF is the Fermi wave vector. For small momentum
transfer and finite frequencies ω, there are no excitations in the particle hole continuum.
We will now explore a collective excitation induced by the Coulomb interaction. This so-
called plasma resonance appears at small momenta and finite frequency, where no continuum
of particle-hole excitations is allowed. To study it, we consider the RPA susceptibility and

70
expand all quantities entering it in small momenta |q|  kF

k+q = k + q · ∇k k + · · ·
∂n (247)
nk+q = nk + q · ∇k k .
∂
At zero temperature, ∂ n = −δ() (keep in mind that all energies are measured with respect
to the Fermi energy, or the chemical potential) and we can write q · ∇k k = vF q cos θ, where
θ is the angle between the (fixed) vector q and k. The bare susceptibility becomes
Z
dk vF q cos θ δ(k )
χ0 (q, ω) = −
(2π) vF q cos θ − ω − iη
3

(2π) π kF2
Z
vF q cos θ
=− dθ sin θ
(2π) 03 vF vF q cos θ − ω − iη
1
kF2
Z
ξ
=− dξ
(2π) vF −1 ξ − ω+iη
2
v q
(248)
" F 2 #
2
kF2
  
2 vF q 3 vF q
= 2
1+ + O(q 4 )
(2π) vF 3 ω + iη 5 ω + iη
" 2 #
2

nkF q 3 vF q
= 2
1+ + O(q 4 ).
2m(ω + iη) 5 ω + iη

We used the integral


Z 1  
ξ 2 2 2 3 2
dξ = 2 − arctan(b) = − b 1 + b + O(b6 ). (249)
−1 ξ − 1/b b 3 5

We first need the q 2 term to find that, following Eq. (242), the dielectric function has a
characteristic frequency dependence

ωp2
lim (q, ω) = 1 − (250)
q→0 ω2

where the so-called plasma frequency is given by ωp2 = 4πe2 nkF /m.
Second, we use Eq. (248) to rewrite the RPA susceptibility as

nkF q 2 R(q, ω)2


χ(q, ω) ≈
2m(ω + iη)2 − 8πe2 nkF R(q, ω)2
(251)
nkF q 2 R(q, ω)
 
1 1
= −
4mωp ω + iη − ωp R(q, ω) ω + iη + ωp R(q, ω)

with
3vF2 q 2
R(q, ω)2 = 1 + . (252)
5ω 2

71
Now, we can use Eq. (245) to compute
πnkF q 2 R(q, ω)
Im χ(q, ω) = [δ (ω − ωp R(q, ωp )) − δ (ω + ωp R(q, ωp ))] . (253)
2mωp
This has an excitation with well-defined dispersion relation
 
3vF 2
ω(q) = ωp R(q, ωp ) ≈ ωp 1 + q , (254)
10 ωp2

i.e., a quasiparticle. It is called plasma resonance and the associated plasmon is again
bosonic in character. The plasma resonance coexists with the electron-hole continuum at
lager q, but is damped due to the decay into electron-hole excitations. [See Fig. 6 b)] Actual
values for the plasma frequency are typically in the range of a few eV. The plasma resonance
can be understood from classical electrodynamics. If the mobile electrons are shifted with
respect to the ions by a rigid displacement r, there results a polarization-induced electric
field E = 4πernkF . This exerts a force on the electron cloud and the equation of motion
becomes that of a harmonic oscillator

mr̈ = −eE = −4πe2 rnkF . (255)

The eigenfrequency of that oscillator is precisely the plasma frequency


4πe2 nkF
ωp2 = . (256)
m
b. Thomas-Fermi screening In closing this section, we return to our initial question,
namely the form of the effective interaction potential, once the screening by the mobile
electrons in the metal is taken into account. This is most easily found by taking the ω → 0
limit, i.e., the static limit, of the self-consistent potential using the dielectric function. We
start from the first line of Eq. (248) (we multiply the susceptibility by 2 here to account for
the spin degree of freedom and connect to known results below)
4π kF2 3nkF
χ0 (q, 0) = − 3
=− (257)
(2π) vF 2F
and, following Eq. (242) we find the static dielectric function
2
kTF
(q, 0) = 1 + , (258)
q2
2
written in terms of the Thomas-Fermi wave vector kTF = 6πe2 nkF /F which quantifies the
screening by the mobile electrons. To see how the profile of the Coulomb interaction is

72
modified by this screening, consider the potential of a bare charge V (q) = 4πe2 /q 2 and how
it modifies the total potential
4πe2
Vtot (q) = 2
. (259)
q 2 + kTF
2
We observe that kTF is a cutoff of the potential for small q, i.e., long distances. This becomes
even clearer in the Fourier transformed version

e2 −kTF r
Vtot (r) = e , (260)
r2

showing the exponential screening of the Coulomb potential at long distances. In a metal,
the screening length is comparable to the Fermi momentum. The Thomas-Fermi screening
−1
implies that electric fields cannot penetrate a metal on distances larger than kTF , i.e., a few
interatomic distances.
c. Singularities in lower dimensions The bare static susceptibility can be computed
exactly in different dimensions for a fully isotropic Fermi surface and a quadratic dispersion.
The results are

(1D) 1 s + 2
χ0 (q, 0) =− log
,
2πq s − 2
(2D) 1 h p i
χ0 (q, 0) = − 1 − 1 − 4/s Θ(s − 2) ,
2 (261)
2π    
(3D) kF s 4 s + 2
χ0 (q, 0) = − 2 1 − 1 − 2 log
,
2π 4 s s − 2

with s = q/kF . At s = 2, i.e., for scattering processes across the Fermi surface, χ0 is not
smooth. [See Fig. 6 c)] However, the order of non-smoothness depends on the dimension and
is more severe for smaller dimensions. In particular, in 1D the susceptibility is diverging at
q = 2kF signaling an instability of the electronic system.

B. Fermi liquid theory

What happens to a noninteracting fermi sea ground state of a metal when interactions are
added? This is a highly nontrivial question and a multitude of instabilities are conceivable,
including symmetry breaking states (magnetism, superconductivity, density waves), Mott
insulators and others. The main message of this section, and the core statement of Fermi
liquid theory is: In many cases not all that much happens. The character and key properties
of the Fermi sea survive the presence of interactions with some conceptual modifications.

73
This bold statement is valid in 3D for weak repulsive interactions. It is a blessing for us,
since we rely very much on the conducting properties of metals. If they were unstable, it
would pose a technological challenge.

1. Quasiparticles

At the basis of the Fermi liquid theory are so-called quasiparticles. They are a sharply
defined concept for excitations above and very close to the Fermi sea. To get some intuition
about this, we consider a system with a filled Fermi sea, dispersion k = k2 /(2m), and Fermi
momentum kF . We consider a state with this Fermi sea filled and one additional electron at
some k with |k| > kF . This state is an eigenstate of the noninteracting Hamiltonian, but if
electron-electron interactions
Z X
H int = drdr 0
b V (r − r 0 )φ†β (r 0 )φ†α (r)φα (r)φβ (r 0 ), (262)
α,β

are considered it will have a nonvanishing decay amplitude into other states. Specifically,
we consider the screened Coulomb interactions
4πe2
V (q) = . (263)
q 2 + kTF
The lowest order effect of the interaction that couples to the electron at k is to create an
extra particle-hole pair with momenta (k0 + q) and k0 and at the same time to shift the
momentum of the extra electron to (k − q). [See Fig. 7 a)] Momentum is conserved in this
process because
k = (k − q) − k0 + (k0 + q) (264)

and energy conservation requires

k = k−q − k0 + k0 +q . (265)

(We will use a convention in which k is measured relative to the Fermi energy, i.e., it
includes the chemical potential and is zero near the Fermi surface.)
We now want to calculate the total decay rate of the initial state by summing over all
possible final states (i.e., all q and k0 ). Fermi’s golden rule yields
1 2π X
=2 2 |V (q)|2 nk0 (1 − nk−q )(1 − nk0 +q ) δ (k − k−q + k0 − k0 +q ) , (266)
τk Ω k0 ,q

74
a) b)
k q
q n0,k,
quasiparticles
k 1
electrons
Fermi sea Fermi sea
Z kF
k0

k0 + q kF k

Figure 7. a) Dominant decay processes for a single particle excitation above the filled Fermi
sea. b) Momentum distribution function at zero temperature in an interacting Fermi liquid for
the original electrons (orange) and the quasiparticles (blue). The latter follow the Fermi-Dirac
distribution, while the former have a smaller step-like singularity that equates to the quasiparticle
weight ZkF .

where the factor of 2 is for the spin degeneracy. The factors nk0 (1 − nk−q )(1 − nk0 +q ) ensure
that the final state is indeed available by the Pauli principle. After some algebra, that
we relegate to an exercise, we obtain the following approximation for the lifetime of the
excitation
Z
1 N (F ) 2
=  dq |V (q)|2 , (267)
τk 8π vF2 k
where N (F ) = m? kF /π 2 is the density of states at the Fermi level.
Exercise: Fill in the missing steps to compute the quasiparticle life time.
The key feature of this result is that τk ∝ −2
k , i.e., the lifetime becomes infinitely long

close to the Fermi surface, while excitations at higher energy decay much faster. This is
indicative of a well-defined quasiparticle concept at energies close to the Fermi level. It
comes about because of the reduced phase space available for scattering processes that obey
momentum conservation, energy conservation, and the Pauli principle simultaneously for
small excitation energies.

2. Basics of Landau Fermi liquids

Landau’s theory of Fermi liquid can be motivated from microscopic considerations, but
for efficiency, we present it as an axiomatic effective theory here. The basic axiom is that
there is a one-to-one mapping between the excitations above the noninteracting Fermi sea

75
(free electrons) and the excitations above the interacting many-body ground states (quasi-
particles). We denote the momentum distribution function of the quasiparticles by nk,σ and
that of the electrons by ñk,σ , where σ = ± denotes the spin. The total number of electrons
equals the number of quasiparticles
X X
N= nk,σ = ñk,σ . (268)
k,σ k,σ

One now postulates that the distribution function of the quasiparticles in equilibrium follows
the Fermi-Dirac distribution n0,k,σ . As a consequence, the electronic distribution function
becomes distorted. In particular, the jump at kF at T = 0 is reduced from 1 to a smaller
value (but there is still a jump). The jump height is called quasi-particle weight ZkF . [See
Fig. 7 b)]
The object of interest in the Fermi liquid theory are the deviations from the equilibrium
quasiparticle distribution function

δnk,σ = nk,σ − n0,k,σ , (269)

which is a small expansion parameter in the theory. Central is an expansion of the energy
as a functional of this parameter
X 1 XX
E[δn] = E0 + σ,k δnk,σ + fk,k0 ;σ,σ0 δnk,σ δnk0 ,σ0 + O(δn3 ), (270)
k,σ
Ω k,k0 σ,σ0

where E0 is the ground state energy, and σ,k as well as fk,k0 ;σ,σ0 are phenomenological param-
eters at this stage which can be determined from a microscopic derivation or experimental
input. The variational derivative with respect to δn yields a renormalized effective dispersion
of the quasiparticles due to the interaction with the perturbed background quasiparticles

1 X
˜k,σ = k,σ + fk,k0 ;σ,σ0 δnk0 ,σ0 . (271)
Ω k0 ,σ0

To obtain a more practical set of parameters in a perturbative regime, one ignores the
radial dependence of fk,k0 ;σ,σ0 on k and k0 so that the only relevant variable is the angle θ
between k and k0 , which are both assumed to lie on the Fermi surface. We can then expand
fk,k0 ;σ,σ0 in Legendre polynomials

X
fk,k0 ;σ,σ0 ≈ fθ;σ,σ0 = (fls + σσ 0 fla )Pl (cos θ), (272)
l=0

76
where we have also split up each coefficient into a spin-symmetric fls and a spin-asymmetric
part fla . When rescaled by the density of states at the Fermi level ν(F ), these parameters
are known as the Landau parameters of the theory

Fls = ν(F ) fls , Fla = ν(F ) fla . (273)

The density of states at the Fermi surface is defined as


2X k2 m? kF
ν(F ) = δ(k − F ) = 2 F = (274)
Ω k π vF π2

and follows from the dispersion k of the bare quasiparticle energy


kF
∇k |kF = vF = (275)
m?
where for a fully rotation symmetric system we may write k = k2 /(2m? ) with m? as an
“effective mass”, although we will be only interested at the spectrum in the immediate
vicinity of the Fermi energy.

3. Specific heat – Density of states

Since the quasiparticles are fermions, they obey Fermi-Dirac statistics fF−D () = 1/(eβ +
1) which is their equilibrium distribution function
1
n0,k,σ (T ) = (276)
e(˜k −µ)/(kB T ) +1
with the chemical potential µ. For low temperatures and an otherwise undisturbed system,
their deviation must obey
δnk,σ = n0,k,σ (T ) − n0,k,σ (0) (277)

which, using Eq. (278) leads to ˜k,σ = k,σ because here


1 X
0= fk,k0 ;σ,σ0 δnk0 ,σ0 , (278)
Ω k0 ,σ0

due to the fact that in δnk,σ there are as many particles (positive) as holes (negative)
such that they cancel in the correction term that measures the overall charge imbalance.
Therefore we rewrite the Fermi-Dirac distribution (276) as
1
n0,k,σ (T ) = . (279)
e(k −F )/(kB T ) +1

77
In doing so, we have also replaced the chemical potential µ by the Fermi energy F which
is justified at low temperatures because µ = F + O(T 2 ). We are interested in leading
corrections, which are usually of the order T 0 and T 1 , and corrections of higher order may
therefore be neglected in the low-temperature regime. In order to discuss the specific heat,
we employ the expression for the entropy of a fermion gas. For each quasiparticle species
with a given spin there is one state labelled by k. The entropy density of independent
degrees of freedom may be computed from the distribution function
X
S = −kB nσ,k ln(nσ,k ) + (1 − nσ,k ) ln(1 − nσ,k ). (280)
k,σ

Taking the derivative of the entropy S with respect to T , the specific heat is obtained as

∂S
C(T ) = T
∂T
eξk /(kB T )
 
kB T X ξk nk,σ
=− ln
Ω k,σ (eξk /(kB T ) + 1)2 kB T 2 1 − nk,σ (281)
kB T X 1 ξk ξk
=− 2 ,
Ω k,σ 4 cosh [ξk /(2kB T )] kB T 2 kB T

where we introduced ξk = k − F . In the limit T → 0 we find

ξ2
Z
C(T ) ν(F )
≈ dξ
T 4kB T 3 cosh2 [ξ/(2kB T )]
kB2 ν(F )
Z
y
= dy 2
(282)
4 cosh (y/2)
2 2
π kB ν(F )
=
3
which is the well-known linear behavior C(T ) = γT for the specific heat at low temperatures,
with γ = π 2 kB2 ν(F )/3 the Sommerfeld coefficient. Since ν(F ) = m? kF /π 2 , the effective mass
m? of the quasiparticles can directly determined by measuring the specific heat of the system.

4. Compressibility and susceptibility

As an application of the Fermi liquid theory, we compute its compressibility and magnetic
susceptibility. The former is defined by
 
1 ∂Ω
κ=− , (283)
Ω ∂p T,N

78
where p is the uniform pressure exerted on the system. The quasiparticle energy levels are
affected by the pressure as
∂k,σ
δk,σ = δp
∂p
∂k,σ ∂k ∂Ω
= · δp
∂k ∂Ω ∂p (284)
k
= vk · κ(0) δp
3
2F (0)
≈ κ δp,
3
where we assumed to be close to the Fermi surface to obtain the last line and we used
k = n2π/Ω1/3 . We are after the analogous equation for the renormalized quasiparticle
dispersion and the compressibility of the Fermi liquid κ
2F
δ˜k,σ = κδp. (285)
3
To obtain it, we start with the definition of ˜k,σ and use Eq. (284) for the first term (here,
δ˜k,σ = ˜k,σ − k,σ )
2F (0) 1 X
δ˜k,σ = κ δp + fk,k0 ;σ,σ0 δnk0 ,σ0
3 Ω k0 ,σ0
2F (0) 1 X ∂nk0 ,σ0
= κ δp + fk,k0 ;σ,σ0 δp
3 Ω k0 ,σ0 ∂p
(286)
2F (0) 1 X ∂nk0 ,σ0
= κ δp + fk,k0 ;σ,σ0 δ˜k0 ,σ0
3 Ω k0 ,σ0 ∂˜k0 ,σ0
2F (0) 1 X 2F
= κ δp − fk,k0 ;σ,σ0 δ(˜k0 ,σ0 − F ) κδp.
3 Ω k0 ,σ0 3

Since ˜k0 ,σ0 is isotropic, this brings about exactly the l = 0 Landau parameter and we have

(0) κ(0)
κ=κ − κF0s , κ= . (287)
1 + F0s

Without derivation we also state the unperturbed compressibility of the Fermi liquid 1/κ(0) =
2nF /3.
By similar considerations, we can compute the magnetic susceptibility of the Fermi liquid
1 ∂M
χ=
Ω ∂B
(288)
µ2B ν(F )
= .
1 + F0a

79
Common to Eqs. (287) and (288) is that the Fermi liquid parameter appears in the denom-
inator, indicating the possibility of a divergence (when it reaches the value −1), i.e., an
a/s
instability of the system. However, for small F0 , i.e., weak interactions, the Fermi liquid is
stable, as promised. This proof can be extended to all angular momentum channels where
a similar form is found. A divergence of the magnetic susceptibility indicates a tendency
to (ferro-)magnetic order. This constitutes the so-called Stoner instability which we will
encounter again from a more microscopic viewpoint.

5. Galilei invariance – effective mass and F1s

We initially introduced by hand the effective mass of quasiparticles in k,σ . In this section
we show, that overall consistency of the phenomenological theory requires a relation between
the effective mass and one Landau parameter, namely F1s . The reason is, that the effective
mass is the result of the interactions among the electrons. This self-consistency is connected
with the Galilean invariance of the system.
To illustrate this, we consider a situation in which the momenta of all particles are shifted
by q as a perturbation to the Fermi liquid (|q| shall be very small compared to the Fermi
momentum kF in order to remain within the assumption-range of the Fermi liquid theory).
The distribution function is given by

(0) (0) (0)


δnσ,k = nσ,k+q − nσ,k ≈ q · ∇k nσ,k . (289)

(0)
This function is strongly concentrated around the Fermi energy, as the derivative of nσ,k
tends to zero far away from it.
(0)
The current density can now be calculated, using the distribution function nσ,k = nσ,k +
δnσ,k . Within the Fermi liquid theory this yields,

1X
jq = vk nσ,k (290)
Ω k,σ

with

vk =∇k ˜k,σ
1 X (291)
=∇k k,σ + ∇k fk,k0 ;σ,σ0 δnk0 ,σ0
Ω k0 ,σ0

80
With this we obtain the current density
1X k 1 X X  (0) 
jq = nσ,k + 2 n + δnσ,k ∇k fk,k0 ;σ,σ0 δnσ0 ,k0
Ω k,σ m? Ω k,σ k0 ,σ0 σ,k
1X k 1 XX (0)

2
= δn σ,k − ∇ k σ,k fk,k0 ;σ,σ 0 δnσ 0 ,k0 + O(q )
n
Ω k,σ m? Ω2 k,σ k0 ,σ0
(292)
1X k 1 XX k0
= ?
δnσ,k + 2 δ (k ,σ − F ) ? fk,k0 ;σ,σ0 δnσ,k + O(q 2 )
0
Ω k,σ m Ω k,σ k0 ,σ0 m

≡ j1 + j2 ,

where in the second line we integrated by parts and neglected terms quadratic in δn and to
reach the third line, we used fk,k0 ;σ,σ0 = fk0 ,k;σ0 ,σ as well as
(0)
(0) ∂nσ,k k
∇k nσ,k = ∇k k,σ = −δ(k,σ − F )∇k k,σ = −δ(k,σ − F ) ? . (293)
∂k,σ m

The current j1 denotes the quasiparticle current, while the second term j2 can be interpreted
as a drag-current, i.e., an induced motion (backflow) of the other particles due to interactions.
From a different viewpoint, we consider the system as being in the inertial frame with a
velocity |q|/m, as all particles received the same momentum. Here m is the bare electron
mass. The current density is then given by

N q 1Xk 1Xk
jq = = nσ,k = δnσ,k (294)
Ω m Ω k,σ m Ω k,σ m

Since these two viewpoints have to be equivalent, the resulting currents should be the same.
Thus, we compare Eq. (292) and Eq. (294) and obtain the equation,

k k 1 X k0
= ?+ fk,k0 ;σ,σ0 δ(k0 ,σ0 − F ) ? (295)
m m Ω k0 ,σ0 m

k = k/kF then leads to


which with b

k ·b k0
Z
1 1 dΩ bk 0 X s b
= ? + ν(F ) fl Pl (cos θ)
m m 4π l m?
Z (296)
1 dΩ bk 0 X s P1 (cos θ)
= ? + ν(F ) fl Pl (cos θ)
m 4π l m?

or by using the orthogonality of the Legendre polynomials,

m? 1
= 1 + F1s , (297)
m 3

81
where 1/3 = 1/(2l + 1) for l = 1 originates from the orthogonality relation of Legendre-
polynomials. Therefore, the relation (297) has to couple m? to F1s in order for Landaus theory
of Fermi liquids to be self-consistent. Generally, we find that F1s > 0 so that quasiparticles
in a Fermi liquid are effectively heavier than bare electrons.

6. Stability of a Fermi liquid

Upon inspection of the renormalization of the quantities treated previously


γ m?
= ,
γ0 m
κ m? 1
= , (298)
κ0 m 1 + F0s
χ m? 1
= ,
χ0 m 1 + F0a
with
m? 1
= 1 + F1s , (299)
m 3
and the the response functions of the non-interacting system are given by
kB mkF 3m mkF
γ0 = , κ0 = , χ0 = µ2B , (300)
3 nkF2 π2
one notes that the compressibility κ (susceptibility χ) diverges for F0s → −1 (F0a → −1),
indicating an instability of the system. A diverging spin susceptibility for example leads to
a ferromagnetic state with a split Fermi surface, one for each spin direction. On the other
hand, a diverging compressibility leads to a spontaneous contraction of the system. More
generally, the deformation of the quasiparticle distribution function may vary over the Fermi
surface, so that arbitrary deviations of the Fermi liquid ground state may be classified by
the deformation
∞ X
X l
δnσ, bk = δnσ,l,m Yl,m (θ bk , φ bk ). (301)
l=0 m=−l

Note that we allow here formally for complex distribution functions. For pure charge density
deformations we have δn+,l,m = δn−,l,m , while pure spin density deformations are described
by δn+,l,m = −δn−,l,m . The general response function for a redistribution δnσ, bk with the
anisotropy Yl,m (θ bk , φ bk ) is given by
(0)
χl,m
χl,m = Fls,a
(302)
1+ 2l+1

82
Stability of the Fermi liquid against any of these deformations requires

Fls,a
1+ >0 (303)
2l + 1

If for any deformation channel l this conditions is violated one talks about a “Pomeranchuck
instability”. Generally, the renormalization of the Fermi liquid leads to a change in the
Wilson ratio, defined as
R χ γ0 1
= = (304)
R0 χ0 γ 1 + F0a

where R0 = χ0 /γ0 = 6µ2B /(π 2 kB2 ). Note that the Wilson ratio does not depend on the
effective mass. A remarkable feature of the Fermi liquid theory is that even very strongly
interacting Fermions remain Fermi liquids, notably the quantum liquid 3 He and so-called
heavy Fermion systems, which are compounds of transition metals and rare earths. Both
are strongly renormalized Fermi liquids. For 3 He one observes that the higher the applied
pressure is, the denser the liquid becomes and the stronger the quasiparticles interact. As
a result, the Fermi liquid parameters and the effective mass increase with pressure. Ap-
proaching the solidification the compressibility is reduced, the quasiparticles become heavier
(slower) and the magnetic response increases drastically. Finally the heavy fermion systems
are characterized by the extraordinary enhancements of the effective mass which for many
of these compounds lies between 100 and 1000 times the bare electron mass (e.g., CeAl3 ,
UBe13 ). These large masses lead the notion of almost localized Fermi liquids, since the large
effective mass is induced by the hybridization of itinerant conduction electrons with strongly
interacting (localized) electron states in partially filled 4f - or 5f -orbitals of Lanthanide and
Actinide atoms, respectively.

7. Microscopic justification

A rigorous derivation of Landaus Fermi liquid theory requires methods of quantum


field theory and would go beyond the scope of these lectures. However, plain Rayleigh-
Schroedinger perturbation theory applied to a simple model allows to gain some insights
into the microscopic fundament of this phenomenologically based theory. In the follow-
ing, we consider a model of fermions with contact interaction U δ(r − r 0 ), described by the

83
Hamiltonian
X Z
H
b = c †σ,k
k b c σ,k +
b d3 rd3 r 0 φb †↑ (r) φb †↓ (r 0 )U δ(r − r 0 ) φb ↓ φb ↑ (r)
k,σ
(305)
X U X †
= c †σ,k
k b c σ,k + c c† 0 b c ↓,k0 b
c ↑,k
Ω k,k0 ,q ↑,k+q ↓,k −q
b b b
k,σ

where εk = k2 /2m is a parabolic dispersion of non-interacting electrons. We previously no-


ticed that, in order to find well-defined quasiparticles, the interaction between the Fermions
has to be short ranged. This specially holds for the contact interaction.
Starting form the Hamiltonian (305), we will determine Landau parameters for a cor-
c †σ,k b
responding Fermi liquid theory. For a given momentum distribution nσ,k = h b c σ,k i =
(0)
nσ,k + δnσ,k , we can expand the energy resulting form equation (305) following the Rayleigh-
Schroedinger perturbation method,

E = E (0) + E (1) + E (2) + · · · (306)

with
X
E (0) = k nσ,k
k,σ
UX
E (1) = n↑,k0 n↓,k (307)
Ω k,k0

(2) U 2 X n↑,k n↓,k0 (1 − n↑,k+q )(1 − n↓,k0 −q )


E = 2
Ω k,k0 ,q k + k0 − k+q − k0 −q

The second order term E (2) describes virtual processes corresponding to a pair of particle-
hole excitations. The numerator of this term can be split into the corresponding four different
contributions. We first consider the term quadratic in nk and combine it with the first order
term E (1) , which has the same structure,

U2 X n↑,k n↓,k Ũ X
Ẽ (1) = E (1) + ≈ n↑,k n↓,k0 (308)
Ω2 k,k0 ,q k + k0 − k+q − k0 −q Ω k,k0

In the last step, we defined the renormalized interaction Ũ through


U2 X 1
Ũ = U + (309)
Ω q k + k0 − k+q − k0 −q

In principle, Ũ depends on the wave vectors k and k0 . However, when the wave vectors
are restricted to the Fermi surface (|k| = |k0 | = kF ), and if the range of the interaction

84
` is small compared to the mean electron spacing, i.e., kF `  1, this dependency may be
neglected. (We should be careful with our choice of a contact interaction, since it would
lead to a divergence in the large-q range. A cutoff for q of order Qc ∼ `−1 would regularize
the integral which is dominated by the large-q part.)
Since the term quartic in nk vanishes due to symmetry, the remaining contribution to
E (2) is cubic in nk and reads
Ũ 2 X n↑,k n↓,k (n↑,k+q + n↓,k0 −q )
Ẽ (2) = − (310)
Ω2 k,k0 ,q k + k0 − k+q − k0 −q

We replaced U 2 by Ũ 2 , which is admissible at this order of the perturbative expansion. The


variation of the energy E in Eq. (306) with respect to δn↑,k can easily be calculated

Ũ X Ũ 2 X n↓,k0 (n↑,k+q + n↓,k0 −q ) − n↑,k+q n↓,k0 −q


˜k,↑ = k + n↓,k0 − 2 , (311)
Ω k0 Ω k0 ,q k + k0 − k+q − k0 −q

and an analogous expression is found for k,↓ . The coupling parameters fk,k0 ;σσ0 may be
determined using the definition (278). Starting with fkF ,kF0 ;σσ0 with wave-vectors on the
Fermi surface (|kF | = |kF0 | = kF ), the terms contributing to the coupling can be written as
(0) (0)

Ũ 2 X n↓,k0 −q − n↓,k0 0
k+q→kF 1X Ũ 2 X n↓,k0 −q − n↓,k0
n↑,k+q → n↑,kF0
Ω2 0 k + k0 − k+q − k0 −q Ω 0 Ω 0 k0 − k0 −q 0

k ,q kF k q=kF −kF

1X Ũ 2
=− n↑,kF0 χ0 (kF0 − kF )
Ω 0 2
kF

(312)
(0)
where we consider n↑,kF0 = n↑,k0 + δn↑,kF0 . Note that the part in this term which depends on
F
(0)
n↑,k0 will contribute the ground state energy in Landaus energy functional. Here, χ0 (q) is
F

the static spin susceptibility. With the help of Eq. (278), it follows immediately, that

Ũ 2
fkF ,kF0 ;↑↑ = fkF ,kF0 ;↓↓ = χ0 (kF − kF0 ) (313)
2
The other couplings are obtained in a similar way, resulting in
Ũ 2
fkF ,kF0 ;↑↓ = fkF ,kF0 ;↓↑ = Ũ − [2χ̃0 (kF + kF0 ) − χ̃0 (kF − kF0 )] (314)
2
where the function χ̃0 (q) is defined as
(0) (0)
1 X n↑,k0 +q + n↓,k0
χ̃0 (q) = (315)
Ω k0 2F − k0 +q − k0

85
If the couplings are be parametrized by the angle θ between kF0 and kF , they can be expressed
as
"  
Ũ cos θ 1 + sin(θ/2)
fθ;↑↓ = 1 + ũ 1 + ln δσσ0
2 4 sin(θ/2) 1 − sin(θ/2)
   # (316)
1 sin(θ/2) 1 + sin(θ/2)
− 1 + ũ − ln σσ 0 ,
2 4 1 − sin(θ/2)

where ũ = Ũ ν(F )/2. With this, we are in the position to determine the most important Lan-
dau parameters by matching the expressions (314) and (314) to the parametrization (316),
  
s 1
F0 = ũ 1 + ũ 1 + (2 + ln 2) = ũ + 1.449ũ2 ,
6
  
a 2 (317)
F0 = − ũ 1 + ũ 1 − (1 − ln 2) = −ũ − 0.895ũ2 ,
3
2
F1s = ũ2 (7 ln 2 − 1) = 0.514ũ2 ,
15
We see that all Landau parameters increase in magnitude with increasing strength of a
repulsive interaction. Since the Landau parameter F1s is responsible for the modification
of the effective mass m? compared to the bare mass m, m? is enhanced compared to m
for both attractive (U < 0) and repulsive (U > 0) interactions. Obviously, the sign of the
interaction U does not affect the renormalization of the effective mass m? . This is so, because
the existence of an interaction (whatever sign it has) between the particles enforces the
motion of many particles whenever one is moved. The behavior of the susceptibility and the
compressibility depends on the sign of the interaction. If the interaction is repulsive (ũ > 0),
the compressibility decreases (F0s > 0), implying that it is harder to compress the Fermi
liquid. The susceptibility is enhanced (F0s < 0) in this case, so that it is easier to polarize
the spins of the electrons. Conversely, for attractive interactions (ũ < 0), the compressibility
is enhanced due to a negative Landau parameter F0s , whereas the susceptibility is suppressed
with a factor 1/(1 + F0a ), with F0a > 0. The attractive case is more subtle because the Fermi
liquid becomes unstable at low temperatures, turning into a superfluid or superconductor, by
forming so-called Cooper pairs. This represents another non-trivial Fermi surface instability.

8. Distribution function

Finally, we examine the effect of interactions on the ground state properties, using again
Rayleigh-Schr?odinger perturbation theory. The calculation of the corrections to the ground

86
state |Ψ0 i, the filled Fermi sea can be expressed as

|Ψi = |Ψ(0) i + |Ψ(1) i + · · · , (318)

where
|Ψ(0) i = |Ψ0 i

(1) U XX b c †k0 −q,σ0 b


c †k+q,σ b c k0 ,σ0 b
c k,σ (319)
|Ψ i = |Ψ0 i
Ω k,k0 ,q σ,σ0 k + k0 − k+q − k0 −q

The state |Ψ0 i represents the ground state of non-interacting fermions. The lowest order
correction involves particle-hole excitations, depleting the Fermi sea by lifting particles vir-
tually above the Fermi energy. How the correction (319) affects the distribution function,
c †k,σ b
will be discussed next. The momentum distribution nσ,k = h b c k,σ i is obtained from the
expectation value
c †k,σ b
hΨ| b c k,σ |Ψi (0) (2)
nσ,k = = nσ,k + δnσ,k + · · · , (320)
hΨ|Ψi
(0)
where nσ,k is the unperturbed distribution function Θ(kF − |k|) and

− U 22 P (1−nk1 )(1−nk2 )nk3
k1 ,k2 ,k3 (k +k3 −k1 −k2 )2 δk+k3 ,k1 +k2 , |k| < kF

(2) Ω
δnσ,k = (321)
 U 22 P nk1 nk2 (1−nk3 )
k1 ,k2 ,k3 (k +k −k −k )2 δk+k3 ,k1 +k2 , |k| > kF

Ω 3 1 2

(0)
where we used the notation nk ≡ nσ,k for brevity. This yields a modification to the distri-
bution functions. It allows also to determine the size of the discontinuity of the distribution
function at the Fermi surface,

nσ,k+ − nσ,k− = 1 − ũ2 ln 2, (322)


F F

where
 
(0) (2)
nσ,k± = lim nσ,k + δnσ,k . (323)
F |k|−kF →0±

The jump of nσ,k at the Fermi surface is reduced independently of the sign of the interaction.
The reduction is quadratic in the perturbation parameter ũ. This jump is also a measure
for the weight of the quasiparticle state at the Fermi surface.

9. Fermi liquid in one dimension?

Within a perturbative approach the Fermi liquid theory can be justified for a three-
dimensional system and we recognize the one-to-one correspondence between bare electrons

87
and quasiparticles renormalized by (short-ranged) interactions. Now we would like to show
that within the same approach problems appear in one-dimensional systems, which are
conceptional nature and hint that interaction Fermions in one dimension would not form
a Fermi liquid, but a Luttinger liquid, as we will motivate briefly below. The Landau
parameters have been expressed above in terms of the response functions χ0 (q = kF − kF0 )
and χ̃0 (q = kF − kF0 ). For the one-dimensional system, the relevant contributions come
from two configurations, since there are two Fermi points only (instead of a two-dimensional
Fermi surface),
(kF , kF0 ) ⇒ q = kF − kF0 = 0, 2kF . (324)

We find that the response functions (such as the charge susceptibility) show singularities for
some of these momenta and we obtain

f↑,↑ (±kF , ∓kF ) = f↓,↓ (±kF , ∓kF ) → ∞, (325)

as well as
f↑,↓ (±kF , ∓kF ) = f↓,↑ (±kF , ∓kF ) → ∞, (326)

giving rise to the divergence of all Landau parameters. Therefore the perturbative approach
to a Landau Ferm liquid is not allowed for the one-dimensional Fermi system.
The same message is obtained when looking at the momentum distribution form which
had in three dimensions a step giving a measure for the (reduced but finite) quasiparticle
weight. The analogous calculation as in Sec. IV B 8 leads here to

1 U 2 log k+

k > kF
(2) k−kF
nk,σ ≈ 2 2 × , (327)
8π vF −log k− k < kF

k−kF

where k± are cutoff parameters of the order of kF . Apparently the quality of the perturbative
calculation deteriorates as k → kF± , since we encounter a logarithmic divergence from both
sides.
Indeed, a more elaborated approach shows that the distribution function is continuous
at k = kF in one dimension, without any jump. Correspondingly, the quasiparticle weight
vanishes and the elementary excitations cannot be described by Fermionic quasiparticles
but rather by collective modes. Landaus Theory of Fermi liquids is inappropriate for such
systems. This kind of behavior, where the quasiparticle weight vanishes, can be described by

88
q= e S=0
charge
excitation

q=0 S = 1/2
spin
excitation

Figure 8. Spin-charge separation in a one-dimensional antiferromagnetic chain

the so-called bosonization of fermions in one dimension, a topic that is beyond the scope of
these lectures. However, a result worth mentioning, shows that the fermionic excitations in
one dimensions decay into independent charge and spin excitations, the so-called spin-charge
separation. This behavior can be understood with the naive picture of a half-filled lattice
with predominantly antiferromagnetic spin correlations. In this case both charge excitations
(empty or doubly occupied lattice site) and spin excitations (two parallel neighboring spins)
represent different kinds of domain walls, and are free to move at different velocities (see
Fig. 8).

C. Transport in metals

In principle, transport is not a concept that requires electron-electron interactions. We


still place this section in the chapter on interacting electron systems, as it naturally builds up
on the principles of Fermi liquid theory (which is intrinsically interacting), and the results
on particle-hole excitations in metals. What follows is often applicable to an interacting
Fermi liquid as much as to a system of noninteracting electrons.

1. Linear response and complex analysis

When studying the charge transport properties of a material, the natural object of interest
is the conductivity which in linear response is defined by the equation

X
jα (q, ω) = σα,β (q, ω)Eβ (q, ω), α, β ∈ {x, y, z} (328)
β

89
Here, we introduced σα,β (q, ω) in all generality as a tensorial quantity. However, in the
following discussion we specialize for simplicity on isotropic systems where σα,β is diagonal
with equal diagonal elements σ(q, ω).
We can relate σ(q, ω) to the susceptibility in the following. For this, we note that current
j(q, ω) and charge density ρ(q, ω) satisfy the continuity equation

ρ̇(r, t) + ∇ · j(r, t) = 0, (329)

which in reciprocal space reads

ωρ(q, ω) − q · j(q, ω) = 0. (330)

Inserting this into Eq. (240) (note that we divide ρ by (−e) to relate it to the number density
that was used as δρ in Eq. (240))
ρ(q, ω)
χ0 (q, ω) = −
eV (q, ω)
q · j(q, ω)
=−
eωV (q, ω)
(331)
σ(q, ω) q · E(q, ω)
=−
eωV (q, ω)
σ(q, ω) iq 2 V (q, ω)
=− ,
ω eV (q, ω)
where we used the first Maxwell equation eE(q, ω) = −iqV (q, ω). In summary
q2 4π
χ0 (q, ω) = −i σ(q, ω), (q, ω) = 1 + i σ(q, ω). (332)
e2 ω ω
From our discussion of the plasma resonance, we had deduced that for small q  kF the
dielectric function takes the form (0, ω) = 1 − ωp2 /ω 2 , which implies for the conductivity in
this limit
ωp2
σ(ω) = i . (333)
4πω
Is the conductivity purely imaginary in the limit of small frequency and momenta? This is
not the case, as we can directly see from computing the Kramers-Kronig relation between
imaginary and real part
Z ∞ 0

1 0 Im σ(ω )
Re σ(ω) = − P dω
π −∞ ω − ω0
Z ∞ 0
 (334)
1 0 Re σ(ω )
Im σ(ω) = + P dω .
π −∞ ω − ω0

90
Using the from of Eq. (333), we obtain for any finite ω that Re σ(ω) = 0 simply by shifting
the integration variable ω 0 by ω/2. For ω = 0 we obtain a divergence which using the Cauchy
principal value yields

ωp2 ωp2
Re σ(ω) = δ(ω), Im σ(ω) = . (335)
4 4πω

This tells us that the metal is perfectly conducting (finite conductivity comes results from
disorder scattering, which will be considered below). Furthermore, the conductivity obeys
the so-called f sum rule following from complex analysis
∞ ∞ ωp2 πe2 n
Z Z
0 0 1
dω Re σ(ω ) = dω 0 Re σ(ω 0 ) = = , (336)
0 2 −∞ 8 2m

where n is the carrier density.

2. Boltzmann transport theory

To include the effect of impurity scattering, we utilize Boltzmann’s transport theory,


which is an efficient semiclassical tool to compute transport responses. The central object
of interest is the distribution function

f (k, r, t) (337)

in phase space which gives the number of (quasi-)particles residing in a phase space volume
d3 rd3 k/(2π)3 at a point (k, r) and at time t. For this semiclassical picture of phase space to
be applicable, we restrict ourselves to phenomena at small wave numbers and small energies

q  kF , ω  F . (338)

From this definition there follows the total number of particles as

d3 k 3
Z
N =2 d r f (k, r, t) (339)
(2π)3

and the equilibrium distribution function (i.e., in absence of perturbations to the system)
as the Fermi-Dirac distribution

1
f0 (k, r, t) = . (340)
eβ k +1

91
The key equation obeyed by f (k, r, t) is the Boltzmann equation which says that the total
time derivative of f (k, r, t) equals a functional called the collision integral which incorporates
the scattering processes
d  
f (k, r, t) = ∂t + ṙ · ∇r + k̇ · ∇k f (k, r, t) = (∂t f )coll (k, r, t). (341)
dt
If scattering was absent (i.e., the right-hand side zero), the equation would simply be the
continuity equation for f (k, r, t) required by particle number conservation. Hence, all the
magic is in the collision integral. For scattering off static potentials, it can be written as
Z 3 0
dk
(∂t f )coll (k, r, t) = − {W (k, k0 )f (k, r, t)[1 − f (k0 , r, t)] − (k ↔ k0 )} . (342)
(2π)3
Here, W (k, k0 ) is the probability of a scattering process of a quasiparticle from momentum
k to k0 . The factors f (k, r, t) and [1 − f (k0 , r, t)] are the probability of there being a
quasiparticle at k that can be scattered in the first place and the probability of the state at
k0 being empty, respectively. The second term k ↔ k0 describes scattering into (as opposed
to out of) the phase space volume near k from a state at k0 and therefore comes with a
negative sign. If a system has time-reversal symmetry, we have W (k, k0 ) = W (k0 , k) and
thus
d3 k0
Z
(∂t f )coll (k, r, t) = − W (k, k0 ) [f (k, r, t) − f (k0 , r, t)] . (343)
(2π)3
An important approximation to make progress is the so-called relaxation-time approxi-
mation. We assume that the system is close to an equilibrium distribution f0 (k, r, t), which
is equal to the Fermi-Dirac distribution, but with a varying chemical potential µ(r, t) and
temperature β(r, t), which are the source of a position and time dependence in f0 (k, r, t).
The collision integral is then written using the ansatz
f (k, r, t) − f0 (k, r, t)
(∂t f )coll = − , (344)
τ (k )
i.e., we replace the integral with a simple linear function of f (k, r, t) in the limit of small
deviations from equilibrium. The quantity τ (k ) is the relaxation time, quantifying the time
scale of scattering events.

3. Conductivity

As a simple application of the Boltzmann theory using the relaxation time approximation,
we compute the conductivity as a response to applying a small uniform electric field E(t).

92
Then f (k, r, t) = f (k, t) is independent of r. In absence of a magnetic field and for vanishing
Berry curvature, the semiclassical equations of motion

k
ṙ = , k̇ = −eE, (345)
m

allow to introduce the electric field in the Boltzmann equation (341). The applied electric
field leads to a change δf (k, t) in the distribution f (k, t) = f0 (k) + δf (k, t). We can thus
write the Boltzmann equation (341) to linear order in δf (k, t) and E(t) using their Fourier
transforms
Z ∞ Z ∞
dω dω
f (k, t) = f (k, ω)e−iωt , E(t) = E(ω)e−iωt , (346)
−∞ 2π −∞ 2π

as
δf (k, ω)
− iω δf (k, ω) − eE(ω) · ∇k f0 (k) = − . (347)
τ (k )
This is an algebraic equation for δf (k, ω) solved as

eτ (k )E(ω) ∂f0 ()


δf (k, ω) = · ∇k k . (348)
1 − iωτ (k ) ∂

Once we have obtained the distribution function, we can use it to calculate observables. For
example, the quasiparticle current is given by

d3 k
Z
j(ω) = −2e vk f (k, ω). (349)
(2π)3

As an aside, were we interested in the thermal response of a material, we could instead


compute the heat current

d3 k
Z
jh (ω) = −2e k vk f (k, ω). (350)
(2π)3

For the case at hand, using vk = ∇k k ,

e2 τ (k )E(ω) · vk ∂f0 (k )


Z
j(ω) = − 3 d3 k vk . (351)
4π 1 − iωτ (k ) ∂k

Comparing to Eq. (328), we find the conductivity

e2
Z Z
∂f0 () τ () vk;α vk;β
σα,β (ω) = − 3 d dΩk k 2 . (352)
4π ∂ 1 − iωτ () |vk |

We now discuss the result in different frequency regimes.

93
a. Limit of large frequencies If ωτ  1 and for temperatures small compared to the
Fermi energy, the energy integrand becomes a delta function multiplying i/ω. In the case of
an isotropic system of mass m, we have σα,β = δα,β σ with
e2
Z
kF 2
σ(ω) =i 3 dΩk k
4π ω m F,x
e2 kF3 4π
=i 3
4π ω m 3 (353)
e2 n
=i

ωp2
=i ,
4πω
where n = kF3 /(3π 2 ) (spin degeneracy!). This agrees with our previous result Eq. (333).
b. Limit of small frequencies (dc limit) Before, our perfect conductor had a real delta
function conductivity at small frequencies. Using the Boltzmann theory, we can obtain a
more realistic result in this limit, namely
e2 n
Z
∂f0 ()
σ(ω) = d τ ()
m ∂
e2 nτ̄ (354)
=
m
ωp2 τ̄
= ,

where we simply defined the averaged relaxation time τ̄ as the integral with the derivative
of the Fermi Dirac distribution, which is strongly peaked at the Fermi surface. This is
the Drude from of the conductivity, which can also be derived with much more elementary
means. Its important feature is that the conductivity is proportional to the relaxation time.
c. Limit of nearly constant relaxation times If the relaxation time depends only
weakly on energy, we can use the approximation used to derive the Drude from also at finite
frequencies to obtain
ωp2 τ̄
σ(ω) =
4π 1 − iωτ̄ (355)
ωp2 τ̄ ωp2 τ̄ 2 ω
= + i .
4π 1 + ω 2 τ̄ 2 4π 1 + ω 2 τ̄ 2

4. Optical properties

With the approximations above, there follows the dielectric function


ωp2 τ̄
(ω) = 1 − (356)
ω i + ωτ̄

94
directly from the conductivity. It is related to the complex index of refraction n + iκ via
 = (n + iκ)2 , which determines how light propagates inside the metal. Light with vacuum
wave number k = ω/c has wave number k(n + iκ) in the material. Hence, the penetration
depth
δ = c/(ωκ) (357)

determines how far light can penetrate the metal.


Straight-forward evaluation yields in the regime of small frequencies ωτ̄  1  ωp τ̄ that
r
c 2
δ(ω) ≈ , (358)
ωp ωτ̄
yielding the dependence of the so-called skin depth of a metal δ ∝ ω −1/2 .
For intermediate frequencies 1  ωτ̄  ωp τ̄ the penetration depth is only weakly de-
pendent on frequency as δ(ω) ≈ c/ωp , which is in the range of hundreds Angstrom. (This
regime does not include visible light.)
At ultraviolet frequencies ω ∼ ωp the imaginary part of  is very small yielding a strong
increase in δ. The metal becomes transparent in the ultraviolet.
Note that we have only considered a single band model for a metal here. More dis-
criminating optical properties, like the shiny colors of some metals like gold in the visible
frequency range stem from interband transitions.

5. Computing the relaxation time

So far, we have introduced the relaxation time as a phenomenological function. We want


to improve on that by considering actual scattering mechanisms now. We consider time-
reversal invariant, isotropic, time-independent, and homogeneous systems, such that the
relevant equation from the collision integral is
Z 3 0
f (k) − f0 (k) dk
= 3
W (k, k0 ) [f (k) − f (k0 )] . (359)
τ (k ) (2π)
Let the only source of perturbation be an external electric field E, such that

f (k) = f0 (k) + A(k) k · E (360)

to lowest order by isotropy. Then

f (k) − f (k0 ) = A(k) (k − k0 ) · E (361)

95
because elastic scattering implies that |k| = |k0 |. Let θ and θ0 be the angles between E and k
as well as between k0 and k, respectively. Then k0 · E = k 0 E(cos θ cos θ0 + sin θ sin θ0 cos φ0 ),
where φ0 is the polar angle between k and k0 . The latter term drops out of the integral by
isotropy and
f (k) − f0 (k)
Z
1
= A(k) k E cos θ dΩk0 (1 − cos θ0 )W (k, k0 )
τ (k ) (2π)3
Z (362)
1
= 3
[f (k) − f0 (k)] dΩk0 (1 − cos θ0 )W (k, k0 ).
(2π)
The relaxation time reduces to
d3 k0
Z
1
= (1 − cos θ0 )W (k, k0 ). (363)
τ (k ) (2π)3
The factor (1−cos θ0 ) gives more weight to the backscattering (θ0 = π) processes as compared
to forward scattering (θ0 = 0).
a. Impurity scattering If we get microscopic information about an impurity potential
Vb , we can compute W (k, k0 ) by applying Fermi’s golden rule
D E 2
W (k, k ) = 2πnimp k Vb k0 δ(k − k0 ),
0
(364)

where nimp is the density of impurities, which is assumed to be small enough from them
to be considered as independent. Screened point charge scatterers of charge Ze have the
potential
D E 4πZe2
k Vb k0 = . (365)

2
|k − k0 |2 + kTF
In the limit of strong screening, kTF  kF , we can drop the (k − k0 ) in the matrix element
and obtain
2 Z 3 0
4πZe2

1 dk
= 2πnimp 2
(1 − cos θ0 )δ(k − k0 )
τ (k ) kTF (2π)3
2 (366)
4πZe2

= π ν(F ) nimp 2
.
kTF
This yields the temperature-independent residual resistivity of a metal
m
ρ= , (367)
e2 n τ (F )
the ratio of which with the value of the resistivity at 300 K is the residual resistance ratio
RRR, which is a measure for the amount of impurities in the system and hence a measure
for sample quality. Typical values are in the tenths, clean materials reach RRR values in
the thousands.

96
b. Electron-electron scattering The scattering rate between electrons is suppressed for
energies close to the Fermi energy, because of the reduced phase space available for elastic
scattering there. The life-time (here interpreted as the relaxation time) takes the form
 2
1 1 
= , (368)
τ () τe F
where τe is a constant and  is measured with respect to the chemical potential, i.e.,  = 0
at the Fermi level. Inserting this in the first line of Eq. (354) to compute τ̄ , we find
Z
∂f0
τ̄ = d τ ()
∂
Z (369)
β τ ()
= d .
4 cosh2 (β/2)
The latter integral is divergent for small .
In order to regulate it, we also consider coexisting impurity scattering. To show how one
properly accounts for different scattering mechanisms we use Matthiessen’s rule. It states
that different independent scattering processes can be added at the level of the scattering
rates, i.e.,
W (k, k0 ) = W1 (k, k0 ) + W2 (k, k0 ). (370)

For the relaxation times and the resulting resistivity contributions this translates into
1 1 1
= + , ρ = ρ1 + ρ2 . (371)
τ τ1 τ2
It thus looks like we are coupling resistors in serial circuit. Matthiessen’s rule can be inap-
plicable for 1D systems and if the τ ’s are strongly k-dependent.
Returning to our integral (369), we replace 1/τ () = 2 /(2F τe ) + 1/τ0 , where τ0 comes
from impurity scattering
τ02 2
Z  
β 1
τ̄ ≈ d 2 τ0 −
4 cosh (β/2) τe 2F
(372)
π 2 τ02
= τ0 − .
3(βF )2 τe
The resistivity then becomes temperature dependent due to the electron-electron scattering
as 2
π2 m

kB T
ρ(T ) = ρ0 + . (373)
3 ne2 τE F
The T 2 behavior is the main feature here which is often seen as an identification criterion
for a Fermi liquid.

97
c. Sources of momentum loss Devil’s advocate could argue at this point that if all
scattering processes conserve momentum, no resistance should occur. This line of argu-
mentation ignores the important role played by the lattice. In the case of electron-electron
scattering, it also allows for Umklapp processes in the scattering for which

k = k0 + G, (374)

where G is a reciprocal lattice vector. Such a process leads to a static deformation of the
lattice, which absorbs the momentum mismatch.
A second source of resistivity due to the lattice, which we have not touched upon is the
scattering between electrons and phonons, i.e., elementary lattice vibrations. The resulting
relaxation time has the characteristic temperature dependence
 5
−1 T
τ ∝ , T  ΘD ,
ΘD (375)
T
τ −1 ∝ , T  ΘD ,
ΘD
where the Debye temperature kB ΘD ≈ cs kF with cs the speed of sound.

6. Kondo effect

There are impurity atoms inducing so-called resonant scattering. If the resonance occurs
close to the Fermi energy, the scattering rate is strongly energy dependent, inducing a
more pronounced temperature dependence of the resistivity. An important example is the
scattering off magnetic impurities with a spin degree of freedom, yielding a dramatic energy
dependence of the scattering rate. This problem was first studied by Kondo in 1964 in order
to explain the peculiar minima in resistivity in some materials. The coupling between the
local spin impurities Si at Ri and the quasiparticle spin s(r) has the exchange form
X 
VK=
b VK
b
i
i
X
=J b i · sb (r) δ(r − Ri )
S
i
X 1 b+ − 1 b− +
 (376)
z z
=J S i sb (r) + S i sb (r) + S i sb (r) δ(r − Ri )
b
i
2 2
J X h bz  †  i
−i(k−k0 )·Ri
= Si b c k↑ b c †k↓ b
c k0 ↑ − b c k0 ↓ + Sb +
i c
b †
k↓ c
b k
b− b
0↑ + S
i c †
k↑ c
b k 0↓ e
2Ω k,k0 ,i

98
Here, it becomes important that spin flip processes, which change the spin state of the
impurity and that of the scattered electron, are enabled. The results for the scattering rate
are presented here without derivation,
 
0 2 D
W (k, k ) ≈ J S(S + 1) 1 + 2Jν(F ) log , (377)
|k − F |
where D is the bandwidth and we have assumed that Jν(F )  1. The relaxation time is
found to be  
1 2 D
≈ J S(S + 1)ν(F ) 1 + 2Jν(F )log (378)
τ (k ) |k − F |
Note that W (k, k0 ) does not depend on angle, meaning that the process is described by
s-wave scattering. The energy dependence is singular at the Fermi energy, indicating that
we are not dealing with simple resonant potential scattering, but with a much more subtle
many-particle effect involving the electrons very near the Fermi surface. The fact that
the local spins Si can flip, makes the scattering center dynamical, because the scatterer
is constantly changing. The scattering process of an electron is influenced by previous
scattering events, leading to the singularity at F . This cannot be described within the first
Born approximation, but requires at least the second approximation or the full solution. As
mentioned before, the resonant behavior induces a strong temperature dependence of the
conductivity. Indeed,
e2 kF3
Z
1
σ(T ) = 2 d 2 τ ()
6π m 4kB T cosh ( − F )/(2kB T )
(379)
e2 n J 2 S(S + 1)[1 − 2Jν(F ) log(D/˜)]
Z
≈ d˜ .
8mkB T cosh2 (˜/2kB T )
A simple substitution in the integral leads to
e2 n
 
D
σ(T ) ≈ JS(S + 1) 1 − 2Jν(F ) log (380)
2m kB T
Usual contributions to the resistance, like electron-phonon scattering, typically decrease
with temperature. The contribution (380) to the conductivity is strongly increasing, induc-
ing a minimum in the resistance, when we crossover from the decreasing behavior at high
temperatures to the low-temperature increase of ρ(T ). At even lower temperatures, the
conductivity would decrease and eventually even turn negative which is an artifact of our
approximation. In reality, the conductivity saturates at a finite value when the temperature
is lowered below a characteristic Kondo temperature TK ,

kB TK = D e−1/[Jν(F )] , (381)

99
a characteristic energy scale of this system. The real behavior of the conductivity at tem-
peratures T  TK is not accessible by simple perturbation theory. This regime, known
as the Kondo problem, represents one of the most interesting correlation phenomena of
many-particle physics.

7. Anderson localization in one dimension

Transport in one spatial dimension is very special, since there are only two different
directions to go: forward and backward. We introduce the transfer matrix formalism and
use it to express the conductivity through the Landauer formula. We will then investigate
the effects of multiple scattering at different obstacles, leading to the so-called Anderson
localization, which turns a metal into an insulator.
a. Landauer Formula for a single impurity The transmission and reflection at an ar-
bitrary potential with finite support in one dimension can be described by a transfer matrix
T [see Fig. 9 a) and b)]. A suitable choice for a basis of the electron states is the set of plane
waves {e±ikx } moving in the positive (negative) x-direction with wave vector +k (−k). Only
plane waves with the same |k| on the left (part I1 ) and right (part I2 ) side of the scatterer
are interconnected. Therefore, we write

ψ1 (x) = a1+ eikx + a1− e−ikx ,


(382)
ikx −ikx
ψ2 (x) = a2+ e + a2− e ,

where ψ1 (ψ2 ) is defined in the area I1 (I2 ). The vectors ai = (ai+ , ai− ), i ∈ {1, 2} are
connected via the linear relation,
 
T11 T12
a2 = Tb a1 =   a1 (383)
T21 T22

with the 2 × 2 transfer matrix Tb . The conservation of current (J1 = J2 ) requires that Tb is
unimodular, i.e., detT = 1. Here,
dψ ∗ (x)
 
i ∗ dψ(x)
J= ψ(x) − ψ (x) (384)
2m dx dx

such that, for a plane wave ψ(x) = (1/ L)eikx in a system of length L, the current results
in
J = v/L (385)

100
with the velocity v defined as v = k/m. Time reversal symmetry implies that, simultaneously
with ψ(x), the complex conjugate ψ ∗ (x) is a solution of the stationary Schroedinger equation.
∗ ∗
From this, we find T11 = T22 and T12 = T21 ,such that
 
T11 T12
Tb =  . (386)
∗ ∗
T12 T11

It is easily shown that a shift of the scattering potential by a distance x0 changes the coeffi-
cient T12 of Tb by a phase factor ei2kx0 . Meanwhile, the coefficient T11 remains unchanged.
With the ansatz for a right moving incoming wave (∝ eikx ), producing a reflected (∝
re−ikx ) and transmitted part (∝ teikx ) part, the wave functions on both sides of the scatterer
read
1 1
ψ1 (x) = √ eikx + re−ikx , ψ2 (x) = √ teikx .

(387)
L L
The coefficients of Tb can be determined explicitly in this situation, resulting in
 
1/t∗ −r∗ /t∗
Tb =  . (388)
−r/t 1/t

Here, the conservation of currents imposes the condition 1 = |r|2 + |t|2 . Furthermore, we
can find a relation between the parameters (r, t) of the potential barrier and the electric
resistivity. For this, we notice that the incoming current density J0 is split into a reflected
Jr and transmitted Jt part, each given by
1
J0 = − ve = −n0 ve,
L
|r|2
Jr = − ve = −nr ve, (389)
L
|t|2
Jt = − ve = −nt ve,
L
with the electron charge −e, and the particle densities n0 , nr , and nt corresponding to the
incoming, reflected and transmitted particles respectively. The electron density on the two
sides of the barrier is given by
1 + |r|2
n1 = n0 + nr = ,
L (390)
|t|2
n2 =nt = .
L
From this consideration, a density difference δn = n1 − n2 = (1 + |r|2 − |t|2 )/L = 2|r|2 /L
results between the left and the right side of the scatterer. The resistance R of the barrier

101
a) V (x) b) c)

I1 I2 a1+ a2+
T T1 T2
a1 a2
x

Figure 9. a) and b) Transfer matrices are sufficient to describe potential scattering in one dimen-
sion. c) Two spatially separated scattering potentials with transmission matrices Tb 1 and Tb 2 ,
respectively.

is defined by the ratio between the voltage drop over the resistor δV and the transmitted
current Jt , i.e.,
δV
R= (391)
Jt
Consequently, a relation between δV and the electron density δn remains to be established to
determine R. The connection is easily found via the existing energy difference δE = −eδV
between the two sides of the resistor, such that the expression

dn
δn = δE
dE (392)
dn
= (−eδV )
dE

produces the wished relation. Here, (dn/dE)dE is the number of states per unit length in
the energy interval [E, E + dE] and we find

k2 k2
  Z  
dn 1X dk 1
= δ E− =2 δ E− = (393)
dE L k,σ 2m 2π 2m π v(E)

The resistance R is finally obtained from the Eqs. (391), (392), and (393), leading to

h |r|2
R= , (394)
e2 |t|2

where we have reinstated ~ to obtain the usual form of the von Klitzing constant RK =
h/e2 ≈ 25.8 kΩ, the resistance quantum named after the discoverer of the Quantum Hall Ef-
fect. The result (394) is the famous Landauer formula, which is valid for all one-dimensional
systems and whose application often extends to the description of mesoscopic systems and
quantum wires.

102
b. Scattering at two impurities We consider now two spatially separated scattering
potentials, represented by Tb 1 and Tb 2 each characterized by r1 , t1 and r2 , t2 , respectively
[see Fig. 9 c)]. The particles are multiply scattered at these potentials in a unknown manner,
but the global result can again be expressed via a simple transfer matrix Tb = Tb 1 Tb 2 , given
by the matrix multiplication of each transfer matrix. All previously found properties remain
valid for the new matrix Tb 12 , given by
   
1 r1∗ r2 r∗ r∗ 1 ∗
t∗1 t∗2
+ t∗1 t2
− t∗2t∗ − t∗1t2 t∗
− rt∗
Tb =  1 2 1
r1 r2∗
 =  (395)
− t1r2t2 − r1
t1 t∗2
1
t1 t2
+ t1 t∗ − rt 1
t
.
2

For the ratio between reflection and transmission probability we find

|r|2 1
= 2 −1
|t|2 |t|
∗ 2

1 r 1 r t
2 2
= 1 + −1 (396)
|t1 | |t2 |2
2 t∗2
2
r1 r2∗ t2 r1∗ r2 t∗2

1 2 2
= 1 + |r1 | |r2 | + + −1
|t1 |2 |t2 |2 t∗2 t2

Assuming a (random) distance d = x2 − x1 between the two potential barriers, we may


average over this distance. Note, that for x1 = 0, we find r2 ∝ e−2ikd , while r1 , t1 , and t2 are
independent of d. Consequently, all terms containing an odd power of r2 or r2∗ vanish after
averaging over d. The remainders of Eq. (396) can be collected to

|r|2

1 2 2

= 1 + |r 1 | |r2 | −1
|t|2 avg |t1 |2 |t2 |2
(397)
|r1 |2 |r2 |2 |r1 |2 |r2 |2
= 2 + +2 2
|t1 | |t2 |2 |t1 | |t2 |2

Even though two scattering potentials are added in series, an additional non-linear com-
bination emerges beside the sum of the two ratios |ri |2 /|ti |2 . It results from the Landauer
formula applied to two scatterers, that resistances do not add linearly to the total resistance.
Adding R1 and R2 serially, the total resistance is not given by R = R1 + R2 , but by

R1 R2
R = R1 + R2 + 2 > R1 + R2 . (398)
RK

This result is a consequence of the unavoidable multiple scattering in one dimensions. This
effect is particularly prominent if Ri  h/e2 for i ∈ {1, 2}, where resistances are then
multiplied instead of summed.

103
c. Anderson localization Let us consider a system with many arbitrarily distributed
scatterers, and let ρ be a mean resistance per unit length. R(`) shall be the resistance
between points 0 and `. The change in resistance by advancing an infinitesimal δ` is found
from Eq. (398), resulting in
R(`)
dR = ρd` + 2 ρd`, (399)
RK
which yields
Z ` Z R(`)
dR
ρd` = , (400)
0 0 1 + 2R/RK
and thus,
h
ρ` = log (1 + 2R(`)/RK ) . (401)
2e2
Finally, solving this equation for R(`), we find

RK 2ρ`/RK 
R(`) = e −1 . (402)
2

Obviously, R grows almost exponentially fast for increasing `. This means, that for large
`, the system is an insulator for arbitrarily small but finite ρ > 0. The reason for this
is that, in one dimension, all states are bound states in the presence of disorder. This
phenomenon is called Anderson localization. Even though all states are localized, the energy
spectrum is continuous, as infinitely many bound states with different energy exist. The
mean localization length ξ of individual states, related to the mean extension of a wave
function, is found from Eq. (402) to be ξ = ρ/RK . The transmission amplitude is reduced
on this length scale, since |t| ≈ 2e−`/ξ for `  ξ. In one dimension, there is no linearly
increasing electric resistance, R(`) ≈ ρ`. For non-interacting particles, only two extreme
situations are possible. Either, the potential is perfectly periodic and the states correspond
to Bloch waves. Then, coherent constructive interference produces extended states that
propagate freely throughout the system, resulting in a perfect conductor without resistance.
On the other hand, if the scattering potential is disordered, all states are strictly localized.
In this case, there is no propagation and the system is an insulator. In three-dimensional
systems, the effects of multiple scattering are far less drastic and the Ohmic law is applicable.
Localization effects in two dimensions is a very subtle topic and part of todays research in
solid state physics.
Another contemporary aspect is the impact of interactions on Anderson localization.
Naive arguments may suggest that Anderson localization would be destroyed by interactions,

104
which was a long-lasting implicit believe. However, at least in one dimensions localization
seems to be robustly stable even in presence of electron-electron interactions, a phenomenon
now dubbed many-body localization. The phenomenon entails a breakdown of some core
concepts of (quantum) statistical mechanics, such as the eigenstate thermalization hypoth-
esis. When a many-body localized system is initialized in a generic state (not an eigenstate
of the Hamiltonian), it keeps a memory of its initial conditions for arbitrarily long times.

D. Interacting instabilities

1. The fractional quantum Hall effect

We deduced that a 2D electron gas in a (strong) perpendicular magnetic field undergoes a


dramatic spectral reorganization, where density of states is concentrated in highly degenerate
Landau levels. The Hall conductivity σxy = νe2 /h scales with the filling of the Landau levels
ν. However, this result is at odds with the experimental observation of quantized values of
the Hall conductivity σxy = ne2 /h with n ∈ Z. To reconcile theory and experiment, we
had to account for the subtle role disorder plays in these systems: It broadens the Landau
levels and leads to a quantization of the Hall conductivity by forming many localized states
which do not contribute to transport. Only a single percolating state exists per Landau level
which corresponds to the location of the jump in the Hall conductivity as a function of filling.
Clearly, this noninteracting system is far away from the starting point of our discussion of
Fermi liquids, for example. There is no well defined Fermi surface in momentum space.
Moreover, many single-particle states are nearly degenerate in energy. It is thus imperative
to ask how interactions can alter the nature of the ground state of a quantum Hall system.
It is the great achievement of Laughlin to propose a variational wavefunction without
free parameters for the incompressible states that were observed experimentally at filling
ν = 1/n, with n an odd integer. We can construct the Laughlin wave function for Np
electrons in the following way for the disk geometry. The Np -particle wave function for the
p
filled lowest Landau level confined to a droplet of radius 2Np ` is given by the single Slater
determinant  
Np Np
Y X
ΨLLL (z1 , · · · , zNp ) ∝ (zi − zj ) exp − |zl |2 /(4`2 ) . (403)
i<j l

105
The electron density per unit area inside the droplet is constant and given by 1/(2π`2 ). By
raising the holomorphic part of the wave function to the power n
 
Np Np
Y X
Ψn (z1 , · · · , zNp ) ∝ (zi − zj )n exp − |zl |2 /(4`2 ) , (404)
i<j l

one obtains the Laughlin wave function describing a droplet that

• has all electrons in the lowest Landau level due to the holomorphic structure

p
• covers an n-times larger area of radius 2n Np `,

• is a featureless liquid with electron density reduced by a factor 1/n,

• supports excitations of fractional charge e/n,

• includes strong correlations between the electrons.

By inspection of the Laughlin wavefunction, one can gain some intuition as to why it might
be an energetically favorable state for repulsively interacting electrons in the lowest Landau
level. As any two electrons approach each other, the wavefunction has a n-th order zero,
while the configurations with the electron positions well separated has the highest weight.
Further, any two electrons with coordinates z1 and z2 have a relative angular momentum of
mrel = n or higher. Here, the relative angular momentum of a two-particle state (z1 −z2 )m is
the exponent m. Remarkably, the elementary excitations are in some sense already contained
in the analytic form of the Laughlin wavefunction. For example, the wavefunction of a
quasihole with fractional charge −e/n localized at z0 is given by
Np
Y
Ψqh
n (z1 , · · · , zNp |z0 ) ∝ Ψn (z1 , · · · , zNp ) (z0 − zl ). (405)
l

As the Laughlin wave function is variational in nature, two immediate questions are: (i)
How close is it to the actual ground state of the partially filled Landau level, when electrons
are subject to Coulomb interactions? (ii) Does there exist a Hamiltonian for which Ψn is an
incompressible ground state? The answer to (i) is given by numerical exact diagonalization
studies of finite systems, which showed that the Laughlin state is in the same universality
class as the exact ground state, the so-called Coulomb ground state.

106
Trugman and Kievelson showed how to construct a Hamiltonian for which Ψn is an exact
and unique zero-energy ground state. Crucial to their derivation is the fact that any two
particles have relative angular momentum mrel = n or higher in the Laughlin wave function.
As an immediate consequence, Ψn is annihilated by the projector on states with angular
momentum mrel < n. This projector is precisely the so-called pseudopotential Hamiltonian
that they propose. In configuration space, its matrix elements are derivatives of delta-
functions. Physically, it corresponds to a highly local repulsive interaction.

2. Superconductivity

The theory of superconductivity is a very rich field, and as a result we can only give a
very short account of the theoretical description of superconductors. We will focus on the
derivation of the superconducting ground state and its elementary excitations.

Many materials turn superconducting at low temperatures (ironically, many nobel metals
and the excellent conductor copper don’t). The phenomenology of the superconducting state
is two-fold: (1) The resistivity of a superconductor vanishes exactly. (2) The magnetic field is
expelled from the bulk of a superconductor (or channelled in very thin flux tubes in so-called
type-II superconductors). The latter is called the Meissner-effect. Only the combination of
properties (1) and (2) yield a well-defined thermodynamic state. It also implies the existence
of persistent currents in a superconductor, which do not decay.

a. Cooper pairs The elementary building block of superconductors are Cooper pairs,
that is, coherent combinations of two-electron states. The formation of Cooper pairs results
from some effective attractive interaction between electrons, which overcomes their repulsive
Coulomb interaction. In so-called conventional superconductors, this attractive interactions
is mediated by phonons (lattice distortions). An electron passing through the crystal leaves
behind a slowly relaxing lattice distortion (the slowness is determined by the large mass
ration Mion /m of the ionic and electronic mass). This polarized crystal environment is seen
by the other electron of the Cooper pair as an attracting force.

Here, we do not detail the phononic origin of the interaction and simply assume that
there exists some attractive interaction between electrons. To understand the formation of

107
Cooper pairs more quantitatively, we consider the following wave function ansatz
X
|Ψ12 (K)i = ak,σ,σ0 b c †−k+K,σ0 |0i,
c †k,σ b (406)
k,σ,σ 0

which describes two electrons (k, σ) and (K − k, σ 0 ) added to the filled Fermi sea ground
state |0i. Since we expect the system to remain translationally invariant, we have assigned
a well-defined momentum K to this state. Anticipating a minimization of the energy of
this state, we can further refine this ansatz. We assume that attractive interactions exist
between electrons only in a small energy range ω, which for the case of phonons can be
taken as the Debye frequency of the material, i.e., if |k − K−k | ≤ ω. Figure 10 a) suggests
that most electron pairs can take advantage of this interaction if K = 0. We furthermore
assume that the ground state in this two-electrons-above-the-Fermi-sea subspace has even
orbital character (as is usually the case for symmetric attractive potentials). To obtain a
valid fermionic two-body wave function (which is antisymmetric under particle exchange),
we thus have to make the spin part antisymmetric, i.e., the two electrons in our ansatz have
opposite spin, yielding
X
|Ψ12 i = c †k,↑ b
ak b c †−k,↓ |0i. (407)
k
We want to study this family of states subject to the Hamiltonian
X V X0 †
H
b = c †k,σ b
k b c k,σ − c †−k−q,−σ b
c k+q,σ b
b c −k,−σ b
c k,σ (408)
k,σ
2 k,q,σ

where we take V 6= 0 only for |k − K−k | ≤ ω, which is indicated by the primed sum. Notice
also that we have already restricted the interaction to the so-called ‘Cooper channel’, i.e.,
we have dropped all terms of a generic translation symmetry preserving interaction that do
not connect states of the form (407). The energy is given by

E = hΨ12 | Hb |Ψ12 i
X X0 (409)
=2 k |ak |2 − V a∗k+q ak .
k k,q

|ak |2 = 1,
P
We minimize it subject to the constraint k
!
∂ X
E−λ |ak |2 (410)
∂a∗k k

yielding
X0
(2k − λ)ak = V ak 0 . (411)
k0

108
E
a) b)

! K k
2
vF kF

Figure 10. a) Areas in momentum space in which quasiparticles can interact when forming Cooper
pairs with center of mass momentum K are marked in blue. The largest contribution is obtained
for K = 0. b) Excitation spectrum (blue) of the Bogoliubov quasiparticles in a superconductor.

Dividing Eq. (411) by (2k − λ), summing both sides over k (with a primed sum), we can
X0
shorten ak0 and obtain
k0
X0 1
1=V . (412)
0
2k −λ
k

Multiplying the complex conjugate of Eq. (411) by ak and summing over k, we obtain
X X0
2 k |ak |2 − λ = V a∗k0 ak , (413)
k k0 ,k

which, when compared with Eq. (409) gives λ = E. Thus,


Z ω
X0 1 ν()
1=V =V d . (414)
k0
2k − E 0 2 − E

Setting ν() ≈ ν(F ), we can compute the integral and obtain


 
2ωexp − ν(2F )V
E=−  
2
1 − exp − ν(F )V (415)

≈ − 2ω e−2/[ν(F )V ] .

The two main features of this result are (1) the negative sign, indicating that it is indeed
energetically favorable to form Cooper pairs, and (2) the energy gain is exponential in the
product of interaction strength and density of states. One can show that even if the ansatz
is less restrictive than ours, this is the solution minimizing the energy of the model.
b. Superconducting ground state In the next step we want to find the ground state
of the Hamiltonian (408), i.e., minimize it over the full fermionic Fock space instead of

109
the two-particle excitation state considered so far. Implicitly, we will perform a mean-field
approximation for the case of small interaction V . We start out with a completely trivial
rewriting of the fermionic creation and annihilation operators into new linear combinations
as
c †−k,↓ ,
c k,↑ − vk b
αk = uk b

α−k = uk b c †k,↑ ,
c −k,↓ + vk b
(416)
αk† c †k,↑
= uk b − vk b
c −k,↓ ,

α−k c †−k,↓ + vk b
= uk b c k,↑ ,

where we have introduced the real (and k even) functions uk and vk . Their reality is merely
a gauge fixing. We have also explicitly locked the sign of k in α to the spin degree of freedom.
One verifies that the αk obey the usual fermionic anticommutation relations if

u2k + vk2 = 1, (417)

which shall be imposed from here on.


Let us rewrite the Hamiltonian in the new operators. We start with the noninteracting
part
X
H
b0= c †k,σ b
k b c k,σ
k,σ
X
= c †k,↑ b
k ( b c †−k,↓ b
c k,↑ + b c −k,↓ ) (418)
k
X
= k [2vk2 + (u2k − vk2 )(αk† αk + α−k

α−k ) + 2uk vk (αk† α−k

+ α−k αk )].
k

The interacting part gives

b int = − V
X0 †
H c k+q,σ b
b c †−k−q,−σ b c −k,−σ b
c k,σ
2 k,q,σ
X0 †
= −V b c †−k0 ,↓ b
c k0 ,↑ b c −k,↓ b
c k,↑
k,k0
X0  † † † † (419)
= −V uk vk uk0 vk0 (1 − α−k 0 α−k0 − αk0 αk0 )(1 − α−k α−k − αk αk )

k,k0
† † † †
+ (u2k − vk2 )uk0 vk0 (1 − α−k 0 α−k0 − αk0 αk0 )(α−k αk + αk α−k )


+ (more forth-order terms in α) .
To proceed, we first neglect the residual terms of fourth order in α. This seems arbitrary
and highlights a general problem of mean-field approximations: If the ansatz (in this case

110
the expressions for the αs) is not a good description of the physics, we have no chance of
proceeding in a controlled way. The other problem are terms quadratic in the αs but of the
form α† α† or the hermitian conjugate. What our mean-field solution needs to achieve is that
all these terms (or better put, their prefectors) vanish. Notice that such terms appear both
in Hb 0 and in Hb int . The prefactor of terms of this form is
X0
2k uk vk − V (u2k − vk2 ) uk0 vk0 . (420)
k0
X0
To have it vanish, we sum over (the primed set of) k and set uk0 vk0 = ∆/V (thereby
k0
defining the constant ∆). This yields

2k uk vk = ∆(u2k − vk2 ), (421)

which has the normalized solution


1 + ξk 1 − ξk k
u2k = , vk2 = , ξk = p 2 . (422)
2 2 k + ∆2
The transformation from b
c operators to the α operators is called a Bogoliubov transforma-
tion. Finally, to calculate ∆ from this, we start from its definition
X0
∆ =V uk vk
k
V X0
q
= 1 − ξk2 (423)
2 k
V X0 ∆
= p .
2 k 2k + ∆2
Dividing it out and going to an integral representation
V ω
Z
ν()
1= d √
4 −ω  + ∆2
Z ω (424)
V ν(F ) 1
≈ d √ ,
4 −ω  + ∆2
yields a solvable integral that
∆ = 2ω e−2/[ν(F )V ] . (425)

With the off-diagonal terms vanishing, the total Hamiltonian, omitting the fourth order
terms is of the from
X X0
H
b0+ H
b int = k 2vk2 − V uk vk uk0 vk0
k k,k0
Xq (426)

+ 2k + ∆2 (α−k α−k + αk† αk ).
k

111
Let us first take a closer look at the constant terms, which encode the shift in ground state
energy. With the help of some more, less interesting integrals one can compute them in the
limit of small ∆ as
− ν(F ) ∆2 /4. (427)

Again the main feature of this expression is its negative sign, signaling that our mean-field
state is indeed lower in energy than the noninteracting Fermi sea. It is the condensation
energy of the superconducting state.
The second line in Eq. (426) is the Hamiltonian for a noninteracting Fermi liquid with
only manifestly positive energies. The ground state is thus the state annihilated by all αk
and α−k operators. However, we should not forget that these operators are superpositions
of electron creation and annihilation operators. A first guess would be to write the ground
state as
Y
|0̃i = αk α−k |vaci
k
Y (428)
= c †k b
(uk vk + vk2 b c †−k )|vaci.
k

This state is not normalized, though. The properly normalized state is


Y
|0i = c †k b
(uk + vk b c †−k )|vaci. (429)
k

This many-body state has no definite particle number. The particle number conservation
is spontaneously broken in a superconductor: The condensate can absorb and emit Cooper
pairs.
c. Excitations The elementary excitations above this ground state can be directly read
off thanks to the noninteracting structure of the mean-field Hamiltonian. The quasiparti-
cles created by the αk† operators, called Bogoliubons, are fundamentally different from the
quasiparticles in a Fermi liquid in that they create a coherent superposition of a particle
and a hole. The uk gives the particle-like weight and the vk gives the hole-like weight. Far
away from the Fermi surface, they tend to vk → 1 and uk → 0 inside the Fermi sea, while
the opposite happens far outside the Fermi sea. Right at the Fermi surface they are an
equal amplitude superposition of electron and hole. The spectrum of these single-particle
like excitations
q
2k + ∆2 (430)

112
is gapped, i.e., the smallest excitation above the ground state has energy gap 2∆. [See
Fig. 10 b).]
From this observation, we can conclude that the superconductor must support persistent
currents. We can view a state which carries a current j in momentum space as displacing the
Fermi sea by an amount proportional to the current strength. After switching off the electric
field that drove the current, elastic scattering processes from one side of the displaced Fermi
sea to the opposite side (at impurities, phonons, etc.) lead to a relaxation of the Fermi
sea to its original form. If, however, there is an excitation gap 2∆ and the displacement of
the Fermi sea is small enough such that none of the states on the left end of the displaced
Fermi sea has an energy of 2∆ or more above the energy of any state on the right side of
the displaced Fermi sea, no such elastic scattering is possible. We can be more quantitative
by considering a displacement of the Fermi sphere
m
δk = −mδv = j. (431)
ne
Elastic scattering processes require
(kF + δk)2 (kF − δk)2
− ≥ 2∆. (432)
2m 2m
From this, we can deduce the critical current density above which the superconductivity
breaks down as
2en∆
jcrit = . (433)
kF
Inserting typical values, we get jcrit ∼ 107 A/cm2 , which is a pretty sizable value.
d. Unconventional superconductivity – gap function We have discussed the simplest
example of superconducting instabilities so far, where the order parameter is in the A1 irre-
ducible representation of the point group, as so-called s-wave superconductor (that is, the
gap ∆ does not depend on the value of the relative momentum k of the two electrons that
make up the Cooper pair). Historically these were also the first classes of superconductors
discovered and the first ones theoretically understood. However, in the search for materials
with higher transition temperatures, superconductors with generalized pairing order param-
eters were found. By now far more research is done on these unconventional superconductors
than on the conventional ones and they also become more important technologically.
The generalized BCS theory relies on an extended form of the microscopic interaction
where we consider again only the scattering of electron pairs with vanishing total momentum

113
which is attractive. The corresponding Hamiltonian can be written as
X 1 X0 X
Hb = c †k,σ b
k b c k,σ + Vk,k0 ;σ1 ,σ2 ,σ3 ,σ4 b c †−k,σ2 b
c †k,σ1 b c −k0 ,−σ3 b
c k0 ,σ4 (434)
k,σ
2 0
k,k σ ,σ ,σ ,σ1 2 3 4

with the pair scattering matrix elements


D E
Vk,k0 ;σ1 ,σ2 ,σ3 ,σ4 = k, σ1 ; −k, σ2 Vb − k0 , σ3 ; k0 , σ4 . (435)

Due to the Fermionic anticommutation rules it should have the following symmetries

Vk,k0 ;σ1 ,σ2 ,σ3 ,σ4 = −V−k,k0 ;σ2 ,σ1 ,σ3 ,σ4 = −Vk,−k0 ;σ1 ,σ2 ,σ4 ,σ3 = V−k,−k0 ;σ2 ,σ1 ,σ4 ,σ3 . (436)

We are again considering weak attractive interactions that only act in an energy range ω,
i.e., scattering matrix elements are only nonzero for −ω < k , k0 < ω, where k is measured
relative to the Fermi energy F while F  ω as measured from the band edge. We signify
this restriction by the primed sum. In analogy to the conventional case, we introduce the
off-diagonal mean-field
bk,σσ0 = h b c k,σ0 i
c −k,σ b (437)

to rewrite the Hamiltonian as


X 1 X0 X h i
b0=
H c †k,σ b
k b c k,σ − c †k,σ1 b
∆k,σ1 σ2 b c †−k,σ2 + ∆∗k,σ1 σ2 b
c k,σ1 b
c −k,σ2 + K
b , (438)
k,σ
2 k σ ,σ 1 2

where K
b subsumes constant terms (quadratic in bk,σσ0 ) and terms of fourth order in the

fermion operators which are considered small in our mean-field approximation. We will
neglect K
b from here on. The generalized gap function is defined as
X
∆k,σσ0 = − Vk,k0 ;σ,σ0 ,σ3 ,σ4 bk,σ3 σ4
k0 ,σ3 σ4
X (439)
∆∗k,σσ0 = − Vk0 ,k;σ1 ,σ2 ,σ0 ,σ b∗k0 ,σ2 σ1
k0 ,σ1 σ2

We can rewrite the gap function as a complex 2 × 2 matrix in spin space


 
∆k,↑↑ ∆k,↑↓
∆k =  . (440)
∆k,↓↑ ∆k,↓↓

The structure of the gap function is related to the wave function of the Cooper pairs bk,σσ0 .
Let us consider the case of no spin-orbit coupling and with inversion symmetry, in which we
can separate this wave function into an orbital and a spin part

bk,σσ0 = φk χσσ0 (441)

114
The parity of the orbital part and the spin configuration are linked due to the antisymmetry
condition of the many-Fermion wave functions as mentioned above:

Even parity φk = φ−k ⇔ χσσ0 = (| ↑↓i − | ↓↑i)/ 2



 | ↑↑i

 √ , (442)
Odd parity φk = −φ−k ⇔ χσσ0 = (| ↑↓i + | ↓↑i)/ 2



| ↓↓i

which corresponds to spin-singlet and spin-triplet pairing, respectively. The gap function
obeys 
∆−k,σσ0 = −∆k,σ0 σ

even
∆k,σσ0 = −∆−k,σ0 σ = (443)
−∆−k,σσ0 = ∆k,σ0 σ

odd
or in matrix notation ∆k = −∆T−k . Following this, a spin-singlet gap function can always
be represented by a single function ψk = ψ−k as
 
0 ψk
∆k =   = iσy ψk . (444)
−ψk 0

A triplet gap function can be represented by a vector-valued function dk = −d−k as


 
−dk,x + idk,y dk,z
∆k =   = i (dk · σ) σy . (445)
dk,z dk,x + idk,y

Let us now discuss some examples of unconventional pairing. For that it is useful to also
consider the superconducting gap that appears in the spectrum and is related to many
measurable quantities. In general it is given by

1   |ψk |2

spin singlet
2 †
|∆k | = tr ∆k ∆k = (446)
2 |dk |2 spin triplet.

This result holds as long as we consider so-called unitary pairing states with d∗k ∧ dk = 0.
Non-unitary pairing states, in contrast, break time-reversal symmetry and show a net spin
polarization. As a result, the band structure of the BCS Hamiltonian becomes spin-split
and each band ± has a different gap

|∆k,± |2 = |dk |2 ± |d∗k ∧ dk |. (447)

Examples of paring states include

115
• The spin-singlet s-wave paring is what we discussed for conventional superconductors
and is in this scheme represented by a k-independent gap function

ψk = ∆, (448)

which yields an isotropic gap |∆| everywhere on the Fermi surface.

• There is also a simple spin-triplet gap function that gives an isotropic gap on a spherical
three-dimensional Fermi surface. It is called the Balian-Werthammer-state and given
by

dk = k, (449)
kF
i.e., the d-vector is isotropically pointing outwards on the Fermi surface. The gap is
given by |dk |2 = |∆|2 |k|2 /kF2 = |∆|2 , as promised. This state appears as the superfluid
B-phase of 3-He at low temperatures (below about 2 K).

• One of the most studied unconventional pairing states appears in the cuprate high-
temperature superconductors. It is spin-singlet pairing with l = 2 relative angular
momentum, i.e., a gap function with d-wave symmetry

∆ 2
ψk = 2
(kx − ky2 ). (450)
kF

This gap function has line nodes on the Fermi surface for (kx , ky ) k (1, 1) or (1, −1).

• Another spin-triplet state of great conceptual importance is a chiral (i.e., time-reversal


symmetry breaking) p-wave state


dk = zb (kx ± iky ) (451)
kF

with
kx2 + ky2
|∆k |2 = |∆|2 . (452)
kF2
This state has point nodes on the north and south pole of the Fermi surface and finite
(total, not relative) orbital angular momentum Lz = ±1 along the z-axis. The state
is realized in the superfluid A-phase of 3-He induced under pressure and may appear
in the quasi-two-dimensional metal Sr2 RuO4 .

116
e. Low-temperature properties The question arises how to experimentally distinguish
superconductors with these various pairing symmetries. The form of the gap function man-
ifests itself in various response functions (magnetic susceptibility, compressibility, specific
heat, ...) mainly through a characteristic behavior in the density of states at low tempera-
tures. It is defined as
2X
q
ν(E) = δ(Ek − E), with Ek = 2k + |∆k |2 . (453)
Ω k
We decompose the k-integral into a radial energy part ξ and the angular part (average over
the Fermi surface):
Z Z
dΩk p 
ν(E) = ν(F ) dξ δ ξ 2 + |∆gk |2 − E

Z
dΩk E
= ν(F )
(454)
p
4π E + |∆gk |2
2
* +
E
= ν(F ) p .
E 2 + |∆gk |2 |k|=k
F

Here, ∆ is the maximal gap and gk is a function with gk ≤ 1. The density of states for an
isotropic gap gk = 1 is straight-forward

0
 |E| < |∆|
ν(E) = ν(F ) (455)
√
 E
|∆| < |E|.
E 2 +|∆|2

No state can be found with energies below ∆ and a characteristic square-root singularity
signals the onset of continuous spectrum above ∆. We are used to such a singularity from
one-dimensional systems. Here it appears in three dimensions because the bottom of the
Bogoliubov-quasiparticle band forms a sphere at nonzero momenta (with radius kF ) as com-
pared to the usual case where the band bottom is a single point in momentum space. At
higher energies the density of states approaches the normal state value, so that the influence
of superconductivity is restricted to an energy range of several times the gap.
Turning to anisotropic gap functions we find an important change in the density of states,
since “subgap” states appear. First we consider a gap with line nodes. As a simple example
we take ∆k = ∆ cos θ which has a line node in the x-y-plane. We obtain

!
Z +1
E π/2 |E| < |∆|

E 1
ν(E) = ν(F ) dx Re p = ν(F )
|∆| −1 (E/∆)2 − x2 |∆| 
 
arcsin |∆| |∆| < |E|.
E
(456)

117
Indeed a finite density of states is found below the maximal gap, down to zero energy.
However, the density of states vanishes in a characteristic way at E = 0, In the case of
line nodes it is a linear behavior. The singularity at E = |∆| is replaced by a cusp. The
anisotropy smoothens the singularity found for the isotropic gap. In the case of point nodes,
a similar calculation gives ν(E) ∝ E 2 at low energies and a logarithmic singularity at E = ∆.
See Fig. 11 for a summary of these results.
These different behaviors are directly manifest in the low-temperature specific heat
2 X df (Ek )
C(T ) = Ek
Ω k dT
Z
df (E)
= dEν(E) E
dT
Z 2
E 1
= dEν(E) 2 2
kB T 4 cosh [E/(2kB T )] (457)




 T −3/2 e−∆/(kB T ) isotropic gap

∝ T3 point nodes .



T 2

line nodes

This exponential behavior is typical of a gaped system (thermally activated), like in a semi-
conductor. The gap sets a natural energy scale which can be derived from the exponential
behavior. It is important to note that this behavior is only really valid for T  Tc , and
is not easily observed in experiments, since various other influences can complicate the be-
havior. In particular, impurity scattering changes the low-energy density of states strongly.
More generally, thermodynamic quantities are governed by the density of states so that they
usually have a powerlaw behavior for nodal superconductors.

3. Density waves

Will be discussed in the exercise.

4. Mott insulators

With the exception of the (fractional) quantum Hall effect, we have considered models
which have weak interactions and where a description in momentum space was most conve-
nient and appropriate. In this section, we will be concerned with the paradigmatic Hubbard

118
isotropic gap point nodes line nodes
⌫(E) ⌫(E) ⌫(E)

⌫(✏F ) ⌫(✏F ) ⌫(✏F )

| | E | | E | | E

Figure 11. Low-energy density of states in unconventional superconductors with different nodal
structures of the gap function.

model, which is very important in the theoretical study of cuprate superconductors. The
Hubbard model forces us to take a different angle on the problem: It involves comparably
strong interactions and the lattice plays a crucial role. In some regimes a configuration space
picture is more practical, in others we will work in momentum space.

Simple theoretical models are often introduced because they illustrate a physical effect
while being exactly soluble. The Hubbard model is a very different case. Neither is it
analytically soluble, nor have numerical approaches been successful in establishing its ground
state properties. The numerical investigation of the Hubbard model is still an active area
of condensed matter research.

The story of the Hubbard model will consist of a charge and a spin part. We relegate the
spin part to the next chapter, where we study magnetic instabilities of matter.

The charge degrees of freedom of the Hubbard model feature a metal-insulator transition.
The interesting side of this will be the insulator, which is termed a Mott insulator. So
far, we have mostly been talking about (true or mean-field induced) band insulators (with
the exception of the fractional quantum Hall effect). The Mott insulator, in contrast, is
interaction induced but does not allow for a band insulator description. Furthermore, it
does not break any symmetries of the system spontaneously.

The Hubbard model defined on a square lattice, with one (spinful) orbital per site is

119
a) ✏ b) ✏

N (✏) N (✏)
U

Figure 12. Density of states of the 3D Hubbard model a) in the limit U  t displaying the lower
and upper Hubbard band and b) in the metallic band limit U = 0.

defined, within a tight-binding approximation, by the Hamiltonian


X X
b = −t
H c †i,σ b
(b c †j,σ b
c j,σ + b c i,σ ) + U n
b i,↑ n
b i,↓ , (458)
hi,ji,σ i

where the first sum runs over directed nearest-neighbor links on the lattice and the density
operators are defined as n c †i,σ b
b i,σ = b c i,σ . We will study the model at half filling, i.e., in a
Hilbert space with as many electrons as there are lattice sites (filling 1 would be twice as
many electrons, as we can put an up-spin and a down-spin electron on each site). There are
two obvious limits of the model:
a. Metallic limit For U = 0 the Hamiltonian reduces to
X
H
b U =0 = c †k,σ b
k b c k,σ (459)
k,σ

with dispersion
k = −2t (cos kx + cos ky + cos kz ) . (460)

and the unique ground state


Y
|ΨU =0 i = c †k,↑ b
Θ(−k ) b c †k,↓ |0i. (461)
k

We now wrote the model in 3D, where the van Hove point that we encountered in 2D is not
a singularity in the density of states. The bandwidth 2D = 12t in 3D. To study the state at
small but finite U is challenging and there are no controlled approximations available. An
instructive variational approach is the Gutzwiller approximation. We define the following

120
densities

1 electron density
s↑ density of singly occupied lattice sites with spin ↑
s↓ density of singly occupied lattice sites with spin ↓

d density of doubly occupied sites


h density of empty sites

Not all of these quantities are independent. The following relations must be satisfied

s
h = d, s↓ = s↑ ≡ , 1 = s + 2d. (462)
2

The classical energy (density) cost due to the interaction is given by U d.


The Gutzwiller approximation includes correlation effects by renormalizing hopping prob-
abilities in a statistically average setting. We focus on hopping processes that keep the
density of doubly occupied sites constant (and with it the interaction energy U d). Let’s first
consider a hopping process from a singly occupied site (i) to an empty site (j). The hopping
probability depends on the availability of the initial configuration. Let P0 (↑i , 0j )/P0 (↓i , 0j )
be these probabilities to find the initial state in absence of interactions. We relate it to the
renormalized probabilities P (↑i , 0j )/P (↓i , 0j ) via a factor gt as

P (↑i , 0j ) + P (↓i , 0j ) = gt [P0 (↑i , 0j ) + P0 (↓i , 0j )]. (463)

We thus have
P (↑i , 0j ) + P (↓i , 0j ) = sh = sd = (1 − 2d)d. (464)

In absence of correlations, this probability can be computes explicitly

1
P0 (↑i , 0j ) = ni,↑ (1 − ni,↓ )(1 − nj,↑ )(1 − nj,↓ ) = , (465)
16

and similarly for ↓. We can now compute

gt = 8d(1 − 2d). (466)

The other hopping processes that leave d constant are (↑↓, ↑) → (↑, ↑↓), as well as the one
with spin reversed. One finds the same renormalization factor gt = 8d(1 − 2d) for these
processes. We do not consider processes of the form (↑↓, 0) → (↑, ↓) that change d. The

121
factor gt is used to renormalize the hopping probability, i.e., t → gt t. We thus find the
energy of a sector of constant d to be
Z 0
1
E(d) = gt kin + U d, kin = dν() . (467)
N −D

Note that this is not a variational calculation in a strict sense, as E(d) is not guaranteed to
be an upper bound to the ground state energy. Nevertheless, we can minimize E(d) with
respect to d and obtain
   2
1 U U
d= 1− , gt = 1 − (468)
4 Uc Uc
with
Uc = 8|kin | ≈ 25t ≈ 4D. (469)

The theory thus naturally produces a critical interaction strength, and for U > Uc double
occupancy and with it any hopping is completely suppressed. This provides a qualitative
description for the metal-insulator transition in a Mott insulator. Obvious shortcomings are
that only local correlations are included to renormalize parameters, as opposed to corre-
lations between different lattice sites, and the spin degree of freedom plays no role in this
treatment.
We can argue more precisely why the system turns insulating by splitting up the con-
ductivity into two parts that correspond to different d sectors. Using the conductivity from
Eq. (335), we write
ωp?2
Re σ(ω) = δ(ω) + Re σ high energy (ω), (470)
4
where the first term is the conductivity associated with the Hamiltonian
X
H
b ren = c †k,σ b
gt k b c k,σ + U d (471)
k,σ

while the second part corresponds to sectors with at least one more doubly occupied site.
Since ωp2 ∝ 1/m, it renormalizes as ωp?2 = gt ωp2 . We can now make use of the f sum rule (336)
Z ∞ ωp2 ωp2
dω Re σ(ω) = gt + Ihigh energy = . (472)
0 8 8
According to Eq. (468), the weight of the low energy contribution vanishes as U → Uc , and
the f sum rule dictates that it is sifted to the high-energy contribution (i.e., to large ω).
The system becomes insulating.

122
b. Insulating limit For t = 0, the ground state with zero energy has exactly one electron
per site. However, this ground state is not unique due to the spin degree of freedom. In
fact, any state of the from
Y
|Ψ{σi }i = c †i,σi |0i
b (473)
i

with any spin configuration {σi } is a ground state. The ground state degeneracy is 2N . It
is therefore a bit dangerous to call this an insulator, without knowing how this degeneracy
is lifted by small perturbations. However, since all these ground states do not differ in the
occupation numbers of each site, we can already anticipate that no electric transport will be
possible no matter how the degeneracy is lifted. The lifting of the ground state degeneracy
by small t will be the subject of the spin part of the Hubbard model story contained in
the next chapter. Excited states at energy U feature exactly one empty and one doubly
occupied site.
We now slightly depart from the limit t = 0 and consider the insulating state with
t  U . The excitations of one site being doubly occupied and one site being empty are
called doublon and holon, respectively. When t is finite, these excitations can actually hop
and move through the lattice. This lifts some of the degeneracy of the “excited band” and
gives a dispersion
Ek,k0 = U + k + k0 > U − 12t. (474)

Upon increasing t, the two sectors (i.e., the ground state sector and this doublon-holon
band) start to overlap. When not overlapping, the two bands are called the lower and upper
Hubbard band. [See Fig. 12 a).] A holon is a hole in the lower Hubbard band and a doublon
is a particle in the upper Hubbard band. This density of states structure (there are no actual
energy bands here, the name is a complete abuse of language) is called a Mott insulator.
The gap is entirely due to the interaction, without the need for any mean-field treatment or
the like. It is therefore fundamentally different from the band insulators that we considered
before. We should keep in mind that we have neglected the role played by the spin.
c. The metallic state as a Fermi liquid In the uncorrelated case, we have

1 X
kin = k . (475)
N k∈FS

Within the Gutzwiller we found kin → gt kin . One can furthermore show (we don’t give
a proof here) that within the same approximation a finite interaction strength induces a

123
nonzero and constant occupation number nout outside the Fermi surface while at the same
time the density inside the Fermi surface nin is reduced below 1 but remains constant. The
total must correspond to the filling, i.e.,

1 1 X 1 X nin + nout
= nin + nout = , (476)
2 N k∈FS N k6∈FS 2

where we used that the Fermi sea and its complement are equal in size. This gives the
constraint
1 X 1 X
gt kin = nin k + nout k . (477)
N k∈FS N k6∈FS

We can furthermore use the particle-hole symmetry of the band structure

X X
k = − k . (478)
k∈FS k6∈FS

Then we can determine from 1 = nin + nout and gt = nin − nout the densities

1 + gt 1 − gt
nin = , nout = . (479)
2 2

We thus find that there is a step in the distribution function (i.e., a quasiparticle picture
makes sense), as long as U < Uc . Without derivation, we give two Fermi liquid parameters
of the model

U ν(F ) 2Uc + U U ν(F ) 2Uc − U


F0a = − Uc , F0s = Uc . (480)
4 (U + Uc )2 4 (U − Uc )2

One interesting feature about them is the divergence of F0s as U → Uc . We recall that F0s
appears in the denominator of the compressibility

µ2B ν ? (F ) ν ? (F )


χ= , κ= . (481)
1 + F0a n2 (1 + F0s )

The renormalized density of states at the Fermi level ν ? (F ) = m? kF /π 2 diverges due to
the diverging renormalized mass m? = m/gt ∝ (U − Uc )−1 . However, the divergence in
F0s is stronger so that, as expected for an insulator, the compressibility tends to zero. The
spin susceptibility does diverge at the metal-insulator transition, due to the formation of
completely independent local spins.

124
V. MAGNETISM IN SOLIDS

A. Antiferromagnetism

1. Superexchange in the Hubbard model

As discussed in the previous chapter, the Hubbard model has an extensive ground state
degeneracy in the limit t = 0. We now study how this degeneracy is lifted when a small
kinetic term t  U is present. In order to derive an effective Hamiltonian, we consider
the hopping process between two neighboring sites in second order perturbation theory (to
get back to a ground state configuration, two hopping processes need to be involved). The
two neighboring sites have four degenerate ground state configurations | ↑, ↑i, | ↑, ↓i, | ↓, ↑i,
| ↓, ↓i. Acting with the kinetic part of the Hamiltonian on each of them yields

b kin | ↑, ↑i = H
H b kin | ↓, ↓i = 0,
(482)
b kin | ↑, ↓i = − H
H b kin | ↓, ↑i = −t| ↑↓, 0i − t|0, ↑↓i,

where the relative minus sign in the last line is required by choosing a consistent ordering
of the second quantized operators.
This way we obtain the matrix elements

hσ1 , σ2 | H b kin |σ 0 , σ 0 i
b kin | ↑↓, 0ih↑↓, 0| H
1 2
Mσ1 ,σ2 ;σ10 ,σ20 = −
h↑↓, 0| H U | ↑↓, 0i
b
(483)
hσ1 , σ2 | H b kin |σ 0 , σ 0 i
b kin |0, ↑↓ih0, ↑↓ | H
1 2
− ,
h0, ↑↓ | Hb U |0, ↑↓i

where H
b U is the interaction part of the Hamiltonian. The denominator is U . The only

nonzero matrix elements are allocated in the block


   
M M +1 −1
 ↑,↓;↑,↓ ↑,↓;↓,↑  = −2t2 /U  . (484)
M↓,↑;↑,↓ M↓,↑;↓,↑ −1 +1

which has the eigensystem


   
1 1 1 1 4t2
√   , E = 0; √  , E=− . (485)
2 1 2 −1 U

The states | ↑, ↑i and | ↓, ↓i are two more eigenstates with zero energy. Together, the zero-
energy states for a degenerate triplet, as required by the SU(2) spin-rotation symmetry of

125

the Hamiltonian. The triplet | ↑, ↑i, (| ↑, ↓i+| ↓, ↑i)/ 2, | ↓, ↓i furnishes the eigenspace with

eigenvalue S(S+1) for S = 1 of the operator (S b1 +S b2 )2 . The singlet state (| ↑, ↓i−| ↓, ↑i)/ 2

corresponds to the S = 0 eigenvalue. Using that

(S b2 )2 = S
b1 + S b2 + S
b2 + 2S
b1 · S
b2
1 2
3 3 (486)
= b1 · S
+ + 2S b2
4 4
we can thus recast the effective Hamiltonian as
4t2
 
b eff;1-2 = J Sb1 · S
b2 − 1
H , J= > 0. (487)
4 U
This spin-spin coupling mediated by a pair of virtual hopping processes, that we derived
using second-order perturbation theory goes back to P.W. Anderson and is called superex-
change. It can be used to derive this coupling between any two nearest-neighbor sites in the
Hubbard model, yielding up to constant the effective Heisenberg Hamiltonian with antifer-
romagnetic coupling
X
H
bH = J bi · S
S bj . (488)
hi,ji

As mentioned above, the model has a global SU(2) spin rotation symmetry, represented by
" #
X
Rα = exp −iα · Sbi . (489)
i

2. Mean-field theory and its shortcomings

We will now explore the ground state of the antiferromagnetic Heisenberg Hamilto-
nian (488). Surprisingly, there is a fundamental difference between the antiferromagnetic
and the ferromagnetic Heisenberg model, even though they are related by just a global sign
change in the Hamiltonian. The ferromagnetic problem essentially classical: the ground
state is of the form | ↑↑ · · · ↑i, we can compute its energy exactly. For the antiferromag-
net, this is not true. The ground state deviates from | ↑↓ · · · ↑↓i. The reason for this
striking difference lies in the fact that the order parameter commutes with the Hamiltonian
in the ferromagnetic case, while it does not commute in the antiferromagnet. If the order
parameter commutes with the Hamiltonian, both have common eigenstates, and the ground
state (necessarily an eigenstate of the Hamiltonian) can saturate the order parameter, as is
uniquely realized by the | ↑↑ · · · ↑i state in the ferromagnetic case. For the antiferromagnet,

126
by the lack of commutation such a ideal ground state cannot be found. We say quantum
fluctuations reduce the order parameter from the fully saturated value – the problem is
intrinsically quantum-mechanical.
Let us be a bit more explicit about the (missing) commutation and the order parameters.
The Heisenberg model is defined on a bipartite lattice, i.e., we can group the lattice sites in
two sublattices A and B with a checkerboard arrangement, such that each site in A has only
nearest neighbors in B and vice versa. The order parameters for ferro and antiferromagnet
are then defined as
X X
M
cfm = S
bi + S
bj ,
i∈A j∈B
X X (490)
M
cafm = bi −
S S
bj
i∈A j∈B

The lack of commutation can be seen from (for nearest neighbors i ∈ A and j ∈ B)

[Sbx,i Sbx,j + Sby,i Sby,j , Sbz,i ] = iSby,i Sbx,j − iSbx,i Sby,j ,


(491)
[Sbx,i Sbx,j + Sby,i Sby,j , Sbz,j ] = iSbx,i Sby,j − iSby,i Sbx,j .

If the two terms are summed up as in Mcfm , the right-hand sides cancel, if we take their

differences, as in M
cafm , they don’t.

While the symmetry is spontaneously broken in the antiferromagnetic ground state


hM
cafm i 6= 0, mean-field theory is not going to deliver the exact ground state. Neverthe-

less, it is instructive to perform a mean-field calculation for the problem. We write

S b i − M eb z ),
b i = M eb z + ( S i∈A
(492)
b j = −M eb z + ( S
S b j + M eb z ), j∈B

introducing the mean field M . The effective Hamiltonian is


X X z
b mf = −JzM
H Sbz,i + JzM Sbz,i + J M 2 N, (493)
i∈A i∈A
2

where we returned to a general case of coordination number z. The partition function

ZN (β, M ) = tr eβ Hmf
b

z 2N N (494)
= e−βJ 2 M eβJM z/2 + e−βJM z/2

is identical to the ferromagnetic case of Eq. (45), which shows again that the mean-field the-
ory is not capable of detecting the differences between antiferromagnetic and ferromagnetic

127
order. There follows the identical transition temperature, which for the antiferromagnet is
called Neel temperature
Jz
TN = . (495)
4kB
Clearly, the larger the coupling constant J and the more neighbors are present, the more
stable is the ordered state. Close to the transition, we can expand the free energy in M
"   3 #
Jz TN 2 TN
F (M, T ) = F0 + 1− M2 + M4 . (496)
2 T 3 T

This effective free energy describes a second-order phase transition. Minimization of F


with respect to M yields the characteristic mean-field temperature dependence of the order
parameter
s  
1 T T
M (T ) = 3 1− , T ≤ TN , (497)
2 TN TN

while M (T ) vanishes for T > TN .

3. Spin-wave theory

We are going to study spin-wave excitations (magnons) on top of the ferromagnetically


and antiferromagnetically ordered state. We will treat this problem using the so-called
Holstein-Primakoff transformation, which allows to rewrite the spin operators at each lat-
tice site in terms of creation and annihilation operators of a boson. The virtue of this
transformation is that the algebra of bosons is much simpler than that of spins. However,
the price to pay is that the bosonic operators are acting on a larger local Hilbert space at
every lattice site than the spin operators. In this way, one might end up with solutions of the
bosonic problem, that do not map back to physical states in the spin variables. This problem
is avoided in the approximation by which spins only deviate slightly from the ferromagnetic
or antiferromagnetic orientation, as we shall see below.
Our goal is to derive the dispersion relations of spin waves and to understand their
connections to the symmetries of the problem. For small momenta k, the dispersions will
be quadratic in k in the ferromagnet and linear in k in the antiferromagnet. We will discuss
the general spin-S case and particularize on the spin-1/2 model discussed so far in the end.
On every lattice site i, the components Sbi,α with α = x, y, z of the spin operator act on the

128
(2S + 1)-dimensional local Hilbert space

His = span{| − Sii , | − S + 1ii , · · · , |S − 1ii , |Sii }, (498)

which we have chosen to represent with the eigenbasis of Sbi,z ,

Sbi,z |mii = m|mii , m = −S, −S + 1, · · · , S − 1, S. (499)

In order to perform the Holstein-Primakoff transformation, we consider introducing a


a†i
bosonic degree of freedom at every lattice site i with creation and annihilation operators b
and b
ai , respectively, that act on the local infinite-dimensional Hilbert space
n o
Hib = span |n)i := (n!)−1/2 b a†i n |0)i n = 0, 1, 2, · · · ; b
ai |0)i = 0 , (500)

and obey
[b a†j ] = δi,j ,
ai , b (501)

with all other commutators vanishing.


a. Ferromagnetic case The Holstein-Primakoff transformation is defined by introduc-
ing the following substitutes for the spin operators
q q
Sbi+ := a†i b
2S − b ai b
ai , a†i
Sbi− := b a†i b
2S − b ai , a†i b
Sbiz := S − b ai . (502)

One verifies that these spin wave operators obey the same algebra

[Sbi+ , Sbi− ] = 2Sbiz , [Sbi± , Sbiz ] = ∓Sbi± (503)

on Hib as the original spin operators do on His (where Sbi± := Sbi,x ± iSbi,y ). Now, the central
assumption that will allow us to simplify the theory when written in the bosonic variables
is that the fraction of reversed spins above the ferromagnetic ground state is small

a†i b
hb ai i/S  1, ∀i. (504)

Within the range of validity of this assumption, it is justified to expand the square-roots
a†i b
that enter Eq. (502) in b ai /(2S). Of course, this will be better justified, the larger S is,
i.e., the more “classical” the spin behaves.
It is furthermore useful to introduce the Fourier transforms of the bosonic operators
1 X −iri ·k † 1 X iri ·k
c†k = √
b e ai ,
b ck = √
b e b ai . (505)
N i N i

129
Using these together with Eq. (502), and expanding the square root to lowest order, the
Hamiltonian
X X
b fm = −J
H bi · S
S bj − B Sbi,z (506)
hi,ji i

becomes
X  γk + γ−k 
b 0 = −JN z S 2 − BN S − JzS
X †
Hfm −1 bc†k b
ck + B b ck
ck b (507)
2 k
2 k

where
2 X ik·δ
γk = e , (508)
z δ

δ are the directed connection vectors to nearest neighbors and z is the coordination number.

Using γk = γ−k and dropping the constant terms in this Hamiltonian, we arrive at
X
H 00
b fm = c†k b
[JzS(1 − Reγk ) + B] b ck . (509)
k

From this we can directly read off the dispersion of the ferromagnetic magnons as

ωk =JzS(1 − Reγk ) + B
(510)
≈B + JSa2 k2 ,

where we assumed a hypercubic lattice with lattice constant a to obtain the last line. This
is equivalent to the dispersion of a free massive particle with mass m? = 1/(2JSa2 ).
b. Antiferromagnetic case For the antiferromagnetic case, we have to account for the
two sublattices when performing the Holstein-Primakov transformation and we have to re-
place the source field B by a staggered one. We are thus interested in the Hamiltonian
X X X
H
b afm = J bi · S
S bj − B Sbi,z + B Sbj,z , (511)
hi,ji i∈A j∈B

and introduce on sublattice A the operators


q q
Sbi+ := a†i b
2S − b ai b
ai , a†i
Sbi− := b a†i b
2S − b ai , a†i b
Sbiz := S − b ai . (512)

while on sublattice B we use the operators


q q
Sbj+ := bb†j 2S − bb†jbbj , Sbj− := 2S − bb†jbbj bbj , Sbjz := bb†jbbj − S. (513)

130
a†j and bb†j obey the usual bosonic algebra. It is again convenient to introduce
Both the b
Fourier transforms of these bosonic operators
1 X −iri ·k † 1 X iri ·k
c†k = √
b e ai ,
b ck = √
b e bai ,
NA i∈A NA i∈A
(514)
1 X −irj ·kb† 1 X irj ·kb
db†k = √ e bj , dbk = √ e bj ,
NB j∈B NB j∈B

where NA = NB = N/2. Furthermore allowing for a linear transformation

α ck − vk db†k ,
b k = uk b c†k ,
βbk = uk dbk − vk b (515)

with 1 = u2k − vk2 , which ensures [b bk† ] = [βbk , βbk† ] = 1, we can cast the Hamiltonian (511),
αk , α
expanding in the bosonic operators, in the form

0 z 1 X  † †b

Hafm = −JN S(S + 1) − BN (2S + 1) +
b ωk α
bk α
bk + βk βk + 1 .
b (516)
2 2 k

The antiferromagnetic magnon dispersion is given by


p
ωk = (JzS + B)2 − (JzS Re γk )2
√ (517)
≈ 2 3JSa|k|,

where we have expanded in small momenta a|k|  1 for the case of a hypercubic lattice for
B = 0 in the last line. Notice that in contrast to the ferromagnetic case, this is a gapless
dispersion.
This sharp difference between the ferro- and antiferromagnetic case is rooted in the prop-
erty that the order parameter commutes and does not commute with the Hamiltonian in
the two cases, respectively.
We can expand on this observation by considering the symmetries of Hamiltonians (506)
and (511) in the limit B = 0. The Hamiltonians, if the lattice Λ has the space group PΛ as
symmetry group, has the symmetry group

GΛ = SO(3) × Z2 × PΛ , (518)

where SO(3) are the proper rotations in spin space and Z2 is time-reversal. For the anti-
ferromagnetic order, we anticipate a reduction of the point group symmetry to PΛA of the
A sublattice only, where PΛ = Z2 × PΛA . Notice that all symmetries but time reversal are
unitary here.

131
For the ferromagnet, the order parameter M
c is actually one of the generators of the

continuous global symmetry group SO(3). This is the reason why it commutes with the
Hamiltonian. When aquiring a nonzero expectation value, M
c breaks SO(3) down to SO(2)

(rotations around the ferromagnetic axis) and at the same time breaks time-reversal sym-
metry. The symmetry-breaking pattern is thus

GΛ = SO(3) × Z2 × PΛ −→ HΛ,fm = SO(2) × PΛ . (519)

For the antiferromagnet, the order parameter similarly breaks SO(3) down to SO(2) and
breaks time-reversal. It does, however, preserve an ‘effective time-reversal symmetry’, that is
a composition of time-reversal symmetry and a translation that interchanges the sublattices
A and B. This yields the symmetry breaking pattern

GΛ = SO(3) × Z2 × PΛ −→ HΛ,afm = SO(2) × Z2 × PΛA . (520)

The fundamental difference between HΛ,fm and HΛ,afm is that the latter still contains an
antiunitary symmetry (the effective time reversal symmetry).
As stated before, the ferromagnetic ground state coincides with the quantum-mechanical
one, but this is not true for the antiferromagnet. To quantify this we compute the devia-
tion ∆M of the fully polarized classical sublattice magnetization N S/2 with the quantum
mechanical one within the approximation (504), i.e.,
* +
NS X
∆M = − Sbi,z , (521)
2 i∈A

where the brackets stand for the ground state expectation value. With the form of Hamil-
tonian (516) this quantity can be computed as
!
1X 1
∆M = p −1
2 k 1 − (Reγk )2
Z
N 1 3 1 (522)
=− + d k p
4 2(2π)3 1 − (Reγk )2
N
≈0.0784 ,
2

meaning that we obtain a nearly 8 percent reduction from the fully polarized magnetization.

132
B. Stoner magnetism

In this section we are going to explore magnetism from a different angle, namely starting
from itinerant electrons. Magnetism arises here as an instability of a Fermi liquid, without
the need for a picture of localized moments and thus without the underlying lattice playing
an important role. Examples of magnets for which this mechanism is at work include
the ferromagnets iron, cobalt, and nickel, in all of which 3d-orbital electrons dominate the
states at the Fermi energy. The ferromagnetic order is a spontaneous symmetry breaking
phenomenon appearing at some finite Tc , the Curie temperature. The phase transition is of
second order in the Landau classification of phase transitions, thus featuring a discontinuity
in the specific heat but none in the entropy and volume.
Stoner magnetism is driven by repulsive (Coulomb) interactions. The energy gain can be
understood from Hunds rule: Alignment of the electronic spins in a favored direction allows
them to reduce the Coulomb energy. We recall that within Fermi liquid theory we have
found
m? χ0
χ= , (523)
m 1 + F0a
for the magnetic susceptibility and concluded that a ferromagnetic instability would occur
if F0a → −1. We will now use mean-field theory to derive this more microscopically.

1. Mean-field theory

We start from the Hamiltonian with a repulsive contact interaction


X † Z
H=
b k b ck,σ + U d3 rd3 r 0 ρb↑ (r)δ(r − r 0 )b
ck,σ b ρ↓ (r 0 ), (524)
k,σ

where ρbσ (r) = φb†σ (r)φbσ (r) is the density operator with φb†σ (r) the electron field operators.
We use this contact interaction as a simple model for screened Coulomb interaction. It is
clear that electrons of the same spin are not affected by the interaction, which gives rise to
the Hund’s rule mentioned above. We make the mean-field ansatz

ρσ (r) − n̄σ ],
ρbσ (r) = n̄σ + [b (525)

with n̄σ = hb
ρσ (r)i and the brackets represent the thermal average. To justify a mean-field
Hamiltonian, we assume

h[b
ρ↓ (r) − n̄↓ ][b
ρ↑ (r) − n̄↑ ]i  n̄↓ n̄↑ . (526)

133
Then
X Z
H
b mf = c†k,σ b
k b ck,σ +U d3 r [b
ρ↑ (r)n̄↓ + ρb↓ (r)n̄↑ − n̄↓ n̄↑ ]
k,σ
X (527)
= c†k,σ b
(k + U n̄−σ ) b ck,σ − ΩU n̄↓ n̄↑ .
k,σ

This mean-field Hamiltonian describes electrons of one spin type σ moving in the background
of the opposite spin electrons −σ. We thus reduced the problem again to a single-particle
like problem. This is at the heart of the so-called Hartree-Fock approximation. To find a
self-consistent solution, we calculate the spin density
1X †
n̄↑ = hbc b ck,↑ i
Ω k k,↑
1X
= nk +U n̄↓
Ω k
Z (528)
1X
= d δ( − k − U n̄↓ )n
Ω k
Z
1
= d ν( − U n̄↓ )n ,
2
where n is the Fermi Dirac distribution. The analogous equation holds for n̄↓ . These
equations allow to self-consistently determine the mean densities n̄σ . Solutions must obey
particle number conservation. To implement this, it is convenient to split up the spin
densities in average particle densities n̄ and magnetization m̄, the latter being related to the
measurable magnetization density M = µB m̄. That is

n̄ + σ m̄
n̄σ = . (529)
2

The self-consistency equations can then be cast in the form


Z
1
n̄ = d [ν( − U n̄↓ ) + ν( − U n̄↑ )] n
2
Z (530)
1
m̄ = d [ν( − U n̄↓ ) − ν( − U n̄↑ )] n
2
or
Z
1 X
n̄ = d ν( − U n̄/2 − σU m̄/2)n
2 σ
Z (531)
1 X
m̄ = d σ ν( − U n̄/2 − σU m̄/2)n .
2 σ

134
To make analytical progress from here, we assume that the magnetization is small, i.e.,
m̄  n̄. We absorb the shift −U n̄/2 in the Fermi energy and expand (for the case of n̄,
where the m̄-odd term vanishes by the sum over σ
 2
1 U m̄
ν( − σU m̄/2) = ν() + ν 00 () + O(m4 ). (532)
2 2

We furthermore use the so-called Sommerfeld expansion


∞ µ
π2 0
Z Z
1
h() β(−µ) = h()d + 2 h (µ) + O(βµ)−4 . (533)
−∞ e +1 −∞ 6β

Lastly, we have to allow for a m̄-dependent shift in the chemical potential, i.e., µ(m̄, T ) =
F + δµ(m̄, T ), such that δµ(0, T ) = 0. Together, when expanded to lowest order both in m
and in β −1 , this yields
µ 2
π2 0
Z 
1 U m̄ 0
n̄ = dν() + ν (F ) + 2 ν (F )
−∞ 2 2 6β
Z F 2
π2

1 U m̄
= dν() + δµν(F ) + ν 0 (F ) + 2 ν 0 (F ) (534)
−∞ 2 2 6β
2
π2

1 U m̄
= n̄ + δµν(F ) + ν 0 (F ) + 2 ν 0 (F ).
2 2 6β

We can now solve for δµ as


" 2 #
ν 0 (F ) π 2

1 U m̄
δµ(m̄, T ) = − + . (535)
ν(F ) 6β 2 2 2

The analogous calculation for m̄ reads


Z "  3 #
U m̄ 1 U m̄
m̄ = dn ν 0 () + ν 000 ()
2 3! 2
" 2 #
π 2 00

U m̄ 0 1 U m̄ 00
= ν(F ) + δµν (F ) + 2 ν (F ) + ν (F )
2 6β 3! 2
" " 2 # 2 # (536)
ν 02 π 2 π 2 ν 00 1 U m̄ ν 00
 
U m̄ 1 U m̄
= ν 1− 2 + + 2 +
2 ν 6β 2 2 2 6β ν 3! 2 ν
" #
3
π 2 ν 02 π 2 ν 00 1 ν 00 1 ν 02
     
U m̄ U m̄
=ν 1− 2 2 + 2 + − ,
2 6β ν 6β ν 2 3! ν 2 ν2

where we dropped the F argument in the last line for notational compactness and inserted
δµ from Eq. (535).

135
The structure of this equation is m̄ = am̄ + bm̄3 . If b is negative, we have two possible
solutions
m̄ = 0, a < 1,
1−a (537)
m̄ = , a ≥ 1.
b
Obviously, the second solution is the one in the symmetry broken phase and hence a = 1
the critical value. Via the temperature dependence of a we can determine the critical
temperature (the Curie temperature)
π 2 ν 02 ν 00
  

1= 1− 2 − (538)
2 6β ν2 ν
yielding √ r r
6 2 Uc
kB Tc = q 1− ∝ 1− , (539)
02
π νν 2 − ν 00 Uν U
ν

for U > Uc = 2/ν(F ). The latter condition

U ν(F ) > 2 (540)

on the minimal interaction strength is the instability condition for the paramagnetic Fermi
liquid to turn ferromagnetic at low temperatures, called the Stoner condition. For U ν(F ) <
2 there is no phase transition. In the symmetry-breaking phase (T < Tc ), the magnetization
has the characteristic mean-field temperature-dependence
p
m̄(T ) ∝ Tc − T . (541)

This dependence is valid sufficiently close to the transition T − Tc  Tc . Since the density of
states can also be changed by applying pressure, say, the phase transition can also be driven
a zero temperature if system parameters are changed such that the U ν(F ) = 2 is crossed.
A phase transition at zero temperature is called a quantum phase transition.

2. Susceptibility on the paramagnetic side

To compute the susceptibility, we apply a small symmetry-breaking field B along the


spin-quantization direction by adding the following Zeeman term to the Hamiltonian (527)
(setting the g-factor equal to 2)
Z
b Z = −µB
H d3 rB [b
ρ↑ (r) − ρb↓ (r)] . (542)

136
The self-consistency equations (536) are now modified by replacing U m̄/2 with U m̄/2+µB B
on the right-hand side. We only keep the m̄-linear terms this time
π 2 ν 02 π 2 ν 00
  
U m̄
m̄ = ν + µB B 1− 2 2 + 2 (543)
2 6β ν 6β ν
We can solve this equation for m̄ and then compute the susceptibility χ via χ = M/B with
M = µB m̄. The result is given by
M χ0 (T )
χ(T ) = = , (544)
B 1 − U χ2µ0 (T
2
)
B

where
π2 ν 02 ν 00
  
χ0 (T ) = µ2B ν 1− 2 − . (545)
6β ν2 ν
Notice that the denominator of χ(T ) vanishes exactly when Eq. (538) is satisfied, i.e., at
the phase transition at finite temperature. From the paramagnetic side, the susceptibility
diverges as
χ0 (Tc )
χ(T ) ≈ Tc2
∝ |T − Tc |−1 , (546)
T2
−1
which is the characteristic mean-field behavior with critical exponent 1. Notice also, that in
passing we have determined the Landau parameter
U χ0
F0a = − , (547)
2µ2B
which, as expected, is proportional to the interaction strength in the Fermi liquid.

3. Magnons

We have already encountered magnons as elementary excitations on top of the ferro-


magnetic ground state of the Heisenberg model using the spin-wave approximation. Here
we study how magnons arise in the itinerant ferromagnet that has undergone a Stoner-
instability. To that end, we consider particle-hole excitations with momentum q by making
the ansatz
X
|Ψq i = c†k+q,↓b
ak b ck,↑ |0i, (548)
k

where |0i is the (symmetry-broken) ground state and ak are coefficients. To find potential
eigenstates of this form, we solve the Schrödinger equation

b q i = (E0 + ωq )|Ψq i,
H|Ψ (549)

137
where H|0i
b = E0 |0i. We can recast this as
X h i
ωq |Ψq i = ak H, c†k+q,↓b
b b ck,↑ |0i (550)
k

As before, we are going to employ a Hamiltonian with on-site interaction


X U X †
H
b= c†k,σ b
k b ck,σ + b c†k0 −q,↓b
ck,↑b
ck+q,↑b ck0 ,↓ . (551)
k,σ
Ω k,k0 ,q

with the help of which we can compute the commutator (550) as

U X X † 
ωq |Ψq i = (k+q −k )|Ψq i+ ak ck0 +q0 ,↑b
b c†k+q−q0 ,↓b
ck0 ,↑b c†k+q,↓b
ck,↑ − b c†k0 −q0 ,↓b
ck−q0 ,↑b ck0 ,↓ |0i.
Ω k k0 ,q 0
(552)
We now make an approximation by which we replace pairs of b
c operators by their expectation
values nk,σ according to the Fermi-Dirac distribution

c†k0 +q0 ,↑b


b c†k+q−q0 ,↓b
ck0 ,↑b ck,↑ → c†k+q,↓b
δq0 ,0 nk0 ,↑b c†k+q−q0 ,↓b
ck,↑ − δk,k0 +q0 nk,↑b ck−q0 ,↑ ,
(553)
c†k+q,↓b
−b c†k0 −q0 ,↓b
ck−q0 ,↑b ck0 ,↓ → c†k+q,↓b
−δq0 ,0 nk0 ,↓b c†k−q0 +q,↓b
ck,↑ + nk+q,↓ δk0 ,k+q b ck−q0 ,↑ ,

which together give

c†k0 +q0 ,↑b


b c†k+q−q0 ,↓b
ck0 ,↑b c†k+q,↓b
ck,↑ − b c†k0 −q0 ,↓b
ck−q0 ,↑b ck0 ,↓
(554)
c†k+q,↓b
→ δq0 ,0 (nk0 ,↑ − nk0 ,↓ )b c†k−q0 +q,↓b
ck,↑ + (nk+q,↓ δk0 ,k+q − nk,↑ δk,k0 +q0 ) b ck−q0 ,↑ .

This approximation yields an iteration of Eq. (552), that when projected on the state
c†k̃↑b
h0|b ck̃+q↓ reads

  UX
ak̃ ωq − k̃+q,↓ + k̃,↑ = ak (nk+q,↓ − nk,↑ ) ≡ Rq , (555)
Ω k

where we defined
k,σ = k + U n−σ (556)

in terms of the density of spin σ


1X
nσ = nk0 ,σ (557)
Ω k0

and
UX
Rq = ak (nk+q,↓ − nk,↑ ). (558)
Ω k

138
The goal is now to find the eigenvalues that solve Eq. (555) (we are not necessarily interested
in the eigenstates ak̃ ). To that end, we rewrite Eq. (555) as
Rq
ak̃ = . (559)
ωq − k̃+q,↓ + k̃,↑

Now, we use this expression for ak̃ and insert it in Eq. (558) that defines Rq to obtain
U X nk̃+q↓ − nk̃↑
Rq = Rq . (560)
Ω ωq − k̃+q,↓ + k̃,↑

This equation has two types of solutions. One is Rq = 0, which yields ωq = k+q,↓ − k,↑ =
k+q − k + U (n↑ − n↓ ) ≡ k+q − k + ∆ and defines a particle-hole continuum at finite energy
(for small q). Aside from this solution, we have
U X nk+q↓ − nk↑
1= . (561)
Ω k ωq − k+q,↓ + k,↑

Setting q = 0 we find in addition zero energy excitations because


n↓ − n↑
1=U (562)
ω0 − U (n↑ − n↓ )
is solved by ω0 = 0. This is the beginning of a magnon branch. We can follow it by
expanding Eq. (561) to quadratic order in q, assuming that ωq is small compared to the gap

U X nk+q↓ − nk↑
1=
Ω k ωq − k+q,↓ + k,↑
U X nk+q↓ − nk↑ U X nk+q↓ − nk↑
≈− − ωq
Ω k k+q,↓ − k,↑ Ω k (k+q,↓ − k,↑ )2
U X nk+q↓ − nk↑
 
1
≈− − ωq −
Ω k k+q − k + ∆ ∆
" 2 # (563)
U X nk+q↓ − nk↑ k+q − k k+q − k

ωq
≈− 1− + +
Ω k ∆ ∆ ∆ ∆
U X nk↓ + nk↑ q 2 nk↓ − nk↑ (2k · q)2
 
ωq
≈1 − 2
− 2 2
+
Ω k ∆ 2m ∆ 2m ∆ ∆
q2
 
4F ωq
≈1 + U n 0 − + .
2m∆2 3 ∆
Solving for ωq , this gives the dispersion
q 2 4F
ωq = [U N (F ) − 2] + O(q 4 ) (564)
2m∆ 3

139
which we can rewrite as r
2 U
ωq ∝ q − 1. (565)
Uc
This form shows how the magnon dispersion is related to the Stoner instability and the
dispersion becomes flat (the magnon heavy) near the transitions. That is, the magnons are
very ‘soft’ or easy to excite near right after the symmetry-breaking transition. We recall
that the magnon is the Goldstone mode of the continuously broken rotational symmetry.

VI. ADVANCED TOPICS

A. Relativistic effects in solids: spin-orbit coupling

When we considered noninteracting electronic band structures, we did not pay much
attention to the spin degree of freedom. It was always a mute variable the simply accounted
for a two-fold degeneracy of all energy bands, i.e., they could be occupied with a spin up and
a spin down electron. If however, we derive the Schroedinger equation from the relativistic
Dirac equation, we know that spin-orbit coupling appears and takes the form L · S in a
Lorentz invariant system. However, since Lorentz invariance is strongly broken in a crystal,
and crystal fields (which can be thought of as spatially varying electric fields) influence the
electron motion, we take a different route to understand spin-orbit coupling. We simply
assume that all terms in a Hamiltonian that are not forbidden by the lattice symmetry will
be present (with some material-specific coefficients).
Let us consider a two-dimensional electronic system k = (kx , ky ) in a k · p approximation.
The general Hamiltonian without any orbital degrees of freedom
X
H
b = c †k,σ hk,σσ0 b
b c kσ0 (566)
k,σ,σ 0

is determined by the 2 × 2 Bloch matrix hk,σσ0 . It is convenient to expand hk,σσ0 in terms of


the Pauli matrices
X
hk = dµ (k)σµ , (567)
µ=0,x,y,z

where σ0 is the identity matrix. Time-reversal symmetry implies

T hk T −1 = h∗−k ⇒ d0 (k) = d0 (−k), di (k) = −di (−k), i = x, y, z (568)

140
with T = K iσy . Would we also also ask for inversion symmetry to be present, i.e., Ihk I −1 =
h−k with I = σ0 , then di (k) ≡ 0 and there would be no room for spin-orbit coupling terms
in this one-band model. In general multi-band models, spin-orbit coupling can appear also
with inversion symmetry. However, with I and T , all bands are always doubly degenerate,
because IT enforcers a Kramers degeneracy at every k due to (IT )2 = −1.
There are, however, enough crystals without inversion symmetry (so-called noncentrosym-
metric ones) and as far as two dimensional systems are concerned, we can also think of
interfaces or surfaces, where inversion is certainly broken. Let us instead impose mirror
symmetry in the x and y direction, respectively,

Mx hkx ,ky M−1


x = h−kx ,ky ⇒ d0 (kx , ky ) = d0 (−kx , ky ),
dx (kx , ky ) = dx (−kx , ky ),
(569)
dy (kx , ky ) = −dy (−kx , ky ),

dz (kx , ky ) = −dz (−kx , ky )

and similarly for My . We used Mx/y = iσx/y . Further, we want the system to be invariant
under (at least C4 ) rotations that are simulationsly carried out in momentum and spin space.
That is, for the case of C4 , (kx , ky ) → (ky , −kx ) and (σx , σy , σz ) → (σy , −σx , σz ). Together,
these symmetries enforce, up to quadratic order in k the Hamiltonian to take the form

k2
 
hk = − µ σ0 + α(kx σy − ky σx ). (570)
2m

The second term is the relativistic Rashba spin-orbit coupling. Notice that we derived this
form solely from symmetries, but now we can give a physical interpretation to the result.
The coefficient α is proportional to the electric (crystal) field perpendicular to the plane
that breaks inversion symmetry. A relativistic electrons moving in the x direction (kx ) in
an electric field in z direction experiences, in its rest frame, a magnetic field in y direction
(in both cases the Lorentz force should point in the z direction). This magnetic field can be
thought of coupling via the Zeeman effect and produces the term kx σy in the Hamiltonian.
The consequences of the Rashba term are nondegenerate electronic bands

k2
± (k) = − µ ± α|k|, (571)
2m

where the eigenstates of each band are such that the spin polarization at any given point in

141
k space is  
sin ϕ
 
h±, k|σ|±, ki = ± − cos ϕ , (572)
 
 
0
with ϕ = arctan(kx /ky ). Thus, the Fermi surface splits up into two concentric rings. At
energy µ one of the Fermi surfaces shrinks to zero in a Dirac-like band touching at k =
0. Rashba spin-orbit coupling has a range of consequences when it comes to transport
properties, as well as superconductivity and other instabilities.
In multi-band systems, or in systems with even lower symmetries more general forms of
spin-orbit coupling are possible.

B. Effective relativistic dispersions in band structures: Topological insulators and


Weyl semimetals

We have seen that in some cases, the effective dispersion resulting from a k · p around
high-symmetry points in momentum space take the same form as the Dirac equation. This
is the case in graphene and at small momenta in the case of Rashba spin-orbit coupling.
But it can also appear in three-dimensional band structures. This fact motivates us to first
recall some basic aspects of the Dirac equation in a language useful for our purposes and
then apply this to a tight-binding simple model.
Dirac invented the equation named after him to link the dispersion relation E(k) =

k2 + m2 of relativistic particles with mass m (recall that the speed of light and ~ are set
to 1) with the theory of quantum mechanics. This led to a matrix-valued Hamiltonian that
is in general written in the form
X
H(k) = Γi ki + mΓ0 , (573)
i

where i can run over x, y, z (here for notational convenience also named 1, 2, 3) if we consider
particles in three dimensions or over x, y only if instead (quasi) two-dimensional space is
considered. The matrices Γµ , µ = 0, 1, 2, 3 are Hermitian and satisfy the algebra

{Γµ , Γν } = 2δµ,ν 11. (574)

With this algebra, one verifies that H(k)2 = E(k)2 11, i.e., the desired dispersion is obtained.
There is, however, one caveat: The eigenvalues of H(k) are actually ±E(k). The negative

142
energy states essentially consisted the theoretical discovery of anti-particles. In three dimen-
sions, where µ, ν = 0, 1, 2, 3 the smallest matrix dimension that satisfies the algebra (574) is
4. In two dimensions, where µ, ν = 0, 1, 2, the smallest matrix size to satisfy this algebra is
2. The Pauli matrices
     
1 0 0 1 0 −i
Γ0 ≡ σz =  , Γ1 ≡ σx =  , Γ2 ≡ σy =   (575)
0 −1 1 0 i 0

do the job, for instance.

1. Topological insulators in three dimensions

We now turn to a tight-binding model which will display some interesting physics, related
to effective Dirac electron dispersions. It is defined on the cubic lattice. On each lattice
site we envision an atom with two orbitals, labelled a and b that are occupied by electrons.
This time, we are going to include the electron spin into the discussion. The Hamiltonian
contains two terms: Hb = H b1+ H b 2 . The first term models nearest neighbor hopping that

is the same for both spin orientations, but has the opposite sign for the two orbitals
t X X † †

H1 =−
b b c Rl ,a,s − b
c Rj ,a,s b c Rj ,b,s b
c Rl ,b,s
2
hRj ,Rl i s=↑,↓
XX †  (576)

+M b c Rj ,a,s − b
c Rj ,a,s b c Rj ,b,s b
c Rj ,b,s .
Rj s=↑,↓

The parameter M quantifies a relativ on-site energy shift between the two orbitals. The
c †Rj ,α,s creates an electron on site Rj in orbital α ∈ {a, b} and with spin s ∈ {↑, ↓}.
operator b
The second term in the Hamiltonian
t0 X X X  †

H2 =−
b iσi;s,s b
0 c Rj ,a,s b
c Rj +ai ,b,s + h.c.
0 (577)
2 R i=x,y,z s,s0 =↑,↓
j

is a spin-dependent hopping term (we accept that such a term should exist – it is a form
of spin-orbit coupling compatible with time-reversal symmetry). In momentum space, the
entire Hamiltonian reads
 
c k,a,↑
b
 
X  bc
 k,a,↓ 

H
b = c †k,a,↑ , b
b c †k,a,↓ , b
c †k,b,↑ , b
c †k,b,↓ H(k)   (578)
 c k,b,↑ 
b 
k

c k,b,↓
b

143
with
 
ξ(k) 0 t0 sin kz t0 sin kx − it0 sin ky
 
 0 ξ(k) t0 sin kx + it0 sin ky −t0 sin kz 
H(k) = 
 


 t0 sin kz t0 sin kx − it0 sin ky ξ(k) 0 

t0 sin kx + it0 sin ky −t0 sin kz 0 ξ(k)
(579)
where ξ(k) = M + t (cos kx + cos ky + cos kz ). It is now straight forward to derive a k · p
Hamiltonian around the Γ point yielding to linear order in k

HDirac (k) = (M + 3t)σ0 ⊗ σz + t0 kx σx ⊗ σx + t0 ky σy ⊗ σx + t0 kz σz ⊗ σx (580)

where σ0 is the 2 × 2 identity matrix and σµ ⊗ σν stands for the direct product of two Pauli
matrices with index µ and ν, yielding a 4 × 4 matrix. One verifies that the four matrices
occurring in Eq. (580) satisfy the algebra (574) and hence we have a Dirac equation with
mass m = M + 3t. The eigenvalues of this effective Bloch Hamilton with energy bands
E± (k)

E± (k) = ± m2 + k2 , (581)

each of which is doubly degenerate. Clearly, the system is gapless (described by gapless
Dirac electrons) at the critical parameter value M = −3t. At other M 6= −3t the system is
an insulator (there might be gap closings at other momenta if M deviates much from −3t).
It turns out that this critical point actually discriminates two band structures which are
“topologically” distinct. For M < −3t we the Hamiltonian describes a normal insulator,
while for M > −3t, we speak of a topological insulator. While the study of topological band
structures is subject on its own, we want to point out the most important property of the
topological insulator, that is, its surface states. A topological insulator in three dimensional
crystals is characterized by gapless topological states on its surface, that are protected (i.e.,
gapless) as long as time-reversal symmetry is preserved.
Let’s solve explicitly for these states. To do so, we consider a geometry where the topo-
logical and trivial insulator are interfacing in the z = 0 plane. In this geometry, kx and ky
in Eq. (580) are still good quantum numbers, while kz has to be replaced by the momentum
operator, which is −i∂z in the position basis. The interface is then modeled by a z-dependent
mass term m(z) = m0 sign(z). Let us first study the Hamiltonian at kx = ky = 0, which

144
reads
 
m0 sign(z) 0 −it0 ∂z 0
 
 0 m0 sign(z) 0 it0 ∂z

HDirac (kx = 0, ky = 0) =  (582)
 

 −it0 ∂z 0 −m0 sign(z) 0 
 
0 it0 ∂z 0 −m0 sign(z)

We observe that it takes a form of two disconnected 2 × 2 blocks. We are interested in


eigenstates that are localized near the surface and energetically reside inside the energy gap
of the bulk material. As for the second property, a spectral symmetry of the problem we
are studying (which is not generic for topological insulators) allows to look for eigenstates
with exactly zero energy. One verifies that the following states are of this type
   
1 0
   
0  0  0  1 
   
v± = e±m0 |z|/t   , u± = e±m0 |z|/t   . (583)
∓i 0
   
0 ±i

Les us assume m0 /t0 > 0. Then only v− and u− are localized at the boundary, i.e., admissible
normalizable solutions of our differential equation. (For m0 /t0 < 0 only v+ and u+ would
be admissible.) Thus, we have two degenerate surface states at kx = ky = 0, both residing
at E = 0. Now, we would like to extend the dispersion of these states to finite momenta kx
and ky . For this, observe
 
hv− |HDirac (kx , ky )|v− i hv− |HDirac (kx , ky )|u− i
Hsurf (kx , ky ) =  
hu− |HDirac (kx , ky )|v− i hu− |HDirac (kx , ky )|u− i
 
0 −i(kx − iky ) (584)
= 
+i(kx + iky ) 0

=kx σy − ky σx .

This again takes the form of a Dirac Hamiltonian, of the massless kind, with energy eigenval-
p
ues Esurf,± = ± kx2 + ky2 . Thus, the surface states of the topological insulator form a ’surface
Dirac cone’, a gapless two-dimensional state that lives at the intersection of a topological
insulator and a trivial insulator (which we can also call vacuum). As opposed to the bulk
states this Dirac cone has spin-polarized energy bands, i.e., they are not doubly degenerate
as the bulk bands. Note that the Hamiltonian looks similar to the Rashba Hamiltonian

145
studied before in purely two dimensions at first sight. There is, however, one important
difference: The Rashba Hamiltonian had two circles as Fermi surface, while this surface
Hamiltonian has only one such Fermi surface sheet. Fundamentally, with time-reversal sym-
metry, it is not possible to have only a single, non-spin degenerate Fermi surface in purely
two dimensions. The reason it is possible for the Dirac surface Hamiltonian lies in the fact
that it is ’secretly connected’ to a three-dimensional bulk band structure and that it has a
partner on the opposite side of the crystal.
In principle, we could imagine introducing a perturbation to the surface that gaps out
the Dirac cone degeneracy and makes also these surface states insulating. The unique term
that does this job in Hsurf (kx , ky ) is given by msurf σz , because σz is the only non-commuting
anticommuting 2 × 2 matrix not already used in the surface Hamiltonian. However, this
perturbation breaks time-reversal symmetry. To understand that, we first have to find (or
define) the representation of the time-reversal transformation for the bulk Hamiltonian.
Noting that time-reversal reverts the momentum, we can define T = K iσy ⊗ σ0 , which
squares to −1 as desired for spin-1/2 fermions. We check that

T HDirac (k)T −1 = HDirac (−k). (585)

Furthermore, T |u− i = −|v− i and T |v− i = +|u− i, i.e., the two surface states degenerate at
kx = ky = 0 are Kramers partners. Kramers theorem then forbids any hybridization that
lifts their degeneracy (at kx = ky = 0), as long as time-reversal symmetry is obeyed. We
conclude that the gapless nature of the topological insulator surface states is protected by
time-reversal symmetry.
Finally, we note that the band representations can be used to determine whether an
insulator is topological, i.e., to predict whether it will have such a gapless Dirac surface
state. If the insulator has inversion symmetry, Fu and Kane found the formula (which we
just state without proof)
Y Y
(−1)ν = ξi,k , (586)
k∈TRIM i∈occupied bands

where ξi,k is the inversion eigenvalue of the i-th band at momentum k. The product runs
only over the momenta in the three-dimensional Brillouin zone whose little group contains
inversion (i.e., which are invariant under inversion k → −k up to a reciprocal lattice vector).
Since these are also the time-reversal invariant momenta (TRIM), bands are doubly degen-
erate by Kramers theorem at these points and only one eigenvalue per Kramers pair enters

146
the formula (586). The integer ν is defined modulo 2 and is called a topological invariant.
If ν = 0, the insulator is trivial, if ν = 1, the insulator is topological. We can check that for
the model at hand.

2. Weyl semimetals

Let us go back to the three-dimensional Dirac model (580) and introduce a time-reversal
breaking perturbation in the form of a Zeeman field

H(k) = (M + 3t)σ0 ⊗ σz + t0 kx σx ⊗ σx + t0 ky σy ⊗ σx + t0 kz σz ⊗ σx + Bσz ⊗ σ0 (587)

If we tune the model to the formerly critical point M = −3t, we obtain the spectrum
q
Eλ,λ0 = λ t02 (kx2 + ky2 ) + (B + λ0 t0 kz )2 , λ, λ0 = ±. (588)

We observe that the fourfold degenerate gapless point at k = 0 (the touching of four bands)
has split into two gapless points at k = (0, 0, ±B/t0 ) (each being the touching of two bands
only). We can derive an effective k · p Hamiltonian involving only the two touching bands
near k = (0, 0, B/t0 ), for instance: Only the σ0 ⊗ σx eigenstates with eigenvalue −1 will
play a role there, the other eigenstates are at high energy. We can thus replace in Eq. (587)
σµ ⊗ σx by −σµ while again setting M = −3t. This yields the effective Hamiltonian

HWeyl (k) = −t0 (kx σx + ky σy + δkz σz ), (589)

where δkz = kz − B/t0 and this expansion is valid for small δkz . The 2 × 2 Hamiltonian (589)
is what (in condensed matter) is called the Weyl Hamiltonian. Remarkably, while the
Dirac point that we started with was a degeneracy that required parameter fine-tuning
(M = −3t), the Weyl point degeneracy cannot be removed by a small perturbation or
parameter change: Any perturbation (i.e., any constant matrix W that can be added to
HWeyl (k) can be decomposed into W = w · σ resulting in

H̃Weyl (k) = −t0 [(kx − wx /t0 )σx + (ky − wy /t0 )σy + (δkz − wz /t0 )σz ] . (590)

This most general (small) perturbation simply moved the Weyl point to (kx , ky , δkz ) = w/t0
but is not able to gap it out. The reason for this is that for a touching of two bands we have
exhausted the anticommuting 2 × 2 matrices already for the kinetic part of the Hamiltonian
so that none are left to be added as masses.

147
Weyl semimetals also have topological surface states, so-called Fermi arcs, with rather
exotic properties. Together with topological insulators they are an active field of study in
contemporary condensed matter physics. Time-reversal symmetry breaking is not needed
to generate Weyl nodes in band structures, as long as inversion symmetry is broken.

C. Density functional theory

Density functional theory (DFT) has been introduced in the 1960s and has since become
the most successful and widely employed tool to compute the electronic structure of matter.
It has a high predictive power in its range of applicability. Here, we are most interested in the
computation of band structures of crystals, but the same concepts also apply to molecules.
DFT works best for weakly correlated materials that have s, p and at most d orbitals neat the
Fermi surface. It is by construction not able to describe strongly correlated states like Mott
insulators, fractional quantum Hall states and the like. More recently, extensions have been
developed to include superconducting instabilities, but with much less predictive power.
Routinely available are by now calculations that include spin-orbit coupling or magnetic
states.

1. Variational principle and Hartree-Fock approximation

Starting point for our considerations is the Born-Oppenheimer approximation to the


Hamiltonian (1) in our lecture notes, which assumes that the nuclei remain at their static
positions. We also reduce the problem here to the Schroedinger equation, noting that a
similar analysis would apply for a fully relativistic formulation. The many-body electronic
Hamiltonian is then written as
N N M N N
1 X 2 X X Zl X X 1
Hel = − ∇ − + , (591)
2 j=1 j j=1 l=1 r̃jl j=1 k>j rjk

where r̃jl is the distance between nucleus l and electron j, while rlk is the distance between
electrons j and k.
The variational principle is based on the fact that a guessed wave function Ψ can be used
to compute the energy expectation value
hΨ|Hel |Ψi
E[Ψ] = , (592)
hΨ|Ψi

148
and since the true ground state has the lowest energy E0 , E[Ψ] will always be an upper bound
for E0 . Here E[Ψ] should be viewed as a functional. Full minimization of the functional
E[Ψ] with respect to all allowed N -electron states will yield the true ground state Ψ0 and
its energy. This is, of course, an exponentially hard problem.
To make progress, the key simplification is to only use single Slater determinant wave
functions as an ansatz. This is the Hartree-Fock approximation. It greatly reduces the space
of wavefunctions (and therefore types of quantum states) that can be captured. Thus, with

ψ1 (r1 ) · · · ψ1 (rN )

1 .. ..

Ψ0 ≈ ΨHF,N = √ (593)

. .
N
ψN (r1 ) · · · ψN (rN )

the problem reduces to finding an orthogonal basis of ψj that minimizes

EHF,N = minΨHF,N E[ΨHF,N ]. (594)

The energy can be written more explicitly as

EHF,N = hΨHF,N |Hel |ΨHF,N i


N N
X 1 X
= Hj + (Jjk − Kjk )
j=1
2 j,k=1
Z  
3 ∗ 1 2 (595)
Hj = d rψj (r) − ∇ − Vext (r) ψj (r)
2
Z Z
1
Kjk = d3 r1 d3 r2 ψj∗ (r1 )ψk (r1 ) ψj (r2 )ψk∗ (r2 )
r12
Z Z
1
Jjk = d3 r1 d3 r2 ψj (r1 )ψj∗ (r1 ) ψk∗ (r2 )ψk (r2 ).
r12
Here, Vext (r) is the potential felt by the electrons due to the nuclei, Jjk are the so-called
Coulomb integrals, and Kjk are the so-called exchange integrals. They obey Jjk ≥ Kjk ≥ 0
and Jjj = Kjj .
The Hartree-Fock equations are the equations resulting from varying EHF,N with respect
to the single electron wave functions ψj . They are non-linear and non-local (thanks to
Jjk and Kjk , and thus have to be solved self-consistently. Thus, albeit the Hartree-Fock
approximation has led us to simplify the interacting many-body problem to an effective one-
electron problem, it is still difficult to solve. It is important to stress that the problem we
want to solve is determined by only the external potential Vext (r) and the particle number

149
N . Everything else is the completely universal Schroedinger equation which does not contain
an further information specific to a particular problem.

2. Hohenberg-Kohn theorems

The key step that led to the development of density-functional theory is to not go down
the Hartree-Fock road and consider ψj but instead the electron density
Z Z
ρ(r) = N d r2 · · · d3 rN |Ψ(r, r2 , · · · , rN )|
3
(596)

as the central object of the theory. Physically, ρ(r) is the probability of finding an electrons
in the volume element d3 r. Different from the wave functions, this is an experimentally
observable quantity (measurable, e.g., via X-ray diffraction). It satisfies
Z
ρ(r) ≥ 0, lim ρ(r) = 0, d3 r ρ(r) = N. (597)
r→∞

Further, at any position of an ion rl , the derivative of ρ(r) has a discontinuity determined
by the ion’s charge
lim (∂r + 2Zl ) ρ̄(r) = 0, (598)
r→rl

where ρ̄(r) is the angular average of ρ(r) with respect to the position of the ion. It is not
a priori obvious that considering the electron density instead of the wave function itself is
a sufficiently general way forward. However, the Hohenberg-Kohn theorems tell us exactly
that.
a. First Hohenberg-Kohn theorem The external potential Vext (r) is (up to a constant)
a unique functional of ρ(r). Since, in turn, Vext (r) fixes the Hamiltonian Hel , the full many-
particle ground state is a unique functional of ρ(r).
Proof : We prove this by contradiction. Assume there were two external potentials Vext (r)
0
and Vext (r) that differ by more than a constant, but have the same ρ(r) in the ground state.
Then, we have two corresponding Hamiltonians Hel and Hel0 with different ground state wave
functions Ψ0 and Ψ00 that have the same density. Take Ψ00 as a variational wave function for
Hel

E0 < hΨ00 |Hel |Ψ00 i

= hΨ00 |Hel0 |Ψ00 i + hΨ00 |Hel − Hel0 |Ψ00 i (599)


Z
0 0
= E0 + d3 rρ(r) [Vext (r) − Vext (r)] .

150
Exchanging the role of the primed and unprimed quantities we obtain
Z
0 0
E0 < E0 + d3 rρ(r) [Vext (r) − Vext (r)] . (600)

Adding Eqs. (599) and (600) gives the contradiction E0 + E00 < E00 + E0 . This means that
there cannot be two different Vext (r) with the same ground state density.
Thus, ρ(r) determines N and Vext (r) and with this all ground state properties. Thus,
we can formally write any ground state property as a functional of ρ(r). In particular, this
is true for the energy
Z
E[ρ] = d3 rρ(r)Vext (r) + FHK [ρ],

FHK [ρ] = T [ρ] + J[ρ] + Encl [ρ] (601)


Z
1 ρ(r1 )ρ(r2 )
J[ρ] = d3 r1 d3 r2 .
2 r12
The functional FHK [ρ] is here represented as a sum over the kinetic part T [ρ], the classical
electrostatic contribution and a quantum-mechanical contribution Encl [ρ] which contains
quantum mechanical corrections. Within Hartree-Fock, it contains the exchange and the
Coulomb correlations. The key advantage of this representation is that it separated the
energy in a problem-specific part involving Vext (r) that is know and easy to compute and a
completely universal part FHK [ρ]. The challenge is that FHK [ρ] is not known. Knowing it
amounts to solving the many-body Schroedinger equation exactly (not within Hartree-Fock).
Providing an explicit from of the functional FHK [ρ] that is in some sense a well-controlled
approximation is the central problem in density functional theory.
b. Second Hohenberg-Kohn theorem FHK [ρ], the functional that delivers the ground
state energy of the system, delivers the lowest energy if and only if the input density is the
true ground state density.
This basically guarantees uniqueness of the solution and allows us to employ the varia-
tional principle in the search for the ground state density and energy. Thus we can obtain
the ground state energy
E0 ≤ E[ρ̃] (602)

by minimizing the functional E[ρ̃] over all trial densities ρ̃.


Proof : We can use the variational principle for wave functions together with the first
Hohenberg-Kohn theorem. The latter states that there is a one-to-one connection between

151
a potential ρ̃ and a Hamiltonian H̃el for which this is the ground state density (in particular,
ρ̃ determines a Ṽext (r) in this way). The Hamiltonian further has a unique ground state Ψ̃0 .
We can use Ψ̃0 as a trial wave function for the Hamiltonian Hel generated from the true
external potential Vext (r)
D E
ψ̃0 |Hel |ψ̃0 ≡ E[ρ̃] ≥ E0 ≡ E[ρ] = hψ0 |Hel |ψ0 i . (603)

We emphasize that this result is only true for the ground state, as association of the
Hamiltonian with a unique ground state is key. The approach cannot be used for excited
states.

3. Kohn-Sham equation

The Kohn-Sham equation provides an explicit approximate expression for T [ρ] and Encl [ρ]
and thereby defines a density functional theory. (Historically, the first relevant density
functional theory of relevance was the Thomas-Fermi theory which is still an important
subject of more mathematical research. For practical numerical computations it is however
not used anymore, because of the availability of better performing methods.)
Kohn and Sham suggested to calculate the exact kinetic energy of a non-interacting
reference system with the same density as the real, interacting one
N N
1 X
X !
ψj |∇2 |ψj , |ψj (r)|2 = ρ(r),

TS = − ρS (r) (604)
2 j=1 j=1

where ψj are the single-particle orbitals of the noninteracting system. Since TS will not be
equal to the actual kinetic energy T [ρ], one rewrites

FHK [ρ] = TS [ρ] + J[ρ] + EXC [ρ] (605)

where all the unknown functionals are subsumed in the exchange-correlation energy

EXC [ρ] = T [ρ] − TS [ρ] + Encl [ρ]. (606)

To go further with the approximation by a non-interacting system, we search for a potential


VS that provides as a solution a slater determinant that has the same density as the real
system that we are interested in. If we succeed in finding this, we have a single slater
determinant solution to a problem defined by TS and VS (i.e., a different problem) which

152
has the same solution in terms of electron density, as the problem we are interested in. We
recall the energy functional
Z Z Z
1 3 3ρ(r1 )ρ(r2 )
E[ρ] = TS [ρ] + d r1 d r2 + EXC [ρ] + d3 rVext (r)ρ(r). (607)
2 r12

All terms except for EXC [ρ] can be straightforwardly written in terms of the ψj . We ignore
this issue for the moment and apply the variational principle, i.e., minimize with respect to
the ψj subject to hψj |ψk i = δjk . This yields the Kohn-Sham equations
 Z 
1 2 3 ρ(r2 )
− ∇ + d r2 + VXC (r1 ) + Vext (r1 ) ψj (r1 ) = j ψj (r1 ) (608)
2 r12

with
Z
ρ(r2 )
VS (r1 ) = d3 r2 + VXC (r1 ) + Vext (r1 ). (609)
r12
The problem is now to find VS . This only depends on the density, i.e., the Kohn-Sham
equations have to be solved iteratively/self-consistently. Up to this, the problem is reduced
to find VXC = δEXC /δρ. Would we succeed in that, we would have the exact ground state
energy of the interacting system. The orbitals ψj that we would find this way have – strictly
speaking – no physical meaning. Still, for weakly correlated systems, they provide a good
approximation to the true states of the system.

4. Local density approximation and beyond

The local density approximation (LDA) is the basis for all practically used approximations
to the exchange-correlation functional. The central idea is that of a uniform electron gas.
The form
Z
LDA
EXC [ρ] = d3 r ρ(r) {X (ρ(r)) + C (ρ(r))} (610)

is postulated, where X (ρ(r)) + C (ρ(r)) is the exchange-correlation energy per particle of a


uniform electron gas of density ρ(r). This energy per particle is weighted with the probability
of finding an electron at this position. Of the two parts X (ρ(r)) and C (ρ(r)) the exchange
part is known exactly for the case of an electron in a uniform electron case of a particular
density
 1/3
3 3ρ(r)
X (ρ(r)) = − . (611)
4 π

153
The correlation part C (ρ(r)) is only known from numerical simulations for the same case,
but with high accuracy.
For weakly correlated systems, LDA gives EXC [ρ] with an error of 10%–20%, but predicts
other ground state properties with much higher accuracy. LDA largely fails for systems
dominated by electron-electron interactions, such as heavy fermion compounds.
Several methods to go beyond LDA are known. One natural step to include more non-
local aspects of the electron density in EXC [ρ] is the generalized gradient approximation
(GGA) Z
GGA
EXC [ρ] = d3 rf (ρ(r), ∇ρ(r)). (612)

Besides the form of the exchange functional, another important property of a density
functional code is the basis set in which the ψj are expanded. One of many options is the
linear combination of atomic orbitals, i.e., orbitals of atoms in free space.

154

You might also like