Optics and Quantum Electronics - Research Laboratory of Electronics PDF
Optics and Quantum Electronics - Research Laboratory of Electronics PDF
Optics and Quantum Electronics - Research Laboratory of Electronics PDF
Quantum Electronics
Editors
Maria Chekhova
Lomonosov Moscow State University, Russia
Sergey Kulik
Lomonosov Moscow State University, Russia
World Scientific
NEW JERSEY LONDON SINGAPORE BEIJING SHANGHAI HONG KONG TA I P E I CHENNAI
For photocopying of material in this volume, please pay a copying fee through the Copyright
Clearance Center, Inc., 222 Rosewood Drive, Danvers, MA 01923, USA. In this case permission to
photocopy is not required from the publisher.
ISBN-13 978-981-4324-50-2
ISBN-10 981-4324-50-7
Printed in Singapore.
Preface
This book belongs to the series of textbooks in electronics and radiophysics writ-
ten at the Physics Department of Lomonosov Moscow State University. Similarly
to the other books of this series [Migulin (1978); Vinogradova (1979)], it is writ-
ten for undergraduate Physics students and aims at introducing the readers to the
most general concepts, rules, and theoretical methods. The main focus is on the
three directions in physical optics that appeared after the advent of lasers: nonsta-
tionary interactions between light and matter (Chapter 5), optical anharmonicity
of matter (Chapter 6) and quantum properties of light (Chapter 7). The first four
chapters describe the theoretical base of more traditional parts of quantum elec-
tronics. The book starts with a short review of the history of quantum electronics
with its main concepts, ideas, and terms. Further, basic methods for describing
the interaction of optical radiation with matter are considered, based on quantum
transition probabilities (Chapter 2), the density matrix formalism (Chapter 3), and
the linear dielectric susceptibility of matter (Chapter 4).
The author tried to combine a systematic approach with a more detailed in-
sight into several interesting ideas and effects, such as, for instance, superradiance
(Sec. 5.3), phase conjugation (Sec. 6.5), and photon antibunching (Sec. 7.6).
The reader is expected to know the basics of quantum mechanics and statistical
physics; however, much attention is paid to explaining the notations used in the
book. The author tried to gradually increase the presentation complexity within
each section as well as within the whole book. Each section or chapter starts with
a simplified qualitative picture of the phenomenon considered. More complicated
sections providing additional information are marked by circles.
The book uses the Gaussian system of units, which is most common in quan-
tum electronics; however, in the numerical estimates, energy and power are given
in Joules and Watts.
v
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
D. N. Klyshko
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Foreword
Maria Chekhova
Sergey Kulik
The Editors
vii
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
ix
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Contents
Preface v
Foreword vii
1. Introduction 1
1.1 Basic notions of quantum electronics . . . . . . . . . . . . . . . 2
1.1.1 Stimulated emission . . . . . . . . . . . . . . . . . . . 2
1.1.2 Population inversion . . . . . . . . . . . . . . . . . . . 2
1.1.3 Feedback and the lasing condition . . . . . . . . . . . 3
1.1.4 Saturation and relaxation . . . . . . . . . . . . . . . . 4
1.2 History of quantum electronics . . . . . . . . . . . . . . . . . . 5
1.2.1 First steps . . . . . . . . . . . . . . . . . . . . . . . . 6
1.2.2 Radio spectroscopy . . . . . . . . . . . . . . . . . . . 6
1.2.3 Masers . . . . . . . . . . . . . . . . . . . . . . . . . . 7
1.2.4 Lasers . . . . . . . . . . . . . . . . . . . . . . . . . . 8
1.3 Recent progress in quantum electronics (added by the Editors) . 9
1.3.1 Physics of lasers . . . . . . . . . . . . . . . . . . . . . 9
1.3.2 Laser physics . . . . . . . . . . . . . . . . . . . . . . 10
1.3.3 New trends in nonlinear optics . . . . . . . . . . . . . 10
1.3.4 Atom optics . . . . . . . . . . . . . . . . . . . . . . . 11
1.3.5 Optics of nonclassical light . . . . . . . . . . . . . . . 11
xiii
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Contents xv
Contents xvii
Contents xix
Bibliography 337
Index 343
This page intentionally left blank
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Chapter 1
Introduction
a Editorsnote: This opinion was quite common in the laser physics community at the time when the
book was written. However, further investigations reduced the optimism in this field, and we are now
still witnessing new attempts towards laser thermonuclear fusion (inertial confinement fusion).
1
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
The operation of lasers and masers rests on the three whales, basic notions of
quantum electronics namely, stimulated emission, population inversion, and
feedback.
Fig. 1.1 Amplification of light under stimulated transitions. A resonant photon hits an excited atom,
which then gives its stored energy to the field. As a result, the field contains two indistinguishable
photons.
Introduction 3
(a) (b)
Fig. 1.2 Obtaining population inversion through optical pumping: (a) initial Boltzmanns population
distribution; (b) strong resonant radiation balances the populations of levels 1 and 3, so that N2 > N1 .
equal to the probability of stimulated emission, W21 = W12 , and therefore the
overall effect depends on the difference of numbers of atoms at the levels 1 and 2,
N N1 N2 . Usually, populations Nm of the levels are defined per unit volume.
If the matter is at thermodynamic equilibrium with a temperature T , then, ac-
cording to Boltzmanns distribution, Nm exp(Em /T ), with being the Boltz-
mann constant. Therefore, if E2 > E1 , then N2 < N1 (Fig. 1.2(a)). As a result,
stimulated up transitions occur more frequently than stimulated down transi-
tions, and external electromagnetic radiation in equilibrium medium is attenuated.
Thus, in order to amplify field, the medium should be in a non-equilibrium state,
with N2 > N1 . One says that such a state has population inversion, or negative
temperature.
A lot of methods have been developed for achieving population inversion.
The most important ones are pumping the medium (Fig. 1.2(b)) with auxiliary
radiation (used for solid and liquid doped dielectrics), electric discharge in gases
and injection in semiconductors.
Introduction 5
Introduction 7
investigations with beam radio spectroscopes, which had been started as early as
in the 1930s. Probably, it was also important that radio spectroscopists, in contrast
to opticians, understood very well the operating principles of MW generators and
amplifiers based on free-electron beams (klystrons, magnetrons, traveling-wave
and backward-wave tubes), they were familiar with the notions of negative resis-
tance and positive feedback, and had practical experience with high-quality MW
cavities.
Among the works directly preceding the advent of masers, one should mention
the ones by Kastler (France), who developed in 1950 the optical pumping method
for increasing the population inversion of close levels in gases. Besides gas and
beam radio spectroscopy, an important role was also played by magnetic radio
spectroscopy, a direction that was started in the 1940s and studied the interac-
tion of radio waves with ferromagnetics and nuclear or electronic paramagnetics
(E. K. Zavoisky, 1944). These are namely the achievements in the theory and
technique of magnetic resonance that led to the development of paramagnetic am-
plifiers, which have an extremely small level of noise. Population inversion has
been first obtained in a system of nuclear spins placed into magnetic field (Parcell
and Pound, 1951).
1.2.3 Masers
The idea of using stimulated emission in a medium with population inversion for
the amplification and generation of MW electromagnetic waves was suggested
at several different conferences at the beginning of the 1950s by N. G. Basov
and A. M. Prokhorov (Lebedev Physics Institute, Academy of Sciences, USSR),
C. H. Townes (Columbia University, USA), and J. Weber (University of Mary-
land, USA). The first quantitative theory of a quantum generator was published
by Basov and Prokhorov in 1954. They have found the threshold population dif-
ference necessary for self-excitation and suggested a method for obtaining popula-
tion inversion in a molecular beam using inhomogeneous electrostatic field. Later,
Basov, Prokhorov, and Townes were awarded a Nobel Prize for their contributions
to the development of quantum electronics.
In 1954, description of the first operating maser was published by Gordon,
Zeiger, and Townes. The active medium was ammonium molecular beam, focused
with the help of electric field. Nowadays, beam masers are used in the national
standards of frequency and time.
The second basic maser type, paramagnetic amplifier, was created in 1957
by Scovill, Feher, and Seidel who followed a suggestion by Bloembergen. In
paramagnetic amplifiers, population inversion is created with the help of auxiliary
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
radiation, the pump, which saturates the populations of levels 1 and 3 (Fig. 1.2).
As a result, levels 1 and 2 (or 2 and 3) get population inversion. The idea of pump-
ing a three-level system, which was later widely used in solid-state and liquid
lasers, belongs to Basov and Prokhorov (1955). The active medium of paramag-
netic amplifiers, which is a diamagnetic crystal doped with a small amount (on the
order of 103 ) of paramagnetic atoms, i.e., atoms with odd electron numbers, is
cooled down to helium temperatures. Cooling is necessary for reducing the noise
and slowing down the relaxation processes, which counteract the population in-
version. (In paramagnetics, relaxation of populations is caused by the interaction
between crystal lattice vibrations and the magnetic moments of non-compensated
electrons.)
1.2.4 Lasers
Transition from radio to the optical frequencies took about five years: the first
operating laser emitting coherent red light was described by Maiman in 1960. As
the active medium, the laser used a pink ruby crystal (aluminium oxide doped
with chrome) and population inversion was achieved using blue and green light
from a pulsed flash lamp. An important step was realizing that a Fabry-Perot
interferometer, i.e., two parallel plane mirrors, is a high-quality resonator, i.e., an
oscillation system for light waves (Prokhorov, Dicke, 1958).
The laser era of physics started. Soon after the appearance of solid-state lasers
with optical pumping, a number of other laser types was developed: gas discharge
lasers (1961), semiconductor lasers based on p n transitions (1962), liquid lasers
based on the solutions of organic dyes (1966). Rather quickly, the wavelength
range from far infrared (IR) to far ultraviolet (UV) was covered. The parameters
of the lasers (power, monochromaticity, directivity, stability, tunability) were con-
tinuously improving; their field of application rapidly broadened. An important
role was played by the invention of methods to shorten the duration of laser light
pulses (q-switching and mode locking).
First experiments on light frequency doubling (Franken et al., 1961) started the
explosive development of nonlinear optics, which studies and uses the nonlinear-
ity of the matter at optical frequencies. Holography and optical spectroscopy had
their second birth; new fields appeared, such as optoelectronics, coherent spec-
troscopy, and quantum optics. X-ray and gamma-ray lasers are to arrive soon.c
It should be stressed once again that the rapid development of quantum elec-
tronics was provided by a large amount of ideas and information stored by the
c Editors note: While X-ray lasers have been indeed constructed in the end of the 20th century [Svelto
Introduction 9
beginning of the 1950s in the fields of radio and optical spectroscopy. Such di-
rections of physics as magnetic resonance or molecular-beam spectroscopy, seem-
ingly far from practical applications, led to a laser revolution in many fields of
science and technology.
This textbook was published in 1987, almost a quarter century ago. At that time, it
was a very modern book; it reflected the latest events in quantum electronics and
provided a complete picture of its directions and tendencies. Since then, many
changes took place in this field. New technologies appeared, new laser sources
were developed, and new effects were discovered. In this section, we will try to
briefly review the advances in quantum electronics that happened after the book
had been published.
Apparently, lasers became widely used devices which penetrate into all fields
of human activity starting from toys up to the high technologies and medicine.
Introduction 11
a chip. Optical fibres are now used not only for light transmission, but also for
beam splitting, polarization transformations, as nonlinear elements and as active
elements [Agrawal (2007)]. Nonlinear waveguides, based on KTP and lithium
niobate crystals, and sometimes on semiconductor layers, are used as extremely
efficient and compact elements for frequency conversion, requiring very low pump
powers and allowing for relatively easy control. Integrated optics also uses plas-
monic structures, which form convenient interfaces between free space or di-
electrics and metal surfaces.
We now witness a certain shift of interest to novel frequency ranges. Among
them, attention is drawn to the terahertz (1012 Hz) range of frequencies, which is
important for spectroscopic studies in biology, for astronomy, and for the security
applications (detection of explosive materials and weapons). For more details
on the recent developments in nonlinear optics, one can see, for instance, [Boyd
(2008)].
Introduction 13
d This is true provided that the unavoidably introduced error rate exceeds some critical level, depending
Chapter 2
The most important notion in quantum electronics is the probability for an electron
in an atom or a molecule to make a quantum transition from one level to another.
In this chapter, we will first give the general expression for the probability of a
quantum transition in the first order of the perturbation theory (Sec. 2.1), then
calculate the probability of a transition due to monochromatic radiation (Sec. 2.2)
and find the interaction cross-section and the absorption coefficient (Sec. 2.3).
Further, we will consider stimulated transitions under fluctuating (noise) radiation
with a broad spectrum (Sec. 2.4). Noise radiation surrounding an atom can play
the role of a thermostat and cause relaxation (Sec. 2.5).
A consistent theory of electromagnetic processes should describe both the
matter and the field based on the principles of quantum mechanics. However,
most part of quantum electronics effects are sufficiently well described by the so-
called semiclassical theory of radiation, in which only the motion of particles
is quantized while the electromagnetic field is considered in terms of classical
Maxwells equations. By avoiding quantum electrodynamics, one gets the the-
ory considerably simplified but, at the same time, loses the chance to consistently
describe fluctuations of the electromagnetic field and, in particular, spontaneous
emission and the noise of quantum amplifiers. The present book mainly considers
stimulated effects in a classical deterministic field and therefore uses the semi-
classical theory of radiation. Quantization of the field and spontaneous effects are
considered in Chapter 7.
15
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
ternating field E(t). (Usually the effect of the magnetic field is much weaker than
the one of the electric field.) At the second stage, we will find the back action of
the atoms on the field. The self-consistent solution to the two systems of equa-
tions describing the response of the matter to the field and the response of the field
to a given motion of charges, under certain simplifying conditions, is the main
problem in the theory of interaction between radiation and matter.
The behavior of material particles in given external fields is described by the
Schrodinger equation,
(i~/t H)(r, t) = 0. (2.1)
Here, is the wave function, whose arguments are the set of coordinates r =
{r1 , r2 , ...} and the time; H is the energy operator consisting of the non-perturbed
part, H0 , and the alternating energy of the particles in the external field, V(t),
H(t) = H0 + V(t). (2.2)
The non-perturbed energy, in its turn, includes the kinetic energy of the particles
and the energy of their interaction V0 . (The latter also includes the energy of the
particles in external static fields).
so that
Z X
dr||2 = |cn |2 = 1. (2.7)
n
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
The cn coefficient in the expansion (2.3) gives the relative population |cn |2 of
the level n, i.e., the probability to measure the energy En or, as one says, the
probability to find the system at the level n. Indeed, according to the rule of
calculating mean values in quantum mechanics, the mean energy of the system,
with an account for Eqs. (2.3)(2.6), is
Z X
E hH0 i dr H0 = |cn |2 En . (2.8)
n
Note that, according to (2.3), in the general case the atom is not necessarily in
a stationary state with a definite energy En (even in the absence of the alternating
force, V(t) = 0). For instance,let only two coefficients cn of the superposition
(2.3) be nonzero: c1 = c2 = 1/ 2; then the mean ensemble energy of the atom is
(E1 + E2 )/2 but single energy measurements will give either E1 or E2 . Then the
electron cloud, i.e., the probability density to find the electron at point (r, t), will
oscillate with the Bohr frequency, 21 2 1 (E2 E1 )/~ (Fig. 2.1):
P(r, t) = |(r, t)|2 = |1 (r) + 2 (r)exp(i21 t)|2 /2
= 21 /2 + 22 /2 + 1 2 cos(21 t). (2.9)
(We assume that n = n .) Such nonstationary states are called coherent ones.
This term is often used in the case where many identical atoms are in a non-
(a) (b)
Fig. 2.1 Electron cloud of an atom that is in a coherent (non-stationary) state given by a superposition
of two stationary states 1 and 2 with different symmetries oscillates with the transition frequency
21 : (a) dependencies of the wave functions on one of the space coordinates; (b) corresponding con-
figurations of the electron cloud.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
stationary state with the same phase. Then, electrons oscillate synchronously
and the system of atoms has a macroscopic dipole moment emitting intense light
with the frequency 21 . This effect, called superradiance, will be considered in
Sec. 5.3.
In the presence of external alternating field E(t), eigenoscillations of the elec-
tron cloud with the frequencies mn will be accompanied by stimulated oscilla-
tions with the frequency of the field .
As a result, we find the system of equations for the coefficients cn (t), which is
equivalent to the initial Schrodinger equation:
X
0
i~dcm /dt = Vmn cn . (2.14)
n
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
In order to pass to the energy representation, let us left-multiply (2.16) by the m-th
eigenvector of H0 ,
d X
i~ hm|ti = hm|H|ti = hm|H|nihn|ti. (2.17)
dt n
P
The last equation was obtained using the expansion of the unity, I = |nihn|. Let
us denote hm|V|ni Vmn and use (2.2) and (2.5), then
d X
i~ hm|ti = Em hm|ti + Vmn hn|ti.
dt n
Thus, we look for the solution to (2.14) in the form of the sum
X
cn = c(0)
n + c (1)
n + = c(s)
n , (2.18)
s
in which the s-th term is proportional to the s-th power of the external force,
c(s) s
n V . Substituting (2.18) in (2.14) and setting equality between the terms of
the same order in V, we find
c(0)
m = 0, (2.19)
X
0
i~c(1)
m = Vmn c(0)
n , (2.20)
n
X
0
i~c(s)
m = Vmn c(s1)
n . (2.21)
n
cn (t0 ) = c(0)
n = n1 . (2.22)
In this case, the system of equations (2.20) for the first-order coefficients gets
decoupled,
0
c(1)
m = Vm1 /i~. (2.23)
Thus, at large time delays after the perturbation has been switched on (t t0
), the c2 coefficient, which determines the perturbed population of the level with
the energy E2 , scales as the Fourier transform of the external force at the Bohr
frequency 21 = (E2 E1 )/~. In other words, in the first-order perturbation
theory, a quantum system behaves as a set of linear oscillators and responds only
to the resonant harmonics of the external force. If there is no 21 harmonic in the
perturbation spectrum, or its amplitude is small, then level 2 will not be populated.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Fig. 2.2 Probability Pmn of a stimulated transition as a function of the field frequency and the
interaction time t.
Thus, in the first-order perturbation theory, the probability to find the atom at
level m at time t, under the initial condition cn (0) = 1, is
#2
dmn E0 2 sin(t/2)
"
Pmn = , (2.34)
2~ /2
where |mn |. According to this formula, the dependence of the transition
probability on the external field frequency is resonant, and the sharpness of the
resonance increases with time (Fig. 2.2). As a result, in the limit t , the
transition probability is given by Diracs delta function,
" #2
sin(t/2)
lim = 2t(). (2.35)
t /2
The factor by the delta function is verified by integrating both parts of (2.35) in .
Because the transition probability scales as the time duration t of the pertur-
bation, one can introduce the transition probability per unit time, also called the
transition rate,
Wmn Pmn /t = 2|dmn E0 /2~|2(). (2.36)
Thus, the transition rate scales as the square of the field, i.e., the intensity of the
wave. The presence in (2.36) of the delta function, which differs from zero only at
exact resonance, can be understood from the photon viewpoint: according to the
energy conservation law, the change in the atom energy by the value Em En
~mn should be accompanied by the absorption or emission of a photon with the
energy ~.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Fig. 2.3 Spectral line shapes. Broadening due to collisions or spontaneous radiation leads to a
Lorentzian line shape (1); the Doppler effect results in a Gaussian line shape (2), while limited in-
teraction time t causes a sinc2 (t/2) shape (3).
where sinc x (sin x/x) and is the full line width at level 1/2. It is clear from
Fig. 2.3 that the particular mechanism of broadening has a noticeable effect only
on the tails of the lines. Note that the amplitude ratio of the area-normalized form
factors (2.39), (2.41), (2.42) at exact resonance and with the same is
!1/2
2 4 ln 2 5.56
: : 1 : 1.47 : 1.39. (2.43)
2
When deriving (2.37), we have replaced the function (2/)2 sin2 (t/2), which
was obtained from the perturbation theory, by 2t g(). (Here, is the frequency
mismatch between the field and the transition.) Let us explain this procedure using
the strong collision model. According to this model, every collision instantly
brings the atom back to the initial level, after which the interaction between the
field and the atom starts anew. Then, t in (2.34) should be replaced by t t0 t,
where t0 is the time instance of the last collision. In gas, the time interval t
between the last collision of an atom and some fixed time t is a random variable
with exponential distribution (see [Rytov (1976)]):
Here, we have used expression (2.34) for the transition probability P. The mean
transition rate is given by averaging (2.45) using (2.44),
Z
2 /2
W d(t)P(t)W(t) = . (2.46)
0 1 + 2 2
The last expression is in agreement with (2.37) and (2.39) if is replaced by the
inverse half-width of the line, 2/. The integral in (2.46) can be easily done by
replacing sin x with Im[exp(ix)].
Here, e is the unity polarization vector, k = kn/c is the wave vector, k is the
unity vector in the propagation direction, n = is the refractive index. After
substituting (2.48) in (2.47) and time averaging, we get the following relationship
between the time-averaged energy and the amplitude of the plane wave:
Intensity of the wave is obviously given by the product of the energy density,
E/V, and the velocity of the wave, c/n,
Note that this formula was derived without considering the frequency dispersion
of the medium, (). The dispersion can be taken into account by replacing in
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
(2.47) by d()/d (see, for instance, Refs. [Landau (1982); Silin (1961)]). As a
result, (2.49), instead of n2 , will contain the expression
(1/2)[d()/d + ] = c2 /uv, (2.51)
where v = c/n is the phase velocity and u = d/dk, the group velocity. Now, the
energy density should be multiplied by u, which again leads to relationship (2.50).
Nm |cm |2 N0 , (2.56)
where N0 is the total number of atoms. Thus, from considering a single particle
we pass to studying a system of N0 identical non-interacting particles. The n m
transition rate is defined as (see Eq. (2.36))
Here, so far, the relaxation is ignored; in the general case, the rates Wmn should
also include contributions from chaotic fields created by the surrounding particles.
Such equations are studied in non-equilibrium thermodynamics; they are called
kinetic equations or population balance equations. If only two levels take part in
the exchange, then
N1 + N2 = N0 , (2.59)
and only one of equations (2.58) is sufficient. Because the perturbation operator
is Hermitian, it follows that W12 = W21 W (see (2.26)), hence
N1 = N2 = W(N2 N1 ), (2.60)
where N(r, t) is the photon concentration and uN = F is the photon flux density
vector. Hence, in the stationary one-dimensional case, where N/t = 0 and
F = Fz (z),
with the concentration of chrome ions 1019 cm3 , the line width is on the or-
der of 50 MHz. Substituting for d in (2.53), for = 1 cm we get =
82 /~ f = 5 1020 cm2 . A realistic number of active particles available in
paramagnetics is approximately equal to the equilibrium population difference,
N (0) = ~N0 /T g N0 /10 = 1018 cm3 , where g = 2S + 1 = 4 is the degener-
acy of the chrome ion ground level, which is lifted by a constant magnetic field,
and S is the spin number. Hence, 0 = 0.05 cm1 , and for obtaining amplification
G = 100 one needs the length of the crystal l = ln G/0 1 m. In order to reduce
l, the crystal is placed into a bulk resonator, where radiation can many times pass
through the matter, or into a slowing-down system. In the latter case, the above-
given formulas for and will be still valid, with the speed of light in the vacuum
replaced by the group velocity of waves in the slowing down medium, u = dK/dz,
where K is the propagation constant.
Fig. 2.4 Observed shape of the resonance in the case of a Lorentzian line with width at different
optical densities y = 0 l at the center of the line (x 2( 0 )/).
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
2.3.7 Degeneracy of the levels
Expression for the amplification coefficient and the inversion condition N2 > N1
have been obtained above under the assumption that the atom energy levels were
not degenerate. Let now g1 different (with respect to some parameters) states have
the same energy E1 and g2 states have energy E2 ,
(H0 E1 )1i = 0 (i = 1, . . . , g1 ),
(2.69)
(H0 E2 )2 j = 0 ( j = 1, . . . , g2 ).
Note that the conclusions given below will be also valid in the case where the
degeneracy is lifted due to sufficiently small perturbations (Fig. 2.5). Now, the
Fig. 2.5 Degeneracy of the levels: g1 different states have the same energy E1 while g2 other states
have the energy E2 . In the right-hand side of the figure the degeneracy is lifted due to the external con-
stant field H0 , which breaks the symmetry of the system. Alternating field causes transitions between
a certain pair of states i, j.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Now, assume that the saturation effect is absent and relaxation or inversion
lead to a uniform distribution of sublevel populations,
N1 = W 0 N 0 , (2.73)
where
X
W0 W1i,2 j , N 0 N1 /g1 N2 /g2 . (2.74)
ij
Thus, degeneracy of the levels can be taken into account if W in Eq. (2.66)
is understood as W 0 and N, as the difference of state populations Nm /gm . The
inversion condition then takes the form
Let, for instance, g1 = 1 and g2 = 3, then, one needs N2 > 3N1 for amplification.
Recall that, according to the Boltzmann distribution,
From (2.80) and the definition (2.79), the symmetry property follows,
G (t) = G (t); (2.81)
in particular, G (t) should be an even function of time.
Thus, according to (2.78) and (2.79), the probability of a transition due to
stationary random radiation is determined by the field correlation tensor,
X " t
P21 = ~2
d21 d21 dt0 dt00 exp[i21 (t0 t00 )]G (t0 t00 ). (2.82)
t0
Here, as usual, we omit infinite integration limits and denote the function and its
Fourier transform by the same letter. The second integration (in t2 ) simply yields
the observation time t t0 , so that one can define a time-independent transition
rate W P/(t t0 ). Let the dipole moment of the transition be parallel to the x
axis, then we finally find a simple expression for the transition rate,
W21 = 2~2 |d21 |2G xx (21 ). (2.85)
Thus, the rate of a stimulated transition due to a random (noise or incoher-
ent) perturbation scales as the spectral density G() of the perturbation at the
transition frequency. It is useful to compare (2.85) with formula (2.37), which
defines the transition rate in the case of a monochromatic field. The two formulas
coincide after the substitution |E0 |2 g() G().
In this consideration we did not take into account the broadening of the
levels due to relaxation processes. However, intuitively it is clear that the conclu-
sion should be still valid in the case where is much less than the width of the
perturbation spectrum, E 1/E . In this case, the field is called incoherent. In
the opposite case, the field can be obviously considered as monochromatic, i.e.,
coherent.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
where the last equality was obtained using relation (2.84) with t = 0 and taking
into account that, according to (2.83), G() is an even function. From the com-
parison of (2.86) and (2.87) we get the relation between the spectral densities of
field amplitude and energy,
The proportionality coefficient B between the transition rate and the energy
density is called the Einstein coefficient for a stimulated transition. In the next
section, using the Planck function (0) () for equilibrium radiation, we will find
the second Einstein coefficient, A, giving the rate of spontaneous transitions.
2.4.4 Spectral field density
Concluding this section, let us clarify the physical meaning of the field spectral
density G(). In order to do this, we formally represent E(t) as a Fourier integral
(the subscript is omitted),
Z
E(t) = deit E(), (2.91)
Z
E() = dteit E(t)/2. (2.92)
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
As a result,
Consider population kinetics for atoms in an equilibrium field with the energy
spectral density (0) (). From the kinetic equation (2.60) for two nondegenerate
levels, it follows that
N2 = N1 = B(N1 N2 ), (2.98)
where B B12 = B21 and (21 ). Thus, a noise broadband field, similarly
to a monochromatic one, tends to equalize the populations of the levels, so that
in the stationary regime N1 = N2 . However, equilibrium radiation of temperature
T should heat or cool the matter to the same temperature, hence the population
distribution is given by the Boltzmann formula, according to which N1 > N2 .
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
with
N() = [exp(~/T ) 1]1 , (2.102)
where we assumed the value 1/137 for the fine structure constant e2 /~c (recall
that this number also defines the ratio of the velocity of an electron in a hydrogen
atom to the speed of light). Assuming = 1/R 4 137a0 0.1, with R being
the Rydberg constant, we obtain
Displacement of atomic levels due to the interaction with the electromagnetic vac-
uum (the Lamb shift) is on the same order of magnitude or smaller. Thus, relative
perturbation of an atom by the vacuum is extremely small.
where
g 2 /2 c3 , (2.110)
has the meaning of the spectral density of field modes in a unit volume (recall that
is the energy spectral density per unit volume). A mode, or an oscillation type, is,
roughly speaking, an oscillation degree of freedom (or a spatial harmonic) of the
field (Sec. 7.3). The inverse value, 1/g , is equal to the frequency interval between
the neighboring modes. According to definition (2.109), N has the meaning of the
field energy per one mode, in ~ units. In other words, N is the number of photons
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
per mode. This value is also called the degeneracy factor of photon gas. Note that
both energy and the number of photons fluctuate; here, and N are mean values.f
The value N is the most important parameter of incoherent radiation. Let us
show that it scales as the basic photometry characteristic, the spectral brightness
I . The latter is defined as the radiation intensity within a unit spectral interval
and unit solid angle and has dimensionality [W/(cm2 Hzsr)]. Radiation intensity
per unit frequency is equal to half the spectral energy density, /2, multiplied by
the speed of light. By adding the factor 1/4, we pass to the spectral brightness
I . Hence, with the help of (2.109), we find
it was first mentioned by Wooters and Zurek and independently by Milonni and Hardies in 1982, if the
ratio between the induced and spontaneous transitions tends to unity, then it leads to the impossibility to
clone the polarization state of a single photon. Indeed, if Win = Wsp , then the mean number of photons
per field mode equals unity. Then, amplification of a single photon through stimulated transitions in an
atom (which was initially prepared in excited state) will be accompanied by the spontaneous emission
of a photon that has random polarization with respect to the initial one. If the mean number of photons
per mode grows, then the contribution of spontaneous transitions goes down. According to (2.109),
cloning becomes possible in the limit of high N, which is often associated with classical field, therefore
this fact does not contradict to the non-cloning theorem. However, we would like to stress that this
conclusion has to be applied with caution because there is no criteria of non-classicality of light based
on the mean photon number! For instance, squeezed states of light or bright squeezed vacuum states
(Sec.7.5) are nonclassical despite having large photon numbers [Bachor (2004)].
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
2.5.4 Relaxation time
With the help of (2.99), let us now define the rate of population variation with an
account for both spontaneous and stimulated transitions. For this, we replace N1
by N0 N2 and use relation (2.112):
N2 = A[NN0 (2N + 1)N2 ]. (2.113)
Here N is the number of photons per mode and N0 = N1 + N2 is the total number
of atoms at two levels. Hence, in a stabilized regime,
N1(0) /N0 = (N + 1)/(2N + 1) = /(1 + ),
(2.114)
N2(0) /N0 = N/(2N + 1) = 1/(1 + ),
where exp(~21 /T ). The solution to (2.113) has the form
N2 (t) = N2(0) + [N2 (t0 ) N2(0) ]et/T 1 ,
(2.115)
1/T 1 A(2N + 1) = 2B + A = w12 + w21 .
Thus, the time T 1 of heating (or cooling) of the atomic internal degrees of freedom
due to the interaction with incoherent radiation at small N equals 1/A, the life time
of an atom due to spontaneous transitions, while at large N it reduces 2N + 1 =
coth(~21 /2T ) times.
In fact, here we have considered a simple model of relaxation where the ther-
mostat is formed by incoherent electromagnetic radiation surrounding the atom
(Sec. 7.7).
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Chapter 3
The probability method used above allowed us to describe the energy exchange
between radiation and atoms. At the same time, another known manifestation of
the interaction between field and matter, namely, the slowing down of the waves
propagation, was not considered. Another, and more important, drawback of the
probability approach is that it does not provide a sufficiently rigorous account
for relaxation processes, whose consistent consideration should be performed in
the framework of statistical physics and kinetics. A more complete theory of
the interaction of atoms with the external field and the thermostat is based on the
density matrix formalism, which combines quantum and statistical considerations.
Below, in Sec. 3.1, we discuss the definition and the general properties of the
density matrix. Itn Sec. 3.2, its diagonal elements are considered, which give the
populations of the levels, and the notion of negative temperature is introduced.
Section 3.3 describes the time evolution of the density matrix and the relaxation
processes.
3.1.1 Observables
In Chapter 2, we have defined the transition probability in terms of the amplitudes
of energy states cn . Let us now write an arbitrary observable f of a quantum
system (further, f will be understood as the dipole moment of an atom, f d ) in
terms of similar coefficients. We will start from the basic measurement postulate
of the quantum mechanics: multiple measurements of a value f performed on an
ensemble of identical systems, i.e., systems prepared in the same state (r, t),
will yield, on the average, the value
Z
h f (t)i = dr (r, t) f(r, t) ht| f |ti, (3.1)
43
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
where f is the operator corresponding to f and r is the set of the system coordi-
nates. (In what follows, we will often omit the hats of the operators.)
It is important that f in (3.1) can be also understood as a product of operators,
f g2 or f gh. This enables one to determine not only the mean values, h f i,
called first-order moments, but also higher-order moments, hg2 i, hgn i, hghi, . . . ,
which characterize the quantum fluctuations of g and the quantum correlation of
g and h. Of course, Eq. (3.1) is written in the Schrodinger picture and is therefore
applicable only to the case where f is a single-time operator, for instance, f (t) =
g(t) h(t). In order to define the correlation functions hg(t1 ) h(t2)i, one has to switch
to the Heisenberg picture where the time dependence is attributed to the operators
and not to the wave functions.
Knowing the wave function, one can find not only the moments h f n (t)i of an
observable f but also its distribution at time t, P( f, t). This function is given by
Eq. (3.1) with the operator f replaced by the diade operator | f ih f | (Sec. 7.5).
is assumed to be known, and the problem is reduced to the one of finding pairwise
products bn bm , which also form a matrix, called the density matrix or the statistical
matrix,
mn bm bn . (3.4)
Thus, the state vector of the system is put into correspondence with a matrix.
One can also define an operator corresponding to :
Z
drm n bm bn , (3.5)
In terms of the density matrix and operator, the mean (3.3) can be written in a
more compact manner,
X
hfi = mn fnm = Tr( f ), (3.6)
mn
P
where Tr f means the sum of the diagonal elements, fnn , called the trace, or
Spur, of the matrix. The trace of a matrix is one of its invariants, since it does not
change under the matrix transformations of the form f 0 = U f U 1 . Such operator
transformations describe changes of the representation in quantum mechanics, and
the invariance of the trace provides that the observable quantities are independent
of the choice of representation. The property Tr f 0 = Tr f immediately follows
from another property, Tr(gh) = Tr(hg), which can be easily verified from the
definitions of the Tr and multiplication operations.
where
Z
Pi |i |2 , fi j dri f j .
In a similar mixed state, the last term in (3.7) is absent. There is an analogy
with the superposition of two light fields: coherent fields interfere, while a non-
coherent mixture simply yields double intensity.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Due to its linearity, the averaging operation could be included into the definition
of the density matrix. Equation (3.6) maintains its form in this case. In future, the
bar denoting additional averaging will be omitted.
3.1.4 More general definition of the density matrix
Often, one defines a mixed state and the density matrix in a different, more general
way. In this case, the term mixed state is applied not to the whole system but to
a part of it.
Let a system consist of two parts, A and B. Its state is given by a wave function
(rA , rB), which, in the general case, is not factorable, (rA , rB) , A (rA )B (rB ).
Therefore, the wave function A of a subsystem does not exist. Indeed, factora-
bility means independence of the subsystems, hence it is impossible if A and B
interact or have interacted in the past. There is an analogy with classical statis-
tical physics: for interacting particles, the joint probability distribution P(rA , rB)
cannot be represented as P(rA )P(rB).
However, in classical statistical physics we can separately define the probabil-
ity distribution function for subsystem A by summing P over the variables that are
of no interest for us,
Z
PA (rA ) = drB P(rA , rB). (3.11)
Tr = 1, 0 nn 1, + = . (3.16)
In most cases, one uses the energy representation, in which the diagonal elements
of the density matrix nn n have the meaning of relative occupation numbers
of the levels. The first property in (3.16) means that the probability to find the
system on some level is equal to unity, the second one provides the non-negativity
of the probability, and the third one (Hermiticity), that the observable quantities
are real,
X X
h f i = mn fnm
= nm fmn = h f i.
state amplitudes of various systems of the ensemble contain random phase fac-
tors, b(i) (i)
n exp(in ), then, for m , n,
3.1.6 Density matrix and entropy
Let a closed system be in a pure energy state, = 1 exp(iE1 t/~). Then, accord-
ing to definition (3.4), there is only one nonzero element of the density matrix,
mn = mn n1 . Such a trivial matrix satisfies the matrix equation
2 = . (3.19)
This is a property of all pure states. It follows from Eq. (3.4), the matrix multipli-
cation rule, and the normalization condition Tr = 1.
Violation of equality (3.19), or its corollary Tr2 = 1, can be a sign of a
mixed state. However, there exists a more convenient quantitative measure of
statistical indeterminacy of quantum systems, the entropy (see, for instance, [Fain
(1972); Landau (1964)]). Let us define the entropy operator in terms of the density
operator in the following way: S ln . Then the entropy S is equal to hS i, i.e.,
S = hln i = Tr( ln ). (3.20)
In the representation where is diagonal, (3.20) takes the form
X
S = n ln n . (3.21)
n
(This follows from the fact that in the diagonal representation, [F( f)]nn = F( fnn ).)
In a pure state, n equals 0 or 1, therefore S = 0, the indeterminacy (chaoticity)
is minimal. The opposite limiting case of a maximal indeterminacy is realized for
a uniform mixture of states, n = const = 1/g, where g is the number of states with
a given energy (the Gibbs microcanonical ensemble). Then, = I/g, 2 = I/g 2
and, according to (3.21)
g
X
S = (1/g) ln(1/g) = ln g. (3.22)
n=1
Thus, 0 S ln g.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
3.1.7 Density matrix of an atom
In statistical physics, one usually considers macroscopic objects consisting of
N 1022 identical particles. The terms state, energy level, density matrix
relate in this case to the matter as a whole. In principle, one can speak about the
wave function of a 1 cm monocrystal, which depends on about 1022 space argu-
ments ri and time. Correspondingly, the number of possible states and, hence, the
dimensionality of the matrices fmn , mn are also extremely high. Furthermore, in
order to realize an ensemble one has to have, say, 103 similar crystals.
On the other hand, active media in quantum electronics, such as gases,
doped crystals or dye solutions, as a rule, consist of weakly interacting atoms
or molecules. Then, it is sufficient to consider the state of a single atom, or, to be
precise, of a single external electron. The rest of the particles are then considered
as a thermostat, which has a weak influence on the wave function of the atom.
This transition from about 1022 degrees of freedom to a few ones leads to a
crucial simplification of the theory, i.e., to the ideal gas model. The theory is fur-
ther simplified by excluding from considerations all states that are not populated
and not resonant with respect to the external field. This transition to the two-level
system is valid in the case of a quasi-monochromatic external field and the ab-
sence of degeneracy. Note that the density matrix of an n-level non-degenerate
system consists of n2 elements, n(n 1) of them being complex. However, the
normalization and Hermiticity conditions (3.16) reduce the number of indepen-
dent elements, so that the state of the system is described by n2 1 m real
numbers. For a two-level system, m = 3, and its state can be represented as a
point in a three-dimensional phase space, with the coordinates 2021 , 20021 , and
1 2 (Sec. 4.4). In the case of a pure state, condition 2 = reduces the
number of independent parameters to two, and the state can be shown by a point
on a unit sphere.
Since the atoms are identical and independent, the additive macroscopic pa-
rameters of the matter, such as polarization P, are calculated by simply multiply-
ing single-atom mean values by N, P = Nhdi. Note that if all gas atoms are under
the same conditions, the gas as a whole can be considered as an ensemble (quan-
tum or quantum-statistical) containing approximately 1022 systems. Summation
over atoms is then equivalent to ensemble averaging, and a diagonal element of
the density matrix, n , defines the average relative population Nn /N of a level En
in a real gas rather than in a hypothetical ensemble of 103 similar gas volumes.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
partly) due to the spin-orbit interaction (the fine structure) or due to static fields
(the Stark and Zeeman effects). In this case, particles are distributed over sub-
levels, and if the splitting is much less than T , then the populations of the
sublevels are approximately equal to N/g1 , and the population differences are,
according to (3.25), on the order of
N (~/g1 T )N N. (3.27)
Transitions between such sublevels in doped crystals are used in paramagnetic
amplifiers, and relation (3.27) explains why it is necessary to cool the active media
of amplifiers down to helium temperatures.
In the case of molecular gases or solutions of organic dyes, the ground elec-
tronic level has a rich rotational-vibrational structure, which covers the microwave
and the middle-IR spectral ranges. Therefore, the molecules are distributed over
many levels, and the population differences are small as well.
N1 N2
= = tanh x. (3.29)
N
The active medium of a laser, in principle, is in a strongly non-equilibrium
state, and the Boltzmann distribution (3.25) is not applicable to it, as is, strictly
speaking, any notion related to temperature. However, in the case of non-
equilibrium systems it is convenient to keep the equations in the form (3.28),
(3.29) but to understand T as some effective parameter. Effective, or spin, temper-
ature for a given pair of non-degenerate levels is defined through the population
ratio as follows:
Nm /Nn exp(~nm /T e f ), (3.30)
i.e., the effective temperature is simply a logarithmic measure of the population
ratio. It follows from (3.30) that in the case of population inversion, T e f < 0.
It is easy to see that Eqs. (3.28), (3.29) maintain their form even for non-
equilibrium systems, provided that T is understood as the effective temperature.
Figure 3.1 shows the relative population difference as a function of the effective
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Fig. 3.1 Relative population difference and the entropy S of a two-level system as functions of the
parameter x = ~0 /2T .
temperature. The dependence is plotted according to Eq. (3.29) for all tempera-
tures, both positive and negative. Full inversion (1 2 = 1) corresponds
to T e f = 0, full saturation (1 = 2 = 1/2, = 0) corresponds to T e f = ,
at = 1 T e f = +0. Note that a two-level system with a negative temperature has
more energy stored than a system with a positive temperature:
E = 1 E1 + 2 E2 = (~/2) = (~/2) tanh x, (3.31)
where x ~/2T e f .
The entropy of a non-equilibrium two-level system can be also defined in
terms of T e f . According to definition (3.21) and to (3.28),
S = 1 ln 1 2 ln 2 = ln(2 cosh x) x tanh x. (3.32)
Thus, entropy is an even function of the temperature, with the maximum S 0 = ln 2
at T e f = (Fig. 3.1).
Further, we will show that the intensity of thermal radiation from a two-level
system can be also written in terms of the effective temperature (Sec. 7.1). At
T e f < 0, it is this radiation that causes the noise of quantum amplifiers (the Kirch-
hoff law for negative temperatures). In particular, at ~ |T e f | the noise temper-
ature of an amplifier has the same absolute value as the effective one, T n = |T e f |.
3.2.3 Populations in semiconductors
Boltzmanns distribution (3.25) is not valid for calculating the number of active
particles in the case of inter-band transitions in semiconductors. (Such transi-
tions are used in semiconductor lasers.) In contrast to bound electrons in gases
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
where the factor 2 takes into account spin saturation, is the Fermi level, defined
by the normalization condition Nm(0) = N, with N being the total number of elec-
P
trons, and Em are allowed energy values for one electron. The spectrum is discrete
due to periodic boundary conditions for the electron wave function. According to
(3.33), mean population of any level cannot exceed two electrons, in agreement
with the Pauli principle.
The energy levels Em of electrons in semiconductors have almost continuous
distribution within the allowed bands. As a result, population Nm can be consid-
ered as a function of a continuous argument E, and the normalization condition
P
Nm = N, which indirectly defines the Fermi level, takes the form
Z
dEg(E)N(E) = N, (3.34)
with the integration running over the valence and conduction bands and g(E) being
the energy density of states.
For pure semiconductors, the Fermi level is approximately at the centre of
the energy gap. If there were dopant levels, each of them would contain one
electron; the Fermi level can be formally defined as the one that is half-occupied.
At low temperatures, the boundary between full and empty levels is very sharp
(Fig. 3.2(b)).
In the case of sufficiently high levels, for which E T , one can neglect
the unity in the denominator of Eq. (3.33), and the equation takes the form of the
Boltzmann distribution (3.25),
Nm = 2Z 1 exp(Em /T ) 2, (3.35)
where Z = exp(/T ).
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Fig. 3.2 Population inversion in a semiconductor: (a) relation between the momentum p and the
energy E, i.e., the dispersion relation, for electrons and holes, Eg being the energy gap; light with the
frequency and wavevector k transfers electrons from the E1 level to the E2 level (or vice versa); (b)
populations of energy levels in an equilibrium semiconductor (the Fermi-Dirac distribution); (c) due
to the injection of carriers, the Fermi level splits in quasi-levels v , c , and for some pairs of levels,
inversion takes place: f1 < f2
3.2.4 Inversion in semiconductors
Consider the condition for quantum amplification through inter-band transitions in
semiconductors. The incident field, with the frequency exceeding the gap width
Eg /~, leads to almost vertical transitions of electrons from level 1 of the valence
band to level 2 of the conduction band (Fig. 3.2(a)). The levels 1,2 within the
bands are unambigously defined by the conservation laws of energy, ~ = E2 E1 ,
and momentum (or, to be precise, quasi-momentum), ~k = p2 p1 .
The number of stimulated transitions up scales as the probability of filling
the ground level, N1 /2 = f (E1 ) f1 , multiplied, in accordance with the Pauli
principle, by the probability 1 f2 of a hole being on the excited level. Similarly,
the number of transitions down scales as f2 (1 f1 ), with the same proportionality
factor (see (2.26)). The overall effect of field energy amplification or absorption
scales as the difference,
Thus, the contribution of a single pair of resonant levels into absorption scales
as the difference of their populations, similarly to the case of localized electrons,
and the inversion condition has the form
kinetics studies not the time dependencies of coordinates and momenta of separate
particles, q(t), p(t), or of the wave function, (q, t), but the behavior of the means,
h f (p, q, t)i, distribution functions, P(q, p, t), or the density matrix, mn (t), for sys-
tems interacting with the thermostat and (or) with external alternating fields.
The dipole moment of an isolated atom can be calculated from Eq. (3.46),
X
hd(t)i = Tr{d(t)} = dnm mn (0) exp(imn t). (3.47)
mn
At the same time, it follows from Maxwells equations that an oscillating dipole,
similarly to an aerial, emits electromagnetic waves into free space; therefore,
within a certain time an atom should lose all its energy and get into the ground
state, i.e., mn () = mn n0 . Thus, an atom cannot be isolated from electromag-
netic vacuum, which plays the role of the thermostat with T = 0. This example
reminds us that isolated systems do not exist, and hence (3.45) should be com-
pleted by relaxation terms describing the evolution into equilibrium, () (0) .
dm X
= (wmn n wnm m ), m mm . (3.49)
dt n
In rarefied gases, relaxation is only caused by the interaction of atoms with elec-
tromagnetic vacuum. This interaction leads to the spontaneous emission, with
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
the probability A12 = 212 , and the corresponding broadening of the upper level
E2 = ~A12 , as well as the spectral broadening, called the natural broadening,
nat = 212 = A12 . (3.52)
If the lower level of the transition under consideration is not the ground one, its
P
broadening has to be taken into account as well. Let 2n m<n Amn be the total
probability of a spontaneous transition from level n to all lower levels, then
mn = m + n . (3.53)
In reality, fnat is on the order of MHz for the case of visible-range allowed tran-
sitions, and T 2 106 s.
For sufficiently dense gases, natural broadening is masked by the collision
one, and T 2 coincides, within an order of magnitude, with the mean time be-
tween the collisions of atoms with each other. As a result, 2/, and the
line width scales as the pressure p, provided, of course, that the Doppler broad-
ening is smaller than the collision one. For rough estimates, one can assume that
at p = 1 mmHg, f 10...100 MHz. Note, however, that under certain con-
ditions, the increase of the pressure leads to line narrowing, 1/p, called
collision, or dynamical, narrowing. One of the models of this effect is considered
in Ref. [Akhmanov (1981)].
Interaction of atoms with the thermostat leads not only to the damping of the
states, but also to a certain shift of the transition frequency. In the case where
the thermostat is a vacuum, this shift is called the Lamb shift. Both effects can
be formally taken into account by substituting a complex value for the transition
frequency mn ,
mn = mn + mn imn . (3.54)
It is important that transverse relaxation is not always related to the energy
transfer to the thermostat. For instance, elastic collisions in a gas lead to random
changes in the phases of complex state amplitudes for separate atoms, b(i) m , and
(i)
their pair products, b(i)
m n b . If these phases are initially equal, mn , 0, then, after
a certain time interval T 2 = 1/mn , which is on the same order of magnitude as
the mean time interval between collisions, the phase will be uniformly distributed
within the interval 0 2, so that mn 0. A similar effect is caused by the
dipole-dipole interaction of neighboring dopant atoms in crystals. Perturbations
of this kind, which do not change populations, are called adiabatic. Certainly,
non-adiabatic perturbations, such as non-elastic collisions, also contribute to the
relaxation of non-diagonal elements, as they change both the amplitudes and the
phases of the coefficients b(i) m.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Let us now consider the relaxation of diagonal density matrix elements, i.e.,
populations. Kinetic equations (3.49) contain a set of phenomenological coeffi-
cients wmn with the dimensionality 1/s. The w21 coefficient defines the rate of
transition from state 1 into state 2 due to the effect of the thermostat. (Recall
that in quantum mechanics, transition indices are read from right to left.) The
role of the thermostat can be played, for instance, by lattice vibrations in crystals,
translational degrees of freedom of atoms in gases, and electromagnetic radiation.
In the case of two-level systems, one denotes
T 1 determines the relaxation time of populations, i.e., of the mean energy, and is
called the time of spin-lattice, or longitudinal, relaxation. The time of longitudinal
relaxation depends on the thermostat temperature and varies within broad limits,
from 1012 s in the case of nonradiative optical transitions in condensed matter, to
hours and days in the case of nuclear magnetic resonance. (In this case, interac-
tion with lattice is weak due to the small value of the nuclei magnetic moment,
1023 CGS.) Note that adiabatic perturbations, like dipole-dipole interaction,
do not change the populations; therefore, usually T 1 > T 2 . In experiment, longi-
tudinal relaxation manifests itself in the saturation effect (Sec. 4.3).
Equations (3.48), (3.49) should also involve the case of thermodynamic equi-
librium, where = (0) and (0) = 0; hence, the following relation should hold
true:
X
(wmn n wnm m ) = 0. (3.56)
n
Hence, taking into account the Boltzmann distribution, we find the relation be-
tween the probabilities of relaxation transitions and thus reduce the number of
independent parameters in Eq. (3.49) by a factor of two,
with T being the temperature of the thermostat. This condition provides dy-
namical equilibrium for the populations. Thus, w12 > w21 , in contrast to the
case of stimulated transitions in a classical field, where, according to (2.26),
W12 = W21 . When the thermostat is at low temperature, it has no excitations
(photons, phonons, etc.) with high energy, ~ > T . Therefore, it can only absorb
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
energy from the system under consideration, and transitions up are practically
absent. An extreme example is realized for transitions between nuclei levels in
the range. Even in condensed matter, such transitions usually occur only due to
spontaneous emission, with the probability w12 = A = 1/T 1. In the case of nuclear
isomers, A is extremely small because the transitions are forbidden, and T 1 can be
as large as days, similarly to the NMR case.
The parameters wmn can be calculated, in principle, using some model of the
thermostat. An example where the role of the thermostat is played by the field has
been considered in Sec. 2.5. Then,
w21 = B, w12 = B + A, (3.59)
depend on time only due to a varying external force (as, for instance, the energy
operator V(t) = d E(t) in dipole interaction). On the contrary, in the Heisen-
berg picture all time dependence is transferred to the operators and their matrix
elements, except the density matrix operator, and the state vectors are constant.
The interaction picture is intermediate, and it has all values time-dependent.
However, it is important that the observables do not depend on the choice of
the picture,
h f i = Tr( f ) = Tr( f 0 0 ). (3.71)
This can be proven using definition (3.63), the unitarity, U0 U0+ = I, and the in-
variance of the trace to cyclic permutations, Tr(abc) = Tr(bca).
3.3.7 Perturbation theory
It is not difficult to find the formal solution to the von Neumann equation (3.69)
using the iteration method. For this, let us represent the density operator as a series
expansion (the primes will be temporarily omitted),
(t) = (0) + (1) (t) + (2) (t) + . . . (3.72)
and substitute it in (3.69). Here, (0) = (t0 ) is the initial condition. By setting
equalities between the terms of the same order in the perturbation V, we find the
relation
i~(k) = [V, (k1) ]. (3.73)
Integration yields
Z t Z t2
(k) k
(t) = (i~) dtk . . . dt1 [V(tk ), . . . [V(t1 ), (0) ] . . . ]. (3.74)
t0 t0
Chapter 4
67
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
susceptibility is a tensor,
X
P () = ()E (), (4.1)
4.1.1 Symmetry
Here, E() denotes the Fourier component of the field E(t). As it is common in
physics, a function of time f (t) and its Fourier transform f () are denoted by the
same symbol and only their arguments differ,
Z Z
f (t) = deit f (), f () = dteit f (t)/2. (4.3)
The absence of the integration limits means that they are . Note that if f (t) is
real, then (4.3) leads to the following symmetry property for f ():
f () = f (). (4.4)
Thus, the real part, f 0 (), is an even function while the imaginary part, f 00 (),
is an odd function, so it is sufficient to know f () for positive frequencies. From
the definition (4.2), it follows that all components of the susceptibility tensor also
satisfy (4.4),
() = (). (4.5)
The () tensor and its inverse Fourier transform, (t), called the Green func-
tion or the response function, have one more general property, typical for arbitrary
physical systems: they are symmetric, = , or
= . (4.6)
Apparently, the system cannot respond before the external force is turned on;
hence, (t) should turn to zero at t < 0,
Z
deit () (t), (4.7)
where (t) is the Heaviside step function, which is unity at t > 0 and zero at t < 0.
The causality principle, according to (4.7), restricts considerably the allowed
class of () functions. Indeed, it follows that (), considered as a function of a
complex frequency = 0 + 00 , should be analytical in the upper semiplane. Let
us calculate the integral in (4.7) using the residue theory. The integrand contains
00
the factor e t ; therefore, at t > 0, the integral should run along a contour in
the lower semiplane (see Fig. 4.1), while at t < 0, the contour should be in the
upper semiplane. However, due to the causality principle, at t < 0 the integral
should turn into zero. Therefore, the () function cannot have poles in the upper
semiplane (see, for instance, [Vinogradova (1979); Landau (1982, 1964)]).
Further, according to the integral Cauchy formula, the real and imaginary parts
of an analytical function are related via the Hilbert transformations,
00 (1 ) 0 (1 )
Z Z
0 () = PV d1 , 00 () = PV d1 , (4.8)
1 1
where PV denotes the principal value of an integral. These integral equations
are called the Kramers-Kronig relations. They allow, for instance, the real part of
susceptibility to be calculated from the measured imaginary part. The above-given
derivation can be extended to the case of an anisotropic medium. Then, equations
(4.8) will be valid for all components of the tensor.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Fig. 4.1 Proof that the susceptibility of a dielectric () is an analytical function of complex fre-
quency in the upper semiplane: at t < 0, the response function (t) turns to zero due to the causality
principle. At the same time, it is equal to the integral of ()eit along the C contour. Hence,
according to the Cauchy theorem, C should not contain poles of ().
4.1.4 Susceptibility of the vacuum
Further, let us find the field generated in a homogeneous medium by external
sources, i.e., a given polarization with a harmonic variation in time and space,
P = (1/2)P0eikrit + c.c. (4.16)
Here, k and are independent variables. In a homogeneous medium, P induces
a plane monochromatic wave with amplitudes E0 , H0 . Let us substitute (4.16) in
(4.9, 4.10) and take into account that j = P. We get a system of algebraic equations
for E0 , H0 (n ck/),
n H0 + E0 = 4P0 , (4.17)
n E0 H0 = 0. (4.18)
Excluding H0 , we get
n (n E0 ) + E0 = 4P0 . (4.19)
Double vector product in (4.19) projects the E0 vector onto the plane orthog-
onal to the propagation direction k. Let us denote this projection operation by .
Apparently, the tensor has the components
= k k /k2 . (4.20)
As a result, Eq. (4.19) takes the form
(n2 ) E0 = 4P0 . (4.21)
Thus, the problem is reduced to solving a system of two linear non-
homogeneous algebraic equations. The solution can be expressed, in a standard
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
way, in terms of the minors and determinant of the matrix (n2 ) . Instead
of solving the system directly, we express E0 in terms of P0 using the formalism
of the inverse matrix or tensor. By definition, A A1 = A1 A = I; therefore, it
follows from (4.18) and (4.21) that
The G (k, ) tensor is called the spectral Green function for Maxwells equa-
tions. It determines a macroscopic field induced by polarization (4.16), i.e., the
response of the electromagnetic vacuum in matter to an external excitation. The
tensor function G(k, ), similarly to or , satisfies the Kramers-Kronig relations.
Its Fourier transform, G(r, t), determines the fields emerging in a homogeneous
medium due to an arbitrary distribution of the polarization P(r, t) or a current.
Consider the case of a homogeneous medium. Let the z axis be along k, then
from (4.20) and (4.22) we find
recall that here n ck/. The last equation for the field E0z , which is longitudinal
with respect to the propagation direction, shows that it is independent of k: we
usually exclude the effects of spatial dispersion where = (k, ). According
to (4.24), longitudinal field created by this polarization is maximal at frequencies
where |()| is minimal; these frequencies are given by the condition 0 () 0.
Note that the Green function for a longitudinal field can be also obtained from
Eq. (4.11) by assuming that = divP.
Transverse components of the field, E0x,y , considered
as functions of k, ac-
cording to (4.23), have a wave resonance at k = 0 /c, i.e., at n2 = 0 . Then,
G xx = Gyy = i/00 , and the radiation power is (c.w. (4.15))
4.1.5 Thermodynamic approach
Between the ranges of strong absorption in matter, there are transparency win-
dows where one can neglect the energy dissipation, i.e., assume |00 /0 | 1.
(We consider the matter to be non-gyrotropic.) In the absence of dissipation, the
vibrational energy of the particles caused by the external field is conserved; hence,
the work of polarization can be defined as a function of the field amplitude, A(E0 ).
This work consists of displacing the charges and is performed by the sources of
alternating fields. Note that for A to be defined, a finite time is necessary for a
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
d E2 + H2 1
!
P(t) = (EE + HH)/4 + E E = + EE . (4.27)
dt 8 2
In the last equation, we have used the symmetry of the tensor. The expres-
sion in brackets is obviously the energy density of the macroscopic field, the first
term being energy of the field in the vacuum, at the same E, H, and the second
one having the meaning of additional work A performed by a field source in the
presence of the matter. Additional energy of the matter in a given field has the
opposite sign,
v = E E/2. (4.28)
Strictly speaking, the macroscopic field E inside the matter should be replaced
here by the external field E0 in the absence of the matter (see Ref. [Landau (1982)],
Sec. 11), but for the sake of simplicity we ignore the difference between E and E0 .
Equations (4.26)(4.28) assume a linear relation between P and E, which is
valid only for a sufficiently weak field. An evident generalization of (4.28) is
or
Z E
v= P(S , , E) dE. (4.30)
0
Polarization P and, hence, the elementary work of polarization, dv, certainly
depend not only on E but on the other parameters defining the state of the matter.
Therefore, the integral in (4.30) is along a curve, and in order to find v, this curve
should be specified. The work of polarization can be defined at constant entropy S
and density , i.e., for a thermally isolated material with a given concentration of
molecules N = /m. In this case, polarization will not change the internal energy
of the matter per unit volume in the absence of the field, U0 (S , ) (by definition,
dU0 = T dS + d, being the chemical potential). Therefore, the internal energy
of the matter in the presence of the field is
U(S , , E) = U0 (S , ) + v(S , , E), (4.31)
where E plays the role of an external thermodynamical parameter.
Now, one can define polarization and susceptibility thermodynamically as
functions of the state of the matter,
P (S , , E) U/E , (4.32)
Only the constant, or slowly varying, part of the potential is of practical interest,
v = E0 () E0 /4. (4.35)
This expression for the effective potential of the matter in a monochromatic
field describes, according to the known thermodynamic equations, the effect of
light on the state of the matter. Variation of the state (temperature, density, etc.) of
the matter, in its turn, influences and the transmitted light, i.e., causes a nonlinear
optical effect (Sec. 6.2).
Note that, according to (4.35), one can define the polarization amplitude and
the susceptibility in terms of the effective potential,
P0 = 4v/E0 , = 42 v/E0 E0 . (4.36)
Let the density of the molecules be N, then in the approximation of non-
interacting molecules, polarizability of a single molecule is = /N, and from
(4.35) one can find the effective potential of a molecule in an alternating field,
1
V = E0 () E0 . (4.37)
4
This potential defines the mean force of light pressure acting on a molecule in a
monochromatic field in terms of the molecule polarizability,
F = V = (E0 () E0 )/4. (4.38)
This expression can be transformed as
1 1 E0
F = (E0 E0 )= d + c.c.
4 x 4 x 0
1
= d E0 (r) + c.c. = d(t) E(r, t). (4.39)
4 x 0 x
The factor 1/2 is absent here since we assume that the operator does not act on
the dipole moment d = E of the molecule. The force (4.39) corresponds to the
potential V(r) = d E(r). Light pressure will be considered in more detail in
Sec. 6.2.
or
!1/2
|| + 0
k= Re = , (4.42)
c c 2
!1/2
|| 0
/2 = Im = . (4.43)
c c 2
The sign by the square root is chosen from physical considerations. These equa-
tions define the positions of the two poles of the Green functions G(k, ) in the k
plane.
In the case of weak absorbtion, 2 << k2 , Eqs. (4.42), (4.43) take the form
k = 0 /c, (4.44)
= k 00 / 0 . (4.45)
It should be stressed again that the dispersion law (k) and a fixed polariza-
tion e only take place for free waves, i.e., waves generated by distant sources. In
the presence of given sources, the spatial and temporal dependencies of stimu-
lated field are determined by the distribution of the currents and can be arbitrary.
In particular, thermal fluctuation field inside the matter is created by the chaotic
motion of charged particles and the field at a given frequency is a superposition of
plane waves of various lengths. Note that waves with maximal amplitudes do not
always satisfy (4.44) (see Eq. (4.79)).
Thus, the macroscopic theory enables all basic observable rules of waves emis-
sion, propagation, and absorption through the phenomenological function ().
The next step is calculating () in the framework of the microscopic theory.
This is a traditional problem of non-equilibrium thermodynamics, and its com-
plete solution is still absent.
the typical size of a molecule, a0 108 cm. The product of d and the con-
centration of molecules N is equal to the dipole moment per unit volume, i.e.,
polarization, P = N d = E.
Thus, the problem of calculating the susceptibility is reduced to calculating
the dipole moment of a molecule induced by an external field.
Thermal motion of charges can be taken into account in the framework of the
kinetic theory. In the quantum theory, ri and, hence, d, are operators; therefore,
one should do both quantum and statistical averaging, i.e., use the density matrix
formalism.
Consider first the classical Lorentz model, which represents a molecule as an
oscillator. The equation of motion of a linear isotropic oscillator has the form
r + 2r + 20 r = eEloc /m, (4.46)
where m, 0 and e are the effective mass, frequency, and charge of the oscillator,
respectively, is the phenomenological damping constant, and Eloc is the field at
the centre of the molecule, known as the local field. After multiplying (4.46) by
eN, we find the equation of motion for the polarization,
P + 2 P + 20 P = 2p Eloc /4, (4.47)
where p = (4e2 N/m)1/2 is the so-called plasma frequency.
The field Eloc at the centre of a motionless molecule differs from the space-
averaged macroscopic field E. According to Lorentz,
4 +2
Eloc = E + P= E, (4.48)
3 3
so that (4.47) takes the form
P + 2 P + 20 P = 2p E/4, (4.49)
20 20 2p /3. (4.50)
Hence, assuming the field to be monochromatic, we find
2p /4
= . (4.51)
20 2 2i
In what follows, we will assume that the eigenfrequency shifts (4.50) due to the
Lorentz correction are included into the definition of 0 .
Suppose now that there are several types of independent oscillators with eigen-
P
frequencies j and concentrations f j N, f j = 1, then
2p X fj
= . (4.52)
4 j
2j 2 2i
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
2p
1 .
( + 2i)
where
In the zeroth order of the perturbation theory, the density matrix is diagonal, (0)
mn =
(0)
m mn , so that, after substituting (4.53) and (4.55) in (3.48), (3.49), we find for
m,n
mn (0)
nm /2
(1)
mn () = , (4.56)
mn imn
where nm n m is the relative population difference for levels n and m.
The diagonal elements (1)
nn , according to (3.48), (3.49), will scale as the inverse
frequency of the perturbation; if one is only interested in resonance effects,
under the condition / 1 one can assume (1) nn = 0. Thus, the amplitude of
the response to a harmonic perturbation scales as the population difference and
reaches its maximum at resonance, = mn .
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
After substituting (4.56) in (3.6), we find the dipole moment of the molecule
and the polarization,
1
P = Nhd(t)i = P0 eit + c.c.
2
N X (0) nm d nm (dmn E0 )
P0 = .
~ mn mn imn
Hence, according to definition (4.1),
() ()
N X (0)nm dnm dmn
= , (4.57)
~ mn mn imn
where d() d is the projection of the molecule dipole moment onto the axis
= x, y, z.
One can easily verify that the obtained expression has the necessary symme-
try (4.5) and satisfies the causality principle (Fig. 4.1). Note that (4.57) can be
represented in a somewhat different form,
X
= Nn (n)
(),
n
() () () () (4.58)
1 X dnm dmn dmn dnm
(n)
() + .
~ m mn imn mn + + imn
Here, (n) has the meaning of the polarizability tensor of a molecule in state n.
In the absence of a static magnetic field, unperturbed wave functions and, hence,
the matrix elements dmn = dnm , can be considered to be real (see Ref. [Landau
(1964)]). Then, according to (4.6), (4.58) is invariant to the permutation of ,
indices,
() ()
2X mn dmn dmn
(n)
= (n)
= . (4.59)
~ m mn ( + imn )2
2
Using (4.57) and (4.14), one can easily show that the contribution of each pair
of levels (m,n) into the field energy is positive or negative depending on the sign
of mn nm , i.e., amplification of the field requires population inversion, see also
(4.60).
In the case of a gas, (4.57) should be averaged over random orientations and
velocities of the molecules. Due to orientation averaging, non-diagonal elements
() 2
of the d d turn into zero, and the diagonal ones become |dmn | = |dmn |2 /3. As a
result, the susceptibility tensor (4.57) becomes a scalar,
2N X mn (0) nm |dmn |
2
= , (4.60)
3~ m>n 2mn ( + imn )2
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
In the last equation, we have taken into account that the double sum contains twice
each term with m , n,
X X X
amn = ann + (amn + anm ), (4.61)
mn n m>n
and the diagonal terms in (4.60) are zero since nn = 0.
4.2.5 Oscillator strength
In order to compare (4.60) with the classical expression (4.52), let us define di-
mensionless oscillator strengths,
fmn 2mnm |dmn |2 /3~e2 . (4.62)
Note that the oscillator strength can be also defined phenomenologically, in terms
of 00 , see Ref. [Landau (1982)].
Let us number possible pairs of states (m, n) by a single index j {m, n}, as-
suming m > n. If we neglect the 2 terms in the denominator of (4.60) and put
f j fnm (0)
nm , Eq. (4.60) takes the form of (4.52). Therefore, quantum calcula-
tion confirms the Lorentz model: in the first approximation in the amplitude of
the external field, matter behaves like a set of linear oscillators with damping.
However, f j may now take negative values, which manifests itself in the effects of
quantum amplification (00 < 0) and negative dispersion (0 / < 0 outside of
the resonance).
Recall that, according to (4.7), a field -pulse causes a pulse of polarization
shaped as the Fourier transform of (). According to (4.60), the poles of (),
understood as the function of a complex frequency, are at points j = j i j
in the lower semi-plane; therefore, the polarization pulse is a sum of damped
harmonic oscillations,
2p X f j
(t) = (t) exp ( j t) sin ( j t). (4.63)
4 j j
This expression defines the Green function for the polarization of the matter in
terms of the eigenfrequencies and the oscillator strengths of the transitions.
The oscillator strengths satisfy the sum rules. For instance, for single-electron
transitions,
X
fnm = 1. (4.64)
m
This equation can be obtained from the commutation rule [x, p] = i~, p p x .
Let H0 = p2 /2m + V(r), then
[x, H0 ] = i~p/m, pmn = immn xmn ; (4.65)
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
hence,
x ( 0 )/, 0 > 0.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Fig. 4.2 Dispersion dependence of susceptibility () in the vicinity of an isolated resonance, for
the Q-factor 0 /2 = 5 and = 0: solid lines correspond to (4.67), dashed lines, to approximate
formula (4.68), and dash-dotted lines, to approximate formula (4.69).
This equation yields an even dependence for 00 (x) and an odd dependence for
0 (x) (see Fig. 4.2). It is clear from Fig. 4.2 that even for a resonance with
a low Q-factor, this formula provides a good approximation for 00 and somewhat
worse one for 0 .
Note that far from the resonance, |0 | decays much slower than |00 |;
therefore, in the transparency windows, where 00 1, the refractive index n
can still noticeably differ from unity. At a sufficient distance from the resonance,
absorption can be neglected, and (4.67) takes another asymptotic form (Fig. 4.2),
f 2p /4 0
= , (4.69)
20 2 1 2 /20
where 0 (0).
In the description of optical experiments, instead of or = 1 + 4, one uses
parameters that are more close to experiment, the refractive index and the index
of absorption,
n kc/ = Re , c/2 = Im . (4.70)
The value 1 has the meaning of the length of the wave penetration into the
matter, in o/2 c/2 units. Figure 4.3 shows the dispersion dependence of these
parameters, according to (4.68), (4.69) and (4.42)(4.45), in the vicinity of an
isolated resonance.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Fig. 4.3 Dispersion dependence of the refractive index n and the absorption index in the vicinity of
an isolated resonance, for the resonance amplitude being 1 (a) and 5 (b): solid lines correspond
to (4.42), (4.43), and (4.68); dashed lines, to (4.44), (4.45) (i.e., weak absorption approximation),
and
dash-dotted lines, to (4.69) (i.e., approximation of 00 = 0, when n = at > 0 and = at
< 0); shading denotes the energy gap, where 0 < 0.
4.2.7 Polaritons
In the case of low absorption, macroscopic field in the matter can be treated quan-
tum mechanically (Sec. 7.4). In this case, the notion of a photon in matter, or
a polariton, emerges. (It should not be confused with a polaron, an electron in a
dielectric considered together with the polarization it induces.) A polariton (some-
times also called a light exciton) is an elementary excitation of a macroscopic field
and the molecules interacting with it, having an energy ~ and propagating with a
velocity u = d/dk. As approaches 0 , more and more of the polariton energy
is contained in the internal energy of the molecules.
d Here, the scaling factor 2c between and frequency in cm1 has been omitted.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Fig. 4.4 Spectral density of equilibrium field, hE2 ik , normalized to ~N/23 , as a function of the
frequency and wave vector in the vicinity of a dielectric function resonance for 0 / = 20 and = 5.
Solid lines refer to transverse (with respect to k) oscillations , dashed lines, to longitudinal ones. It
is clear from the figure that the frequency spectrum of thermal field fluctuations is described by the
dispersion dependence (dash-dot) without the anomalous part.
Thus, the dispersion law for field equilibrium fluctuations (4.79) differs from
the dispersion law for free waves (4.42) ck/ = Re () by the absence of the
damping parameter .
The dispersion law (4.79) corresponds to the condition G00 /x = 0. At the
same time, the condition for the G00x maximum at a fixed frequency, G00x /y = 0,
leads, according to (4.77), to the known dispersion law (4.44), Re() = (ck/)2 .
Hence, it follows that the dispersion law observed near a resonance depends on
the experimental conditions.
Dispersion properties of a medium can be qualitatively represented as a graph
showing the relation between and k, instead of the () or n() dependencies.
Figure 4.5 shows such a relation in different approximations and provides the
commonly used names of the corresponding quasi-particles.
If the coupling between the transverse field and the oscillations of charges is
negligible, which is possible at k 0 /c, the elementary excitation, i.e., the en-
ergy quantum of the matter, in the case of polar oscillations of ions in a crystal
lattice, is called an exciton, or an optical phonon. Excitons, similarly to photons,
depending on the wave packet describing them, can be either localized within a
certain area in the crystal, or spread over the whole space. In the = 0 ap-
proximation, dispersion dependencies of photons and excitons overlap without
interaction, which means that the incident field does not excite the excitons.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
With the perturbation switched off, V = 0, populations should take their equilib-
rium values (0)
nn ; therefore, the relaxation rates w12 and w21 are related by (3.58),
which leads to
With an account for (4.83) and (4.84), Eq. (4.82) takes the form
Let the field be quasi-monochromatic and have the mean frequency > 0
close to 0 ,
where E0 (t) is the slowly varying amplitude of the field. Then, in the dipole and
resonance approximations, one can assume in (4.81) that
This system of three equations for three real functions 0 (t), 00 (t), and (t) deter-
mines the evolution of a two-level system due to the thermostat and the external
field. According to (4.89), the effect of the field on 0 scales as the population
difference , which, in its turn, is related to the field through 0 , according to
(4.90). It is this relation that leads to the nonlinearity of the two-level system
response. Under stationary conditions, this nonlinearity manifests itself in the sat-
uration effect, i.e., tending to zero provided that 2 1/T 1 T 2 . Non-stationary
effects caused by the anharmonicity of a two-level system will be considered in
Chapter 5.
4.3.3 Saturation
Consider now a stationary response of a two-level system to a monochromatic
field, with , 0 , , and being constant. Then it follows from (4.89) (compare
with (4.56)) that
/2
0 = ei , (4.91)
0 i/T 2
Substituting this expression in (4.90) allows one to find the stationary population
difference,
where
2 T 2 /2
W . (4.93)
1 + (0 )2 T 22
e We also neglect higher-order harmonics of the density matrix, which oscillate with frequencies n
It is clear from (4.94) that saturation results from the competition between the
transition rates due to the noise field of the thermostat, (w12 + w21 )/2, and the
external monochromatic field, W.
According to (4.93), saturation is most pronounced at exact resonance, where
the saturation factor takes its maximal value,
s0 = 2 T 1 T 2 = 20 FT 1 F/F s . (4.95)
Here, we have introduced the transition cross section W/F, photon flux den-
sity F, and the photon flux density corresponding to a two-fold decrease in the
population difference,
F s 1/(20 T 1 ) = ~c/(80|d(e) 2
12 | T 1 T 2 ). (4.96)
4.3.4 Lineshape in the presence of saturation
By substituting (4.92) and (4.93) into (4.91), one finds the value of 0 with an
account for population saturation. The dipole moment of a two-level system (we
consider a non-polar molecule, dnn 0) is
= 0 i(1 + s0 )1/2 /T 2 .
Thus, at strong saturation the power absorbed by the matter is no more de-
pendent on the intensity and frequency of the field and is only determined by the
Fig. 4.6 The absorbed power P as a function of the frequency at different saturation factors s0 ,
according to (4.100).
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
rate of energy transfer from the molecules to the thermostat. In the case of strong
saturation, the susceptibility real part, according to (4.99), scales linearly with the
frequency,
0 (0 )/s0 . (4.103)
Let us make a numerical estimate. For a wave with the intensity I = 1 W/cm2
and wavelength = 1, ~ = 2 1012 erg, F = 5 1018 photon/(s cm2 ), and
E0 = 8I/c = 0.1G= 30 V/cm. If d12 = 1D, then = 108 s1 , so that s0 = 1 at
T 1 = T 2 = 108 s. The transition rate is W = 5 107 s1 , and if (0) N = 1019 cm3 ,
as it is the case for a doped crystal or a gas at atmospheric pressure, then it follows
from (4.100) that P = 50 MW/cm3 . This estimate shows that in the optical range,
saturation is accompanied by strong heating of the matter. On the other hand, for
electronic paramagnetic resonance in the = 1 cm range and at T 1 = 103 s it
follows from (4.102) that P = 0.1 W/cm3 , so that stationary saturation is possible.
This is used in paramagnetic amplifiers.
Saturation is very important in quantum electronics. It is used for creating
population inversion by means of auxiliary radiation (pump) in lasers based on
doped solids and in paramagnetic amplifiers. It is also applied for producing short
strong light pulses via Q-switching and mode locking. Saturation stabilizes the
amplitude of quantum oscillators and limits the dynamical range of quantum am-
plifiers.
In the case of inhomogeneous broadening, for instance, due to the Doppler ef-
fect, saturation affects not all the line but only its part, with the width on the order
of collision or natural bandwidth. This effect, leading to the Bennett hole burn-
ing in the velocity distribution of the molecules (Fig. 4.7), is used for frequency
stabilization of lasers and in saturation spectroscopy (Sec. 6.4).
Fig. 4.7 Bennett hole burning. For a line inhomogeneously broadened due to the Doppler effect,
saturation only affects those molecules whose velocities have given projections vz () = (0 )/k on
the direction k of the wave propagation. N(vz ) is the velocity distribution of active particles.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
4.4 Bloch equations
In the general case, two arbitrary vectors |ai and |bi can always compose a di-
adic tensor, with the matrix elements given by the products of the corresponding
components of the vectors,
(ab) |aihb|, (ab)
mn = hm|aihb|ni.
A symmetric diadic operator (n) |nihn| is called a projector, since its action on
a vector consists of projecting it onto the |ni direction,
(n) |ai = |nihn|ai = const |ni.
Usually, |ni is a unit vector, hn|ni = 1. Note that the mean value of a diadic
operator (mn) coincides with the corresponding element of the transposed density
matrix,
X
h(mn) i = Tr{|mihn|} = kl hl|mihn|ki = nm .
kl
Let the basic vectors for the representation (4.106) be the energy states of the
system. Then the operator H0 is diagonal and, for f H0 , it follows from (4.109)
that a = b = c = 0, d = ~0 /2. Therefore, the Hamiltonian of the system scales
as the z operator,
H0 = ~0 z /2, (4.111)
where ~0 H022 H011 . The relative population difference is in this case equal
to the mean value of z ,
hz i = 11 22 . (4.112)
Let the eigenfunctions |mi of the H0 operator and, hence, the matrix elements
of the electric dipole moment dmn be real, d12 = d21 = d0 . Let us also assume
that the diagonal elements are absent (the molecule is non-polar), dnn = 0, then
the operator d = er scales as x ,
d = d0 x , (4.113)
V = (d0 E) x . (4.114)
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Fig. 4.8 The state of a two-level system represented geometrically using the Bloch vector R whose
components determine the dipole moment hdi = d0 R x and the population difference = Rz . The
z axis is directed downwards, so that points denoting the excited states of the system are above the
point denoting the ground state. Under free evolution of the system, R undergoes precession with the
frequency 0 , the angle of precession being determined by the initial conditions.
i.e., the depicting point moves along the equator of a unity sphere, and the pre-
cession angle is arctan(R /Rz ) = /2. For an energy state, the point rests at
one of the poles (cn = 0 or 1, = 0 or ). A weak resonant perturbation causes
a slow variation of the precession angle with the Rabi frequency, the so-called
nutation (Sec. 5.1).
2k+1
= , 2k
= I.
2 h2 i h i2 = 1 R2 . (4.119)
For an energy state, where Rz = 1 and R x,y = 0, the energy variance is equal
to zero, while the variance of the dipole moment is a unity (in d0 units). In the
case of a coherent state, according to (4.118), the energy fluctuates with the unity
variance (in ~0 /2 units), while the variance of the transverse components, x,y ,
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
where (t) 2d0 E(t)/~ is the instantaneous Rabi frequency. Let us introduce
the vector A(t) {(t), 0, 0 }, then (4.124) can be represented as a vector product,
= A. (4.125)
From (4.125), we immediately find a similar equation for the Bloch vector,
R = R A. (4.126)
Fig. 4.9 Relaxation of the Bloch vector R. After the perturbation is switched off, the transverse
component goes down to zero at a rate of T 21 , while the longitudinal component takes its equilibrium
value (0) at a rate of T 11 .
where, this time, (t) = H . We have chosen H0z to be negative for the subscript
1 to denote the lower level. Once again, the Heisenberg equations take the form
(4.124).
Let us average these equations over the initial density matrix, pass to magne-
tization M = 0 Nhi, and add relaxation. We get
M x = M x /T 2 + 0 My ,
My = My /T 2 0 M x + Mz H , (4.139)
Mz = (Mz(0) Mz )/T 1 My H ,
magnetic resonance (NMR), and ferromagnetic resonance. One can easily verify
that the Bloch equations are equivalent to the density-matrix equations in the case
of a two-level system; therefore, all above-made conclusions are also valid for the
case of magnetic resonance, with d E(t) replaced by 0 H (t).
As we have already mentioned, often T 1 > T 2 , since the relaxation of the
population energy hz i is only caused by non-adiabatic interactions of the particle
with the environment, for instance, non-elastic collisions in a gas or spin-lattice
(spin-phonon) interaction in a crystal. At the same time, variation of the transverse
components, h i (or 21 ), does not require energy transfer, and hence the life
time of h i reduces as a result of both adiabatic (spin-spin) and non-adiabatic
perturbations. One can say that perturbations disturb the precession phase 1 2
in (4.118), which is the argument of 21 , and hence the time and ensemble mean
values tend to zero, h i 0.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Chapter 5
Non-Stationary Optics
a In this book, instead of the ambiguous term coherent we will use the term non-stationary, which
107
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
R = R A. (5.1)
Here, A {2d0 E(t)/~, 0, 0} is the effective field vector, d0 d21 = d12 is the
dipole moment of the transition, which is assumed to be real and parallel to the
field E(t), and 0 is the transition frequency. Equation (5.1) preserves the length
of the R vector, which is unity in the case of a pure state. The end of the R vector
moves then along the surface of a unit sphere.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Let us pass to a reference frame rotating with the field frequency around the
z axis,
21 = 0 eit ,
R x = R0x cos t + R0y sin t, (5.2)
Ry = R0x sin t + R0y cos t.
R0 = R0 , (5.3)
where
R0 {200 , 2000 , },
{ cos 0 , sin 0 , 0 }, (5.4)
11 22 , d0 E0 /~,
0 is the initial phase of the field at the center of the atom; in the case of a plane
wave, 0 = kz + 1 . Recall that for simplicity, we assume the matrix element of
the transition to be real. In the general case, 0 = arg(d21 E0 ).
Thus, in a harmonic field the R0 vector, which represents the state of the sys-
tem, rotates with the rate = [2 + (0 )2 ]1/2 around the effective field
direction (Fig. 5.1). This slow rotation is called nutation; it adds to the fast
stimulated precession with the rate equal to the field frequency . In other words,
the angle of precession (t) (see (4.128)) slowly varies within the pulse duration,
so that the depicting point moves along a spiral on a sphere with the fixed radius
R = (2 + 4|21 |2 )1/2 .
In the resonant case, where |0 | , the nutation axis is in the equa-
torial plane. If, in addition, the initial state is an energy one, i.e., R(0) is at one of
the poles and (0) = 0 or , then, due to the effect of the field, R moves along a
meridian with the longitude 0 /2.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
(a) (b)
Fig. 5.1 Nutation of the Bloch vector R. Due to a monochromatic field, R, in addition to precession,
rotates around the direction of the effective field with the nutation rate : (a) the frame of reference
rotates with the field frequency ; (b) a fixed frame of reference; solid line corresponds to the exact
resonance, dashed line, to , 0 .
The phase 0 of the field is assumed to be constant, i.e., only amplitude modula-
tion is considered.
Note that equation (5.3) provides a recipe for preparing any arbitrary pure state
of the system (with given latitude and longitude) with the help of a coherent
field: one should first cool the system, i.e., move the depicting point to the South
pole, and then apply a resonant pulse with a given area and phase.
Fig. 5.2 Transition probability as a function of time, according to the Rabi formula, in the case of
exact resonance (solid line) and with 0 = 3 (dashed line).
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
This picture should be compared with the idea of quantum jumps, commonly
used at the early stage of quantum mechanics.
Let a transition with d0 = 1D be caused by a strong plane wave with the peak
intensity 1 GW/cm2 ; then, E0 = 1 MW/cm, and (5.9) leads to t = 1ps. This time
is in many cases much smaller than the relaxation time, hence our approximation
T 1,2 is valid.
But for usual optical experiments, t T 1,2 , and the R vector, depicting the
state of a two-level particle on the sphere, does not have enough time to move far
from the South pole before the next collision of the particle with the neighboring
ones, which returns R into the equilibrium position with R = 0. As a result, on
the average, dynamical equilibrium takes place, with a certain stationary angle of
precession (4.128).
Note that the amplitude of the field required for stationary saturation, E0
~/d0 T 1 T 2 (see (4.95)), does not exceed, on the order of magnitude, the pulse
amplitude required for observing coherent effects,
E0 ~/d0 T 2 . (5.10)
In other words, a field that can cause a transition within a time much shorter than
the relaxation time, will cause, on a long-time scale, strong saturation.
Below, we will consider several experimental methods of observing non-
stationary effects.
5.1.3 Nutation
Let an equilibrium two-level system interact with an electromagnetic wave with
the amplitude E0 and the resonant frequency = 0 . At short time intervals,
t T 2 , the response of the system is described by equation (5.3). According to
this equation, the Bloch vector R, which denotes the instant state of the system,
performs, like a spin top, nutational motion, i.e., periodic variation of the preces-
sion angle with the Rabi frequency = d0 E0 /~. At exact resonance, R moves
along a meridian from the South pole (the equilibrium state) through the equa-
tor (a coherent state) to the North pole (population inversion) and back, with the
angular rate .
Evidently, when the R vector moves upwards, towards the North, the
ensemble-averaged energy of the system increases due to the energy of the wave,
and at the time instance when = t = , the wave, on the average, loses exactly
one energy quantum ~. Further, when R goes downwards, towards the South,
the system gives the stored energy back. As a result, the wave becomes amplitude
modulated with the nutation frequency (Fig. 5.3). Provided that there are suf-
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Fig. 5.3 Effects of transient optical nutation and free polarization decay. The front slope of a resonant
pulse (dashed line) creates in the matter a transient process, nutation, which leads to the intensity
modulation I(t) of the transmitted light (solid line) with the Rabi frequency. Here, is the absorption
coefficient with an account for saturation, l is the layer thickness, T 2 is the transverse relaxation time.
The back slope of the observed pulse gets a tail caused by the free polarization precession.
ficiently many atoms, the wave transmitted through the matter can have a 100%
modulation. This periodic variation of instantaneous optical density of the matter
is called optical nutation.
Let us find the power absorbed by the matter. According to (4.98), polarization
of a medium with the density N of two-level atoms is
This result agrees with the above-given qualitative consideration assuming that
the field and the atoms periodically exchange energy quanta. Note that, because at
t T 1,2 relaxation is too slow to manifest itself, real absorption, i.e., irreversible
energy dissipation into the thermostat, is absent here.
Further, at t > T 1,2 , the amplitude of atoms nutation reduces due to relaxation,
and a stationary angle of precession is established (4.128). Absorption of the field
b Editors note: the terms come from radio spectroscopy; in nonlinear and quantum optics they would
occurs then in a usual way and is described, according to Eq. (4.15), by the imag-
inary part of the susceptibility. Modulation of the transmitted wave disappears,
and its amplitude is given by the Bouguer law with an account for saturation (see
Fig. 5.3 and Sec. 6.4).
If then the field is suddenly switched off, the matter will continue to emit light
for some time (Fig. 5.3) due to the free precession of the R vector; the matter
is then in a coherent superradiance state (Sec. 5.3). This effect is called free
polarization decay.
In the presence of inhomogeneous, for instance, Dopplers, broadening, these
formulas refer to only a small group of atoms with a certain projection of the
velocity. In order to find the total polarization, they should be integrated over
Maxwells distribution.
Optical nutation can be used for finding the transition dipole moment by mea-
suring the modulation frequency and for the study of relaxation processes by
observing the modulation decay rate.
the field. Note that irradiating the matter with short pulses is an example of a
non-stationary way to create population inversion in two-level atoms.
For the quantitative description of SIT, it is necessary to consider the inhomo-
geneous wave equation in combination with the equation of matter, i.e., Maxwells
and Blochs equations (see, for instance, Ref. [Allen (1975)]). Such a consider-
ation shows that the propagation velocity u of a 2 pulse is determined by its
length, cE l, and the usual absorption coefficient 0 ,
Thus, the longer the pulse, the more delayed it gets. For instance, at E = 1ns
(l = 30 cm) and 0 = 102 cm1 the pulse is slowed down by three orders of
magnitude.
An interesting result of the wave theory in nonlinear materials is the prediction
of solitons, stable pulses whose shape and amplitude do not change in the course
of propagation [Rabinovich (1989)]. In the case of two-level anharmonicity,
solitons have a hyperbolic secant shape,
According to (5.6), after the /2 pulse is over, the density matrix of the atom
has the following elements:
21 (t) = i(0)ei0 t /2, (t) 11 22 = 0. (5.17)
Hence, the mean dipole moment of the atom oscillates (see Fig. 2.1) with the
transition frequency 0 = 21 , its amplitude being equal to the transition matrix
element d0 d21 (we assume that d21 = d12 and (0) = 1),
hd(t)i = 2d0 021 (t) = d0 sin 0 t (5.18)
the first one, with a certain probability, scaling as r2 , can get into a coherent state,
with the precession phase shifted by 0 r/c.
therefore hd2 i = d02 regardless of the state of the atom; this also follows from
(4.107). For instance, in this model the atom should emit even in the ground
state, which contradicts the energy conservation law. Instead of a stable excited
state we have obtained an unstable state. The same problem arises in the well-
spread interpretation of spontaneous transitions as caused by zero-point vacuum
fluctuations.
A consistent theory of the spontaneous emission, as well as statistical optics
in general, should be based on the quantum description of the field (Chapter 7).
5.2.3 Normally ordered emission
However, spontaneous emission can still be described correctly in the frame-
work of semiclassical theory, provided that the squared dipole moment in (5.19)
is replaced by the mean value of the normally ordered squared dipole moment
(Sec. 7.7),
The colons denote normal ordering, i.e., placing positive-frequency operators f (+)
on the right of negative-frequency operators g() . By definition, an f (+) operator
in the Heisenberg picture contains only positive-frequency harmonics,
X
f (+) (t) = fn exp(in t), n > 0.
n
Similarly, the Fourier transform of a g() (t) is only nonzero for negative
frequencies.
Any operator can be represented as a sum of positive- and negative-frequency
parts (disregarding the constant component), d = d(+) + d() ; for Hermitian opera-
tors, d(+) = (d() )+ . Hence,
vector, by substituting, for the external field, the emission field in the zeroth-order
approximation, i.e., the reaction field (for more detail, see Ref. [Allen (1975)]).
Such equations describe the effect of radiation-induced damping, well known
from classical electrodynamics [Landau (1973)]. For z , they lead to an equa-
tion of the form (5.34), while for () , to harmonic-oscillator equations with the
eigenfrequency 0 + iA/2, where is the correction to the transition fre-
quency, called the Lamb shift, A. For instance, for the resonant line L of
a hydrogen atom, /2 109 Hz. The imaginary correction to the transition
frequency has the meaning of the inverse time 1/T 2 of transverse relaxation due
to the radiation reaction. Thus, for an isolated atom, T 2nat = 2T 1nat = 2/A.
the gas emits equilibrium Plancks radiation. In Sec. 7.1 we will consider a simple
qualitative model of this process, also covering the case of amplified spontaneous
emission at < 0.
5.2.6 Quantum beats
In the case of a multi-level system, the natural bandwidth can be explained by the
finite lifetimes of all excited levels,
X
tm = 1/Am 1/ Anm , (5.38)
n<m
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
due to spontaneous transitions to all lower levels. The probabilities Anm are de-
termined by (5.24) with 30 d02 3mn |dnm |2 . Due to the uncertainty relation
Et = ~, the lifetime tm of the system on a certain level corresponds to the
broadening of this level, which is equal, in circular frequency units, to the inverse
lifetime,
X
m Em /~ = 1/tm = Anm . (5.39)
n<m
Finally, the linewidth of emission or absorption corresponding to a certain pair
of levels is obviously given by the sum of the widths of the two levels,
mk = m + k . (5.40)
Hence, it follows that a weak line with d23 0 can have a large natural width due
to large d12 and d13 .
Spontaneous emission of a multilevel system excited by coherent radiation
has an interesting feature: its power oscillates in time. Let us consider the time
dependence of the dipole emission power.
According to the upgraded semiclassical formula (5.26), the power of radi-
ation emitted by an atom at a given time moment scales as the mean value of the
normally ordered square of the dipole moment second derivative,
4 X () (+)
P(t) = 3 kl dlm (t)dmk (t). (5.41)
3c klm
(For simplifying the notation, we assume that the dipole moments of all transitions
are parallel.) Here, kl is the density matrix of the atom at the initial time moment
t0 0. It is determined by the excitation, which should be pulsed for observing
quantum beats, i.e., the duration of the excitation pulse should be much smaller
than the beat period, 2/32 . In the case of a gas, this is achieved using a pulsed
laser or discharge while in the case of an atomic beam the latter should be passed
through a thin foil. As a result, a considerable number of atoms are in the same
pure non-energy state (0), so that the beat signal has a definite phase and a large
power.
After the excitation pulse is over, the dynamics of a given atom is determined
by its unperturbed Hamiltonian H0 . (Relaxation can be neglected provided that
the beat frequency is sufficiently large, 32 T 2 1.) The matrix elements in the
energy representation will then depend on time harmonically,
dmn (t) = dmn exp(imn t), dmn (t) = 2mn dmn (t). (5.42)
Let us number the levels according to increasing energy. Then the matrices of
positive- and negative-frequency operators have triangular shapes,
(+) ()
dmn (t) = dmn (t)nm , dmn (t) = dmn (t)mn , (5.43)
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
(a) (b)
Fig. 5.4 Quantum beats: (a) dipole moment of a three-level system in a coherent state with indefinite
energy (dashed line) oscillates with the frequencies 21 and 31 ; (b) as a result, the power P of the
spontaneous emission is modulated with the beat frequency 32 = 31 21 .
where mn 1 at m > n and 0 at m < n (see (5.30) for the case of a two-level
system). As a result, the only nonzero terms in the sum (5.41) are the ones with
m < k, m < l,
4 X 2 2
P(t) = 3 kl dlm dmk exp(ilk t). (5.44)
3c m<k,m<l lm mk
In particular, the power of radiation emitted by a three-level system contains
four terms,
P(t) = ~(31 A13 + 32 A23 )33 + ~21 A12 22
+ 2~(31 21 A13 A12 )1/2 |32 | cos(32 t + ), (5.45)
where
Anm 43mn |dmn |2 /3~c3, arg(23 d31 d12 ).
The first three terms in Eq. (5.45) scale as the populations of the excited levels and
correspond to usual spontaneous emission with the power constant in the frame-
work of our approximation. To take into account radiative energy losses, one
should add factors exp(Amn t) to these terms. The last term in (5.45) describes,
at Amn 32 21 31 , quantum beats, the periodic modulation of the total
power of spontaneous emission from two close levels 2 and 3, with the difference
frequency 32 = 31 21 (Fig. 5.4).e
Let us represent (5.45) in the form
P(t) = P0 [1 + m cos(32 t + )],
P0 ~21 A12 22 + ~31 A13 33 , (5.46)
1/2
m 2~(31 21 A13 A12 ) |32 |/P0 ,
e Note that if superscripts are omitted in (5.41), then (5.45) will contain beats with the frequency
21 , not observed in experiment.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
where we have neglected the weak radiation at the difference frequency 32 , which
is usually in the microwave range. If the atom is in a pure state, then mn = cm cn ,
and
Thus, the modulation coefficient m reaches a unity at A31 A21 , provided that
the atom is excited into a coherent state, |t0 i = (|2i + |3i)/ 2. From the classical
viewpoint, the atom emits two sine waves, which interfere, so that at some time
moments the radiation is completely canceled while at other ones it is doubled.
However, one should keep in mind that it is impossible to observe quantum beats
by measuring the field energy in the case of a single atom: multiple measurements
are required, with different t. In practice, one uses many atoms in the same state.
Usually, levels 2, 3 correspond to the fine or superfine structure, or Zeemans
sublevels, so that the beat frequency is in the microwave range and the modula-
tion can be discovered by radio electronic methods. (For instance, PMTs enable
observing modulation up to frequencies on the order of 100 MHz.) It is important
that quantum beats are a single-atom phenomenon; therefore, the Doppler effect
has nearly no influence on the beat frequency, which allows one to measure small
splitting of levels [Aleksandrov (1972)].
Note that there is another method, developed in the 1960-s, which also uses
the spectral analysis of the photocurrent rather than the direct analysis of light,
but in a stationary regime. This method, called spectroscopy of optical mixing
or spectroscopy of intensity fluctuations, allows one to observe extremely small
frequency splittings on the order of 1 Hz [Cummins (1974)].
5.2.7 Resonance fluorescence
Consider now the case of a cold gas, where T ~0 , and the atoms are excited
by external directed radiation. The total secondary field of a macroscopic sample
can be divided in two parts, the one that is coherent with the incident field, and
the scattered one.
The coherent part is determined by the space-averaged atom density, it inter-
feres with the incident primary field, and a joint resulting field propagates through
the medium in the same direction. Homogeneous matter, i.e., the constant compo-
nent in the space Fourier distribution of the matter mass, only slows the wave down
and, if the energy dissipation is taken into account, reduces its amplitude. These
effects are described by the macroscopic susceptibility of matter (Chapter 4).
The secondary radiation that is scattered sideways is caused by the atomic in-
homogeneity of the matter. The fields of different atoms are not coherent with
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
each other; therefore, the total power of the scattered radiation is additive, i.e.,
equal to the number of atoms times the emission power P of a single atom (pro-
vided that multiple scattering is negligible the Born approximation). Due to the
optical anharmonicity of the matter (Sec. 6.2), the spectra of the secondary radi-
ation contain, in addition to the elastic (unshifted) component, new components,
providing valuable information about the structure of the matter (fluorescence,
Raman scattering, etc.).
Here, using the gyroscopic model, we will consider resonance fluorescence
(resonance scattering), which is emission of radiation with a frequency close to
the transition frequency 0 , provided that the excitation frequency is close to
0 as well. This phenomenon, discovered by R. Wood in Na vapor as early as at
the beginning of the 20th century, attracts much attention nowadays in connection
with its two interesting features.
First, resonance fluorescence of single atoms brings the field into states with
unusual statistics, which cannot be described in terms of classical statistical optics.
These are states with photon anti-bunching and squeezed states (for more details,
see Chapter 7).
Second, in the case of a large intensity of the incident light (the pump), the
spectrum of the scattered light contains, near the elastic (Rayleigh) component,
two satellites with the frequencies and (Fig. 5.5). Here, [2 + (0
)2 ]1/2 is the generalized Rabi frequency and d0 E0 /~. Besides, amplification
of weak probe light at these frequencies due to the pump energy is observed.
Fig. 5.5 Resonance fluorescence. A monochromatic external field modulates the wave function of
a two-level system with the frequencies and . Therefore, the spectrum of the scattered field
consists of three components. The following cases are shown: (a) small mismatch and strong field; (b)
large mismatch; (c) in the case of a large mismatch, satellites appear due to the four-photon process in
which two pump photons (thick lines) become two photons with frequencies 0 and 2 0 .
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
see Chapter 7.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Above, we did not take into account relaxation. It is intuitively clear that it
should lead to the broadening of discrete spectral components up to a width of
about 2/T 2 (Fig. 5.5). Hence, the condition for observing the satellites in the case
= 0 is > 2/T 2 , i.e., strong saturation should take place, which is only
possible with the help of lasers. (The effect was first observed in 1974 [Delone
(1975)].) Note that according to the Bloch equations with an account for relax-
ation (Sec. 4.4), nutation of the R vector disappears under stationary conditions,
so that the above-given explanation of the effect seems to be not valid. However,
the kinetic Bloch equations do not describe the fluctuations of R caused by the
thermostat and containing the frequency .
If we put aside the rotating-wave approximation, the two-level model will de-
scribe other nonlinear effects, for instance, emission at the frequency 3. In the
case of a polarized atom (or molecule) with nonzero diagonal matrix elements of
the dipole moment, dnn , 0, or in the case of the magnetic resonance, the model
will describe two-photon absorption and the Raman effect, i.e., emission at fre-
quencies 0 . Thus, according to the simplest two-level model of an atom, the
matter can scatter the incident light with changing its spectrum, which is a mani-
festation of the nonlinear properties of the matter, i.e., the optical anharmonicity.
Another simple nonlinear effect, saturation, has been considered in Sec. 4.3.
5.3.1 Superradiance
Let at the initial time moment there be N identical atoms in independent coherent
states,
j = {|1i j + |2i j exp[i(0 t + j )]}/ 2. (5.50)
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
where the transition dipole moments d0 d12 are considered as real and oriented
along a certain axis. Here, the phase j of the j-th atom coherent state has a clear
meaning: this is the phase of the Bloch vector precession. If the state is prepared
by means of a /2 pulse, this phase is determined by the phase of the field at the
point r j of the atom. If N = 2, then, according to (5.52), the oscillation amplitude
of the total dipole moment is doubled at 1 = 2 and turns into zero at 1 2 = .
In the case of N atoms with the same j ,
hdi = Nd0 cos(0 t + ).
Suppose then that the linear size of the system is much smaller than the wave-
length, l 0 = 2c/0 , so that formula (5.19) for the dipole emission power is
valid. Replacing there dcl by hdi, in the case of equal phases we get Pcoh N 2 , i.e.,
the power emitted by dipoles oscillating with equal phases scales as the square
of their number. This effect of coherent (collective, cooperative) spontaneous
emission of a multi-atomic system is called superradiance. Although its classical
interpretation is trivial, the quantum theory has been first considered only in 1954
by Dicke [Fain (1972); Allen (1975)].
A close phenomenon had been observed earlier in the microwave range, in
nuclear magnetic resonance (NMR) experiments. In such experiments, a para-
magnetic sample is placed between two crossed inductance coils (the magnetic
induction method). A resonance current in one of the coils excites the stimulated
precession of the macroscopic magnetic moment M of the sample around the di-
rection of a constant magnetic field (see (4.139)). The rotating moment M induces
the induction electromotive force in the other coil, the receiver one. As a result,
the power of the observed signal scales as the squared number of nuclei in the
sample.
Coherent emission of phased dipoles is observed in many nonlinear optical
effects such as generation of optical harmonics. However, here, similarly to the
effect of nuclear induction, emission is at the frequency of the external force,
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
which in optics can be very different from the eigenfrequencies of the transitions.
It is important that the condition l is not necessary: at an appropriate phase
of stimulated or free precession along some direction, so that j = kz j , where
k is the wave vector and z j is the j-th atom coordinate (spatial phase matching
condition), the emission along this direction will be coherent. Thus, superradiance
of extended systems is not isotropic; it has a maximum along the phase matching
direction z.g
The semiclassical approach, based on the formula (5.19) for the power of co-
herent radiation with dcl replaced by hdi, yields an exact result only at N 1.
Recall that at N = 1, we had to introduce an additional factor of 2 into formula
(5.20). As we have mentioned above, in the general case we have to use, instead
of dcl2 , the normally-ordered square 2hd() d(+) i, which leads to the replacement of
N 2 by N(N + 1),
X
hd() d(+) i = hd() (+)
j dk i
jk
X X
= h j |d() (+)
j d j | j i + h j |d() (+)
j | j ihk |dk |k i
j j,k
X
= d02 N + cos( j k ) /2 = d02 N(N + 1)/4. (5.53)
j<k
and gradually pass from the energy state into coherent ones, with some precession
angles j and phases j .
However, one can expect that the atoms influence each other through the trans-
verse field, and therefore the precession phases should mutually synchronize, pro-
vided that the atom density is sufficiently high [Andreev (1980)]. As a result, after
a certain delay time t0 the system will pass by itself into a superradiance coherent
state with phased dipoles. At this moment, slow spontaneous emission will turn
into a short strong superradiance pulse with a definite phase of the field (Fig. 5.6).
Fig. 5.6 Time dependence of the emission power for atoms excited at time t = 0: 1, usual (incoherent)
spontaneous emission, observed at low atom density; 2, superradiance of atoms that are initially in a
coherent state; 3, superradiance in the case of an energy initial state occurs with a typical delay t0 .
Fig. 5.7 Photon echo. After being excited by two short light pulses (dashed lines, top), the matter
emits a flash of light (solid line) delayed by a time t0 equal to the interval between the pulses. The
effect is explained by the fact that at time 2t0 , different molecules have the same phase of charge
oscillations (bottom).
Here, N is the particle number density, d0 d(12j) the transition dipole moment,
R( j) the Bloch vector of the jth atom, with the components 2(21j)0 , 2(21j)00 , (11j) (22j) .
At the initial stage, when t T 2 , the difference between the eigenfrequencies
( j)
21 can be neglected, so that
P(t) = Nd0 sin(0 t), (5.59)
( j)
with 0 21 being the mean transition frequency. The macroscopic polarization
(5.59) is accompanied by strong superradiance. However, in a short time T 2 , the
dipoles in (5.58) get out of phase, P(t) becomes close to zero, and only slow
spontaneous emission remains, with a typical decay time T 1 = 1/A.
The dephasing process can be illustrated with the help of the vector model
(Sec. 5.1). In the frame of reference rotating with the frequency 0 , the R(0j) vec-
tors will undergo precession clockwise or anti-clockwise, with the angular rates
j (21j) 0 , which are within the interval +. Prior to the /2 pulse,
( j)
all R0 are directed down along the z axis, while immediately after the pulse, they
are directed along the y axis, R(0yj) (0) = 1. (We assume the pulse to be very short.)
Let T 2 be much smaller than the superradiance time coh and t T 1 , then the
radiative energy losses can be neglected and each Bloch vector is precessing with
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
its own velocity in the equatorial plane. The Bloch vectors will be unfolding like
a fan (Fig. 5.7), and at times t > T 2 they will uniformly fill the equatorial plane.
The precession of the jth atom can be described analytically in the rotating
frame of reference by the formula
where (0j) (21j) exp(i0 t) is the slow amplitude of the density-matrix non-
diagonal element.
At first sight, this decay of the macroscopic order is irreversible. However,
there exists a smart way to restore the coherent state of the system provided that
t < T 2 , where T 2 is the typical time of irreversible precession dephasing caused by
collisions. The idea is that a second resonant pulse, with the area , is incident
on the system at a time t0 .
According to the gyroscopic Bloch equation (5.3), the pulse turns an R(0j)
vector by 180 around the x axis (5.7). Since R(0j) is in the equatorial plane, the
rotation is equivalent to the mirror reflection with respect to the xz plane or to the
change of the R(0yj) sign, i.e., complex conjugation of (0j) (t0 ).
One can easily see that after this operation, the fan or R(0j) vectors starts
folding, and at the time t = 2t0 all vectors will be parallel once again (5.7), this
( j) ( j)
time along y : R0y (2t0 ) = 1, 0 (2t0 ) = i/2. Again, we consider the pulse to
be very short.
At this time moment, a superradiance pulse is emitted again, its duration about
T 2 (Fig. 5.7). It is this pulse of emission from the sample, appearing in a time t0
after the second external pulse, that is called the photon, or spin, echo. As the
interval t0 increases, the amplitude of the echo signal reduces as exp(2t0 /T 2 ).
The appearance of echo at t = 2t0 can be illustrated by an analogy with runners
at a stadium, who start running at t = 0 with different speeds. At t = t0 they
simultaneously turn round and run back with the same speeds. Clearly, at t = 2t0
they will cross the start line simultaneously.
The coherence recovery under a pulse can be also described analytically.
Before the pulse,
Further, at t > t0 , free precession takes place again, with the initial amplitude
given by (5.62),
(0j) (t) = (0j) (t0 + 0) exp[i j (t t0 )] = (i/2) exp[i j (t 2t0 )]. (5.63)
Chapter 6
Nonlinear Optics
Some nonlinear optical phenomena have been already considered above (satura-
tion, resonance fluorescence). This chapter will present a more systematic de-
scription of these effects.
Although most nonlinear optical phenomena are well described by the semi-
classical radiation theory and do not require the quantization of the field, many
effects have a clear interpretation and classification in terms of photons. For in-
stance, frequency doubling can be represented as resulting from elementary three-
photon processes where, due to the interaction between the field and the matter,
two photons of the incident light (pump) are destroyed and a photon with the
double energy 2~ is created (Fig. 6.1(a)). It is clear from the figure that, since
the process does not change the energy of the matter (such processes are called
parametric ones), the energy of the created photon is exactly twice as large as the
energy of the pump photon.
In the case of a biharmonic pump with the frequencies 1 , 2 , the matter
emits photons with the combination frequencies, 0 = 1 2 . These are sum-
and difference-frequency generation effects, see Figs. 6.1(b) and 6.1(c). A simi-
lar description can be suggested for four-photon and higher-order processes. The
processes that take place in a parametric oscillator and in the corresponding spon-
taneous effect, spontaneous parametric down-conversion, are inverse to the sum
frequency generation. Namely, a pump photon is split in two photons with frac-
tional frequencies, 0 1 + 2 (Fig. 6.1(d)).
The efficiency of parametric processes in a macroscopic material is dramati-
cally increased under the condition of photon momentum conservation, k1 + k2 =
k0 . This equality is called the spatial phase-matching condition.
In non-parametric processes, the matter changes its energy and passes to other
energy levels. For instance, in two-photon absorption, two pump photons are
annihilated and an atom goes to an excited level (6.1(e)). Such effects lead to
135
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
nonlinear absorption scaling as the square of the light intensity. In a Raman two-
photon transition, a photon is annihilated and another one, with a different energy,
is created (6.1(g) and 6.1(h)).
All these effects, both parametric and non-parametric, are widely used in non-
linear spectroscopy, as well as for the variation of laser radiation frequency. An
important role is played by nonlinear optical effects in laser thermonuclear synthe-
sis, laser separation of isotopes, laser chemistry, and many other fields of quantum
electronics. Also, note that the possibility of quantum amplification is princi-
pally related to the nonlinearity of the material, since in a linear system the lev-
els are equidistant and the stimulated emission is always compensated for by the
absorption.
be incident on the matter. Equations of motion for the charged particles of the
matter are nonlinear. As a result, charge displacements induced by the field (6.1)
and, hence, the polarization P(t) will contain not only the Fourier components
with the frequencies of the external force n but also the ones with the combina-
tions of these frequencies, n m , including multiple frequencies 2n and the
zero frequency, n n = 0.
Let us first reduce the consideration to the case of the nonlinearity quadratic
in the external field. Then, the phenomenological relation between the spectral
components of the polarization and the electric field has the form
P(2) (2)
0 = (1 , 2 )E1 E2 , (6.2)
where P0 is the complex amplitude of polarization oscillations with the frequency
0 = 1 + 2 , and we are so far assuming 1 , 2 . The quadratic susceptibility
(or quadratic polarizability) (2) defined this way sets a relation between three
vectors and is therefore a third-rank tensor. Notation (6.2) is not based on a certain
frame of reference (often, the relation is written with a colon, P0 = (2) : E1 E2 ).
If some Cartesian frame of reference is chosen, (6.2) takes the form
X
P(2)
0 = (2)
(1 , 2 )E 1 E 2 . (6.3)
In what follows, as usual, we will omit the summation over dumb indices , .
Each of the 27 components (2) 0
(, ) of the
(2)
tensor is a function of two
0
independent arguments , taking values from to +. Since the Fourier
components of the field and the polarization are complex, (2) is also complex,
and in total, there are 54 real functions of these variables. However, as we will
show below, there are many relations between these functions, and the number of
independent variables is reduced.
By analogy with (6.3), the nonlinear susceptibility of an arbitrary order is
defined as
P(m) (m)
0 = 1 ...m (1 , . . . , m )E 11 . . . E mm ,
(6.4)
0 = 1 + 2 + + m .
In the spectral representation, this relation is an algebraic one, while in the time
representation, (m) should be understood as integral operators. Their kernels,
(m) (t1 , . . . , tm ), called multi-time Greens functions or response functions of the
matter, are defined in terms of the spectral susceptibility (m) (1 , . . . , m ) using
the m-fold Fourier transformation (for m = 1, see (4.7)). By analogy with the
linear case (Sec. 4.1), the causality principle leads to integral relations between
the real and imaginary parts of (m) , similar to the Kramers-Kronig relations.
The effect of the magnetic field can be taken into account by doing a double
power-series expansion in (6.5), (mn) Em Hn . Some nonlinear optical effects
P
reveal spatial dispersion, which can be described as the dependence of (m) not
only on 1 , . . . , m but also on the wavevectors k1 , . . . , km .
6.1.2 Various definitions
Often, one uses the definition of spectral amplitudes En that differs from (6.4) by
the absence of the 1/2 factor,
X
E(t) = En exp(in t) + c.c., En En /2. (6.6)
n
case where the field frequencies and their combinations are in the transparency
windows of the matter. In this case, the polarization follows the field instanta-
neously, and for m = 2,
P(2)
(t) = E (t)E (t), (6.9)
where is some real constant tensor. In particular,
P(2) 2
x (t) = xxx E x (t) + xxy E x (t)E y (t)
6.1.3 Permutative symmetry
Definitions (6.3), (6.4) lead to the invariance of the (m) tensors to permutations
of their frequency arguments together with the corresponding Cartesian indices,
(1 , 2 ) (2 , 1 ), (6.14)
(1 , 2 , 3 ) (1 , 3 , 2 ) . . . (6.15)
Indeed, (6.3) can be rewritten in other equivalent forms,
P(2)
0 = (2 , 1 )E 2 E 1 = (2 , 1 )E 2 E 1 .
6.1.4 Transparent matter
Note that spatio-frequency permutative relations (6.14), (6.15) do not concern the
first index. Using (2) as an example, let us show that its permutation is possible
in the non-resonant case where all frequencies are away from the resonances of
the matter. For the sake of symmetry, we introduce the third argument into the
notation of the susceptibility,
(1 , 2 ) (0 ; 1 , 2 ) 012 = 012 , (6.17)
where the minus sign by the combination frequency 0 = 1 + 2 provides that
the sum of all three arguments of the susceptibility is zero. In the last equality, we
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
took into account relation (6.16). Note that sometimes, other notation can be used
as well,
(0 ; 1 , 2 ) (0 = 1 + 2 ).
Let us find the power absorbed in a unit volume of the matter due to the
quadratic nonlinearity in the case of a three-frequency field. According to (4.13),
2
(2)
X
P(2) = E P = (1/2) n ImEn P(2)
n P0 + P1 + P2 . (6.18)
n=0
with the tilde denoting the components in the new frame of reference. Here, a is
the matrix describing the transformation from the old components to the new ones,
and we assume that rotation also includes the sign changes in some or all coordi-
nates, i.e., inversion and mirror reflection. For instance, in the case of inversion,
a0 = 0 , and
(m)... = (1)m+1 (m)
... , (6.30)
where m + 1 is the rank of the tensor. Thus, as a result of the reference frame in-
version, the components of odd-rank tensors (in particular, vectors and quadratic
susceptibility tensors) change their signs, E = E , (2) (2)
= , . . . .
For tensors describing the physical properties of a material, there is a special
frame of reference where the tensor has its simplest form, with the maximal num-
ber of zero or equal components. In crystals, this natural frame of reference
coincides with the crystallographic one. For instance, real tensors of rank two are
diagonal in the natural frame of reference, (1)
= .
Any unbounded medium, either amorphous or crystalline, has a certain sym-
metry of particles positions averaged over their thermal motion. Formally, this
symmetry of the medium is determined by a set (group) of a certain number of
symmetry elements. In particular, the elements of a pointc group of symmetry are
c The term point is due to the fact that rotations leave one point (the origin) fixed, unlike translational
all rotations a of the reference frame, including mirror reflections and inversion,
which leave the structure of the medium unchanged. For instance, many crys-
tals, as well as optically non-active liquids and gases, are invariant with respect to
inversion. Such media are called centrally symmetric.
Similarly, any macroscopic property of a medium, which is characterized by a
certain tensor, has its own group of symmetry elements. The symmetry elements
of a tensor are all rotations a(i) that act according to rule (6.29) and do not change
the components of the tensor. For instance, according to (6.30), all even-rank
tensors are invariant with respect to inversion.
It seems obvious that the symmetry of a macroscopic property of a medium
cannot be lower than the symmetry of its structure (the von Neumann principle,
Ref. [Nye (1957)]). In other words, the symmetry group of a property should in-
clude all symmetry groups of the structure, i.e., the latter is a subgroup of the sym-
metry group of the property. Hence, if a is an element of a point symmetry group
of a medium, then the tilde in (6.29) can be omitted. Then, (6.29) becomes a rela-
tion between different components of the (m) tensor. By substituting into (6.30)
all symmetry elements a(i) of the medium one by one, we obtain a homogeneous
system of equations for (m) . In isotropic media and in crystals, such equations
greatly reduce the number of susceptibility nonzero components, as well as make
many components equal, sometimes up to the sign.
The most bright example follows from (6.30) in the case of centrally sym-
metric media. According to the von Neumann principle, all tensors describing
the physical properties of such media should be also centrally symmetric, i.e., the
tilde in (6.30) can be omitted. Hence, for even m, it should be (m) (m)
... = ... ,
which leads to (m) = 0. Thus, in centrally symmetric media, all even-order sus-
ceptibilities are equal to zero.
Note that this conclusion is not valid in the case of susceptibilities describing
magnetic effects. This is because the magnetic field and the magnetization are
pseudovectors (axial vectors, as they do not change their signs under the inversion
of the coordinates). As a result, the corresponding susceptibilities are pseudo-
tensors and do not transform according to (6.30) under inversion. In particular,
the Faraday effect, described by the relation P1 = (1 = 1 + 0)E1 H0 , is also
possible in centrally symmetric media.
Depending on the features of the matter, its state, the frequencies of the incident
fields, and other experimental conditions, various mechanisms can contribute in
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
the observable effects. Below we will consider several particular classical models
of optical anharmonicity and then present the general quantum scheme of calcu-
lating nonlinear susceptibilities.
e2
= = re o2 , (6.36)
m2
where re e2 /mc2 3 1013 cm is the classical radius of the electron. Note that
re is related to other typical length parameters through the fine structure constant
e2 /~c2 1/137,
i.e., polarisability of a free electron in the UV range is, on the order of magnitude,
equal to the hydrogen atom volume, i.e., to the polarisability of a bound electron
in the absence of a resonance.
In order to find the second approximation, one should replace R by R(2) in
the left-hand sides of (6.31) and by R(1) in the right-hand sides. Then the Lorentz
force will have components with both zero and double frequency,
e2 E12 1
Z2(2) = zxx E12 . (6.40)
8im2 c( + i)( + 2i) e
In the last equation, we have introduced the quadratic polarisability tensor of a
free particle, which provides the relation between the amplitudes of the field and
the dipole moment eZ2(2) at the double frequency. Thus, the quadratic polarisability
of a free electron at is
f ree
|| e3 /m2 c3 = (eo/mc2 ) /E NL . (6.41)
Here, E NL is a typical parameter equal to the field amplitude at which the linear
and quadratic responses are equal, Z (2) = X (1) . At = 0 ,
f ree
E NL = e/re o0 109 G, (6.42)
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Fig. 6.2 Due to a plane monochromatic wave, an electron moves along a Lissajous pattern.
which corresponds to the intensity 1020 W/cm2 .d Below, we will show that the
anharmonicity of a bound electron is two orders of magnitude as large as that of a
free electron even in the absence of a resonance,
bound f ree
E NL /E NL 0 /137,
where
bound
E NL e/2a20 107 G. (6.43)
Multiplying by the density of the electrons, we get the quadratic susceptibility
of the plasma,
f ree
|(2) | = e3 N/m2 c3 = |(1) |/E NL . (6.44)
Thus, one of the fundamental sources of the matter anharmonicity is the
Lorentz force. Note that the denominator of (6.44) contains the speed of light,
which is typical for magnetic effects.
According to (6.41), the ratio Z (2) /X (1) has an order of E1 /E NL , while kZ (2)
(E1 /E NL )2 . This justifies for using the dipole approximation in the calculation of
at E1 E NL .
If the static force of light pressure is not taken into account, then, accord-
ing to (6.34) and (6.40), an electron in a magnetic field moves in the xz plane
along a figure of eight trajectory (Fig. 6.2). Stimulated oscillations of the elec-
tron along the field, X (1) (t), are accompanied by dipole emission in all directions,
except the exact x one. This is what is called Thomson scattering or, taking into
account recoil, Compton scattering. At the same time, longitudinal oscillations of
d In the Gaussian system of units, E and H have the same dimensionality; therefore, E can be measured
the electron, Z (2) (t), lead to dipole emission at the frequency of the incident field
second harmonic, i.e., the effect of frequency doubling. The second-harmonic
emission is maximal in the transverse plane, xy, and absent along the primary
field propagation direction. This structure of the free-electron quadratic polaris-
ability tensor prevents the second-harmonic amplitudes in a macroscopic sample,
such as plasma or a semiconductor, from adding up coherently.
Thus, electrons in a plasma, metal, or semiconductor provide, in addition to
Thomson scattering, also non-coherent scattering with the double frequency and
the intensity scaling as 2 N. Bound electrons in atoms or molecules also have
quadratic polarizability, which leads to a non-coherent scattering at the double fre-
quency, termed hyper-Rayleigh scattering. From the quantum-mechanical view-
point, it is interpreted as the absorption of two incident photons and the emission
of a double-energy photon (Fig. 6.1(a)). If the phase velocities of the incident
wave and its second harmonic are equal, n() = n(2) (the so-called phase match-
ing condition), then, provided that the structure of the (2) tensor is appropriate,
weak nearly isotropic hyper-Rayleigh scattering is accompanied by a much more
intense longitudinal emission, scaling as 2 N 2 = (2)2 (Sec. 6.5).
6.2.2 Light pressure
The constant component of the Lorentz force (6.39) determines the static light
pressure force F0 acting on the electron due to the traveling wave. According to
(6.39), F0 scales as the imaginary part of the electron linear polarisability, 00 , i.e.,
the power of the scattered light, P = 00 |E1 |2 /2 (see (4.15)), or the interaction
cross-section,
where I = c|E1 |2 /8 is the intensity of the plane wave. Therefore, the force of
light pressure can be represented in the form
This force accelerates the electron; however, collisions in plasma will lead
to a constant speed of electron drift, Z0 = F0 /m, where = 1/ and is the
time between collisions. (Recall that is the oscillation damping rate, which can
exceed ). The constant current emerging along the beam, scaling as |E1 |2 , can be
interpreted in terms of optical rectification. (In dielectrics, the term rectification,
or dc-effect, means the appearance of a static field E0 |E1 |2 .)
Let , then, according to (6.35), 00 = 2e2 /m3 . Let us estimate F0 in
the case where the damping of electron oscillations is only due to radiation losses
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
(radiation friction). According to (6.31), the friction force, in the first order, is
2mX (1) . Multiplying it by the velocity X (1) and averaging over a single period,
we find the power of losses, P = m2 |X1(1) |2 . Setting equality between this
expression and the power of dipole emission (5.19), we obtain
The same result follows from (5.24) with a unity oscillator strength (4.62). Now,
(6.46) takes the form
From the photon viewpoint, light pressure is due to the fact that an electron ac-
quires the momenta of absorbed photons, which are then symmetrically re-emitted
in all directions via Thomsons (or Comptons) scattering. Let us stress that we
have only considered the average value of the force, which has quantum fluctua-
tions [Minogin (1986)].
We have found the force of light pressure acting on a free electron in the case
of a traveling wave. Similar analysis can be carried out for a more complicated
spatial configuration of light. For instance, an electron will be displaced in the
field of a standing wave, which is the Kapitsa-Dirac effect. It is important that
in a non-homogeneous field, the Lorentz force has a nonzero value averaged over
the period even at 00 = 0. In this case, F0 is determined by 0 , which is much
greater than 00 , so that the latter can be assumed to be zero. This force is caused
by the exchange between different plane waves and is called the stimulated one
(in contrast to the spontaneous force (6.46)).
Displacement of an electron R(t) due to a monochromatic field is
E(R0 , t)/e, where = me2 /2 is the polarisability and R0 is the non-perturbed
coordinate of the electron. Hence, we find the averaged Lorentz force,
where the last equality is also valid for the case of a complex polarisability. Note
that in a plane traveling wave, EH sin t cos t 0, and that in the general
case the force F0 does not scale as the averaged Poynting vector, S0 = cRe E
H /8.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Hn = kn En , kn = kn /c.
where k R0 + , k k1 k2 , 1 2 .
It is easy to see that the part of F0 scaling as k can be represented in the
gradient form with the effective potential E 2 /2 (from (4.28) it is clear why
the factor 1/2 appears),
F M = |E |2 /4. (6.52)
Note that this part of F0 , called the Miller force, disappears, according to (6.51),
in the case where the waves are orthogonally polarized. For counter-propagating
waves (k2 = k1 ) with the same linear polarization, F0 = F M :
F0 = k1 E1 E2 sin(2k1 R0 + ). (6.53)
Due to this force, charged particles tend to bunch in the nodes of a standing wave.
In the general case, a field consists of a continuum of plane waves, and the light
pressure force can be found by integrating (6.51) in k1 and k2 . Note that in this
case, the gradient part (6.52) maintains its form. It should be stressed that in the
presence of the electron recoil, the interacting waves have different frequencies.
The corresponding phenomenon is called the stimulated Compton effect.
Consider further the pressure of light acting on bound electrons in an atom or
a molecule, i.e., on neutral polarisable particles. As the starting point we take the
effective potential V = d E(r), where d is the induced dipole moment and r is
the coordinate of the particle center of mass. Hence (see(4.39)),
or F = d E /x . Assuming
Consider first resonance pressure. Let, as before, the damping be only due to
the emission by the particle, i.e., the resonance fluorescence (Sec. 5.2), then the
width of the resonance is minimal (the natural width), 2rad = A. Then, according
to (2.55), = 2o2 , and it follows from (6.45), (6.46) that 00 = o3 /2 and
F0 = o2 |E1 |2 /4. (6.55)
2
Thus, the resonance light pressure on a bound electron is o /re2
= 137 times6
as great as on a free one, provided that the damping is only caused by radiation.
This huge difference is due to the high ratio of the resonance and Thomsons cross
sections.
Estimate (6.55) relates to the case where only the lower level is populated. In
the general case,
00 = (1/2)o3 = o3 (0) /2(1 + 2W0 T 1 ), (6.56)
(0)
where and are relative population differences with an account for saturation
and without it, respectively (Sec. 4.3), W0 is the transition probability and T 1 is
the time of longitudinal relaxation. In the case of radiation relaxation, T 1 = 1/A.
According to (6.56), in the presence of population inversion the light pressure
force is directed oppositely to the light beam, which in the photon language can
be explained in terms of the recoil of photons emitted via stimulated transitions
forward.
At strong saturation, the power P absorbed by the atom, according to (4.102),
is ~(0) /2T 1 , so that (6.46) leads to
F0 = ~k(0) /2T 1 , (6.57)
the force scales as the momentum of the incident photon times the number of
photons scattered per unit time. Let = 1 and A = 106 s1 , then F0 = 3 1017
dyn, and at m = 3 1023 g the acceleration is as high as 106 cm/s2 . The intensity
of saturating light is in this case much greater than 104 W/cm2 .
Resonance pressure of laser light provides quite unusual applications. With its
help, one can accelerate, displace, and focus beams of neutral molecules, separate
isotopes, trap molecules within a small space domain, and reduce their thermal
velocities [Minogin (1986)].
Here, the bar denotes averaging over high-frequency components. Indeed, we are
only interested in the static part of the force acting on a molecule as a whole, or at
least in the part that is varying slowly compared with the frequencies of the field
and the molecule. For notation E () , see Sec. 7.2.
The gradient force (6.58) corresponds to the effective potential of the molecule
V = E 2 /2.
The additional energy density of the matter and the force density in an optical
field will be N times as high (see (4.35)),
v = E 2 /2 = (n2 1)E 2 /8, (6.59)
f = E 2 /2, (6.60)
where (1) N, N is the density of molecules, which is assumed to be
uniform, and n = is the refractive index.
Here, we have neglected the interaction between molecules, which is only
possible for a sufficiently small N. One can show that in the general case (see
Ref. [Landau (1982)]), one should make a substitution into (6.60),
! !
= , (6.61)
T 4 T
where is the density of the matter. For instance, from the Clausius-Mossotti
relation, which can be easily obtained from (4.48),
1 4
= , (6.62)
+2 3m
it follows that
! !
1 ln 3
= = ,
T T ( 1)( + 2)
so that in the case of a dense medium, (6.60) should be multiplied by the Lorentz
correction ( + 2)/3.
In a traveling plane wave, E 2 has no constant component, hence F = 0.
(Note that we are considering a range where 00 = 0 and the spontaneous force
(6.46) related to the dissipation is absent.) However, in a standing plane wave,
E = 2E1 cos(kz) cos(t), and from (6.58), it follows that (compare with (6.53))
Fz = kE12 sin(2kz). (6.63)
Thus, at > 0 particles gather in the anti-nodes of a wave. The force (6.63),
scaling as the polarisability, is called the stimulated force.
In limited beams of light, there is a static transverse gradient of the field
square, and at > 0 the particles tend to move towards the beam axis. Under
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
stationary conditions, the force density (6.60) should be compensated for due to
the increase in the pressure, p, and the density of particles, N, in the central
part of the beam,
p = v = E 2 /2, (6.64)
N/N = / = T p, (6.65)
where T is the isothermic compressability of the medium. These equations de-
scribe electrostriction in a light field.
The increased density of particles in the light beam will cause a change in the
susceptibility of matter,
= N = T 2 |E1 |2 /4. (6.66)
At the same time, by definition,
P(3) (3) 2
1 = ( = + )|E 1 | E 1 = E 1 ,
where e is the particle charge and is a small parameter defining how much the
shape of the potential well differs from a parabolic one. (For simplicity, we con-
sider a one-dimensional model.) From (6.69), we find the equation of motion
Dx x + 2 x + 20 x = eE/m + x2 . (6.70)
Let the external field E be a bi-harmonic one. We search the stationary solution
as a perturbative series,
X
x(t) = x(m) (t), x(m) m1 E m , (6.71)
m
scattering.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
the frequencies 1 0 , which are absent in the incident field and appear
spontaneously.)
In order to describe stimulated nonlinear effects caused by the parametric cou-
pling between nuclei and electrons, it is useful to consider the model of two cou-
pled oscillators, with the potential
Here, x and 0 are the coordinate and the eigenfrequency of the electron, Q and
0 are the corresponding values for the nuclei, is the coupling parameter scaling
as /Q. It follows from (6.82) that
Q + 2Q + 20 Q = x2 /M. (6.84)
This model was proposed for the description of StRS by Platonenko and Khokhlov
in 1964. According to (6.84), the force acting on the the nuclei scales as the square
of the electron displacement; therefore, the nuclei oscillations will be enhanced in
the case where the difference between the two field frequencies is close to 0 . Let
the incident field be a biharmonic one, and 1 2 0 .
In the linear approximation in the field, Q(1) = 0, and
x(1) 2 2
n = eE n /mDn , Dn 0 n 2in , n = 1, 2. (6.85)
In the expression for the force, x2 , we only leave the resonance terms with
the frequency scaling as x(1) (1)
1 x2 , then the amplitude of stimulated molecular
oscillations with the frequency is
x(1)
1 x2
(1)
(/M)(e/m)2
Q = = E1 E2 . (6.86)
2M(20 2 2i) 2D0 D1 D2
By means of the two-oscillator model, one can also describe the spontaneous
Raman scattering. To this end, the right-hand side of (6.84) should be replaced by
the stochastic Langevin force f (t) causing thermal (and quantum) fluctuations of
Q(t). This force is delta-correlated, and one can find its spectral density by setting
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Fig. 6.3 Raman susceptibility. Due to the effect of the pump with the frequency , the susceptibility
of matter acquires additional resonances at frequencies L 0 , where 0 are the eigenfrequencies of
the molecules. It is important that the Stokes resonance has negative losses (bottom left) and anoma-
lous dispersion of the refractive index (top left).
equality between the fluctuation energy Q and the equilibrium energy of the os-
cillator. Another method of describing SpRS is based on the Raman analogue of
the FDT (Sec. 7.7), stating that the polarization fluctuations of matter at frequency
are determined by the imaginary part of the cubic susceptibility [Fain (1972);
Klyshko (1980)],
where
Here, at < 0 (the anti-Stokes range), N has the meaning of the equilibrium
number of phonons N0 , while at > 0 (the Stokes range), N = (N0 + 1). In
the last expression, the unity describes the quantum fluctuations of the nucleus
coordinate Q, which cause the Stokes scattering even at T = 0, when the anti-
Stokes scattering is absent. The elementary process corresponding to the Stokes
scattering is a two-photon one. It includes the annihilation of a pump photon and
the creation of a Stokes photon and a phonon (Fig. 6.1(g)).
Certainly, the two-oscillator model, similarly to the anharmonic-oscillator one,
is only qualitative. A quantitative calculation of the susceptibility, even the linear
one, is very complicated and requires the knowledge of the wave functions and
eigenfrequencies of the system (see below).
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Inelastic scattering can be also due to the excitation of other degrees of free-
dom of the medium, for instance, electronic ones. In this case, the radiation fre-
quency changes by a quantity equal to the frequency of some electron transition
in an atom or a molecule, 1 2 = mn (Em En )/~. If the incident field
contains two frequencies such that 1 + 2 = mn > 0 and the molecule is at the
ground level n, then two photons can be absorbed simultaneously. In the inverse
process, the excited molecule emits two photons, spontaneously or via stimulated
emission.
In macroscopic matter, light interacts not only with local internal oscillations
of the particles, but also with the collective excitations, for instance, with acous-
tic, temperature, spin, plasma waves, and with the oscillations of the molecules
orientations.
The equilibrium chaotic part of these waves modulate the refractive index (see
(6.66)), and the corresponding sidebands 2 = 1 appear in the spectrum of
the scattered light. The scattering can be viewed as resulting from the diffraction
of the incident light by a traveling grating formed by waves of pressure, tempera-
ture, and so on. From the quantum viewpoint, a photon ~1 of the incident light
(the pump) gets scattered with a simultaneous birth or annihilation of a matter
excitation quantum ~ (a phonon, a magnon, a plasmon, an exciton, a polariton
etc.). For the scattering by propagating excitations, it is typical that the modulation
frequency depends on the observation direction, i.e., on the angle of scattering,
= (). This dependence follows from the diffraction Braggs condition or,
in other words, the phase-matching condition (the momentum conservation law),
k1 k2 q = 0, and the dispersion dependence for the scattering wave, q = q(),
where q is the wavevector of the matter excitation. The effect of phase matching
on the Raman anharmonicity can be formally taken into account by assuming the
cubic susceptibility to depend not only on the frequencies but also on the wave
vectors (spatial dispersion).
For describing stimulated scattering by acoustic waves (MBS, Mandelshtam-
Brillouin scattering) and other collective excitations, one should take into account
the non-equilibrium coherent part of these excitations caused by the biharmonic
pump. It is clear from (6.64) how sound can be excited by light: due to elec-
trostriction, there appears a source of alternating pressure in the matter, with the
difference frequency ,
2
X
(1)
p(r, t) = (1/2) (En /2) exp(in t) + c.c.
n
This source creates density waves , propagating with the speed of sound
v. If the pump waves are plane ones, then exp[i(k1 k2 ) r], and the
stimulated sound wave will have the maximal intensity at |k1 k2 | = q = /v.
This phase-matching condition chooses from the continuous spectrum of acous-
tic excitations, spanning a range from zero to approximately 1011 Hz, one (or
two, considering the difference between v for transverse and longitudinal waves
in amorphous solids) discrete component, with the frequency
The width of this spectral component is determined by the sound absorption coef-
ficient.
T a2 T = P/c p , (6.94)
variation of pressure, p, due to heating; here, for simplicity, we ignore the rela-
tion between temperature and pressure waves.)
At P = 0, (6.94) describes spontaneous temperature scattering, and its solution
can be represented as a sum of plane waves exponentially decaying in time,
X
T (r, t) = T q eiqrt , (6.95)
q
with aq2 . Similarly to the case of scattering by acoustic waves, the scattering
of light with the wavevector k1 in the direction k2 is caused by the temperature
grating with q = (k1 k2 ). However, according to (6.95), this grating is not
moving, and diffraction by it results in an elastic (unshifted) line in the spectrum
of the scattered light, with the bandwidth
Its amplitude can be found by substituting (6.97) and (6.98) into (6.94),
00 /c p
T = E1 E2 . (6.99)
i
Here, 1 2 || and 00 00 ().
The traveling coherent wave of temperature (6.98) modulates the susceptibility
(mainly due to the density decrease caused by temperature expansion),g
!
T
= T , (6.100)
T p T
g In an ideal gas, N = p/T , therefore, = p/T and, if the dependence of on T is neglected,
(/T ) p = /T .
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
effect, and their populations do not correspond any more to the temperature of
the thermostat. (The role of the thermostat is usually played by the translational
and rotational degrees of freedom of the molecules, or vibrations of the crystal
lattice.) During the relaxation time T 1 , populations are re-distributed, which is
accompanied by the change of the thermostat energy.h As a result, the thermostat
temperature is changed. Note that an analogous magnetocalorical effect is used
for obtaining ultra-low temperatures (adiabatic demagnetization). A more rigid
explanation follows from the definition of the temperature for a closed system
(microcanonical ensemble),
where S is the entropy, U the internal energy and g(U) the density of energy states.
The latter depends on the configuration of the levels and hence changes after the
field is turned on.
Let us estimate the contribution of the electrocaloric effect in the temperature
anharmonicity. According to (4.35), when a dielectric is polarized, its thermody-
namic potentials get an increase v = |E1 |2 /4 per unit volume. Let us choose T
and as independent parameters, then the entropy is determined in terms of the
temperature derivative of the Gibbs potential (T, p); therefore, the variation of
S due to the polarization is
! !
1
S = = |E1 |2 . (6.107)
T p 4 T p
After multiplying S by T , we obtain the increase of the heat Q, and after mul-
tiplying it by T/c p , we find the temperature increase,
!
T
T = |E1 |2 . (6.108)
4c p T p
Comparing this with (6.94), we see that the role of the absorbed power in a
transparent material is played by
T d|E1 |2
!
dQ
Pequiv = = . (6.109)
dt 4 T p dt
Comparing (6.110) and (6.97), we find the ratio of the electrocaloric anharmonic-
ity and the anharmonicity (6.103) due to the dissipation,
(3)
! !
ec T 2T
= . (6.111)
(3) 200 T p
ab
cn T p
Fig. 6.4 Main types of scattering, the corresponding nonlinearities and typical frequencies: 1, tem-
perature (entropy) scattering by temperature fluctuations is related to the electrocaloric anharmonicity
( 108 s1 ); 2, Mandelshtam-Brillouin scattering ( 1010 s1 ) by pressure fluctuations is
related to striction anharmonicity; 3, the Rayleigh wing is caused by anisotropy fluctuations and is re-
lated to the orientation anharmonicity ( 1011 s1 ); 4, Raman scattering by the internal vibrations
of molecules ( 1014 s1 ). The first three types are called molecular, or Rayleigh, scattering.
Hence, an anisotropic molecule tends to turn with respect to the field to max-
imize its polarisability. However, in equilibrium matter, rotation of molecules by
the field is hindered by their interaction with the neighbors: relaxation processes
restore the equilibrium state with chaotic orientations of the molecules, which the
field tends to order. Competition between the field and the thermal motion results
in a dynamic equilibrium, with the degree of orientation being on the order of
V/T . In this case, the liquid becomes birefringent, similarly to a uniaxial crystal
with the axis parallel to E (if the field is linearly polarized). This phenomenon is
called the optical Kerr effect.
Let the polarisability anisotropy be (). For instance, for a linear molecule,
= || . Then the susceptibility variation (2 ) at a frequency 2 induced
by the field at a frequency 1 will be equal, on the order of magnitude, to (2 )N
times the degree of orientation,
Let a30 1024 cm3 and T = 300 K, then E NL = 2 105 G, and if (1) = 0.1,
then (3) = 1012 cm3 /erg (compare with (6.42), (6.43), (6.68)).
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
If a molecule has a constant dipole moment d0 and the orienting field is a static
one or it varies slowly compared with the orientation relaxation time ( 1012 s),
then the effective energy is V = d0 E. (The induced moment can be neglected
in this case.) The degree of orientation will then scale as (V/T )2 since linear
electrooptic effect is forbidden in liquids (Sec. 6.1). As a result,
(Here, we assumed d0 = 1 D.) Thus, the Kerr effect in polar liquids is much
stronger than in non-polar ones.
Suppose now that the orienting field is an optical biharmonic one, with
1 2 < 1/, then V and, correspondingly, the degree of orientation
V/T will contain an alternating component with the frequency . As a result,
the polarization at the probe field frequency 3 (which, in principle, can coin-
cide with 1 or 2 ) will be modulated, i.e., the matter will emit coherent field
with the frequencies 3 . Thus, orientation anharmonicity provides a reso-
nance contribution with the width 2/ to the cubic susceptibility (3) (3 , 2 , 1 )
at (1 2 ) 1/. If the orientation relaxation is taken into account, this con-
tribution becomes complex. Its imaginary part corresponds to the amplification
of the 2 field, with the maximum at 2 = 1 1/, and leads to the stimulated
Rayleigh-wing scattering.
Orientation anharmonicity can be quantitatively described in terms of the
molecular orientation distribution function, which is stationary at 1:
!2
V/T
V 1 V
P() = Ce = C 1
+ . . . , (6.119)
T 2 T
where
Z Z
hVi(0) d3 P(0) V, P(0) 1/ d3 = 1/82. (6.121)
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Here, (0) is the linear polarisability averaged over the equilibrium distribution
function. Hence, we find the correction to polarisability,
Z
(0) = d3 [() (0) ] Re E() () E(+) /82 T. (6.125)
6.2.9 Quantum theory of nonlinear polarization
Nonlinear polarisabilities , , . . . of molecules and the susceptibility of matter
(2) N, (3) N, . . . can be calculated similarly to the linear polarisability
(Sec. 4.2), using the density-matrix equation with the phenomenological damping
constants. However, it is easier to use the general formula (3.75) for the response
h f (t)i of a quantum system to an external perturbation with the energy V(t); the
damping should be added from general considerations at the final stage of the
calculation.
We are interested in the stationary response of the system to a periodic pertur-
bation; therefore, the lower integration limits in (3.75) should be chosen as .
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
The upper limits can be chosen as + provided that the causality is taken into
account by introducing the step functions (t t1 ), . . . , (tk1 tk ) into the inte-
grand. Let us introduce the damping, which is necessary for achieving a stationary
regime and practically inevitable in any system (see Fig. 6.5),
Tr {[. . . [d0 (t), d0 1 (t1 )], . . . , d0 k (tk )]}E1 (t1 ) . . . Ek (tk ). (6.128)
Here, is the equilibrium density operator and the operators d0 (t) are considered
in the interaction picture. The integrand in (6.128) depends on k + 1 time argu-
ments, only k of them being independent, as one can easily verify. This tensor
function is called the response function of the system or its Greens function. One
also says that it is a causal Greens function and, at k > 1, a multi-time one.
Let the field have a discrete spectrum,
X
E(t) = E p exp(i p t)/2, p = 1, 2, . . . , (6.129)
p
Here, summation over the state indices m, n and over the Cartesian indices , =
x, y, z (which are sometimes written as superscripts, to make the notation more
compact), as well as over the frequency index p, is implied. If the damping is
taken into account by changing to the damping rate mn of the density matrix
non-diagonal element, then the linear susceptibility (1)
defined by (6.130) will
coincide with (4.57).
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Fig. 6.5 The Heaviside step function multiplied by et and its Fourier transform are used for taking
into account the effects of causality and damping on the response of a system.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
() () () () () ()
dnm dml dln dnm dml dln
, (6.133)
D(0) (1)
lm Dln
(0) (1)
Dlm Dnm
where the superscripts 0 and 0 relate to the frequencies 0 .
It is easy to see that expression (6.133) has all symmetry features described in
Q
Sec. 6.1. The operator provides the spatio-frequency symmetry with respect to
the last two indices (6.14), (6.15). The property (6.16) follows from the relations
(p) ( p)
Dmn = Dnm and dmn = dnm , so that changing the signs of all frequencies and the
imaginary units only interchanges the terms in (6.133): the first one is swapped
with the second one and the third one, with the fourth one.
The fact that in centrally symmetric media (2) = 0 also follows from (6.133).
The eigenstates of a system with a center of symmetry possess a certain parity:
n (r) = n (r); therefore, dmn = 0 if m and n have the same parity. Hence, at
least one of the three moments relating the states l, m, n is equal to zero.
The symmetry property in the case of transparent matter, (6.21), also follows
from (6.133) at mn = 0. In this case, the last terms in (6.133) can be combined in
pairs,
1 1 1
+ = .
D(0) (1)
lm Dln D(0) (2)
lm Dnm D(1) (2)
ln Dnm
(2) = 108 G1 , (3) = 1015 G2 . Even this crude estimate gives a correct idea
about the orders of magnitude of the susceptibilities. At resonance, (k) increases.
6.2.10 Probability of multi-photon transitions
If we are only interested in energies, then the nonlinear interaction between ra-
diation and matter can be described in terms of probabilities or cross sections of
multi-photon transitions, as it was done in Chapter 2 for the case of linear single-
photon interaction. Then, as a rule, the field does not have to be quantized, i.e., one
can use the semi-classical theory, but the results of calculations can be interpreted
in the photon language.
As an example, let us find the probability of a two-photon elementary process
describing Raman scattering and two-photon (induced) absorption or emission.
Substituting the transition amplitude in the first-order approximation (2.24) into
(2.21), for the amplitude of the two-photon transition from level a to level b we
obtain
Z t Z t2
c(2)
ba (t) = (i~) 2
dt V 0
(t
2 bn 2 ) 0
dt1 Vna (t1 ). (6.136)
t0 0
Here, the n index, in which summation is implied, numerates all intermediate (vir-
tual) non-perturbed states through which the transition can occur. This expression
reflects an important feature of quantum dynamics: all possible virtual states par-
ticipate in a transition, even those seemingly violating the conservation laws, with
the only restriction, following from the causality principle: t > t2 > t1 > t0 .
Let us substitute into (6.136) the dipole perturbation V0 = d0 E and the
biharmonic field (6.129), assuming that t0 , t = ,
X
c(2)
ba = (2~)
2
dbn Eq dna E p
nqp
Z Z t2
dt2 dt1 exp[i(bn q )t2 + i(na p )t1 ]
X dbn Eq dna E p Z
= dt2 exp[i(ba q p )t2 ]. (6.137)
nqp
4~2 i(na p )
The lower integration limit in the integral over t1 makes no contribution due to the
adiabatic start of the perturbation or due to the damping (see (6.127) atR +0).
The integral left in (6.137) is one of the delta-function representations: dt2 =
2(ba q p ). Thus, in the second order of the perturbation theory, the
field excites a molecule only provided that the algebraic sum of the two field
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
c(2) 12
ba = i(2/~)(E1 M E2 )(ba 1 2 ),
() () () ()
12 21 1 X dbn dna dbn dna
M = M + . (6.138)
4~ n na 2 na 1
If ba > 0, i.e., the initial state of the molecule is the lower one, then
(6.138) gives the probability amplitude for the molecule to absorb two photons
(at 1 , 2 > 0) or to absorb one photon (1 > 0) and emit another photon, the
Stokes one (2 < 0). In the last case, (6.138) is the amplitude of a Stokes Raman
process. Similarly, at ba < 0, (6.138) describes two-photon emission or anti-
Stokes scattering. One can say that the two terms in (6.138) differ in the sequence
the 1 and 2 photons are absorbed (or emitted).
Note that there are four types of two-photon emission: stimu-
lated, spontaneous-stimulated, stimulated-spontaneous and purely spontaneous
(Fig. 6.1(f)). They correspond to the four terms in the expression (N1 +1)(N2 +1) =
N1 N2 + N1 + N2 + 1, which follows from (6.138) after replacing E p with operators.
(Here, N p are the initial photon numbers.)
It follows from (6.138) that the maximal contribution to the transition ampli-
tude is provided by virtual states with the minimal energy deficit ~(na p ).
Note that the various transition paths (contributions of various virtual states)
may differ in signs and cancel each other (quantum interference of states).
In order to find the transition probability from (6.138), it is necessary to define
the square of the delta function:
Z T/2
|()|2 = () lim dteit /2 = ()T/2. (6.139)
T T/2
(2)
Kba E1 M12 E2 . (6.141)
For taking into account the finite transition linewidth, we replace the delta
function in (6.140) by the normalized form factor g (Sec. 2.2),
(2) 3 12 2
stim = 128 k1 k2 |e1 M e2 | g(ba 2 1 ), (6.143)
where kn n /c.
The two-photon absorption coefficient for the field with the frequency 2 in
the presence of the second field, with the frequency 1 , is (compare with (1.4))
(2) (2 ) = (2)
stim NF 1 , (6.144)
where N Na Nb is the population difference per unit volume.
The power absorbed from the field 2 by a unit volume of matter is
P2 = 2 ImP(3) 2121
2 E2 /2 = 2 Im E 2 E 1 E 2 E 1 /2. (6.145)
On the other hand, P2 = (2) I2 . Comparing (6.145) and (6.144), for the case of a
real transition dipole moment dmn we find that
Im2
121
= 4~1N M
21 21
M g(ba 1 2 ). (6.146)
Finally, the complete expression for the cubic nonlinearity follows from (6.146)
after replacing g by 1/(ba 1 2 i). It is not difficult to verify that a
straightforward calculation of (3) from (6.128) in a single-resonance approxima-
tion yields the same result.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
k2 4 2
Z X
d2 (d(2)
2 /d) sp.st , (6.147)
4 2
where 2 is the polarization index. After substituting here (6.143), we obtain the
differential cross-section per unit solid angle and a single polarization type of the
field 2 ,
(d(2) 3 12 2
2 /d) sp.st = 16k1 k2 |e1 M e2 | . (6.148)
Further, substituting F1 by c/L3 and summing over all modes 1 of the field,
from 0 up to the transition frequency ab , we find the rate of a purely spontaneous
transition,
X Z ba Z X
(2) 3 (2) 3
W sp = (c/L ) sp.st = (2/ ) d1 d1 d2 k13 k23 |e1 M12 e2 |2 .
k1 0 4 1 2
(6.149)
For a rough estimate of the two-photon decay probability in the optical range,
assume that all frequencies in (6.149) are equal to 0 = e2 /2~a0 = c/2a0 , and
the polarisability M of the transition is equal to a30 , with = 1/137. Neglecting
all other numerical factors, we obtain
(2)
W sp 0 (a/2)6 50 s1 .
An accurate calculation for the 2s 1s in a hydrogen atom yields 8 s1 .
(2) (1)
Thus, W sp is 1373 106 times as small as W sp . Despite the small probability,
two-photon decay can be easily observed using two PMTs and a coincidence cir-
cuit. Note that spontaneous (as well as thermal) two-photon emission, in contrast
to the single-photon one, has a continuous spectrum, which has no relation to the
discrete spectrum of an unperturbed atom. For the emission from a heated body,
the statistics of two-photon radiation also differs from the one of single-photon
radiation, which is caused by the fact that photons are emitted in pairs. Thus,
the anharmonicity of matter leads, in principle, to the deviation of the thermal
radiation statistics from a Gaussian one [Klyshko (1980)].
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
6.2.11 Conclusions
Thus, a large number of mechanisms contributes to the optical anharmonicity of
macroscopic matter. Quadratic susceptibility (2) , as a rule, is related to the non-
linearity of bonded electrons. It differs from zero only in piezoelectric crystals
and is on the order of 107 109 G1 provided that all frequencies belong to the
optical transparency window. The cubic susceptibility (3) in condensed transpar-
ent matter is also caused by the electronic nonlinearity ((3) 1015 G2 ) if all
frequencies belong to the optical range. However, if the difference of two fre-
quencies coincides with the frequency of a molecular vibration, then (3) becomes
as high as 1012 1013 due to the combined electron-nuclear (Placzek or Raman)
nonlinearity. At 0, the main contribution in the case of solids is that of elec-
trostriction ((3) 1013 ); in liquids, the orientation (Kerrs) nonlinearity is added
((3) 1012 ). Electrocaloric anharmonicity, usually, results in (3) . 1013 . Ex-
tremely strong nonlinear optical effects can be observed in liquid crystals and in
plasma. Note that the nonlinear electrodynamics of plasma is well described by
the Landau-Vlasov kinetic equations (see [Silin (1961)]).
where
m
X
ni = 0. (6.153)
i=1
where = x, y, z, then both the sign and the absolute value of P(m) will depend
on (m) (r) n1 1 + + nm m , i.e, on the relation between the field spectral
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
components. Such interactions are called parametric. It is clear from (6.154) that
the parametric interaction of m harmonics provides an effect that is accumulated
in space only under the condition (m) (r) = const, which is possible only for
plane waves, such that n (r) = kn r, kn kn . Then, the wave vectors of the
interacting waves should form a closed polygon,
m
X
k(m) kni = 0. (6.156)
i=1
This condition, with an account for (6.153), is called the spatial phase-
matching condition.
Often, it is only a single low-order term in (6.154) that is important (m = 3
or 4, i.e., three- or four-frequency interaction), with a certain combination of fre-
quency signs sn . This combination can be put into correspondence to an elemen-
tary multi-photon process involving m photons; then (6.153) and (6.156) can be
P
interpreted as the conservation laws of the field energy, ~n , and momentum,
P
~kn . In a parametric process, the energy of matter does not change, i.e., the
initial level Ea and the final level Eb coincide, ba = 0 (Fig. 6.1(a)).
Further, one can consider resonance parametric processes, for which one or
more virtual levels coincides with the real ones. Then, non-parametric effects
become also important (see below), i.e., there appears linear (or multi-photon) ab-
sorption (or emission), , 0, and the energy of the field is not conserved. How-
ever, provided that the absorption is low, the phase-matching condition (6.156)
is still valid. Note that, for instance, at m = 4, single, double, and triple reso-
nances are possible. The efficiency of resonance interactions strongly depends on
the frequencies n (even without taking phase matching into account), and the
corresponding susceptibilities take complex values.
Outside of resonances, 0, susceptibilities are real, and their dispersion
is weak; therefore, the efficiency of non-resonance parametric processes has only
indirect frequency dependence, through the phase-matching condition, which at
fixed directions of kn is satisfied only for a particular set of frequencies and at
fixed frequencies, only for a certain set of directions. Therefore, the efficiency of
parametric interactions manifests joint frequency-angular dispersion.
Let us return to the sum (6.154). Among the terms of even orders m, there
are degenerate ones, containing subscript pairs ni = n j , for which (m) 0. The
corresponding interactions, depending only on the intensities of the field harmon-
ics, |En |2 , are called non-parametric. The phase-matching condition is satisfied
for them automatically. The power absorbed at the frequency 1 due to the non-
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
It has a constant sign for a given set of frequencies and slowly (compared to eikr )
varies in space for any configuration of the field, since the phases of the harmonics
do not influence Eq. (6.157).
In non-parametric elementary processes, such as, for instance, two-photon ab-
sorption or a Raman transition, the final energy of the matter differs from the
initial one (Fig. 6.1), and therefore, the field energy is not preserved.i For a non-
parametric process, in addition to the trivial (in this case) condition (6.153) there
is a resonance at
l
X
n = ba . (6.158)
n=1
(the group matching condition); otherwise, the pulses will separate in the course
of propagation.
6.3.4 One-dimensional approximation
Further, as a rule, we will apply the one-dimensional approximation, which is
most useful in nonlinear (as well as linear) optics. Also known as plane-waves
approximation, it allows one to pass from partial-derivatives equations (6.151) to
usual equations. It reflects correctly the main features of many effects involving
pump beams not too much convergent or divergent and samples with the length l
not too large. The divergence can be often taken into account in the final formulas,
j Editors note: At present, mode locking provides pulse durations as small as tens of femtoseconds.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
and similarly for H. The factor (L/2)3 is added from dimensionality considera-
tions, see (7.100). This is the so-called k, -representation. Here, k and are in-
dependent real variables. In the case of a homogeneous transparent linear medium
without internal sources, k and are not independent any more (Sec. 4.2). Then,
XX
E(k, ) = (1/2) e (k)E(s) (sk)( s (k)), (6.160)
=1,2 s=
where
s (k) s[k r (k)t], s = , = 1, 2.
For instance, in the case of a plane monochromatic polarized wave,
E(s) (k) = (E1 s+ + E1 s ) (3) (k k1 ).
Unit polarization vectors e (k) and the dispersion law (k) are determined by the
() tensor,
c2 k [k e ] + 2 ( ) e = 0. (6.163)
Hence, if is real,
(k) = (k), e (k) = e (k) = e (k).
The wave vector k and the type of polarization define a plane monochromatic
wave, or a mode. It is convenient to make the set of modes countable using a cube
with periodicity L3 (Sec. 7.3); then the Fourier integral turns into a series, in which
a mode is labeled by a single subscript k {k, , s} (which also includes the sign
s of the frequency),
X
E(r, t) = (1/2) ek Ek exp(ik ), Ek E(s) (k). (6.164)
k
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
In a transparent linear medium without sources, the mode amplitudes Ek are inde-
pendent complex numbers defined by the boundary conditions.
Consider now a nonlinear absorbing plane layer (0 < z < l) in the medium,
which has the same 0 , and a stationary field incident on it from both sides. Then,
since the model has no dependence on x, y, t, the amplitudes Ek are only func-
tions of z. They are called the slowly varying amplitudes (SVA). Actually, the
transition from the real field E(r, t) to the z-dependent mode amplitudes Ek (z) is a
three-dimensional Fourier transform in the variables x, y, t (a non-complete trans-
formation, unlike (6.159)) or the transition to the , k , z-representation.
Note that a mode can be defined by fixing, instead of k, the frequency,
= (k), k , and the sign k of the longitudinal component kz . Then, kz is deter-
mined by the dispersion law (for an isotropic medium, kz = k ( 0 2 /c2 k2 )1/2 ).
In experiment, mode expansion is achieved by placing a spectral device into the
far-field zone of the emitter; in this case, the spherical angles k , k of k are also
fixed (taking into account refraction). Thus, the subscript k stands for one of the
following sets of values:
k {k, , s} = {, k , , , s} = {, , , , s}. (6.165)
As we will show in what follows, in the case of sufficiently small nonlinearity and
absorption, Fourier transformation of the Helmholtz equations (6.151) w.r.t. x, y
leads to a system of ordinary equations for SVA (for k > 0),
!
d k 2ik NL
+ Ek (z) = P (z) exp(ikz z). (6.166)
dz 2 cnk k
Here, the following notation was introduced:
k k ek 00 ek /cnk , nk nk cos k cos k ckz /k , (6.167)
where k is the angle between the ray (Poynting) vector sk of the mode and the z
axis, k is the anisotropy angle, i.e., the angle between k and sk , and PkNL is the
Fourier component of the nonlinear polarization with the frequency k > 0 and
the transverse wave vector k {k x , ky }, which is parallel to the ek vector. It can
be written as
PkNL (4|ukz|/L)PNL (k , q, z)
Z
= (2|ukz|/L3 ) dx dy dt ek PNL (r, t) exp[i(k t k x x ky y)], (6.168)
modes. Let us stress that in a one-dimensional model, a mode k can be only ex-
cited by a component of the source field having the same frequency and transverse
wavevector. The source field can have any dependence on z but the most efficient
component is the one that has the propagation constant Kz close to kz , due to the
phase matching condition (compare with (4.23)).
The boundary conditions for system (6.166) are given by the amplitudes Ek (z0 )
of the plane waves incident from the left and from the right: z0 = 0 for = + and
z0 = l for = . The solution to (6.166) yields the rule of mode transformation
by the nonlinear layer, i.e., the scattering matrix of the layer. Certainly, for de-
scribing real experiments the model should include reflection and refraction at the
boundaries (with an account for the nonlinearity, see Ref. [Bloembergen (1965)]),
as well as consider the limited interaction cross-section in x and y. In this case,
even in the linear approximation one gets the diffraction coupling between the
modes having close directions. This effect is more conveniently described in the
, r representation (6.151) (see the self-focusing section in 6.3).
Let us find, with the help of (6.166), the variation rate of the energy flux lon-
gitudinal component carried by the mode k, in ~k units:
where z is a unit vector along the z axis. For calculating Fkz , we multiply (6.166)
by Ek and sum with the complex conjugated expression,
The right-hand side has, according to (6.154), a simple meaning: this is the energy
(in ~k units) absorbed by a unit volume of the matter per unit time due to the
nonlinear polarization, i.e., this is the rate of the photon density decrease, Nk , in
mode k. In other words, (6.171) is the transfer equation (or continuity equation)
for the photons of the macroscopic field,
Fk + Nk = 0, Fk = uk Nk . (6.172)
where
m
X
k si kiz , Ei ei E(si i ) (ki ), (6.174)
i=1
and integration w.r.t. x, y, t in (6.168) yields the following restrictions for the
interacting modes and the sign indices:
m
X
k si ki = 0, (6.175)
i=1
m
X
si i = 0. (6.176)
i=1
Let us re-normalize the mode amplitudes so that their squares are equal
to the longitudinal photon flux density. (To simplify the notation, we assume
Fi Fki z > 0.)
|ai |2 = Fi . (6.179)
where
!m/2 !1/2
1 8~ 1 . . . m .
1...m (m1) ..e1 . . . em . (6.182)
2~ c n1 . . . nm
These equations will be analyzed for several typical cases in Secs. 6.4, 6.5.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Here, all amplitudes and phases of the field harmonics can be varied indepen-
dently; therefore, the expression in round brackets is identically equal to zero,
12... 21...
... = ... . (6.188)
This permutation relation is more general than (6.21) where the medium was as-
sumed to be transparent at all frequencies.
In the case of a non-parametric interaction, the phases are irrelevant, and we
obtain from (6.157) and (6.183) that
. .
Im(1122.....E1 E1 E2 E2 2211... ..E2 E2 E1 E1 . . . ) = 0, (6.189)
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
X
|E1 E1 E2 E2 . . . | Im(1122... 2211...
... ... ) = 0. (6.190)
...
1122... 2211...
... = ... . (6.191)
We have already observed such symmetry for Raman transitions and, in gen-
eral, two-photon transitions (Sec. 6.2). Assuming in (6.191) 1 L > 0, 2
S < 0, we obtain, with the help of (6.16),
LLS S
S S LL
(L = L + S S ) = (S = S + L L ) . (6.192)
Strictly speaking, (6.192) leads to the symmetry of only the imaginary parts
of the susceptibilities; however, using relations of Kramers-Kronig type (4.8), one
can show that the real and imaginary parts of (m) have the same symmetry. Re-
lation (6.191) also follows from (6.188), if one considers a resonance parametric
process with the frequencies 3 = 1 , 4 = 2 :
1212... 2112...
... = ... . (6.193)
where the sign factors si = 1 in the case kiz > 0 are determined by the en-
ergy conservation in the corresponding elementary process. Hence, for systems
(6.180), (6.181) we find m 1 integrals of motion, which allow the intensities in
m 1 modes to be expressed in terms of the intensity in a single mode,
easily see that the total energy flux is expressed in terms of Ck constants. To this
end, let us multiply relations (6.195) by ~k and add them,
m
X m
X
Iz (z) = ~k Fk (z) = ~k Ck . (6.196)
k=1 k=1
6.3.6 Derivation of one-dimensional equations
In order to derive equations (6.166) for SVA in the , q, z-representation, let us
write the frequency components of the field En (r),
E(, r) = (1/2)[En (r)( n ) + En (r)( + n )],
which enter the Helmholtz equations (6.151), in terms of the mode amplitudes
Ek (z) = 4|ukz|E(, q, z)/L defined in (6.164). Let > 0 and the polarization type
be fixed, then
Z
dt it
Z L 3
2E(, r) = e d3 k Ek (z) exp(ik r i(k)t)
2 2
L 3 Z
= d3 kEk (z) exp(ik r)( (k))
2
L 3 Z X
= d2 q |ukz |1 Ek (z) exp[iq ikz (, q)z], (6.199)
2
Further, assume that only a single mode with a certain sign of kz is excited effi-
ciently, then
L3
Z
E(, r) = d2 q|ukz|1 Ek (z) exp(ik r),
163 (6.201)
k {q x , qy , kz (, q)}.
While substituting (6.201) into (6.151), let us take into account that usually
both absorption and nonlinearity are low. Then, Ek (z) vary very little at a distance
of a wavelength, and the second derivatives can be neglected (for more detail on
the SVA method, see [Akhmanov (1964)]),
As a result,
where the k index is so far omitted. The first term here will be further canceled
due to (6.163). Let us take the inner products of (6.151) and (6.203) with e and
use the vector identity
iL3
!Z
d2 q|ukz|1 eikr {2c2 [e (e k)] 2k e 00 e}Ek (z)
163
Thus, the real part of 1 determines the local variation of the propagation con-
stant (i.e., of the wavelength or the phase velocity) of mode k1 , which scales as the
local intensities of other modes (induced dispersion effect), while the imaginary
part of 1 determines additional absorption or amplification due to the energy of
other modes (nonlinear or induced absorption effect).
Multiplying (6.207) by E1 or directly using (6.181), we find the system of
equations for the intensities (Fi Fiz ),
[d/dz + 1 + (m)
1 (z)]F 1 (z) = 0, (6.209)
where
!m
1 8~ 1 m 00
(m) (m) (m)
1 1 F 2 F m , 1 .
2~ c n1 nm 1
Recall that the signs of Fi and ni coincide and are determined by the sign of the ki
projection onto the z axis; therefore, F1 can increase in the course of propagation
only if 001 < 0.
Below, we will consider the basic types of non-parametric effects: nonlinear
absorption (including the saturation effect), Raman amplification and absorption,
spontaneous and stimulated scattering, self-focusing and self-modulation, as well
as their role in optics applications and in spectroscopy.
Fig. 6.6 Variation of the intensity of light in the course of propagation and the limitation effect under
two-photon absorption: F is the photon flux density times l, where is the two-photon absorption
coefficient, and l is the thickness of the layer.
leads to the violation of the exponential Bougers law for the intensity variation in
matter. This effect can be related to two-photon absorption (see (6.146)), in which
(3)00 > 0, and to the saturation of a single-photon resonance, where, according to
(4.99),
(3)00 = (1)00 dab
2
T 1 T 2 /~2 [1 + (0 )2 T 22 ].
Let a plane monochromatic wave be incident on a transparent (in the linear ap-
proximation) medium orthogonally to the boundary. Then, according to (6.209),
dF/dz + F 2 = 0, 322~2 (3)00 /c2 n2 > 0. (6.210)
The solution to this equation can be easily found,
F(z) = F(0)/[1 + zF(0)] 1/z. (6.211)
The last equation takes place at F(0) 1/z and describes the limitation effect
(Fig. 6.6), in which the intensity F(z) of the transmitted light does not depend on
the incident light intensity F(0).
If F(0) has fluctuations (in particular, due to the photon structure of the field),
the two-photon absorption will smear them, i.e., at the output of the layer photons
will be distributed in time and space more uniformly than at the input (the anti-
bunching effect, see Sec. 7.6).
For two-photon inter-band transitions in semiconductors like CdS, (3)00
1013 cm3 /erg at 0 0.7 and n 2. Then, /~ 1 cm/GW, i.e., the limi-
tation level at l = 1 cm for I ~F is on the order of 1 GW/cm2 . Note that at
such intensities, the structure of the matter can be changed, there can be a phase
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
transition and plasma can be created. Such optical breakdown in a transparent gas
or condensed matter can start from multi-photon absorption or ionization.
Ionization of an inert-gas atom requires the absorption of about 10 photons
from the neodimium laser (~ 1 eV). According to (6.209), the number of such
transitions per unit volume and time is
with E NL ~0 /d0 . Although this value is extremely small, the effect can be ob-
served by focusing a Q-switched laser [Delone (1978)]. Note that the high power
of the field in (6.212) increases the effect of its fluctuations. Indeed, according to
(7.41), radiation with Gaussian statistics is hI 10 i/hIi10 = 10! 3 106 times as
efficient as a non-fluctuating radiation with the same hIi.
As it was shown by Keldysh (see Ref. [Elyutin (1982)], Sec. 29), at very large
field values E1 or small frequencies , so that the condition d0 E1 < ~ is violated,
the dramatic power dependence of (6.212) on E1 gets slower due to the factor
exp(~0 /d0 E1 ), which is typical for the effect of tunnel ionization in a constant
field.
Power dependence of the form (6.212) can be also violated due to the dis-
sociation of multi-atom molecules in the field of IR-lasers under a single-photon
resonance with one of the vibrational frequencies of the molecule, v . In the
case of a CO2 laser ( 10 , ~ 0.1 eV), dissociation requires several tens
of photons. However, experiments with short pulses, where collision relaxation is
too slow to be revealed, manifest a weak dependence on E1 . This effect is proba-
bly caused by the fast transfer of the absorbed energy to a large number of other
vibrations (intra-molecular relaxation). Such effects are studied by a new field
in quantum electronics, IR photochemistry, and used for the laser separation of
isotopes.
In photochrome materials [Barachevsky (1977)], the limitation (darkening) ef-
fect is observed at much lower intensities, even in sunlight, where I 0.1 W/cm2 ,
due to the photochemical anharmonicity, which is very strong but inertial.
The lowest nonlinearity threshold is observed for the photographic process,
where the photochemical reaction (creation of blackening centers, silver particles
in AgBr microcrystallites) sometimes requires an energy density (exposure) as
low as It 108 J/cm2 . It is noteworthy that for the creation of a single center,
several photons are required (m > 1), i.e., this is a highly nonlinear process. At
small exposures, the degree of blackening D of a developed negative scales as the
number of centers; therefore, D = I m , where is the film sensitivity, and we
ignore the statistical variance of m. The relative intensity decrease for the probe
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
dF 0 F
+ = 0. (6.214)
dz 1 + F/F s
where F F(z), F0 F(0). The two terms in the right-hand side of (6.215) cor-
respond to the exponential and linear F(z) dependencies, respectively (Fig. 6.7).
At strong saturation, the exponential Bougers law becomes linear,
!
0 z
F 1 F0 , (6.216)
1 + F0 /F s
and in the limit F0 F s the matter is completely bleached (compare with the
nonstationary SIT effect described in Sec. 5.1).
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Fig. 6.7 Intensity F of a wave versus the distance z in an amplifying or absorbing medium in the
presence of saturation. At strong saturation, the exponential variation of F is replaced by a linear one
(the dashed line), 0 is the amplification coefficient for a weak signal, F s is the signal intensity at
which 0 is reduced twice.
(a) (b)
(c) (d)
Fig. 6.8 Doppler-free two-photon spectroscopy: (a) the levels ( is the scanned laser frequency and
0 is the frequency of the observed fluorescence); (b) the scheme of the spectroscope; (c) the idea of
Doppler-shift compensation; (d) the signal as a function of the laser frequency.
(a) (b)
Fig. 6.9 Doppler-free saturation spectroscopy: (a) the scheme of the saturation spectroscope; (b) the
observed signal as a function of the laser frequency.
(a) (b)
(c) (d)
Fig. 6.10 Raman interaction of two waves at various initial intensities. The horizontal axis is the
distance z in 1/C units and the vertical one shows the intensities Fi in units C = F10 + F20 , i.e.,
the functions F1 = (1 + ez )1 and F2 = 1 F1 , where = F20 /F10 is the relative initial intensity
of the Stokes wave. (a) = 1/25, the dashed line shows the asymptotics describing the exponential
amplification of the Stokes wave in the undepleted-pump approximation; (b) = 1/5; (c) = 1; (d)
= 5. The anti-Stokes wave, roughly, decays exponentially.
amplification without the population inversion. Raman amplification (or, for anti-
Stokes frequencies, attenuation) is used in spectroscopy, as well as for shifting
the frequency of lasers. If the feedback is added, with the help of a mirror, an
amplifier becomes an oscillator (a Raman laser).
where
( " # )1
Nb ~(1 2 )
G exp(2 l), N = exp 1 , (6.226)
Na Nb T
T is the temperature of the matter, and we assume that the transition a b is not
saturated.
For passing from N to F, one has to multiply the photon flux density in one
mode, Nc/L3 (L3 is the quantization volume), by the effective number of modes
for the amplifier or the detector,
Nc Nc3 k
F = g = =N . (6.227)
L3 (2)3 22
Here, we have considered a single polarization type, assumed the refractive index
to be unity, and denoted the effective frequency and solid-angle bandwidths of the
amplifier or the detector as and , respectively.
If the scattering volume has the shape of a thread, with a 1 and G 1,
only longitudinal scattering ( 0) or backscattering ( 180) into the solid
angle A/l2 is considerable.k (Here, A a2 is the cross-section area of the
scattering volume and l is its length.) Then, from (6.227) we find the relation be-
tween the noise power per unit frequency band, P = ~FA/, and the number
of photons per mode,
where the dimensionless number nF A/l is called the wave parameter or the
Fresnel number. Substitution of (6.225) into (6.228) yields ( 2 , 2 )
of the Raman resonance, i.e., by the decay rate of molecular vibrations (usually,
/2c 1 10 cm1 ). At high amplification, becomes (l)1/2 times as
narrow (Sec. 2.3)
Thus, according to (6.229), the scattered power depends on the pump inten-
sity as exp(lF10 ) 1. At lF10 1, the exponential dependence turns into a
linear one, and then the scattering is called spontaneous (SpRS). Replacing A/l2
in (6.229) by , we find the power of spontaneous Stokes scattering per unit
frequency and solid angle,
P = (~c/3 )(N + 1)VF10 , (6.230)
where V = Al is the volume of the scattering area. It is not difficult to verify
that this expression agrees with the above-calculated cross section (6.148) of a
spontaneous-stimulated transition. In contrast to StRS, SpRS has a broad direction
diagram, determined by the convolution of (3) with the polarization unit vectors
(6.220).
At F2 F1 , the role of the pump is played by the lower-frequency field,
and the external (anti-Stokes) field F1 (z), according to (6.217), (6.218), decays
exponentially. This effect is used in the spectroscopic method of inverse Raman
scattering. The noise field is then described by (6.225), (6.229) or (6.230) with
the permutation of indices 1, 2 and the replacements N + 1 N, .
Therefore, spontaneous anti-Stokes scattering with a given frequency is
N/(N + 1) = Nb /Na = exp(~0 /T )
times as weak as the Stokes scattering, provided that all other conditions are the
same. With a sufficiently strong pumping, the power of the anti-Stokes component
is saturated. The number of photons per mode tends then, according to (6.225), to
N, i.e., the anti-Stokes field acquires the brightness temperature
T e f = T 1 /0 T. (6.231)
6.4.5 Self-focusing
This single-frequency effect is connected with the self-action of a quasi-plane
quasi-monochromatic wave due to the real part of the cubic susceptibility (3)
Re(3) (, , ). The main contribution in (3) is from the electro-striction and
orientational anharmonicity (then, (3) > 0). In an absorptive medium, tempera-
ture anharmonicity ((3) < 0) and saturation are added.
With an account for linear susceptibility, the polarization amplitude is (we
assume that (1)00 = (3)00 = 0)
P = ((1) + (3) |E|2 )E (|E|2 )E,
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
where E E1 (r, t) is the slowly varying amplitude, i.e., the envelope. As a result,
the refractive index depends on the local intensity of the field,
Fig. 6.11 Self-focusing and defocusing (dashed line): 1, a beam with low power, P P0 , slowly
diverges due to diffraction; 2, a beam with a critical power, P0 , becomes channeled; 3, a beam with
P > P0 collapses.
6.4.6 Self-focusing length
Clearly, at P > P0 the beam will compress (self-focus), and under the opposite
condition it will diverge due to diffraction. The length of self-focusing will appar-
ently depend on the value (P P0 )/P0 .
In order to find this dependence, consider the solution to the nonlinear wave
equation (6.151) for a monochromatic field Re E(r)eit in quasi-optical and
aberration-free approximations (see Ref. [Vinogradova (1979)]). In a homoge-
neous isotropic medium without external sources, the field is transverse, so that
(6.151) leads to the nonlinear Helmholtz equation,
2 E + k2 (1 + (2 /0 )|E|2 ) E = 0, (6.237)
(3)
where k = n0 /c, n0 = 0 , 2 4 = 2n0 n2 .
Let a polarized light beam with a limited cross-section but with a narrow di-
rection diagram (narrow angular spectrum) be incident on a nonlinear transparent
medium occupying a semispace z > 0. Then the field in the medium can be
searched in the form
E(r) = eE(r) exp(ikz), (6.238)
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
i.e, the nonlinearity only changes the phase velocity /(k + k) = c/(n0 + n) of
the plane wave. For CS2 , at I = 1 GW/cm2 , n = 2 105 .
Now, let the incident wave have a Gaussian profile and a plane wave front
at the boundary: E(x, y, 0) = E0 exp(2 /a20 ), where 2 x2 + y2 and a0 is the
initial radius of the beam. Let us try to find the solution to (6.239) in the form of
a Gaussian beam with a variable radius, a(z) a0 f (z), and a parabolic wave front
with a variable curvature (z) on the axis,
Here, (z) is the additional to kz phase delay on the axis, and the factor 1/ f pro-
vides the dependence of the wave power on z,
Z
P = (cn0 /8) dx dy |E|2 = (cn0 /16)E02a20 . (6.243)
At small , the wave (6.241) has approximately spherical wavefront with the ra-
dius 1/(z). Below, it will be shown that the curvature of the front and the width
of the beam at a given z are related through Eq. (6.247).
From (6.241), we find
E02 22 E02
22
|E|2 = exp 1 .
f2
2 2
a0 f f2 a20 f 2
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
In the last equation, we have passed from a Gaussian profile to a parabolic one,
which is possible for the paraxial part of the beam (the aberration-free approxi-
mation). By substituting (6.244) into (6.239), one can find the functions f, , .
Indeed, setting to zero the sum of the coefficients by 2 , we find
2ikF 0 + 4F 2 k2 /z2NL f 4 = 0, (6.245)
where
z2NL n0 a20 /4n0 (6.246)
and n0 is the refractive index variation at point r = 0. The imaginary part of
(6.245) yields
= d(ln f )/dz, (6.247)
and the real part,
f 00 = 1/z20 f 3 . (6.248)
Here, the following notation has been introduced:
z20 (z2 2 1
d zNL ) = z2d /(1 P/P0 ) = z2NL /(P0 /P 1),
(6.249)
zd ka20 /2, z2d /z2NL = P/P0 = (n0 /n0 )(ka0 )2 .
The z0 parameter determines the length at which the beam collapses, and the zd
parameter has the meaning of the diffraction length; here, P0 and P are given
by Eqs. (6.236) and (6.243).
Hence, we finally obtain the z dependencies for the width and the curvature
radius R(z) of a parabolic beam due to its self-action,
a(z)/a0 = f = (1 + z2 )1/2 , (6.250)
Fig. 6.12 Dependence of z0 (in ka20 /2 units) on the beam power P (in the units of critical power P0 ),
given by the expression |1 P|1/2 . At P < 1, z0 has the meaning of the beam diffraction length, and
at P > 1, it is the collapse length. The latter, at P 1, is approximately equal to the nonlinear
length zNL (dashed line).
(Note that 1 + iz = (1 + z2 )1/2 exp(i arctan z), i.e., here, (z) = arctan(z/zd ).)
The diffraction length zd determines the distance at which the cross-section of the
beam is doubled and the front radius achieves its minimal value 2zd = ka20 b (the
so-called confocal parameter of the beam). At P P0 , the role of zd is played by
z0 , and diffraction slows down. Finally, at P > P0 , the parameter z20 and, hence, z2 ,
becomes negative, the beam is narrowed, and its width and the wavefront radius
turn into zero at z = |z0 |; at P P0 , |z0 | zNL (Fig. 6.12). Certainly, quasi-
optical aberration-free approximation describes correctly only the initial stage of
self-focusing; however, the typical self-focusing length should be on the same
order of magnitude as |z0 |.
In the vicinity of the focus, an important role can be played by higher-order
nonlinearities (5) , . . . and multi-frequency nonlinear effects, for instance, stimu-
lated scattering. Note that in the case of pulsed fields, the focus position |z0 (t)| is
time-dependent: as the intensity increases, the focus moves from infinity to some
minimal distance, and then it goes to infinity again (the moving focus).
Next, let a wave with a plane wavefront and a complicated profile
|E(x, y, 0)|2 I(x, y) (a wave with transverse amplitude modulation) be inci-
dent on a nonlinear medium. In the course of propagation, diffraction tends to
smoothen the profile inhomogeneities, while the nonlinear refraction, in contrast,
makes them more pronounced (self-modulation). Qualitatively, these effects are
described by (6.250) if a(z) is understood as the width of an inhomogeneity. As a
result, a wave can split in numerous waveguided filaments, each of them carrying
a power of P0 . This splitting is an example of dynamic transverse instability for
waves in a nonlinear medium and is similar to the formation of solitons in the case
of longitudinal instability (see below).
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Clearly, at (3) < 0, self-defocusing should take place (Fig. 6.11). It is de-
scribed by the above-given theory with negative P0 and z2NL . This effect can
be easily observed in continuous-wave laser beams by adding absorbing dyes to
transparent solutions.
In two-frequency (and/or two-wave) experiments, mutual focusing or defocus-
ing is observed due to Re(3) (2 = 2 1 + 1 ): an intense light beam with a
frequency 1 turns a plane-parallel plate into a collecting or dispersing lens for
the wave 2 . The same frequency component of the cubic susceptibility describes
induced dispersion and the optical Kerr effect: a plane pump with the frequency
1 turns an isotropic medium into a birefringent one for the second wave, with
the frequency 2 . The optical Kerr effect and mode-locked lasers enable the man-
ufacturing of optical shutters with switching times on the order of 1 ps.
Consider now a plane quasi-monochromatic wave with the time-modulated
envelope E(t). In a linear medium, within the first approximation of the disper-
sion theory, E(t) propagates without changing its shape, with the group velocity
u = (k). In the framework of the second approximation, d2 /dk2 , 0, and
the intensity spikes get spread; this effect is similar to the diffraction smooth-
ing of transverse inhomogeneities. In a nonlinear medium, self-action can lead
to an inverse effect: at (3) d2 /dk2 < 0, pulses with sufficient energy are com-
pressed (self-compression, see Ref. [Vinogradova (1979)], Sec. 9.5). A quasi-
monochromatic wave with a small initial modulation splits, as a result of self-
modulation, in separate pulses with fixed energy and shape, the solitons,l , which
are analogues of waveguided filaments in the case of self-focusing. In practice,
such effects are observed for picosecond pulses, so that inertial nonlinearity mech-
anisms (for instance, the temperature one) are not manifested.
servation law,
k x = 0, ky = 0, (6.255)
P
where k si ki . Formally, one obtains (6.254) and (6.255) by doing the three-
dimensional Fourier transformation of Maxwells equations (Sec. 6.3). In experi-
ment, the interaction time and cross-section A a2 are always limited; therefore,
(6.254) and (6.255) are valid only up to an accuracy of 1/, 1/a, respectively,
nonlinearity on the pump, i.e, assume that the nonlinear polarization P1 (z) at fre-
quency 1 in the right-hand side of (6.257) is a given function of the coordinates,
determined by the spatial distribution of free incident fields.
For a given polarization, taking into account definition (6.168), from (6.257)
we find the expression for the amplitude of the mode (k, ) excited in a homoge-
neous transparent linear medium by a Fourier component of the polarization,
82 ikuk
Z
E (k) = 3 2 d3 r eikr e P(k , r), (6.260)
L nk cos k V
where uk is the group velocity. Note that (6.260) determines the radiation field
outside of the radiating area V regardless of its shape.
In the case of a plane-parallel layer with the thickness z, it follows from (6.257)
that
!m
21 |(m1) |2 z2
!
8 2 z
I1z (z) = sinc I2z (0) . . . Imz (0), (6.262)
c 16n1 . . . nm 2
where Ikz = cnk |Ek |2 /8 = ~k Fkz is the longitudinal component of the energy
flux density in mode k and sinc(x) (sin x)/x. Eqs. (6.261), (6.262) give the
efficiency of parametric frequency conversion.
Note that the new mode (k1 , 1 ) k1 is determined through the pump modes
k2 , . . . , km by conditions (6.254), (6.255) only up to the polarization type 1 and
the sign 1 of the longitudinal component k1z (see (6.165)). In other words, each
P
combination of polarized pump waves si ki , i = 2, . . . excites, in principle, four
waves with the frequency 1 , which differ by indices 1 = 1, 2 and 1 = 1.
However, the factor sinc2 (z/2) in (6.262) usually strongly suppresses the wave
with k1z < 0 (about (k1z z)2 times, see Fig. 6.13).
Thus, in the undepleted-pump approximation, excitation of new waves scales
as the product of the incident intensities, the squared nonlinear susceptibility,
and, provided that phase matching is satisfied (||z 1), as the square of
the interaction length. In the case of the opposite inequality, z2 is replaced by
(/2)2 sin2 (z/2) 2/2 , i.e., if the phase velocities are not matched, the ampli-
tude of the new wave periodically turns to zero. The distance /||, at which the
intensity of the wave varies monotonically, is called the coherence length.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Fig. 6.13 In the case of parametric frequency conversion, 2 + 3 4 1 , there are two waves,
k1() , satisfying the energy and transverse momentum conservation laws. However, the coherence
length /() for the k1() wave is extremely small, on the order of the wavelength.
where k kr and r connects some point inside the V area containing the sources
and the observation point. Thus, at large V the far field scales as the Fourier
transform of the external polarization.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Let the tensor (m1) be nonzero within a rectangular box with dimensions a, b, c,
then, at s1 = 1, (6.264) yields
21 ik1 r .
E (1 , r) = 2
e ( (m1) ..E2 . . . Em )V f (r), (6.266)
cr
Z
0 0
f (r) d3 r0 eik r /V = sinc(k x a/2)sinc(ky b/2)sinc(kz c/2), (6.267)
k n1 1 r/c K. (6.268)
Fig. 6.14 To the definition of the direction diagram I() for the field emitted by a polarization with
the frequency and the wave vector K from a domain with the dimensions a, b, c: the radiation has a
noticeable intensity only if the observed wave vector k belongs to a domain of dimensions 1/a, 1/b, 1/c
near the point K.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Fig. 6.15 Angular dispersion of the refractive index ne () for the extraordinary wave enables the
frequency dispersion to be compensated and the phase matching condition no (1 ) = ne (21 , 1 ) to be
satisfied for second-harmonic generation, as well as for other three-wave interactions.
where
(a) (b)
Fig. 6.16 Parametric interaction in the case of a low-frequency undepleted pump. (a) The intensity
versus the distance z; (b) two versions of the radiation spectrum (arrows show the direction of the
energy transfer).
(a) (b)
Fig. 6.19 Parametric amplification and difference-frequency generation in the case of backward in-
teraction. (a) Intensity (in F10 units) as a function of the coordinate z (in 1/ units) at F2l = 0 and
l = 1.5; the parametric amplification coefficient is 1/ cos2 (1.5) 200; (b) the phase-matching trian-
gle for backward interaction.
the OPO can be smoothly tuned by varying the orientation or temperature of the
crystal. The existing pulse-pumped OPOs cover the range 0.420 , which
conveniently complements the range of tunable dye lasers. By optimizing the
parameters of the crystal, the focused pump beam, and the cavity, one can obtain
oscillation even in the cw regime.
Let the layer have a thickness l, then the boundary conditions have the form
a1 (0) = a10 , a2 (l) = a2l . It is easy to check that the solutions to (6.286) are
(Fig. 6.19)
where
The factor 2 in the last expression was added according to relation (6.13).
One can easily see that these equations satisfy the condition of the field energy
conservation, which in this case coincides with the Manley-Rowe equation,
and the factor 2 is due to the fact that each second-harmonic photon has an en-
ergy twice as large as the pump photon energy. The second integral of equations
(6.288), (6.289), according to (6.198), has the form
From the second equation, it follows that both 1 and 2 are independent of z, and
from the first one, that the energy transfer direction (from the first harmonic into
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Fig. 6.20 Second harmonic generation. The horizontal axis shows the distance (in 1/ units), the
vertical axis corresponds to photon fluxes for the pump and the harmonic (in F10 units). At z 1,
every two pump photons turn into a single double-frequency photon.
the second one or vice versa) depends on the signs of and . In the case consid-
ered, b2 is increasing (from zero); therefore, sin > 0, i.e., = (/2) sign.
Substituting (6.292) with C1 = b210 into (6.294), we easily find the solution
db2 /dz = ||(b210 2b22 )/2, (6.296)
Z
1 db2 2 2b2
z= 2 2
= arctanh . (6.297)
|| b10 /2 b2 ||b 10 b 10
Hence,
b2 = (1/ 2)b10 tanh z, b1 = b10 / cosh z, (6.298)
where
1/2
||b10 / 2 = 2(2/cn1)3/2 1 ||I1z . (6.299)
Thus, the intensity of phase-matched second harmonic I2z (z) increases mono-
tonically as tanh2 z, and at z 1 it achieves the initial intensity of the pump
I1z (0) (Fig. 6.20). The pump intensity goes in this case down to zero as 1/ cosh2 z,
with its phase being constant. At z = 1/, the SHG efficiency is 58% in the en-
ergy and 29% in the photon number. At , 0 or a2 (0) , 0, monotonic solutions
are replaced by periodic ones, and reduces.
In practice, powerful pulsed lasers provide as high as several tens per cent.
Note that phase-matched SHG in piezoelectric crystals has important applications,
such as shifting laser frequencies up by as much as an octave.
Let us also mention much weaker effects that do not require phase match-
ing: SHG under the reflection from a medium without the center of symme-
try [Bloembergen (1965)] and incoherent scattering of light at frequency 2 by
non-centrosymmetric molecules [Kielich (1980)], as well as by any molecules,
atoms, and free electrons, due to the magnetic anharmonicity (Sec. 6.2).
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Note that the condition E2 (+0) = 0 accepted above does not fully correspond
to an ordinary experiment where it is the incident second-harmonic field that is
zero, while the transmitted and reflected fields E2 (+0), E2 (0) are non-zero due
to the requirement that the tangent components of the fields E2 , H2 should be
continuous at the boundary [Bloembergen (1965)]. However, in practice E2 (0)
are indeed very small.
Fig. 6.21 Spontaneous multi-photon processes can be taken into account in the semiclassical the-
ory, by adding a single extra photon into the output modes (the corresponding arrows are directed
downwards). The figure shows the direct and inverse three-photon parametric processes.
6.5.9 Parametric down-conversion
The quantum theory, in principle, allows a three-photon transition in which trans-
parent matter (an atom or crystal) absorbs a photon from mode k3 and emits a
pair of photons into modes k1 and k2 , returning into its initial state (Fig. 6.21).
The probability of such a transition scales as (N1 + 1)(N2 + 1)N3 , where Ni are
the initial photon numbers per mode and the unities appear due to the quantum
fluctuations of the mode amplitudes in the ground state (Chapter 7). The prob-
ability of the inverse process, where photons in modes k1 , k2 are absorbed and a
photon is emitted into mode k3 , scales as N1 N2 (N3 + 1). As a result, the rate of
pair generation scales as
Thus, due to the effect of the radiation at frequency 3 , the matter generates pairs
of photons with frequencies 1 and 2 = 3 1 distributed within a broad
range, from zero to the incident radiation frequency. In the case of macroscopic
matter, the frequencies and directions of emission are related through the phase
matching condition k3 = k1 + k2 . This effect, observed in birefringent crystals, is
called parametric down-conversion (PDC)n or parametric fluorescence [Zhabotin-
sky (1969); Klyshko (1980); Akhmanov (1971)]. PDC can be interpreted as the
quantum noise of an optical parametric amplifier.
n Editorsnote: in the original text, D. N. Klyshko used the term parametric scattering, suggested
in his pioneering work (1967) on the theory of PDC [Klyshko (1980)]. Later, the term parametric
down-conversion became widely accepted.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
where p+q = 1, 0 6 p 6 1. In particular, one can add half a photon to all numbers
of photons per mode.
According to (6.306), quantum noise yields G1 photons in each output mode
of a parametric amplifier, with G being the amplification coefficient for this mode.
A similar result (Kirchhoffs law) is valid for quantum and Raman amplifiers (see
(7.7) and (6.225)).
Note that at N2 = N3 = 0, it follows from (6.304) that three-frequency up-
conversion with low-frequency pumping has no contribution from quantum fluc-
tuations. In the case of four-frequency interaction, it is possible that two pump
photons decay in a pair of photons with the frequencies and 23 , occupying
the range 0 2 (hyper-parametric scattering or light scattering by light).p
Let us pass from photon numbers per mode to the spectral brightness. Using
(6.306) and (6.227), we find that at Ni0 = 0 (we assume that 1 = 0)
where F dF/dd is the flux of photons in the k1 direction with the fre-
quency (k1 ) per unit solid angle, unit area (orthogonal to k1 ) and unit circular
frequency. The value I (k, r) is called the spectral brightness (sometimes, sim-
ply the intensity) of incoherent radiation. Recall that the brightness, in the case of
a transparent medium, does not vary in the direction of light propagation.
o An idler mode, with respect to the considered signal mode with the frequency , is the mode
1
with the frequency 2 = 3 1 .
p Editors note: now this process is commonly called spontaneous four-wave mixing. Starting from the
beginning of the 2000 s, it is applied for the generation of nonclassical light, both faint one (photon
pairs) and bright one (squeezed light).
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Fig. 6.22 Frequency-angular spectrum of parametric down-conversion in lithium niobate for different
pump wavelengths 3 : 1 is the angle of the cone along which the wave 1 is emitted. The angle 3
between the pump beam and the crystal axis is 90 . When 3 is reduced, the gap in the spectrum (for
3 < 0.53 ) disappears.
continuously turns into the Raman scattering by polaritons and optical phonons.
Then, has a resonance growth due to the contribution from electron-nuclear an-
harmonicity (Sec. 6.2), but this growth is accompanied by an increase in the linear
absorption 2 at the idler frequency, so that the brightness integrated over the fre-
quency, I , does not change much. Certainly, when the idler frequency is small
and 2 l 1, (6.312) has to be multiplied by N(2 /T ) + 1, where T is the lattice
temperature and N is the Planck function.
By analogy with the scattering by polaritons, PDC can be defined as the scat-
tering of the pump radiation by the field fluctuations in the idler modes, i.e., as
light scattering by light, similarly to the way Mandelshtam-Brillouin scattering
can be called light scattering by sound.
The important features of PDC, distinguishing it from other types of light scat-
tering in matter, are, first, a broad continuous spectrum, not related directly to the
eigenfrequencies of the matter, and, second, the two-photon structure of the emit-
ted radiation: at small pumping (l 1), signal and idler photons are emitted
only in pairs, practically simultaneously.
Note that, in addition to coherent (forward) emission, there is incoherent PDC
by separate non-centrosymmetric molecules (to be precise, by the density and
orientation fluctuations of such molecules). Then, the phase matching condition
plays no important role, since the momentum deficit of the field is provided by the
molecule.
Let l 1 and n 1, then from (6.312) we find the intensity of spontaneous
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
photons are orthogonally polarized and therefore have different group velocities.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
6.5.10 Light scattering by polaritons
This process is described by the system (6.269)(6.271) with linear absorption
introduced into one of the equations. In the approximation of undepleted high-
frequency pump, the interaction between the Stokes (signal) and the polariton
(idler) waves is determined by the equations
where 2 /2, = a30 (see (6.282)), and the lower signs correspond to the
case of the backward idler wave. In the polariton range of the PDC spectrum, i.e.,
in the vicinity of lattice eigenfrequencies, the idler wave is strongly absorbed, so
that usually ||, l 1, and the amplitude of the idler wave is determined by
the local amplitude of the signal wave.
For this reason, let us search the solution to (6.317) in the form a2 (z) =
b2 exp(iz),
Here, the derivative db2 /dz does not much exceed |b2 |; therefore, if || | + i|,
it can be neglected. Then,
In this approximation, the incident idler radiation does not influence the output
field, due to the high absorption, which is much stronger than the parametric am-
plification. Substituting (6.319) into (6.316), we obtain
where
Thus, strong absorption of one of the two interacting waves leads to the ex-
ponential increase, similarly to the case of usual Raman interaction due to (3)
(Sec. 6.4), with the only difference that here, phase matching condition || is
important.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
The same relation between the resonance susceptibilities of the first, second, and
third order follows from the microscopic theory [Klyshko (1980)].
The spectral brightness of the Stokes field I (k1 ) in spontaneous or stimu-
lated light scattering by polaritons can be found through the Kirchhoff law,
vac
I (k1 ) = I [N(2 /T ) + 1]{exp[g0 (k1 )z] 1}, (6.322)
or by using the results of Sec. 6.4. Note that if a2 is understood as the amplitude
of the sound wave, the above analysis will describe the Mandelshtam-Brillouin
scattering (spontaneous and stimulated).
(a) (b)
(c) (d)
Fig. 6.24 Phase matching conditions in four-wave interactions: (a) generation of UV radiation is
usually performed under collinear phase matching; (b) in CARS, all frequencies are close and phase
matching is satisfied at small angles of scattering ; (c) in the general case of frequency conversion of
the form 1 + 2 3 4 , the phase matching has the form k1 + k2 = k3 + k4 . The wave vectors
in this case can be non-planar: the phase matching quadrangle can be folded along the dashed lines;
(d) in OPC, all four frequencies are equal and the phase matching condition is satisfied for any two
standing waves.
Fig. 6.25 Active spectroscopy. The matter is excited by two lasers with frequencies 1 and 2 such
that 1 2 equals the frequency 0 of molecular vibrations. The resonance can be detected by an
increase in the intensity Ii , polarization, or phase of any one of the four frequencies observed while
scanning 1 or 2 .
metric scattering,r or light scattering by light, and spontaneous CARS, are only
described in terms of nonlinear quantum optics and, in some cases, equations like
the Kirchhoff law [Klyshko (1980)].
In the case of active spectroscopy, the resonance manifests itself in the in-
tensity variation at the new frequencies A or S . (It is convenient to use the
anti-Stokes range, A , where there is less stray light caused by the fluorescence
of the sample and the optical elements.) According to (6.261), (6.262), in the
undepleted-pump and plane-wave approximations, the intensity of CARS is
!4
4 2A l2 I12 I2 (3)
IA (2 ) = 2
|NR + (3) 2
R (A = 21 2 )| , (6.323)
c n1 n2 n A
Fig. 6.26 Phase conjugation using degenerate four-wave interaction. A nonlinear medium excited by
a standing pump wave reflects back all waves of the same frequency along the way of their incidence.
As a result, a divergent spherical wave becomes a wave convergent to the source (dashed line).
incident on the medium, the fourth wave will appear, with the frequency and
the wavevector k4 = k1 + k2 k3 = k3 . Thus, the medium excited by the pump
acts as a mirror reflecting all plane waves back along the path of their arrival, in
contrast to a usual mirror, which performs the transformation kz kz .
In the case of an arbitrary spatial distribution of the signal (object) field E3 (r)
it will contain many Fourier components {k3 }, each of them creating its own con-
jugate component. As a result, the initial object wave will be restored around the
medium, RE3 (r), with the same wave front shape, but propagating in the opposite
direction (from the medium) and, of course, having a different energy (R , 1). It
is important that due to the backward parametric amplification (see (6.287)), |R|
can considerably exceed the unity. (Usually, this is only achieved in the pulsed
regime.)
The possibility of the OPC effect for an arbitrary optical field, which, in a
sense, provides the time inversion,s looks striking from the viewpoint of nonlinear
optics. In fact, this effect has been discovered in the optical range as early as in
1949, long before the birth of lasers and nonlinear optics, by Gabor, the author
of holography. In holography, OPC manifests itself in the appearance of twin
images, which were considered by Gabor only as sources of noise. The perspec-
tives of OPC applications were understood only much later, mainly in the sev-
enties, when the practical methods of dynamical holography have been invented,
which enabled OPC to work in real time, without the delay for photographic
film developing. These methods, in addition to four-wave parametric interaction,
s OPC is equivalent to the t t transformation only for a strictly monochromatic field. In the case
of a quasi-monochromatic field, OPC does not change the shape of the envelope E0 (t).
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
(a)
(b)
Fig. 6.27 Applications of the OPC effect: (a) correcting the wave front distortions. The plane front
of the wave incident from the left becomes distorted, but after being reflected by a phase conjugating
mirror and passing the same medium again it restores its initial shape. (Reflection from a usual mirror
would double the distortions.) (b) Focusing of strong laser radiation on small targets: light from a
weak laser (top) is scattered by a target, part of the scattered field is amplified by a strong amplifier,
gets reflected by the mirror, is amplified once again and returns to the target.
use three-wave degenerate interaction effects, 180 light scattering, and superflu-
orescence [Zeldovich (1985); Bespalov (1979)]. OPC methods have been also
developed for acoustic waves.
OPC is an example of adaptive optics, aiming for the automatic correction
of optical systems. OPC allows one to correct the distortions of the wave front
(phase distortions) appearing due to the signal wave passing through an optically
inhomogeneous medium, like an opaque glass or a quantum amplifier. To correct
for this, it is sufficient to reflect the wave back at the output of the inhomogeneous
medium by means of a phase-conjugating mirror and this way to make it travel
along the same path in the opposite direction. Then, all distortions of the wave
front formed on the way will be straightened and the front will restore its origi-
nal shape (Fig. 6.27(a)). (Of course, amplitude distortions caused by irreversible
absorption or amplification will not be compensated but accumulated.) This ef-
fect allows powerful but inhomogeneous amplifiers to be used for increasing the
energy of weak lasers providing single-mode beams with the minimal (diffrac-
tion) divergence and natural bandwidth. This technique provides record values
of spectral brightness.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Another application of OPC, important for solving the problem of laser ther-
monuclear synthesis, is automatic focusing of strong laser radiation onto small
targets (Fig. 6.27(b)).
Using a resonance susceptibility (3) (in particular, enhanced by two-photon
resonances in the vapors of alkali metals) allows one to reduce the pump power in
four-wave interactions (down to 1 W/cm2 , with the reflection coefficient |R| 1).
In OPC via stimulated scattering, a sufficiently strong monochromatic wave
ReE(r) exp(it) serves as a pump causing 180 stimulated Raman scattering
or stimulated Mandelshtam-Brillouin scattering in a nonlinear medium. If the
wave front of the pump is rather non-uniform, the back-reflected Stokes radiation
ReES (r) exp(iS t) (with S = 0 ) has approximately the same wave front
shape, ES (r) RE (r), where |R| is close to unity.
At first stages of backscattering (z . l) the Stokes field is chaotic, it has all
modes with different wavevectors k excited independently and homogeneously.
(Since the wavevector length k = n(S )S /c is fixed, the modes are determined
by the transverse component k q). In the case of a multi-mode pump, var-
ious Stokes modes (q, S ) have different Raman amplification coefficients (q).
Moreover, one can show [Zeldovich (1985)] that if some mode (q, ) is present in
the pump spectrum, then the Stokes mode (q, S ) has, on the average, an ampli-
fication coefficient twice as large: (q) 2. Because at l 1 this difference
is amplified exponentially, it is sufficient that the part of the Stokes field repeating
the pump spectrum is considerably more intense than the noise part.
The advantages of Brillouin or Raman mirrors are the absence of the pump
(they are analogues of reference-free holography) and almost 100% efficiency.
The disadvantages are the existence of a threshold, the necessity to have multiple
modes in the signal field, and the frequency shift of the reflected radiation. The
latter restricts the accuracy of the reconstruction. Of much interest are lasers in
which one of the mirrors is phase conjugating (a Brillouin one) and the other
one, a usual mirror, plane or concave. This scheme simultaneously provides Q-
switching (due to the threshold behavior of the stimulated scattering) and correc-
tion for the optical inhomogeneity of the active medium.
Let us consider OPC via four-wave interaction in a little more detail. An
arbitrary signal field (for simplicity, considered as scalar) can be represented as
The phase-conjugated field describing monochromatic waves with the inverse di-
rections of wavefront propagation, by definition, differs by the sign of the time,
The amplitude and phase of the phase-conjugated field will be defined by analogy
with (6.325),
Hence, the wave surfaces of the monochromatic fields determined by the equa-
tions (r) = const and (r) = const coincide. Thus, the OPC effect reverses not
the wave fronts but the propagation directions.t Note that reflection by a usual
mirror, plane, spherical, or of a more complicated shape, also leads to the trans-
formation E E, but only in trivial cases where the mirror surface coincides
with the wave surface.
The reversed field E copies the signal one in all space outside of the nonlinear
medium, including any transparent or scattering bodies of arbitrary shapes (pro-
vided of course that the scattered field fits the mirror aperture, see Fig. 6.26).
However, as we have already mentioned, if real, irreversible absorption (or ampli-
fication) is present, these are only phase surfaces that are restored; the amplitude
of the reversed field will be further reduced (or increased) on the way back through
the absorber (amplifier).
Let a signal field, ReE3 exp(i3 t), and a pump field, ReE1 exp(i1 t), be
excited in a nonlinear medium, which for the sake of simplicity will be considered
isotropic. An electromagnetic field present in a medium is accompanied by other
fields, for instance, pressure p(r, t), temperature, vibrations of molecules, excited
electrons, and so on (Sec. 6.2). In photographic materials, there are fields of
metallic silver or other products of photochemical reactions. In the simplest cases,
the amplitudes of these fields scale as the constant or slowly varying part of the
squared local field E 2 (r, t). For instance, due to the optical electrostriction,
the case of monochromatic fields with the same frequencies, p(r) is a static field
materializing the spatial distribution of the interference field E1 (r)E3 (r). This
simple model illustrates the idea of dynamical holography, which is used for the
study of fast processes.
The record (6.329) can be read out with the help of a second reference wave
ReE2 exp(i2 t), which will be scattered by the field n(r, t) of the refractive
index induced by the pressure field. In other words, the readout field E2 induces
polarization scaling as pE2 , and the field it emits is
E4 (r) P4 (r) = (3) E1 (r)E2 (r)E3 (r), (6.330)
where 4 = 1 + 2 3 . The relation between E4 and P4 in the Born approxima-
tion is given by Eqs. (6.260), (6.264). One can see from (6.330) that if the product
E1 E2 has a weak dependence on r, the OPC effect takes place, E4 E3 = E3 .
In particular, this is the case for a plane monochromatic standing-wave pump, for
which 2 = 1 and k2 = k1 . Note that in usual, static holography, since the
nonlinearity is inertial, it is necessary that 1 = 3 and 2 = 4 .u
Note that OPC can be also obtained via three-wave interaction,
P4 (r) = (2) E1 (r)E3 (r). (6.331)
However, in this case, only a part of the signal field E3 is reversed, whose fre-
quency and angular spectrum is within the phase matching band |k|l < 1,
k k1 k3 k4 , which leads to a loss of fine spatial and temporal details
of the signal. In addition, the phase-conjugated field E4 propagates from left to
right, similarly to the signal field, and an additional mirror is required for reflect-
ing it back. A huge advantage of four-wave interaction is the possibility to obtain
phase matching automatically using standing plane waves.
Let us estimate the efficiency |R| of OPC via degenerate (i ) interaction
using one-dimensional (diffraction-free) approximation in the undepleted-pump
regime. To this end, let us use Eqs. (6.286) for backward interaction, with an
obvious replacement of frequency subscripts. From (6.287) at z = 0, we find the
ratio of the incident and reflected amplitudes in modes k and k,
|R| |E4 (k)/E3 (k)| = tan l, (6.332)
where, according to (6.278), with the replacement (2) (3) E2 and E1 = E2 ,
2 (3) 323
= | E1 E2 | = 2 |(3) |I1 . (6.333)
cn cn 0
Let (3) = 1012 cm3 /erg, l = 1 cm, 0 = 1 , n = 1.5, then l = 1 at
I1 1 GW/cm2 .
u In the case of a bulk hologram (kl 1), both a static or a dynamic one, the phase matching condition
a2 P0 /I = l/4, (6.334)
i.e., in the case where the pump has considerable diffraction divergence in the
layer. (In this case, the validity of the one-dimensional approximation is also
violated.)
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Chapter 7
Statistical Optics
237
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
the one of the sunlight by many orders of magnitude. In connection with this, let
us mention one of the paradoxes in the history of physics: quantum optics started
its rapid development only during the laser age, when light fields with hNi 1
appeared (although the general principles of quantum electrodynamics were de-
veloped much earlier).
In quantum optics and electrodynamics, an ensemble of fields is given by a
wavefunction or a density operator . The angular brackets in the definition
of the correlation function denote now quantum averaging over or ; then, Ei
are operators acting on according to certain rules. It is important that, in the
general case, the fields at the neighbouring points of space-time do not commute,
which leads to the quantum fluctuations of the field, to the moments hEi E j i being
nonzero even for the vacuum, to the spontaneous emission of excited atoms, and
to the noise of quantum amplifiers and generators.
The statistical theory of laser radiation studies the most important parameters
of lasers, such as the maximal possible monochromaticity of oscillators and the
sensitivity of amplifiers. Similarly to a consistent theory of thermal light emission
by heated bodies, this theory should be based on quantum electrodynamics and
non-equilibrium thermodynamics [Zubarev (1971); Klimontovich (1982)]. A spe-
cial difficulty in the quantum-statistical analysis of a laser is that the nonlinearity
plays a principle role, as it determines, due to the saturation effect, the stationary
amplitude of oscillations (the limiting cycle of a classical auto-oscillator).
The most important achievement in the nonlinear quantum theory of
lasers [Loudon (2000); Klimontovich (1980); Lax (1968); Arecchi (1974);
Klauder (1968)] is the conclusion that in the cavity of a laser operating well above
threshold, the field is in a coherent state, a concept introduced into quantum optics
by Glauber [Glauber (1965)]. There is a close analogy between a field in a coher-
ent state with a large amplitude and a classical harmonic oscillation and, since the
saturation effect manifests itself only in the case of high amplitudes, the nonlinear
regime of a laser is rather accurately described by the semi-classical theory. Non-
linear theories predict all statistical characteristics of laser radiation: the intensity,
the spectral width, the coherence radius, and the higher moments.
Even more crude, but still useful is the approximation considered in the linear
theory of noise in quantum amplifiers and oscillators (Sec. 7.1), which ignores
the saturation effect and is therefore valid only below the oscillation threshold.
The most important results of this theory are the Kirchhoff law, giving the noise
intensity of an amplifier in terms of its amplification coefficient, and the Townes
formula, relating the natural bandwidth of an oscillator to its power. According
to the linear theory, the radiation has Gaussian statistics, and therefore these two
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
parameters, the intensity and the spectral width, fully describe the statistics of the
field.
This chapter is devoted to the foundations of quantum optics. The consid-
eration starts with the linear theory of noise in quantum amplifiers, which does
not require the quantization of the field (Sec. 7.1). In Sec. 7.2, basic notions of
classical statistical optics are considered. The next section (Sec. 7.3) describes
the initial stage of the field quantization, i.e., writing the Maxwell equations in
the canonical form, which allows the quantization to be performed in an easy way
(Sec. 7.4). Section 7.5 considers the basic classes of quantum states of the field,
and Sec. 7.6 describes the statistics of photons and photoelectrons for these states.
Finally, in Sec. 7.7 we once again return to the question about the probability
of a transition due to a noise field (Sec. 2.4), but this time we consider it in the
framework of the quantum theory.
The intensity of the noise radiation of a laser or a maser operating in a linear sta-
tionary regime can be calculated without using quantum electrodynamics, from
the general considerations based on certain rules of non-equilibrium thermody-
namics like the Nyquist formula (or the fluctuation-dissipation theorem, FDT)
and the Kirchhoff law for the thermal radiation.
Fig. 7.1 Waveguide Kirchhoff law. The noise of the waveguide is determined by the competition
between the absorption and the spontaneous emission j = 0 N, where N = N2 /(N1 N2 ) is the
Planck function and Ni are the level populations. As a result, the noise can be expressed in terms of
the transfer coefficient G = ez and the temperature T .
= 2/k but much less than the distance at which P f (z) changes noticeably; E f
is the spectral energy density of the macroscopic field within L; N f E f /h f is
the number of photons within L per 1 Hz, which is equal to N times the spectral
density of the longitudinal modes g f = Ld1 /d = L/u. The inverse value, u/L,
is equal to the interval between the eigenfrequencies of the neighboring modes
(for the notion of a mode in more detail, see Sec. 7.3).
Let us start from the linear kinetic equation for N(, z) of the form
dN/dz = N + j. (7.2)
where N0 is the signal at the input of the amplifier and G is the transfer coefficient
of the waveguide. Here, the second term, which is independent of the input signal,
is the noise of the waveguide.
a Suchequations are called transfer equations (for photons, neutrons, etc.), they imply that a space
coordinate z, defined up to an accuracy of L/2, can be attributed to a photon. In the general case, a
function N(k, r, t) is used, which has the meaning of local spectral brightness in ~c/3 units.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Let us express the ratio j/ in terms of the temperature of the matter. Let
z 1, then, according to (7.3), N = j. On the other hand, in a sufficiently long
waveguide with damping, equilibrium radiation should be formed, with the mean
number of photons per mode given by the Planck function N. Hence,
j/ = N [exp(~/T ) 1]1
= (1/2)[coth(~/2T ) 1] = N2(0) /(N1(0) N2(0) ). (7.4)
This conclusion is rigorous only for a completely equilibrium system where the
populations Ni and the photon number N obey the Boltzmann and Planck distri-
butions, respectively.
However, it is reasonable to assume that (7.4) is approximately valid even
in the absence of equilibrium radiation, for instance, at z . 1 and N0 = 0.
Then, the parameter T relates only to the matter, whose temperature is maintained
constant and uniform, despite the radiation cooling (quasi-equilibrium). For
this, it is necessary that the degrees of freedom that emit and absorb radiation at
frequency interact with the thermostat much stronger than with the field.b In this
approximation, Eq. (7.3) at N0 = 0 describes the spatial spreading of equilibrium
Plancks field (with a single propagation direction) in a waveguide of a layer with
the length z, temperature T and absorption coefficient (),
N = AN, A 1 G. (7.5)
Here, A is the absorptivity of the waveguide or the layer (in the general case, with
an account for reflections at the input and output). It can be measured, according
to (7.3), using an external signal: A = 1 dN/dN0 .
Equation (7.5) is the Kirchhoff law relating the thermal radiation of a heated
body to its thermodynamic parameter T and kinetic parameter A. It can be easily
derived in the form (7.5) without using the kinetic equation [Landau (1964)]. Note
that this law is used in the construction of photometry reference sources for light
intensity, although, in principle, it is only applicable within the approximations of
strong coupling with the thermostat, linear optics, and geometric optics. For its
generalizations, see Ref. [Klyshko (1980)].
Fig. 7.2 The Planck function determining the mean number of photons with the frequency > 0
in a single mode of an equilibrium field, as well as the number of excited atoms N2 divided by the
population difference N1 N2 . Here, T is the effective temperature; in the case of a field, T > 0.
Fig. 7.3 The spectral density of the thermal radiation emitted by a single-mode attenuator (T > 0) or
amplifier (T < 0) as a function of its length z and temperature T , according to the Kirchhoff law. 0 is
the absorption coefficient at T = +0.
which in the two limiting cases of small absorption and large amplification yields
T n = zT e f , |z| 1, (7.10)
T n = T e f , z 1. (7.11)
Thus, the spin temperature determines not only the ratio of the populations but
also the limiting noise temperature of a quantum amplifier.
In paramagnetic amplifiers, |T e f | is on the order of the lattice temperature of
the active crystal, which is cooled down to helium temperatures for the sake of
lowering T n , reducing spin-lattice relaxation, and increasing the equilibrium pop-
ulation difference. As a result, the noise temperature of such amplifiers is as low
as several Kelvins. At the same time, in ordinary electronic amplifiers, T n 103
K, due to the high temperature of the cathode; in parametric microwave amplifiers
based on semiconductor diodes, T n 100 K.
In practice, T n in paramagnetic amplifiers is determined by the thermal emis-
sion not from the active medium, according to (7.11), but from the elements of the
input channel (the aerial, waveguides, and ferrite devices), according to (7.10).
Indeed, an input waveguide with the walls at room temperature and the losses of
only 1% makes a noticeable contribution, T n 3 K.
It is not difficult to generalize the Kirchhoff law (7.5) to the case where the
waveguide has several sources of homogeneous linear loss, i , and noise, ji , with
different effective temperatures T i (i = 1, 2, . . . ). Assuming again ji = i Ni , we
obtain the kinetic equation
X
dN/dz = i (N N), (7.12)
i
P P
which only differs from (7.2) by the replacements i and N i Ni .
As a result, (7.5) takes the form
X
N = (1 ez ) (i /)Ni , (7.13)
i
P
with i . Thus, the contribution of each element to the total thermal emis-
sion scales as its contribution i / to the total absorption coefficient.
Consider now the shape of the thermal emission spectrum in the vicinity of
a single narrow resonance at various optical densities z (Fig. 2.4). When ||z is
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
small, the emission spectrum repeats the absorption spectrum, N() = ()zN
(the dependence N() is too slow to be taken into account here.) At large positive
z, the line gets flattened and broadened, tending eventually to the equilibrium
spectrum N(), and at large negative z the linewidth
is reduced (similarly to the
amplification band, see (2.68)) and tends to zero as ln G.
g = (a1 1 2
x )(by ) = A/ , (7.14)
where = 2k, we consider only a single polarization type and assume that the
observation direction is orthogonal to A. According to (7.14), the angular interval
between the neighboring transverse modes is on the order of the diffraction angle
/a.
Multiplying (7.5) by the photon energy ~, the detector band f = /2,
and the number of transverse modes g , we find the power of multimode thermal
radiation with a single polarization type,
where we assume that the transfer coefficient G = ez and the effective tempera-
ture T are the same for the modes having close directions.
The ratio P/A is called the spectral brightness; this is the main
energy characteristic of incoherent multi-mode radiation. In the general case,
I I(k, r) depends on the frequency, direction, and the observation point; how-
ever, in the absence of scattering, emission, and absorption between the points
r and r + a, the spectral brightness at these points is the same provided that the
argument k is parallel to a,
In photometry, there are special terms for the integrals of I w.r.t. various vari-
ables: the brightness, the radiance, and the luminous intensity,c
Z Z
B( k, r) dk I(k, r), E(r) dB( k, r) cos k ,
0
Z
P(z) dxdyE(r). (7.17)
Here, k k/k, k is the angle between k and the z axis. The spectral and volume
energy density is
Z
1
(, r) = dI(k, r), (7.18)
u() 4
where u is the group velocity, k . In the case of the thermal radiation from
a heated body, I(k, r) cos k is called the emittance of a body (at point r of its
surface).
According to (7.15), the Kirchhoff law for the multimode emission from a
matter layer with the thickness z and the effective temperature T has the form
(compare with (6.225), (6.307))
vac
I (k) = I N(k), (7.19)
where
vac
I ~/22 , N(k) = N(1 e(k)z ), (7.20)
vac
where we take into account only a single polarization type. The value I corre-
sponds to the spectral brightness of radiation having a single photon in each mode
vac
(i.e., I is the doubled brightness of zero-point vacuum fluctuations), it is a
natural unit for the measurement of I : the mean number of photons per mode
vac
N is equal to the spectral brightness in I units.
in light-engineering ones. The latter take into account the spectral characteristic of the human eye and
are based on the candela unit.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
vac vac
I = I Nz = I N2 l/ cos k , (7.22)
where the optical density of the layer z for the mode k was assumed to be (N1
N2 )l/ cos k . Let us write the transition cross-section in terms of the transition
dipole element d0 according to (2.52) and find, using (7.17), the total power of
spontaneous emission. Let n = 1, then
Z Z Z Z
P= d dxdy cos k dI = (42/~c)I
vac
N2 V0 d(d0 e k )2 ,
4 A 0 4
(7.23)
where V = Al is the matter volume. The integral here is so that P/VN2 8d02 /3,
coincides with the expression found above for the power of the spontaneous emis-
sion per one molecule, 440 d02 /3c3 (see (5.32)).
3. Now, let z 1, then N = GN, and
P = N2 (N2 N1 )1 I
vac
G0 Ae f e f e f , (7.24)
vac
P = I G0 A2 /l2 . (7.25)
Fig. 7.4 Cavity quantum amplifier: (a) in the optical range; (b) in the microwave range; (c) an
equivalent circuit.
where
2
1
L = L 20 1 2L(0 ).
Rc R0 + Ra , X (7.27)
C
The relation between Rl and the other parameters depends on the coupling between
the cavity and the waveguide.
Let us pass from the equivalent parameters Ri to dimensionless variables that
have a direct physical meaning: the Q-factors Qi 0 L/Ri or the damping con-
stants di 1/Qi . The value Qa is called the (magnetic) quality factor of the active
medium. After dividing both the numerator and the denominator of (7.26) by 0 L,
we find
dl dc 2ix
K() = , (7.28)
dl + dc + 2ix
where x (0 )/0 . The gain (or attenuation) of the reflected wave power is
(dl dc )2 + 4x2
G() = |K()|2 = . (7.29)
(dl + dc )2 + 4x2
It is important that the coupling between the waveguide and the cavity can
be varied, for instance, by changing the orientation of the feedback loop or the
transmission of the mirror. Then, the normalized losses dl = Rl /0 L due to the
external circuit will also vary from zero (no feedback, a totally reflecting mirror)
to infinity (maximal feedback, no mirror). In the case of a resonance ( = 0 ) and
positive losses in the cavity (dc > 0), variation of the feedback strength from 1
to +1 leads to the variation of the amplitude reflection coefficient for the current
K0 . The dependence passes through zero at the point of load matching, dl = dc
(Fig. 7.5).
If dc = d0 + da < 0, then by decreasing the feedback strengths one makes K0
and G0 vary from 1 to . Thus, a cavity quantum amplifier with a sufficiently
high-quality cavity (Q0 > Qa ) enables a weak signal to be amplified as much as
possible due to the regeneration.
However, an increase in K0 is accompanied by a decrease in another parameter,
important for many applications, namely, the amplification bandwidth . As we
will show below, at K0 1 the product of the amplitude gain and the bandwidth
does not depend on the feedback,
K0 = 20 /|Qa |. (7.30)
Fig. 7.5 The gain coefficient K0 of a cavity amplifier as a function of the cavity quality factor 1/dc .
By reducing the feedback dl , at a fixed dc < 0 one can make K0 as high as desired. Here, dc is the
cavity damping constant with an account for the negative contribution from the active medium; dl is
the damping of the load, i.e., the relative losses due to the external circuit.
According to (7.29),
dc dl
G1= , (7.31)
d2 /4 + x2
where d dl + dc 1/Q is the total damping constant and Q is the quality factor
of the loaded cavity. It follows that at G 1 the frequency characteristic of the
amplifier has a Lorentzian shape and the amplification bandwidth is determined
by the quality factor of the loaded cavity with an account for all losses,
1 1 1
=d= + + . (7.32)
0 Ql Q0 Qa
For achieving G 1, a nearly full compensation of all losses is necessary,
da d0 + dl , in which case K0 2|d p |/d, so that the product of the gain and
the bandwidth is a constant, K0 = 20 |dc |. Usually, the quality of the cavity,
Q0 & 104 , is much greater than the quality of the active medium, |Qa | . 103 ,
hence, in practice, Qc = Qa , which yields (7.30).
Let us write Qa in terms of the parameters of the matter by using the general
definition of the quality factor as the ratio of the energy stored in an oscillation
contour,
Z Z
0 2
E= 3 2
d r( E + H )/8 = d3 r 0 u2 (r)/8, (7.33)
Vc Vc
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Here, we have used the equalities Eel = Emag , q2 (t) = 1/2, and Eq. (4.15) for the
losses per unit volume. Hence,
Q1 00 0
a = / = /k, (7.35)
where V/Vc 0 1 is the factor showing the proportion of the cavity filled
by the active medium and determined by the ratio of the integrals in (7.33) and
(7.34); = 1 + 4, is the linear amplification factor for plane waves with the
wavevector k; in the case of a magnetic transition, 00 should be understood as the
magnetic permeability. At electron paramagnetic resonance, 5 102 cm1
(Sec. 2.3), which yields, with = 3 cm and 0 = = 1, Qa = 40. Let K0 = 100,
then it follows from (7.30) that f = 5 MHz. This bandwidth is not sufficient for
many applications (communications, radar, radio astronomy); for this reason, one
uses several coupled cavities and slowing-down systems.
7.1.6 The Kirchhoff law for a cavity amplifier. The Townes equation
In order to take into account the noise of the active medium, let us add to the
equivalent circuit (Fig. 7.4) a source of current with the spectral density given by
the Nyquist equation (see Ref. [Akhmanov (1981)]),
This leads to the Townes equation giving the spectral width of the emission of a
cavity amplifier in terms of the total radiation power,
~30 N2
= . (7.39)
PQ2a (N2 N1 )
Certainly, this calculation of is valid only for the linear regime of the
amplifier below its oscillation threshold (Ql < Qa ). However, one can expect
that even above the threshold, (7.39) provides a correct order of magnitude for
the bandwidth of a quantum oscillator (in the limiting case where the technical
noise, like mirror vibration, is absent). More accurate calculations, taking into ac-
count the nonlinearity due to the saturation [Loudon (2000); Klimontovich (1980);
Lax (1968); Arecchi (1974); Klauder (1968)], which restricts the noise amplitude,
lead to a factor 1/2 appearing in (7.39) for the regime high above the threshold.
This can be explained by the suppression of amplitude fluctuations, which give
the same contribution to the spectral width as the phase fluctuations. One should
keep in mind that the random slow phase variation (phase drift) of a harmonic sig-
nal leads to the fluctuations of its frequency and to a finite width of its spectrum.
The difference between the oscillation shapes of a quantum amplifier (oscillator)
below threshold and above it is explained in what follows (Fig. 7.7).
It is only the absence of amplitude fluctuations that qualitatively distinguishes
the radiation of a single-mode laser from a narrow-band noise. In the case of a
multi-mode laser with independent modes, this difference vanishes as well.
Above, we have found, in the form of the Kirchhoff law, the spectral brightness
of the radiation I(k) as a function of the frequency and the observation direction
in the far-field zone. Although the intensity is an important characteristic of the
radiation, clearly, it does not provide the complete statistical information about
the electromagnetic field.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
By definition, the spectrum of the analytical signal E (+) (t) contains only pos-
itive frequencies ( > 0) while the spectrum of the complex conjugate (negative-
frequency) field E () = E (+) , only negative frequencies. From the definition, the
relation follows (Fig. 7.6)
(a) (b)
Fig. 7.6 The analytical signal E (+) (t) and the real field E(t) in the case of a monochromatic spectrum
(a) and in the general case (b).
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
In principle, one can measure the parameters (7.47)(7.49) by observing the mean
value and the variance of the readings of a broadband single-quantum detector (see
below). The readings of an n-quantum detector directly yield the n-th moment,
hI n i G(n) .
We will assume the field to be stationary, so that the mean values in (7.47)
(7.49) do not depend on the time, and ergodic. The angular brackets may then
denote the averaging either over time or over an ensemble (with some distribution
function P(I)).
After passing to the quantum theory (Sec. 7.4), E (+) is replaced by the operator
E , which is written in terms of the photon annihilation operators ak , while E ()
(+)
are only valid for classical fields for which the degeneracy factor hNi (see below)
is much greater than the unity. The general case will be considered in Sec. 7.6.
So far, let us only note that in the quantum theory, the continuous distribution
(7.50) is replaced by the discrete exponential one, while (7.52) is replaced by
the Poissonian distribution (Fig. 7.7(c)).
(Here we use the notation that is also valid in the quantum description.) Each
subscript denotes a set of arguments, for instance, E1 E1 (r1 , t1 ). For simplicity,
we only consider CFs with even numbers of fields, since in optics the moments
of the form hE1 E2 E3 i are usually equal to zerof and, in addition, are difficult to
measure.
f Exceptions are fields at the output of nonlinear media excited by external radiation [Perina (1972);
The CFs of a stationary field are invariant to the choice of the initial time
moment, i.e., to the replacement of time arguments
t1 , . . . , t2n t1 + t, . . . , t2n + t, (7.54)
where t is arbitrary. It is convenient to choose t t1 .
As a result, G(n) depends on 2n1 time arguments, while its Fourier transform,
the spectral CF G(n) , depends on 2n1 frequencies. For instance, the first-order CF
has the form G(r1 , r2 , ); it is called the mutual coherence function of the field at
points r1 , r2 , while its Fourier transform, G(r1 , r2 , ), is called the mutual spectral
density. At r1 = r2 , the first-order CF, G(r, t), is called the autocorrelation func-
tion, while G(r, ) is the spectral density. Traditional polarization characteristics
of directed radiation, such as degree of polarization or the Stokes parameters, are
also determined by the first-order CF with an account for tensor indices [Perina
(1972)]. With all arguments coinciding, CFs become intensity moments (we omit
the tensor indices),
G(n) (+) 2n n
1...1 = h|E (r1 , t1 )| i = hI (r1 )i. (7.55)
Among the various statistical models of the field, a special role belongs to
the Gaussian model, in which all CFs are expressed in terms of the first-order
CF [Glauber (1965)],
0
X
G(n)
1...n10 ...n0 = G(1) (1)
110 G nn0 , (7.56)
P0
where denotes the sum of all n! permutations of the primed indices. For in-
stance,
G(2) (1) (1) (1) (1)
1234 = G 13 G 24 + G 14 G 23 . (7.57)
All information about a Gaussian (chaotic, thermal) field is contained in the
first-order correlation function G (r1 , r2 , ) or the mutual spectral density
G (r1 , r2 , ).
For a rough characterization of a quasi-monochromatic directed non-polarized
Gaussian field, it is sufficient to fix at each point the intensity G (r) and the co-
herence parameters coh 1/, coh for both polarization types = 1, 2. The
meaning of these parameters will be explained in the next sections.
Fig. 7.8 The Michelson interferometer and the measurement of the longitudinal coherence length.
Fig. 7.9 Typical dependence of the mean intensity at the output of a Michelson interferometer, hI 0 i,
on the position of the moving mirror: g() is the normalized autocorrelation function of the field at the
input.
where E(t) is the field of the plane wave at the input of the interferometer.
According to (7.46), the intensity at the output is
I 0 (t) = [I(t) + I(t0 ) + 2ReE () (t)E (+) (t0 )]/4.
Hence, in the case of stationary radiation,
hI 0 i = [hIi + ReG()]/2 = hIi[1 + Reg()]/2, (7.58)
where
G() hE () (0)E (+) ()i = hE0 (0)E0 ()ieit /4 (7.59)
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
is the auto-correlation function of the field at the output, which is related, ac-
cording to the Wiener-Khinchine theorem, to the spectral density G() via the
Fourier-transformation, g() G()/hIi is the normalized CF, and hIi = G(0).
Note that in optics, CFs are usually normalized to the mean values and not to the
standard deviations I.
According to (7.58), the dependence of the intensity at the output of a Michel-
son interferometer on the delay time determines the real part of the first-order CF.
One can show that the function G(), similarly to E (+) (), is analytic in the lower
semi-plane. Therefore, its real and imaginary parts are related via the Hilbert
transformation, so that, in principle, from the interference pattern one can restore
the radiation spectrum. This method forms the basis for the Fourier spectroscopy.
Besides, the interference of the field under study with a reference coherent wave
is the source of information in holographic interferometry.
A typical interference pattern in the case of a single spectral line is shown in
Fig. 7.9. The relative amplitude of the oscillations in the output intensity is called
the interference visibility or the degree of coherence. According to (7.58), (7.59),
the visibility coincides with |g()|.
If we define the coherence time coh by the condition |g(coh )| = 1/2, then
it follows from the properties of the Fourier transformation that coh 2/.
Hence, the longitudinal coherence length ccoh is on the order of the inverse spec-
tral width in inverse centimeters,
For instance, for lines with natural broadening in the visible range,
2 106 s1 , lcoh 105 cm. This can be viewed as the length of the wave train
emitted by an atom during spontaneous emission. In the case of a single-mode
laser, estimation with the Townes equation (7.40) yields a coherence length on the
order of a light second.
Fig. 7.10 Youngs interferometer and the measurement of the transverse coherence radius.
while the pattern on the symmetry axis is determined by G12 (0). The coherence
radius coh is determined by the distance between the pinholes at which the visi-
bility is 50% reduced.
An ideal laser with a single transverse mode emits plane or spherical waves,
for which coh = . In the case of a multi-mode laser, or a chaotic (thermal) light
source, coh in the far-field zone (z a2 /) is determined by the transverse size a
of the source and the distance z from the source (Fig. 7.10),
where d is the diffraction angle and a is the angular size of the source. Due
to diffraction, the transverse coherence radius increases in the course of light
propagation. The shape of the phase front in this case increases and tends to
the spherical one. The van Cittert-Zernicke theorem [Akhmanov (1981); Klauder
(1968); Perina (1972)], which describes this effect quantitatively, states that the
dependence of the first-order CF on (r1 r2 ) is given by the Fourier transform
of the brightness distribution over the source cross-section.
Relation (7.62) enabled Michelson to measure, with the help of his stellar
interferometer, the angular diameters of several stars, for which a & 107 rad and
coh . 10 m. At smaller a , the coherence radius is determined by the atmospheric
distortions of the wave front, which hinder the operation of the interferometer. The
Hanbury BrownTwiss interferometer (see below) is free of this drawback.
Table 7.1 Basic distribution types for the photon number and the energy.
the probability theory, the energy distribution in a volume V that is much greater
than Vcoh will be Gaussian, with the variance inversely proportional to the number
of cells V/Vcoh . In the quantum case, where . 1, the Gaussian distribution for a
multi-mode field is replaced by the Poissonian one (Sec. 7.6)
Table 7.1 shows some types of distributions for the energy or the number of
photons. The states that are called K-photon ones have no classical analogues
and manifest photon anti-bunching and bunching (Sec 7.6).
Assume first that the intensity I is constant, i.e., it does not vary from sample
to sample. It is important that even in this case, the number of photoelectrons in
a single sample is, according to the quantum mechanics, random, unpredictable.
The very process of energy measurement necessarily introduces additional Poisso-
nian stochasticity into the detector readings. (We do not consider the non-realistic
case of a detector with a 100% quantum efficiency and a pure energy state of the
field with a fixed number of photons.) Using the semi-classical (Sec. 2.1) or purely
quantum (Sec. 7.7) perturbation theory, we can only find the ionization probabil-
ity W1 t I for a single atom of the photocathode during a small time interval
t.
In the case of a sufficiently small t, the probability for any on the N indepen-
dent atoms of the photocathode to be ionized is N times as large, and also scales
as I,
As a result, we obtain the semi-classical Mandel formula for the photocount dis-
tribution, i.e., for the probability of discovering m pulses at the PMT output,
Fig. 7.11 Relation between the statistics of the field and the photocount statistics from the semi-
classical viewpoint: E is the field, i is the PMT output current; (a) in the case of a field with a constant
amplitude E0 , the number of photoelectrons mi emerging during a certain time interval T has a Poisso-
nian distribution; (b) the amplitude fluctuations of the field at T coh cause additional fluctuations in
the number of the photoelectrons (the bunching effect); (c) a multi-mode (T coh ) detector averages
over the field fluctuations, and the bunching effect is not observed.
Using the generating functions method (Sec. 7.6), one can easily find the general
rule,
hm(m 1) . . . (m k + 1)i = k hI k i. (7.77)
The linear combination of moments in the left-hand side is called the kth-order
factorial moment. From this, in particular, it follows that hmi = hIi and
hm2 i = hmi + 2 hI 2 i, (7.78)
hI 2 i hI 2 i hIi2 , (7.79)
and similarly for hm2 i. Thus, fluctuations of the photoelectron number contain,
in addition to the usual Poissonian (shot-noise) part, a contribution from the fluc-
tuations of the light intensity. Only in the case of a single-mode stabilized laser
(I = 0) this contribution is absent. In other cases, fluctuations of the photo-
electron number, according to (7.78), should at first sight exceed the shot noise,
since from the definition (7.79) and from the condition P(I) > 0 it follows that
hI 2 i > 0.
The existence of these excess fluctuations has been called the photocount
bunching effect, since in a Poissonian sequence of pulses, by definition, the ap-
pearance of a single pulse has no effect on the appearance of the next one, and the
inequality hm2 i > hmi means that the pulses have a tendency to bunch. A similar
effect for the photon numbers, hN 2 i > hNi, is called photon bunching or photon
correlation. A close effect has been discovered by Hanbury Brown and Twiss in
1956 in the chaotic light of a mercury lamp.
In chaotic light, it follows from (7.51) and (7.78) that
hm2 iT = hmi(1 + hmi), (7.80)
i.e., the excess part of the variance is hmi times as large as the Poissonian one, so
that the photon bunching effect is more pronounced in classical fields. From the
classical viewpoint, strong fluctuations of the amplitude |E0 | of a wave formed by
many independent sources with random phases are quite obvious. More surpris-
ing for the classical theory is the anti-bunching of photons and, correspondingly,
photocounts, so that m2 < hmi, in contradiction with (7.78) and the initial Man-
del formula (7.71) if P(I 1 0) is assumed. (As we have already mentioned, the
last condition is violated in quantum theory.)
G(2) (1) 2
12 () = hI1 ihI2 i[1 + |g12 ()| ], (7.82)
where g(1)12 () is the normalized first-order CF. Thus, in a chaotic field, by mea-
suring the intensity correlation one also gains information about the amplitude
correlations. This relation between G(2) and G(1) forms the basis for the opti-
cal mixing spectroscopy [Cummins (1974)], also called spectroscopy of intensity
fluctuations or the method of photon correlations.
In the general case, G(2) is not related to G(1) , and the second-order coherence
parameters may differ from (1) (1) (2)
coh , coh . For instance, in two-photon light, coh
(1) i
coh .
The time dependence of G(2) () at = 0 is measured by means of a single
detector with a delay line and an electronic correlation circuit.j One can also use
the spectral analysis of photocurrent fluctuations. The first experiment of this type
has been performed by Forrester, Gudmundsen and Johnson as early as in 1955,
before the advent of lasers.k Modern technique enables one to achieve the spectral
resolution much less than 1 Hz.
In order to study the spatial second-order coherence G(2) (), one has to use
two detectors with a variable distance between them.l According to (7.82), when
is varied from to 0, G(2) is increased, in an ideal case, by a factor of two
(Fig. 7.12). This effect has been discovered by Hanbury Brown and Twiss in 1956
and used for the measurement of the angular diameters of stars [Hanbury Brown
i Editors note: This is the case in one of the possible definitions of second-order coherence parame-
Fig. 7.12 Correlation and anti-correlation of light intensities: i1 , i2 are the photocurrents of the de-
tectors; is the displacement of one of the detectors from the symmetric position; 1, thermal radiation;
2, laser radiation; 3, two-photon radiation.
(1971)] whose intensities are correlated within distances on the order of hundreds
of meters.
The Hanbury BrownTwiss experiment on the measurement of G(2) () for the
light from a mercury lamp is shown in Fig. 7.13. Splitting of the beam with the
help of a semi-transparent mirror allows one to measure the correlation at points
that are arbitrarily close to each other. Let the PMTs operate in the photon count-
ing regime, then the correlation between the photocounts in the two channels,
hm1 m2 i, is 1 2 hI1 I2 i. Hence, with the help of (7.82), we find
hm1 m2 i = hm1 ihm2 i(1 + |g(1) 2
12 (0)| ). (7.83)
This result is only valid in the case of single-mode detectors, where the time
constant of the detector T is much less than coh and the detector aperture A is
much less than Acoh . If, for instance, T coh , then a small factor on the order of
coh /T appears by the second term of (7.83), which reduces the observed effect.
Fig. 7.13 Hanbury BrownTwiss experiment on the observation of light intensity correlation.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
The intensity correlation effect is closely connected with the intensity fluctua-
tions of light at the input of Hanbury BrownTwiss interferometer. Indeed, let I1
and I2 be random intensities in the arms of the interferometer and I1 and I2 be
their fluctuations (In In hIn i). The output signal hi1 i2 i scales as
The second term here characterizes the mutual correlation of intensities. From the
condition I1 + I2 = I, we find the relation
Assume first that the incident light has a constant intensity (radiation of a
single-mode laser), then In = 0 and, according to (7.86), the correlation is equal
to zero. Then, hI1 I2 i = hI1 ihI2 i. Now, suppose that usual light from a thermal or
luminescent source be incident on the interferometer; then, according to (7.51),
hI 2 i = hIi2 . Suppose that similar relations hold for the secondary beams as well,
hIn2 i = hIn i2 . Hence, with the help of (7.86) we find (Fig. 7.12) that
In fact, the result is the same: in thermal light, there is photon correlation caused
by photon bunching or, in other words, by the presence of excess noise in addition
to shot noise.
Further, consider field with a fixed number N of photons. These N photons
will be randomly split by the semi-transparent mirror between the two channels
with the probabilities p and q = 1 p. This picture corresponds to the well-
known Bernoullis probability model [Rytov (1976)], which gives the binomial
distribution for the probability of N1 photons going to channel 1,
g(2)
12 = 1 1/N. (7.92)
where h: N 2 :i hN(N 1)i is the normalized (factorial) moment and the an-
gular brackets denote averaging over the distribution P(N). Thus, the relative
correlation of photon numbers at two field points, g(2)
12 1, is determined by the
normalized factorial moment of the field g(2) . The same result follows from the
rigorous quantum-theory approach (see (7.331)).
corr
t corr
tcorr
rcorr
(a) (b)
Fig. 7.14 Measurement of the temporal (a) and spatial (b) widths of the second-order correlation
function.
(2)
t
coh
corr
(2)
t corr
t r r
corr
coh
corr
(a) (b)
Fig. 7.15 Intensity interference observed using (a) Michelsons interferometer and (b) Youngs in-
terferometer. Coincidences of photocounts are observed for two detectors registering photon pairs
emitted by sources separated in time (a) or space (b).
The same reasoning will be valid for spatial coherence parameters, (1,2)
coh .
In this section, we will show that Maxwells equations for the transverse part of the
field, i.e., for the radiation field, can be reduced to a system of independent equa-
tions for harmonic oscillators. These equations can be easily represented in the
form of classical Hamiltons equations. This enables one to use the quantization
algorithm which defines the commutator of two operators in terms of Poissons
brackets for the corresponding classical values.
Fig. 7.16 To the definition of the quantization length L: E(z) is the real field and E(z) is the fictitious
field that is periodic in space.
o This condition is assumed to be well satisfied, so that the field at the boundaries stays equal to zero
Fig. 7.17 The sinc(x) function in the cases of a discrete (points) and continuous (dashed line)
arguments.
then
Z L/2 X
dz exp(ikn z)E(z) = Em Lsinc[(m n)] = En L. (7.95)
L/2 m
The last inequality follows from the fact that the function of an integer value,
Z L/2
dz exp[i(km kn )z] = Lsinc[(m n)] = Lmn , (7.96)
L/2
differs from zero only at a single point m = n, where it is equal to unity (Fig. 7.17).
Since E(z) is real, Em = Em .
At the final stages of the calculation, the quantization length L can be usually
assumed to be infinite; in this case, the Fourier series (7.94) becomes an integral,
Z
E(z) = (L/2) dkE(k)eikz , (7.97)
where
Z
1
E(k) = L dzE(z)eikz . (7.98)
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
In the derivation of (7.98), the following representation of the delta function was
used:
Z
dzeikz = lim Lsinc(kL/2) = 2(k). (7.99)
L
The factor L/2 is added to (7.97) for making the discrete Fourier components
coincide at L with the continuous ones, Em E(km ). Comparing (7.94) and
(7.97), we find the rule for passing from summation to integration,
X Z
(L/2) dk . . . (7.100)
m
The value L/2 is called the density of modes (in the one-dimensional case
with a single polarization state). The inverse value, 2/L, is equal to the distance
between the neighboring modes m, m + 1 on the k axis.
By repeating this procedure for other components of the field Ey , Ez and for the
dependencies on x, y, we obtain a three-dimensional Fourier series for the field,
X X
E(r, t) = Elmn (t)e2i(lx+my+nz)/L E k (t)eikr , (7.101)
lmn k
Z
E k (t) = L3 d3 rE(r, t)eikr = Ek (t), (7.102)
L3
and similar relations are valid for the magnetic field. By substituting (7.106) into
Maxwells equations (4.9)(4.12) and assuming = 1, we obtain
Ek = 4 jk , Hk = 0, (7.108)
E k ick H k = 4 j k , (7.110)
H k + ick E k = 0, (7.111)
E k + 2k E k = 4 j k . (7.112)
Here, k ck and
Z
3
j k (t) L d3 r k j(r, t)eikr = jk (t), (7.113)
L3
with k being the projection tensor (see (4.20)). Thus, Maxwells equations for
the transverse field in the k, t representation are reduced to a system on inhomo-
geneous equations for independent harmonic oscillators. Note that the harmonics
E k and Ek are always excited simultaneously since E k = Ek .
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
E = A/c. (7.117)
Hence, we find the relations between the spatial harmonics of a real field and its
potential,
A k = cE k , A k = ik H k /k2 , H k = ik A k . (7.118)
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Instead of the magnetic field, one can use the vector potential. According to
(7.118) and (7.119),
Ak Ak ek = iHk /k = (Hk00 iHk0 )/k,
Ak = (Hk00 + iHk0 )/k,
where k denotes simultaneously the absolute value of the k vector and the mode
index.
Let us form linear combinations,
qk (L3 /42k )1/2 (Ek00 + Hk00 ),
(7.120)
pk (L3 /4)1/2(Ek0 + Hk0 ).
(the choice of the coefficients will be explained below, from (7.134).) With the
help of (7.119), we see that the variables qk , pk for the backward mode are inde-
pendent of qk , pk ,
Note that in the case of a standing plane wave, ak = ak , and therefore the variables
qk , pk scale as the magnetic and electric fields, respectively,
then every term in the sum (7.124) describes a plane wave propagating in the +k
direction (in contrast to the sum (7.101)).
In a free-space field, this approximation, according to (7.128), is at jk = 0
valid rigorously,
By comparing (7.114) and (7.122), we find the relation Ek0 = 2ick ak (0).
Hence, the positive-frequency part of the field is determined by ak (t) functions
while the negative-frequency part, by ak (t) functions,
X
E(+) (r, t) = i ck ak (t)eikr ,
k
X (7.131)
()
E (r, t) = i ck ak (t)eikr .
k
In the quantum theory, these functions become photon creation and annihilation
operators, ak ak , ak ak .
7.3.3 Hamiltonian of the field and the matter
From Maxwells equations, it follows (see, for instance, (4.27)) that the instanta-
neous energy of the field is
Z
E(t) = (1/8) d3 r(E 2 + H 2 ) H0 . (7.132)
L3
Let us accept this expression as the Hamilton function of the free-field transverse
part. By substituting here the plane-wave expansion (7.101) and taking into ac-
count the orthogonality condition (7.96), we obtain a diagonal quadratic form,
X
H0 = (L3 /8) (|Ek |2 + |Hk |2 ). (7.133)
k
According to (7.136), the Hamiltonian of the interaction between the particles and
the transverse field is
X ei e2i
2
V=
Pi A i + 2
Ai . (7.139)
i
mi c 2mi c
In the case of particles with internal magnetic moment i , one should add the
energy of the spin interaction, Hi .
Let us show that (7.135), (7.136) lead to the usual Newtons equations with
the Lorentz force for the particles and to Maxwells equations with the external
currents (7.107) for the field. In order to obtain Newtons equations, let us differ-
entiate (7.137) in time. Taking into account that, according to (7.135), (7.136),
Pi = (ei /c)Vi Ai /Ri , (7.140)
we find that
!
ei Ai Ai ei
mi Ri = Pi + Vi = ei Ei + [V i Hi ] . (7.141)
c t Ri c
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Recall that the fields are taken at the point of the particle location, therefore,
d Ai /dt , Ai /t = cEi .
The field equations can be found by differentiating the total Hamiltonian
(7.136) w.r.t. the canonical variables pk , qk of the field. Then the forces act-
ing on the field from the particles are determined by the second term in (7.138).
Let us first differentiate it in Ai with the help of (7.137):
mi Vi2 Vi ei ei
= mi Vi = Vi = V i Ai . (7.142)
Ai 2 Ai c Ai c
It follows that the Hamiltonian of the interaction between the transverse field and
non-relativistic spin-free particles can be represented, instead of (7.139), as
Z
0
X ei 1
V V i Ai = d3 r j A, (7.143)
i
c c
where j(r, t) is the external current density determined by the coordinates and
velocities of the particles,
X
j(r, t) = ei V i (t)(3) (r Ri (t)), (7.144)
i
and the prime reminds that, according to (7.142), the Hamiltonian (7.143) pro-
vides an exact description only for the perturbation of the field by the particles
and not vice versa.
Hamiltons equations (7.135) immediately lead to the equations of motion for
an arbitrary function of canonical coordinates, f (qk , pk , t),
d f /dt = f /t + { f, H}, (7.145)
X f g !
g f
{ f, g} . (7.146)
k
qk pk qk pk
One can easily verify that after the linear transformation (7.121) from qk , pk to
the new independent variables ak , ak , the Poisson bracket (7.146) takes the form
!
1 X f g g f
{ f, g} = . (7.147)
i~ k ak ak ak ak
Assuming f ak , with the help of (7.134) and (7.143) we obtain
Z
1 H i Ai
ak = = ik ak + d3 r j . (7.148)
i~ ak ~c ak
With an account for (7.124) and (7.144), this equation coincides with Eq. (7.127),
which was obtained from Maxwells equations with external currents.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Thus, we have written the equations for the field and the matter in the canon-
ical form (7.135) with the Hamiltonian (7.136). Before using this result for the
quantization of the field equations, let us consider the interaction Hamiltonian in
the dipole approximation.
7.3.4 Dipole approximation
In quantum electronics, it is often possible to use approximations instead of the
exact expressions for the perturbation energy (7.139), (7.143). In the case of a
plane free monochromatic wave, H = E. Therefore, in the first order in Vi /c, one
can neglect the magnetic part of the Lorentz force in the Newton equation,
Further, let the particles occupy a restricted space area with the linear size a
much less than the scale of the field variation, o = c/. Then the field can be writ-
ten as a series expansion in Ri and only the first few terms should be taken into
account. In this case, (7.139) leads to the multi-field expansion of the perturbation
Hamiltonian for the particles in powers of Ri /o. In the zeroth (dipole) approxi-
mation, A(Ri ) A(r0 ) A0 , where r0 is some fixed point inside the system of
particles (for instance, the center of mass). Then, according to (7.140), Pi = 0,
and (7.141) takes the form (compare with (7.149))
mi Ri = ei E0 , (7.150)
where E0 E(r0 , t). This equation, according to (7.135), follows from the inter-
action Hamiltonian of the form
where
X
d(t) ei (Ri (t) r0 ), |Ri r0 | o. (7.152)
i
Here, in contrast to (7.139), the field is a given external parameter. Note also that
the dipole moment of a neutral system does not depend on the choice of r0 .
Let the matter consist of N separate motionless molecules with the dipole
moments d j and centers at r j . Then the energy of the matter in the external field,
according to (7.151) is (compare with (4.28))
N
X Z
Vdip = d j (t) E(r j , t) = d3 rP(r, t) E(r, t), (7.153)
j=1
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
N
X
P(r, t) = d j (t)(3) (r r j ). (7.154)
j=1
The interaction Hamiltonian for the field in the dipole approximation follows
from (7.143) after replacing Ai by A0 ,
0 1 1d
Vdip = d A0 = d E0 (d A0 ). (7.155)
c c dt
If we restrict the consideration to the case of quasi-monochromatic currents and
fields, then d A0 contains two components: a constant one and one oscillating
with the double frequency. As a result, only the first term in (7.155), coinciding
with (7.151), provides an accumulating interaction,
0
Vdip Vdip = d E0 . (7.156)
Thus, the dipole Hamiltonian Vdip can be also used for calculating the emitted
field in single-frequency problems. It follows from (7.156) that
1 Vdip ck
ak + ik ak = = dk exp(ik r0 ), (7.157)
i~ ak ~
where dk d ek . The same result can be obtained from the exact equation
(7.127) in the case of a neutral system, after taking into account (7.144), replacing
exp(ik Ri ) by exp(ik r0 ), and replacing V i by ik Ri .
Often, instead of (7.139), one uses the approximation
X
V ei Pi Ai /mi c, (7.158)
i
i.e., neglects the term that is quadratic in eA. (Note that in the case of a single
electron in a harmonic field, this term is on the order of Ei2 where = o2 re is the
polarisability of a free electron (6.36).) This approximation is only valid in the first
order of the perturbation theory, i.e., in the calculation of single-quantum effects.
Eq. (7.158) also follows from (7.143) if the canonical and kinetic momentums are
assumed to be the same. Eq. (7.158) leads to the following equation of motion for
the particle:
ei dAi
mi Ri = , (7.159)
c dt
which at a o coincides with (7.150).
For bonded electrons in atoms and small molecules, a 108 cm, and the con-
dition for the validity of the dipole approximation (7.151), (7.155) is satisfied up
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
to the X-ray range. Recall that magnetic moments related to spin and orbital mo-
tion are on the order of the Bohr magneton 0 , which is two orders of magnitude
as small as one Debye,
20 e~/mc = eoc ea0 /137. (7.160)
However, despite the relatively small value of multi-field effects, their mani-
festation in the optical range is important and can be easily observed: for instance,
in the effect of the optical activity (polarization rotation) and in the appearance of
forbidden lines in spectra.
A free electron in a harmonic field, according to (7.149), oscillates with the
amplitude a1 = eE1 /m2 and the velocity a1 ; therefore, the conditions a o
and V c have the same form,
E1 mc2 /eo 108 G. (7.161)
This estimate was made for = 1 and corresponds to a practically impossible
intensity 1018 W/cm2 . Nevertheless, by taking into account the magnetic-field
effect in a light wave (Sec. 6.2) one can calculate the quadratic polarisability of a
free electron and observable nonlinear effects.
Thus, we have represented the field equations in the form of Hamiltons equations
for the spatial harmonicsq Ek (t), Hk (t) (or their linear combinations qk , pk , ak ).
Now, we can pass to the main stage of the quantum description, which is finding
the commutation rules for the dynamical field variables.
For instance, assuming f ak and using (7.124), (7.134), (7.143) and (7.165), we
obtain the Heisenberg equation for the annihilation operator in the form (7.127).
Similarly, all other relations from Sec. 7.3 remain valid after changing the classical
dynamical variables to operators in the Heisenberg representation. It is important
that the operator products are written in the symmetrized form, for instance,
|a|2 (a a + aa )/2 = a a + 1/2 = aa 1/2, (7.172)
where we used (7.165) for obtaining the last equalities. Note that in the operator
identities similar to (7.172), 1/2 is understood as I/2, where I is the unity, or
= .
identity, operator, I
Spatial harmonics of a free field depend on time harmonically with the fre-
quency k = ck. Hence, we find the two-time commutators,
[ak (t), ak0 (t0 )] f ree = kk0 exp[ik (t t0 )], (7.173)
and similar relations for other field variables. In the presence of external currents,
(7.173) can be replaced by a more complicated dependence on t and t0 , but it
should turn into (7.164) at t = t0 . This conservation of commutation relations
follows from the unitarity of the operators time evolution.
7.5 States of the field and their properties
Next, we consider the various states of the field, both pure and mixed ones, and
their properties, as well as the mean values and distributions of the observables in
these states. It is convenient to use some basis set of wave functions, so that an
arbitrary state can be represented as an expansion over this basis. This procedure
is similar to expanding an arbitrary vector over the set of unit vectors of some
frame of reference in real space. We will consider the basis sets generated by
various operators: energy H, coordinate q, momentum p, photon annihilation
a q + ip, as well as relations between these bases. In this consideration,
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
we will use compact Diracs notation, which will be briefly described in the next
subsection.
Similarly,
X
h|= h |nihn|. (7.179)
n
In the case of a continuous variable, summation in (7.177) and other similar equa-
tions is replaced by integration,
Z
|i= dx|xihx| i. (7.180)
The vectors h | and | i are called, respectively, the bra- and ket-vectors (being parts
of a bracket).
q Recall that in classical mechanics, the state is given by the numbers q, p, while in quantum mechanics,
Projections of different unity vectors onto one another are equal to zero (in the
case of an orthogonal system),
hn|n0 i = nn0 , hx|x0 i = (x x0 ). (7.181)
One says that a frame of reference (a basis) is complete if any vector can be
represented in the form (7.177). The completeness can be expressed in the form
of a tensor equation
X
I = |nihn|. (7.182)
n
Here, I is a unit tensor (I| i = | i), and |aihb| denotes a dyadic tensor, or the outer
product of the vectors |ai and hb|. The action of a dyadic on the vectors is obvious
from its notation,
{|aihb|}| i |aihb| i = hb| i|ai,
(7.183)
h |{|aihb|} h |aihb| = hb|h |ai.
The tensor |aiha| Pa with ha|ai = 1 is called a projector since its action on a
vector | i selects the component of this vector along |ai: Pa | i = |aiha| i ca |ai.
The expansion of the unity (7.182) provides an easy way for forming various
representations of scalars ha|bi, vectors | i, tensors (operators) f :
X
ha|bi = ha|I|bi = ha|nihn|bi, (7.184)
n
X
| i = I| i = |nihn| i, (7.185)
n
X
f = IfI = fnn0 |nihn0 |, (7.186)
nn0
where N a a is the photon-number operator for the chosen mode. The k index
will be omitted, as a rule, whenever we consider only a single mode. By definition,
the eigenvalues and eigenfunctions satisfy the equality
H0 |Ni = EN |Ni. (7.194)
According to (7.193), the energy states |Ni or, in short, the N-states, also
known as Fock states, are also eigenstates for the a a operator, (N N)|Ni = 0,
where N E/~ 1/2.
The H0 operator is Hermitian; therefore, the vectors |Ni form a complete or-
thogonal normalized frame of reference,
X
hN|N 0 i = NN 0 , |NihN| = I, (7.195)
N
X X
|i= |NihN| i, f = fNN 0 |NihN 0 |. (7.196)
N NN 0
Using the commutation rule [a, a ] = I, it is not difficult to show (see [Klyshko
(1980)]) that N are integers,
EN = (N + 1/2)~, N = 0, 1, 2, . . . (7.197)
Thus, the energy of a single mode can only take a set of discrete equidistant
values differing by ~, the energy of a photon. A mode has the smallest possible
energy if N = 0, which corresponds to the vacuum state |0i, while the states with
N > 0 are called N-photon states.
It is also easy to find out how the a, a operators act on N states,
a|Ni N 1/2 |N 1i, (7.198)
state of a mode is an incoherent mixture of a few first N states, while the state of
an ideal laser is a coherent mixture of many N-states.
All considerations given above related to a fixed time moment. The time
dependence of the state vector of a single free-field mode is described in the
Schrodinger representation by the equation
and if at time t = 0 the mode was in an N-photon state, then, according to (7.194),
where = ck. Hence, in an N-state, the mean values, moments, and distributions
of all observables, including q, p, are stationary. An arbitrary pure state of a mode
depends on the time as
X
|ti = cN |N, ti = c0 |0i + c1 eit |1i + . . . , (7.203)
N
Thus, an energy state is fixed by giving the photon numbers {Nk } in all modes
(occupation numbers), while an arbitrary state can be represented as a superposi-
tion of states with all possible combinations {N},
X
|ti = c({Nk })|{Nk }, T i. (7.206)
{Nk }
In the presence of external currents, the state amplitudes c({Nk }) become time-
dependent, and can be only determined by applying the perturbation theory (com-
pare with Sec. 2.1).
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
The states |zi are called coherent states. Since a q + ip, one can expect that the
spectrum of a is continuous and complex, i.e., z = z0 + iz00 is an arbitrary complex
number.
From the definition (7.207), it follows that the action of an arbitrary operator
function f (a) on the vector |zi is reduced to a simple multiplication of this vector
by a usual (c-number) function f (z),
In the case of a coherent state, all factorial moments of the photon number are
also calculated in a simple way (see (7.170)),
G(m) m
z hz| : N : |zi = |z|
2m
= hNim
z . (7.211)
For instance,
haa iz = ha a + Iiz = |z|2 + 1,
N 2 a aa a = a (a a + I)a =: N 2 : +N, (7.214)
2 4 2
hN iz = |z| + |z| .
Using the last equality, one can write the second moment and the variance hN 2 i
hN 2 i hNi2 in terms of the first moment,
hN 2 iz = hNiz (hNiz + 1), hN 2 iz = hNiz . (7.215)
This relation is typical for a Poissonian random variable (see below).
In the general case, normal ordering of an arbitrary operator is not a simple
task (see examples in Ref. [Louisell (1964)]). Sometimes, the following operator
identity is helpful [Klauder (1968)]:
exp(a + a) = C exp(a ) exp(a) = C 1 exp(a) exp(a ), (7.216)
where C exp(/2). At = , the operator in (7.216) is called the displace-
ment operator and is denoted as D(),
D() exp(a a). (7.217)
From (7.200), (7.217), and the equality obtained below, (7.222), it follows that
D(z)|0i = |zi. (7.218)
One can show [Bloembergen (1965)] that a classical external current jk converts
a mode from a vacuum state, |0i, into a coherent one, |zi, i.e., its action can be
described by the displacement operator. The amplitude z coincides in this case
with the classical amplitude found from (7.128).
Let us show that an oscillator in a coherent state will indeed have a Poissonian
distribution of the energy, i.e., that in a coherent state photons behave, in a sense,
like a chaotic flow of sand grains. For this, we will find the transformation matrix
hN|zi relating the bases |Ni and |zi. Let us left-multiply (7.207) by hN|, which
yields
hN|a|zi = zhN|zi. (7.219)
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
In particular,
Z
1
|z1 i = d2 z|zihz|z1 i. (7.231)
This relation, together with (7.225), shows that the basis vectors |zi can be ex-
pressed in terms of each other, i.e., the z basis is over-complete. Roughly speak-
ing, it means that the number of coordinates exceeds the dimensionality of the
space.
The time dependence of a coherent state can be easily found by substituting
(7.202) into (7.222),
X
|z, ti = |NihN|zieiNt = |zeit i |z(t)i. (7.232)
Thus, free evolution does not turn a coherent state into some other type of
state, similarly to the case of an energy state (see (7.202)) and in contrast to the
cases of q- and p-states (see below).
If all modes are in coherent states, then the state vector of the field will be
|{zk }i = |z1 i1 |z2 i2 |z1 , z2 , . . . i. (7.233)
According to (7.131), this vector is the eigenvector for the positive-frequency
(+)
field operator E (r, t) with the eigenvalue
X
E(+) (r, t) i ck ek zk exp(ik r ik t). (7.234)
k
The mean value of the field in a coherent state is equal to the real part of this
expression,
h{zk }| E(r, t)|{zk }i = 2ReE(+) (r, t). (7.235)
Further, according to (7.212), all normally ordered field moments (correlation
functions),
G(n) () () (+) (+)
1...2n h E 1 . . . E n E n+1 . . . E 2n i, (7.236)
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
which usually determine the readings of optical detectors, are factorable in the
case of a coherent state, i.e., can be expressed in terms of the products of the first
moments,
G(n) () (+)
1...2n ({zk }) = E 1 . . . E 2n . (7.237)
Here, the eigenvalues of the fields Ei() Ei (ri , ti ) are determined by the set {zk }
according to Eq. (7.234).
hq|q0 i = (q q0 ), (7.240)
Z
I = dq|qihq|. (7.241)
The squared delta-function has a meaning provided that one of its represen-
tations with a finite width q is used. In this case, (0) = 1/q (see (6.139)),
and one can replace (q)2 by (q)/q. The width q is chosen from the physical
considerations: it should be much less than the interval on which the functions
that are multiplied by the delta function before the integration vary considerably.
In this example, however, q is canceled since C = q,
P(q|q1) = (q q1 ). (7.244)
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Fig. 7.18 Distribution of the coordinate for an oscillator in a Fock state (top) and
a coherent state
(bottom) with the same mean photon numbers, equal to 5. The coordinate q is in ~/m units.
As a result, the distributions shift without changing their shapes (Fig. 7.18),
P(q|z, t) = 1/2 exp[(q q0 cos t)2 ],
(7.260)
P( p|z, t) = 1/2 exp[( p + q0 sin t)2 ].
Thus, the mean coordinate and momentum in the case of a coherent state de-
pend on time the same way as the corresponding values of a classical oscillator.
As |z0 | increases, the relative fluctuations are reduced and a quantum oscillator
becomes more and more similar to a classical one.
Note that the wave function (7.251) with the substitution zt z(t) = z0 eit
should satisfy the Schrodinger equation in the q representation,
2
!
2i 2
+ q z (q, t) = 0,
t q2 (7.261)
z (q, t) hq|zt i.
(Here and below, we use dimensionless variables q q, p p.) This condition
allows one to find the phase in (7.253). As a result, the coherent wave function
can be represented as
z (q, t) = 1/4 exp[(q 2zt )2 /2 z002 0 00
t + i(zt zt t/2)]
determining the coordinate distribution in Fock states and the photon-number dis-
tribution in q-states. These functions satisfy Eq. 7.261 with /t replaced by
iN; they are equal to the Hermite polynomials multiplied by the vacuum func-
tion h0|qi = exp(q2 /2). They are obtained if one multiplies (7.200) by hq| and
replaces hq|a with 21/2 (q d/dq)hq|,
the averaging runs over the initial state of the oscillator |t0 i. Thus, the variances
of the coordinate and momentum oscillate anti-phased with the frequency 2, and
their sum is an integral of motion,
According to (7.265), the variances are constant only under the condition that
Dq = D p and Dqp = 0. Using (7.254), one can verify that this is the case for Fock
states (Dq = D p = N + 1/2) and coherent states (Dq = D p = 1/2).
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Fig. 7.19 Coherent (a) and squeezed (b,c) states of an oscillator. The figure shows the time depen-
dencies of the mean coordinate and the coordinate uncertainty.
The dependencies are calculated using
Eq. (7.265) with q0 = 5, Dqp = 0, qp = 1/2 and q = 1/ 2 (a), 0.2 (b), 2.5 (c).
experiment, first through four-wave mixing and then via parametric amplification, see [Bachor (2004);
Walls (1994)]. Amplitude squeezing of photocurrent was also observed using a negative feedback
loop [Yamamoto (1999)].
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Fig. 7.20 Various states of a quantum oscillator shown on the phase plane. The horizontal and verti-
cal sizes of the figures correspond to the uncertainties in the coordinate and momentum, respectively;
the sizes along the radius and the azimuth correspond, respectively, to the amplitude and phase un-
certainties. 1, the vacuum state; 2, a coherent state; 3, a coordinate state; 4, a squeezed state; 5, a
momentum state; 6, a photon-number state; 7, a phase state.
vertical straight line; a p-state, by a thin horizontal straight line.u However, one
should keep in mind that these figures have only qualitative meaning; strictly, they
do not correspond to any joint distributions P(q, p), which do not exist in quantum
mechanics.
Similar to the states themselves, the figures depicting them on the phase plane
change in time due to the natural evolution (described by the Schrodinger equa-
tion) or due to the reduction as a result of the measurement back-action. For
instance, after an accurate measurement of q, a coherent circle in Fig. 7.20 will
turn into a vertical line. The evolution due to free oscillations is described by the
counter-clockwise rotation of the figure around the origin, with the angular rate
, or, alternatively, the clockwise rotation of the frame of reference q, p.
u Editors note: Probably one should add squeezed vacuum, a state generated at the output of an
unseeded optical parametric amplifier. In the diagram, it will be shown by an ellipse at the origin,
with the area being the same as for a coherent state [Bachor (2004); Walls (1994)].
v Editors note: Note that the description of a composite system consisting of two or several interacting
sub-systems (or sub-systems having interacted in the past) directly relates to the concept of entangled
states ([Peres (1993); Nielsen (2000); Bouwmeester (2000)], Sec. 7.5.7). This family of states was not
considered in the original book at all, partly due to the fact that in 1986, entangled states were not as
popular as nowadays. At the same time the author put a lot of efforts to avoid using vague terms while
describing physical phenomena ([Klyshko (1994)]).
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
hN||N 0 i, qq0 , zz0 and so on. In most cases, the density operator of the field is
written in the N-basis, but in some cases the z-basis is more convenient. For a
single mode, can be written in terms of N- and z-projectors as (the mode index
k is omitted)
X
= |NihN 0 | (7.267)
NN 0
Z
= d2 zP(z)|zihz|. (7.268)
Note that here, the diagonal z-representation is used, which is possible in many
cases and is provided by the over-completeness of the z-basis. Equation (7.268)
is called the Glauber-Sudarshan representation or the P-representation. The nor-
malization and Hermiticity conditions of are
X
NN = 1, NN 0 = N 0 N ,
N
Z
d2 z P(z) = 1, P(z) = P (z).
The mean value of any field operator f can be expressed in terms of accord-
ing to the relation h f i = Tr( f ) (Sec. 3.2), or according to (7.267), (7.268),
X Z
hfi = NN 0 fN 0 N = d2 zP(z)hz| f |zi. (7.269)
NN 0
Thus, the P(z) function provides an easy way to calculate the mean values of
normally ordered operators. In particular, the normally ordered moments can be
found as
Z
(n)
G = d2 zP(z)|z|2n . (7.271)
Equations (7.269), (7.270) show that the weighting function P(z) plays the
role
0of the probability
for the oscillator to have a complex amplitude z, i.e., q =
2z , p = 2z00 . However, P(z) can take negative values; besides, even with
P(z) = (2) (z z1 ), i.e., in the case of a pure coherent state, q and p have zero-point
fluctuations.w Therefore, P(z) is called a quasi-probability.
w Editors note: also called shot-noise fluctuations.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Moreover, the quasi-probability of a coherent state P(z) allows one to find the
probability distribution P( f ) of an arbitrary observable f . For this, one should
replace f in (7.269) by the projector P( f ) = | f ih f |,
Z Z
P( f ) = d2 zP(z)|hz| f i|2 = d2 zP( f |z)P(z). (7.272)
This definition yields a usual (not generalized) function for all states, in contrast
to P(z).x From the definition of , it follows that G(n) can be calculated by means
of differentiation, instead of more complicated integration,
!n
(n)
G = (, ) |=0 . (7.274)
Thus, a mixed state of a mode can be described by the NN 0 matrix, or one of
the functions P(z), (, ).
A mixed state of a multi-mode field is given by the density matrix
hN1 , N2 , . . . ||N10 , N20 , . . . i = h{Nk }||{Nk0 }i,
or by the quasi-probability P({zk }), or by its Fourier transform, ({k , k }). In
the case of independent modes, these values are factorable, reducible. It should
be stressed that with the help of the quasi-probability function, the operation of
quantum averaging of normally ordered operators (which are usually of interest)
takes the classical form (7.270), which is also maintained in the case of a multi-
mode field. For instance, the correlation functions (7.236) are found by averaging
their mean values in a coherent state (7.237) with the quasi-probability,
Z Z Y
G(n)
1...2n = . . . P({zk })G (n)
1...2n ({zk }) d2 zk . (7.275)
k
Recall that here, zk z(k, ) has the meaning of the amplitude (in 2ck units) of
the plane wave Ek0 propagating in the k direction and having polarization e .
Further, let us consider some examples of mixed states of the field.
In the case of a stationary field, the density operator does not depend on time,
i.e., [, H] = 0, which in the N-representation yields
X
h{Nk }||{Nk0 }i (Nk0 Nk )k = 0.
k
x For instance, P(z) for N states contains the 2Nth-order derivative of the delta function [Perina (1972)].
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
where
C = P(0) = 1 ex , x ~/T.
In a non-equilibrium field, the modes can also have exponential energy distri-
bution, provided that they are chaotically excited by many independent sources.
This is the case for thermal radiation, fluorescence, or superfluorescence (in the
linear regime, see Sec. 7.1). Then, the dependence of hNk i on |k| = /c de-
termines the frequency spectrum of the radiation, while the dependence on the
direction, k/k, determines the angular spectrum and the direction of incoherent
radiation. Recall that in non-laser light, usually all hNk i 1. For instance, for the
green part of the sunlight spectrum, hNk i 102 , hence the probabilities to find
0, 1, and 2 photons in one mode are approximately equal to 0.99, 102 , and 104 .
One can show [Glauber (1965)] that the quasi-probability of a chaotically ex-
cited mode is a two-dimensional Gaussian function with the variance hNi/2,
PT (z) = exp(|z|2 /hNi)/hNi. (7.280)
Hence, taking into account (7.273),
T (, ) = exp( hNi) (7.281)
and, according to (7.274), only even symmetric moments are nonzero,
G(m) m m
T h: N :iT = m!hNi . (7.282)
It follows from (7.282) and (7.211) that an m-quantum transition is m! times as
probable in a thermal field than in a coherent field with the same hNi (see (6.212)),
which is due to the long tail of the thermal distribution. At m = 2, (7.282)
describes photon bunching (Secs. 7.2, 7.6).
By substituting (7.280) into (7.272), one can see that the distribution of the
coordinate and momentum for a thermal state are Gaussian as well, with zero
mean values and the variances determined by the relation hNi + 1/2 = h p2 + q2 i/2,
i.e.,
hq2 iT = h p2 iT = hNiT + 1/2 = (1/2) coth(x/2). (7.283)
Note that additive multi-mode parameters of the field, such as, for instance, the
electric field amplitude E at point (r, t), will have Gaussian distribution regardless
of the states of separate modes (provided that they are independent), by virtue of
the central limit theorem.
As we have already mentioned, a pure coherent state does not belong to the
class of stationary states, since z(t) = exp(it+i). However, one can construct
a stationary coherent state by forming a mixture of coherent states with the same
amplitudes and random phases . Such a state, apparently, is described by the
quasi-probability of the form [Glauber (1965)]
P(z) = (|z| )/2 = (|z|2 2 )/, (7.284)
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
apparently, |12 i , |1 i |2 i.
However, definition (7.285) does not tell us whether a given state is more
entangled or less entangled. For quantifying entanglement, there is a simple pa-
rameter introduced by Wootters, the concurrence C, which indicates how much
entanglement is stored in a composite state of two qubits:
0 C 2|c1 c4 c2 c3 | 1. (7.287)
If the state is separable, then C = 0. An example of maximally entangled two-
qubit states (C = 1) are the Bell states, which form a complete basis for two-qubit
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
systems:
1
()
12
[|01 i|12 i |11 i|02 i], (7.288)
2
1
()
12 [|01 i|02 i |11 i|12 i]. (7.289)
2
The state ()
12 is called the singlet state and plays a special role due to its remark-
able symmetry properties. The other three states form the triplet. For instance
the singlet state is invariant with respect to choice of basis. The Bell states are an
important tool in modern quantum optics but probably they are most famous for
violating the Bell inequality [Klyshko (1998); Grynberg (2010)]
Another way to quantify entanglement is through the so-called Schmidt de-
composition, which exists for any bipartite composite system:
D
X D
X
p
AB = j |u j i|v j i, = 1, (7.290)
j=1 j=1
where |u j i and |v j i are bases for subsystems A and B, both having dimensionality
D. The degree of entanglement can be estimated with the Schmidt number,
D 1
X
1 K 2j D, (7.291)
j=1
which can be interpreted as the number of nonzero Schmidt modes in the expan-
sion (7.290). It is easy to calculate the Schmidt number for the Bell states (7.288),
(7.289): K = 2. A state is separable if and only if K = 1. The Schmidt decom-
position is a very useful approach from the physical viewpoint as it allows one to
interpret the natural eigenmodes |u j i and |v j i of the system under study (called the
Schmidt modes) in terms of entanglement [Mandel (2004)].
Another closely related measure of entanglement is the von Neumann entropy,
D
X
S j log2 j = S (A ) = S (B ), (7.292)
j=1
where S (A,B) TrB,AAB is the reduced density matrix for subsystem A (B).
This definition returns us to the initial meaning of entanglement introduced by
Schrodinger. Indeed, the notion of the entropy relates to the uncertainty in the
(sub)system. For a pure state the von Neumann entropy equals zero, while for
completely mixed states it takes the maximal value.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
The definition (7.285) can be also formulated for mixed states. Namely, a
mixed state is called separable if and only if it can be written as
X
j j j j
= p j |A ihA i |B ihB i, (7.293)
j
P
where j p j = 1, 0 p j 1. Otherwise, the state is entangled.
As an example let us consider the so-called Werner states,
W x| ih |+{1 x} | ih | + |+ ih+ | + |+ ih+ | + |+ ih+ | . (7.294)
It turns out that the state (7.294) is separable for x < 1/3.
Let us mention that sometimes one falsely associates entanglement with the
violation of the Bell inequalities. In this connection, the Werner states (7.294)
represent an important example: at 1/3 < x < 1/ 2 they are not separable (and
hence are entangled) but do not violate the Bell inequalities.
It is worth noting that the concept of entangled states was actively developed
in quantum information science and formally relates to systems of quantum bits
(qubits) or quantum dits (qudits) which are rather abstract notions. Nowadays
there are several physical systems that play the role of qubits, such as polarization
states of single photons, two-level atoms or ions in traps, etc. Real physical qubits
need to be carefully specified to avoid a contradiction with the formal description.
For instance, the well-known permutation property of identical particles leads to
entanglement since their wave function should be symmetrized, hence the singlet
state of two spin-1/2 particles takes the form
1
= [| i| i | i| i]. (7.295)
2
However, this sort of states can not serve as a resource of quantum information
since it is impossible to perform local operations over the subsystems and there
is no possibility to change the coefficients in the coherent superposition (7.295).
This fact was discussed by Zanardi and Peres [Peres (1993)] but using systems of
identical particles in the protocols of quantum information and quantum commu-
nication seems to be still an open question.
Starting with the pioneering experiments performed by Fry and independently
by Aspect [Grynberg (2010)] with two-photon fluorescence of atoms, entangled
states were actively studied in quantum optics. In particular, the most popular
object was the two-photon state (biphoton) [Klyshko (1998)] created via sponta-
neous parametric down conversion (see Secs. 6.5 and 7.6). Several types of bipar-
tite entangled states were suggested depending on the available degrees of free-
dom under consideration: entanglement between polarization and momentum, po-
larization and frequency, energy and time, frequency and momentum. The partic-
ular type of entanglement is determined by the phase-matching conditions and/or
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
7.6 Statistics of photons and photoelectrons
Let us consider in more detail the statistics of the photon number N (see also
Ref. [Loudon (2000)]). For simplicity, we will mostly consider a single mode.
Below, we will show that the distribution P(N) NN or the moments hN m i
P m
N P(N) can be experimentally obtained from the statistics of the number of
photocounts, i.e., the number of electrons released by light from the photocathode
of a PMT during some sample time T . Such methods form the base for the optical
mixing spectroscopy [Cummins (1974)] where the traditional spectral analysis of
light is replaced by the statistical analysis of photocurrent at the PMT output.
Further, using (7.270), one can write P(N) as an infinite series of normally ordered
moments of orders m 1 N,
1 1 X (1)k (N+k)
P(N) = h: N N eN :i = G . (7.302)
N! N! k=0 k!
Thus, the photon-number distribution has the form of a Poissonian one, N e /N!,
with the random parameter N, which requires additional quantum averaging
with normal ordering.
Above, we have considered two examples of P(N) distributions: the Poisso-
nian one, (7.224), for a coherent state, and the geometric one, (7.279), for a chaotic
(thermal) state. These are single-parameter distributions: they are fully character-
ized by, for instance, the first moment, hNi. One can show (see, for instance,
Ref. [Loudon (2000)]) that for a single-mode laser much above the oscillation
threshold, P(N) is Poissonian and for a laser below the threshold, it is geometric.
Near the threshold, the distribution has a shape that is intermediate between these
two limiting cases. This distribution, in the simplest models, is determined by two
parameters, for instance, hNi and the excess inversion above the threshold.
It is useful to consider another type of states, namely, an incoherent mixture
of the vacuum, |0i, and the K-photon state, |Ki. Then, P(N) differs from zero only
at two points,
P(N) = P(0)N0 + P(K)NK . (7.303)
Hence, hNi = KP(K), and all values can be expressed in terms of a single param-
eter, the mean photon number,
P(K) = hNi/K, P(0) = 1 P(K), (7.304)
hN m i = K m P(K) = K m1 hNi, 0 0 N 0 K. (7.305)
In the limiting case hNi = K, we obtain a pure energy state with K photons;
however, it is interesting to consider the more realistic case hNi K. Such states
can be generated via a K-photon decay of a single excited atom into one mode.
Repetition of this process in time leads to the appearance of K-photon light,
radiation consisting of K-photon groups. Two-photon light can be also generated
via the spontaneous (hNi 1) parametric down-conversion of usual light, in
which photons have Poissonian or Bose-Einstein distribution.
Figure 7.21 shows the plots of all three distributions we have considered.
Below, it will be shown that the termination of the K-photon distribution at
N > K leads to the effect of photon anti-bunching for hNi > K 1, while at
K > 1, hNi K 1, it leads to the effect of super-bunching. (Conventionally, the
Poissonian distribution is considered as having no bunching.)
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Fig. 7.21 Typical photon-number distributions: (a) geometric; (b) Poissonian; (c) two-photon mixed.
The dashed line shows a Gaussian function with the mean value 9.5 (see (7.320)).
The last equality follows from (7.302). Note that Q(x) differs from the he xN i
function, whose derivatives yield usual moments. If Q(x) is known, the factorial
moments and the distribution P(N) can be easily found by differentiating it at
points x = 0 and x = 1. Indeed, it follows from the definitions that
G(m) = Q(m) (0), (7.308)
An important property of the generating functions (GFs) follows from the def-
P
inition (7.307): the GF Q(x) of a sum N = Ni of independent integers Ni is
equal to the product of the GFs for these integers,
Y
Q(x) = Qi (x). (7.315)
i
For instance, let a set of M similar atoms independentlyy emit K-photon light;
then, using (7.313) and (7.315), we find
M
Y
Q(x) = {1 + hNi i[(1 + x)K 1]/K} exp{hNi[(1 + x)K 1]/K}, (7.316)
i=1
P
where we assumed M and hNi i 0 with a finite hNi hNi i. In the case
K = 1, (7.316) becomes (7.311), i.e., weak single-photon radiation from a large
number of independent atoms results in a Poissonian distribution (in the absence
of interference). At K > 1, (7.316) describes Poissonian distribution for groups of
K photons. For instance, in the case of two-photon radiation,
Q(x) = exp[(2x + x2 )], hNi/2,
(7.317)
P(2N) = N e /N!, P(2N + 1) = 0.
Apparently, this example also includes the case where M is the number of inde-
pendent modes and N is the total number of photons in these modes.
Consider another example of applying the composition rule (7.315). Let us
focus on the total photon number N in M independent modes with the geomet-
ric distributions and the same mean photon numbers, hNk i = hNi/M (the
y Here we neglect interference, which one can do in the case of multi-mode detectors with a large
Thus, the N distribution for a coherent quantum ensemble coincides with the
distribution of chaotically scattered classical particles over a set of cells, either in
space or in time. For an arbitrary state, the variance hN 2 i can certainly differ
from hNi; in this case, for hN 2 i > hNi one speaks of photon bunching, while
for hN 2 i < hNi, of photon anti-bunching. These effects can be quantitatively
described in terms of the normalized second factorial moment,
h: N 2 :i hN 2 i hNi hN 2 i hNi
g 2
= 2
= 1+ . (7.325)
hNi hNi hNi2
For a Poissonian distribution, g = 1, in the case of bunching g > 1, and in
the case of anti-bunching g < 1. Because the rate of a stimulated two-photon
transition scales as h: N 2 :i, g determines the ratio of two-photon efficiencies for
a given field and for a coherent field with the same mean energy. Note that if we
ignore the normal ordering in the definition of g (7.325), then g will be always
greater than a unity,
and gT = 2. The variance is above hNi due to the relatively slow decay of the ge-
ometric (exponential) distribution (Fig. 7.21), which makes groups of N pho-
tons with N , hNi occur more often than in the case of a Poissonian distribution.
As a result, relative fluctuations of the photon number in a thermal state tend at
hNi 1 to the unity (and not to zero, as for a coherent state),
Sometimes one says that the first and the second terms in (7.327) and (7.328)
correspond, respectively, to the corpuscular and wave sides in the wave-particle
duality of a photon, and that the bunching effect (i.e., the second term) confirms
that the photons have a tendency to joining in groups. This terminology masks
the fact that the variance of some observable f characterizes the state, and not
the properties of f . This approach also ignores the presence of states with anti-
bunching, i.e., with the relative fluctuations less than 1/ hNi and with two-photon
efficiency less than the one for a coherent state with the same mean energy.
An evident example of an anti-bunched state are photon-number states. In a
K-photon state, the fluctuations of N are absent, P(N) = NK , so that hN m i = hNim
and g = 1 K 1 < 1.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Fig. 7.22 A typical distribution of 10 photons in time in the cases of (a) chaotic light (bunching);
(b) coherent light; (c) single-photon light (anti-bunching); (d) two-photon light (super-bunching).
The above-considered mixture of the vacuum and the K-photon state (7.303)
also manifests anti-bunching under the condition hNi > K 1. Indeed, it follows
from (7.305) that
gK = (K 1)/hNi. (7.329)
Hence, single-photon decay of separate atoms turns the field into an anti-bunched
state, g = 0, which was observed in the resonance fluorescence (Sec. 5.2) of
sodium atomic beam [Paul (1977)]. Certainly, in experiment the averaging runs
not over the ensemble of experimental setups but over time.
This effect can be explicitly explained by the fact that the decay of a single
atom cannot create two photons; therefore, the emitted photons are always sepa-
rated by a certain time interval required for a second excitation and de-excitation
of the same atom or the next atom in the beam (Fig. 7.22(c)). Recall that our
single-mode theory relates only to time intervals T much less than ; therefore,
P(N) = 0 for N > 1.
Radiation with anti-bunching (single-photon light) can also emerge as a re-
sult of multi-photon absorption (Sec. 6.4) of usual (chaotic or coherent) radia-
tion. The reason is that K-photon absorption, apparently, influences only the tail
of the P(N) distribution at N 1 K, which leads to the rarefication of photon groups
and the reduction of fluctuations in the initial radiation. On the contrary, the sat-
uration effect in single-photon absorption (Sec. 4.3) makes fluctuations more pro-
nounced, i.e., leads to photon bunching. This forms the base for a method of
obtaining very short (picosecond)z pulses in mode-locked lasers.
In the case of weak multi-photon light, K > 1, hNi K 1, Eq. (7.329)
yields g 1 (Fig. 7.23). This effect can be called photon super-bunching. It
z Editors note: at present, tens of femtoseconds are achieved.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Fig. 7.23 The bunching parameter g as a function of the mean photon number hNi in the cases of
single-photon light (1), two-photon light (2), three-photon light (3), coherent light (4), and chaotic
light (5). In range I, there is photon anti-bunching, in range II, bunching, in range III, super-bunching.
In the case of a pure two-photon state, hNi = 2, and the so-called negative corre-
lation, or anti-correlation, takes place, with g(2)
12 < 1 (Fig. 7.12).
Above, for simplicity we discussed the statistics of a single mode; however,
many conclusions can be generalized to the case of a multi-mode field. Indeed, in
spontaneous two-photon emission (Sec. 6.2) and in spontaneous parametric down-
conversion (Sec. 6.5), photons in pairs usually belong to different modes, k1 and
k2 , which differ both in frequency and in direction. Then, a small contribution
from the state |1i1 |1i2 is added to the vacuum, which provides the equality between
the moments,
This inequality, which can also be interpreted as the super-bunching effect (com-
pare with (7.329) at K = 2), was experimentally confirmed by measuring the rate
of coincidences between two PMTs.
Fig. 7.24 Distributions of the photon number, P(N), and the photoelectron number, P(m), in the case
of the detector efficiency 0.3 and the field being in a four-photon Fock state.
Hence, in the case of an arbitrary state of the field, we find the relation between
the distribution of photons, P(N) NN , and the distribution of photoelectrons (or,
in other words, photocounts),
X
P(m) = P(m|N)P(N). (7.336)
N=m
This relation, after taking into account (7.270), can be represented in an invariant
form,
where
Here, the functions of m are averaged using the discrete distribution P(m),
X
h f (m)i f (m)P(m). (7.344)
m=0
The quantum mean value, h f (N)i, has a similar form in the N representation.
Equation (7.341) can be easily obtained using the formalism of generating
functions for photons, Q phot (x) (see (7.307)), and electrons,
X
Qel (x) (1 + x)m P(m). (7.345)
m=0
m2 = hmi(1 ) + 2 hN 2 i. (7.349)
Thus, to the usual shot noise of the photocurrent, hmi, photon noise, hN 2 i,
is added with the weight 2 (which seems natural), but simultaneously, the term
2 hNi is subtracted (which is surprising for semi-classical theory). In the case of
photon anti-bunching, hN 2 i < hNi, so that the photocurrent noise is less than the
Poissonian, i.e., electrons appear with some regularity, repulse each other.
Note that, according to (7.341), the bunching parameters (as well as all nor-
malized factorial moments) of photons and electrons coincide,
7.7 Interaction of an atom with quantized field
ordered correlation functions (CFs). These functions have more direct relation
to the energy exchange than the symmetrized CFs that are commonly used. We
will also consider the symmetry of CFs, the relations between different CFs, and
the dependence of CFs on the microscopic parameters of the systems and their
Greens functions, i.e., response to coherent perturbations.
If we are not interested in the final state of the system, (7.359) should be
summed over all possible states |n f i. These states form a complete set; therefore,
the total probability of the transition up will be (compare with (2.82))
Z t
2
P = ~ dt0 dt00 F () (t0 , t00 )G(+) (t0 , t00 ), (7.360)
t0
where
F () (t0 , t00 ) hd(+) (t0 )d() (t00 )i, G(+) (t0 , t00 ) hE () (t0 )E (+) (t00 )i (7.361)
are, respectively, the anti-normally ordered CF for the dipole moment of the atom
and the normally ordered CF for the field at the initial state (which, apparently,
can be mixed as well).
Similarly, the probability of the emission of a quantum is determined by the
square of (7.357) with only the second term in (7.358) taken into account,
Z t
P = ~2 dt0 dt00 F (+) (t0 , t00 )G() (t0 , t00 ). (7.362)
t0
Thus, the probability of an energy quantum transfer from one quantum sys-
tem to another one is determined by the product of the unperturbed correlation
functions, the normal one for the emitting system and the anti-normal one for the
absorbing system (provided that the initial states are independent and at least one
of them is not coherent). The last condition is to exclude coherent interactions,
which depend on the phases of the states.
The mean variation of the total photon number within time T , in the second
order of the perturbation theory, is equal to the difference between (7.362) and
(7.360),
Z t
N = ~2 dt0 dt00 (F (+)G() F ()G(+) ). (7.363)
t0
In the ground states, normal CFs are equal to zero; therefore, the spontaneous
emission of an atom is determined by F (+) (this conclusion was already used in
Chapter 5), while the probability of a cold detector registering a photon is deter-
mined by G(+) (Secs. 7.2, 7.6).
Note that the obtained result (7.363) is valid for any systems with the interac-
P
tion energy of the form fi gi .
where Gvac is the anti-normally ordered CF of the field in the vacuum state.
According to (7.131),
X X 0 00
Gvac (t0 , t00 ) = ck0 ck00 h0|ak0 (t0 )ak00 (t00 )|0i = c2k eik (t t ) , (7.365)
k0 k00 k
where c2k = 2~k /L . Note that regardless of the state of the field,
3
If T is much greater than the correlation time of the atom and the field, then it
follows from (7.374) that
Z
W = ~2 d(F(+)G() () (+)
F G ), (7.375)
where W (P P )/T and the sign of W determines the direction of the quantum
transfer (from the atom to the field at W > 0).
Our initial model describes the time evolution of the states of the atom and
the field. Angle brackets in (7.373) denote averaging over the ensemble of experi-
ments with the same initial conditions and different . If, however, we assume that
both systems are in contact with their thermostats (Fig. 7.25), which continuously
restore and maintain the initial stationary states, we will obtain an ergodic model.
Then, (7.375) describes a continuous flux of energy quanta transmitted from one
thermostat to the other one through the atom-field system.
Fig. 7.25 Stationary interaction between an atom and a field. The atom-field coupling is often the
bottleneck for the energy exchange between the thermostats T 1 , T 2 .
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Let us now write the CF (7.373) in terms of the microscopic parameters of the
systems. Averaging with the help of stationary density matrices (not necessarily
equilibrium ones) yields
X
G(+)
= c2k hNk ieik ,
k
X
G()
= c2k (hNk i + 1)eik ,
k
X (7.376)
F(+) = |dmn |2 mm eimn ,
m>n
X
F() = |dmn |2 nn eimn ,
m>n
where hNik hak ak i are mean photon numbers in the modes and nn are relative
populations of the atomic levels. After substituting (7.376) into (7.375), the rate
of quantum exchange takes the form
X
W = 2~2 c2k |dmn |2 (k mn )[(hNk i + 1)mm hNk inn ]. (7.377)
k,m>n
The three terms of the last factor correspond to the three types of transitions be-
tween each pair of levels according to Einstein (Chapter 2).
By inserting into (7.377) the energy of a quantum ~k , apparently, we find the
power of emission or absorption,
X
P = (2)2 L3 2k |dmn |2 (k mn )(mm nm hNk i), (7.378)
k,m>n
Below, it will be shown (see (7.390), (7.394)) that in the case of a linear or equi-
librium system, the only difference between various types of CFs is the different
contributions of zero-point fluctuations into these functions.
properties:
(s) (a)
F(s) = F(s) = F , F(a) = F(a) = F . (7.386)
Comparing the last expression with Eq. (4.57) for the linear susceptibility (polaris-
ability) of a single atom at = 0, one can see that the anti-symmetric spectral
CF of a stationary system coincides (up to the factor 2/i~) with the imaginary
part of the susceptibility of the system with respect to a coherent excitation,
00 = (2/i~)F(). (7.391)
The response function and its Fourier transform are called the retarded part
of the Greens function of the system. Note that in order to determine the suscep-
tibility of matter, experimentally or theoretically, one requires the field to be in a
coherent quantum state (Sec. 7.5). And vice versa, the definition of the suscepti-
bility of the vacuum (Sec. 4.1) is based on the coherent state of the matter. From
the comparison between (7.390) and (7.365), it follows that G(a) vac
= (ImG ) , i.e.,
the role of in the case of the field is played by the anti-normally ordered CF of
the vacuum.
Recall that 00 , in its turn, unambiguously determines 0 (Sec. 4.1). Thus,
the kinetic parameters of a stationary system are in one-to-one correspondence
with its unperturbed fluctuation characteristics.aa In the case of equilibrium sys-
tems, this relation can be inverted, so that each one of the four CFs of the system
determines the other ones.
where equals unity at > 0 and zero at < 0. The last equation in (7.394)
is called the fluctuation-dissipation theorem (FDT) or the Nyquist-Callen-Welton
theorem.
Let the atom and the field be in equilibrium states with the temperatures T 1
and T 2 , respectively. Then, according to (7.394) and similar equations for field
CFs, (7.388), (7.389) take the form
Z
P = 4i d00 G(a)
(N1 N2 ), (7.395)
0
1
where Nn [exp(~/T n ) 1] . If the temperatures are close, the last factor
in (7.395) scales as T 1 T 2 , then the ratio P/|T 1 T 2 | determines, through 00 ,
the heat conductivity of the atom-field link, which provides the heat exchange
between the two thermostats.
Here, we only considered single-quantum transitions and linear susceptibility,
which are described in the first orders of the perturbation theory. A similar consid-
eration can be performed for multi-quantum transitions, nonlinear susceptibilities,
and higher-order CFs. The corresponding generalizations of the FDT, obtained by
Efremov and Stratonovich, are described in short in Ref. [Klyshko (1980)].
In conclusion, let us mention that the interaction between an atom and a field
leads not only to the energy exchange but also to a certain shift of the energy
levels. In the case of the vacuum state of the field, this shift leads to a variation of
the eigenfrequencies by a value about 109 Hz (for the hydrogen atom). This effect
is called the Lamb shift (see, for instance, Ref. [Allen (1975)]). If the field is in
an excited state, the shift is called the Stark effect in alternating field (5.48). This
phenomenon has to be taken into account in many problems of laser spectroscopy.
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Bibliography
337
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Bibliography 339
Agrawal, G. (2007) Nonlinear Fiber Optics, 4th ed. (Academic Press, San Diego).
Bachor, H.-A. and Ralf, T. (2004). A Guide to Experiments in Quantum Optics (Wiley-VCH
Verlag GmbH & Co. KGaA, Weinheim).
Bouwmeester, D., Ekert, A., Zeilinger, A. (Eds.) (2000). The Physics of Quantum
Information (Springer, Berlin).
Boyd, R. (2008) Nonlinear optics, 3rd ed. (Academic Press, San Diego).
Bruss, D. (2002) Characterizing entanglement, Journ. of Math. Phys., 43, p. 4237.
Goodman, J. (1985). Statistical Optics (McGraw-Hill, New York).
Grynberg, G., Aspect, A., Fabre, C. (2010). Introduction to quantum optics: From the
semiclassical approach to quantized fields (Cambridge University Press, New York).
Klyshko, D. N. (1994). Quantum optics: quantum, classical, and metaphysical aspects,
Phys. Usp. 37, pp. 10971122.
Klyshko, D. N. (1998). Basic quantum mechanical concepts from the operational view-
point, Phys. Usp. 41, pp. 885922.
Leonhardt, U. (1997). Measuring the quantum state of light (Cambridge University Press,
Cambridge).
Mandel, L. and Wolf, E. (1995). Optical Coherence and Quantum Optics (Cambridge
University Press, Cambridge).
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Bibliography 341
Index
343
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Diracs hyper-parametric
notation, 289 scattering, 12, 221, 228
dispersion, 26, 39, 140, 144
dependence, 12, 77, 81, 83, 8589, 159, intensity
161, 169, 180182, 216, 227 correlation, 266, 267, 269
frequency-angular, 179 interference, 271, 272
induced, 180, 191, 207 interaction
law, 7577, 86, 87, 184, 189 Hamiltonian, 283, 284
linear and nonlinear, 181 of stationary systems, 331
negative, 85 operator, 57, 328
spatial, 72, 76, 139 picture, 6264, 170, 328
theory, 5, 77, 154, 207 inversion
Doppler-free saturation, 196 in semiconductors, 55
Doppler-free spectroscopy, 195, 197 isolated resonance, 82, 83
Index 345
bandwidth, 38, 94, 121 amplification, 214, 215, 224, 225, 230,
broadening, 29, 38, 39, 60 303
non-equilibrium coefficient, 215
matter, 242 amplifier, 220222, 244
radiation, 41, 261, 309 approximation, 213
state, 52, 56 down-conversion, 12, 220, 227, 312,
system, 53 315, 321, 323
systems, 52, 56, 131 spontaneous, 135
thermodynamics, 239 fluorescence, 220
non-parametric oscillation, 214, 224, 225
processes, 135, 136, 179, 180, 182, oscillator, 135, 215
187, 191, 197 processes, 135, 136, 158, 179182,
nonclassical 187189, 199, 207209, 213,
light, 11 219, 226228, 230, 235
nonlinear scattering, 220
absorption, 136, 180, 191 super-fluorescence, 224
dispersion, 180 permutation symmetry, 187
optics, 10 permutative symmetry, 141, 142
spectroscopy, 136, 197, 228, 229 perturbation theory, 15, 19, 20, 23, 25, 32,
susceptibilities, 10, 138, 142, 146, 159, 33, 51, 58, 62, 64, 79, 89, 108, 111,
336 120, 172, 173, 263, 284, 293, 328, 329,
normally ordered 336
characteristic function, 307 photon
correlation functions, 328330, 332, anti-bunching, 262, 270, 315, 318320,
333 326
emission, 118 bunching, 262, 265, 266, 269, 309,
fluctuations, 40 318320
moments, 118, 119, 122, 286, 297, 306, echo, 131
315, 322 kinetics, 28
operators, 295, 306, 307 statistics, 314
number of photons, 39 physics of lasers, 9
nutation, 100, 109, 110, 112114, 126, polaritons, 85, 86, 197, 223, 226
127 population
inversion, 2, 3, 68, 37, 46, 52, 56, 80,
observables, 43, 46, 64, 95, 96, 102, 286, 84, 94, 112, 114, 131, 152,
288, 293, 333 194, 199
one-dimensional kinetics, 28, 36
approximation, 29, 182, 210, 217 populations
equations, 189, 191 in semiconductors, 53
model, 155, 185, 207, 212, 234, 235 of the levels, 3, 4, 8, 17, 18, 20, 2830,
oscillator 32, 36, 37, 43, 5052, 54, 55,
strength, 81, 150 6062, 67, 79, 9092, 96, 98,
oscillator strength, 79 104, 105, 107, 119, 120, 123,
152, 165, 172, 175, 197
paramagnetic resonance, 94, 105, 251
parametric quantum
March 23, 2011 16:14 World Scientific Book - 9in x 6in ws-book9x6
Index 347