Polarization in Spectral Lines
Polarization in Spectral Lines
ASTROPHYSICS AND
SPACE SCIENCE LIBRARY
VOLUME 307
EDITORIAL BOARD
Chairman
W.B. BURTON, National Radio Astronomy Observatory, Charlottesville, Virginia, U.S.A.
([email protected]); University of Leiden, The Netherlands ([email protected])
Executive Committee
J. M. E. KUIJPERS, Faculty of Science, Nijmegen, The Netherlands
E. P. J. VAN DEN HEUVEL, Astronomical Institute, University of Amsterdam,
The Netherlands
H. VAN DER LAAN, Astronomical Institute, University of Utrecht,
The Netherlands
MEMBERS
I. APPENZELLER, Landessternwarte Heidelberg-Königstuhl, Germany
J. N. BAHCALL, The Institute for Advanced Study, Princeton, U.S.A.
F. BERTOLA, Universitá di Padova, Italy
J. P. CASSINELLI, University of Wisconsin, Madison, U.S.A.
C. J. CESARSKY, Centre d'Etudes de Saclay, Gif-sur-Yvette Cedex, France
O. ENGVOLD, Institute of Theoretical Astrophysics, University of Oslo, Norway
R. McCRAY, University of Colorado, JILA, Boulder, U.S.A.
P. G. MURDIN, Institute of Astronomy, Cambridge, U.K.
F. PACINI, Istituto Astronomia Arcetri, Firenze, Italy
V. RADHAKRISHNAN, Raman Research Institute, Bangalore, India
K. SATO, School of Science, The University of Tokyo, Japan
F. H. SHU, University of California, Berkeley, U.S.A.
B. V. SOMOV, Astronomical Institute, Moscow State University, Russia
R. A. SUNYAEV, Space Research Institute, Moscow, Russia
Y. TANAKA, Institute of Space & Astronautical Science, Kanagawa, Japan
S. TREMAINE, CITA, Princeton University, U.S.A.
N. O. WEISS, University of Cambridge, U.K.
POLARIZATION
IN SPECTRAL LINES
by
and
MARCO LANDOLFI
Arcetri Observatory,
Firenze, Italy
No part of this eBook may be reproduced or transmitted in any form or by any means, electronic,
mechanical, recording, or otherwise, without written consent from the Publisher
Quod si tam celebris est apud omnes gloria Adamantis, atque varia ista
opum gaudia, gemmae unionesque, ad ostentationem tantum placent, ut
digitis colloque circumferantur; non minori afficiendos speraverim gaudio
eos, quibus curiositatis conscientia quam deliciarum est potior, novitate
corporis alicujus, instar crystalli translucidi, quod ex Islandia nuper ad
nos perlatum est; cujus tam mira est constitutio, ut haud sciam, num
alias magis naturae apparuerit gratia.
Erasmus Bartholinus, Experimenta crystalli islandici disdiaclastici
Apart from a few objects of our immediate neighborhood (the solar system), all
the information on the physical phenomena taking place in the Universe comes
from the radiation that the astronomical objects send into space and that is finally
collected on earth by telescopes or other instruments. Among the different kinds
of radiation, electromagnetic waves have by far played the most important role in
the history of Astronomy – probably, it is not unrealistic to say that more than
99% of our present knowledge of the Universe derives from the analysis of the
electromagnetic radiation.
Such radiation contains three different kinds of information, encoded into as
many physical characteristics typical of any oscillatory propagation phenomenon:
the propagation direction, the frequency and amplitude of the oscillation, and the
oscillation direction – or polarization.
The first one is the most direct and the easiest to measure: the human eye is
itself a suitable instrument, though of limited accuracy. As time passed, positional
astronomy became more and more accurate thanks to the invention of the opti-
cal telescope, the introduction of photographic and digital techniques and, more
recently, the development of technologies for producing images of a given region
at different wavelengths via radio, infrared, X-ray and γ-ray telescopes, often op-
erating on board of spacecrafts. Our present knowledge of the morphology and
dynamics of the Universe, and of the different objects of which it is composed
(from planets to stars, from nebulae to globular clusters, from galaxies to Active
Galactic Nuclei and to clusters of galaxies) is based on the huge number of such
observations that have been accumulating during several centuries.
However, even if very accurate, the measurement of the propagation direction
of the electromagnetic radiation is inadequate to study other fundamental aspects
of the physical Universe such as the composition, structure, and evolution of the
different objects. To this aim, a detailed analysis of the frequency (or wavelength)
distribution of the energy carried by the electromagnetic radiation is required,
which was made possible by the invention of the spectrograph. Only through the
systematic use of spectroscopic methods it has been possible to obtain a direct
comprehension of the physical mechanisms which govern the equilibrium of stars,
their birth, evolution and death, and the complicated processes taking place in
the interstellar medium and in the nuclei of galaxies. Spectroscopy, which is also
viii PREFACE
at the basis of the idea of an expanding Universe, has played such a key role in
the comprehension of the physical Universe that a new name, astrophysics, was
introduced in the scientific lexicon to denote the astronomical research based on
this technique.
The third, and often neglected, characteristic of the electromagnetic radiation
is polarization. The earliest studies on polarization were performed around 1670
by the Danish scientist Erasmus Bartholinus, who was strongly impressed by the
properties of a newly discovered crystal, the Iceland spar (‘. . . whose behavior is so
surprising that, as far as I know, never the grace of nature appeared more clearly’)
and who immediately realized that those properties could prove useful to improve
human knowledge.
Since polarization is mainly related to the geometrical aspects of the emission
process (rather than to its energetics), and since polarization measurements are
often difficult to perform because of the intrinsic weakness of the signals, the study
of polarization found its place in the astronomical research with some difficulty.
But eventually the prediction of Erasmus Bartholinus was fully confirmed: some of
the most important astronomical discoveries of the 20th century were made thanks
to polarimetry – or, more properly, spectropolarimetry. Suffice it to quote the
discovery of magnetic fields in the sun, stars, and the interstellar medium.
The first application of spectropolarimetry to the astronomical research dates
back to 1908 when, using a Nicol prism as a polarizer and a Fresnel rhomb as a
quarter-wave plate, George Ellery Hale succeeded in taking two spectra of the same
area of a sunspot in opposite directions of circular polarization. The comparison
of the spectra showed the presence of the typical signature induced by a strong
magnetic field, the Zeeman effect.
Since 1908, things have considerably evolved from the technological point of view.
Spectropolarimetric observations of the solar spectrum have now attained a sensi-
tivity level which goes beyond the most optimistic expectations of only two or three
decades ago. The first dedicated instrument for the measurement of Stokes parame-
ters profiles in Fraunhofer lines, the ‘mythic’ Stokes-I polarimeter, developed in the
mid 1970s at the High Altitude Observatory, hardly attained a sensitivity of 1%.
Nowadays, sensitivity in solar spectropolarimetry has reached the level of 10−5 for
spatially unresolved observations, and approximately one order of magnitude less
for observations at high spatial and temporal resolution. It has to be expected that
these limits will be rapidly overcome by the next generation polarimeters and that
the same technologies will be adapted to galactic and extragalactic observations.
The dramatic increase of polarimetric sensitivity in solar observations has raised
a serious challenge to the theoretical interpretation. Polarization in spectral lines
is indeed a subtle phenomenon since, in astrophysical plasmas, there are several
physical mechanisms that can generate polarization signatures in line profiles and
many others that can modify them during the propagation. Some of these mech-
anisms have been known for a long time from laboratory atomic physics. They
are – just to mention the most remarkable – the Zeeman effect, resonance po-
larization, and the Hanle effect. Other mechanisms are characteristic of optically
thick plasmas, and are related to the propagation of radiation in anisotropic media.
PREFACE ix
They are known under the general names of dichroism and anomalous dispersion,
though in special cases different denominations are often used (inverse Zeeman
effect, magneto-optical effects, Faraday rotation, Faraday pulsation, etc.).
These processes have mostly been studied in specific physical contexts, for differ-
ent purposes and at different levels of sophistication, and the scientific literature
on the subject is scattered across books and journals, spanning almost a century
of active research. For this reason we felt that a book capable of describing, in
a unique and self-consistent formalism, all the known physical phenomena that
may affect the polarization signatures of spectral lines, might be welcome to the
scientific community. The diagnostic content of spectropolarimetry is high, but
the correct interpretation of the observations requires a full understanding of the
physics underlying the generation and transfer of polarized radiation.
The redaction of this book required several years. We might try to say, like Huy-
gens in the preface to his Treatise on light, ‘The reason is that I wrote it rather
carelessly in the Language in which it appears, with the intention of translating it
into Latin, so doing in order to obtain greater attention to the thing’,1 but we feel
it would be hardly believed. The true reason is that the theory of spectropolarime-
try is complicated, because it implies some knowledge of several subjects: atomic
physics, quantum mechanics (with special emphasis on the theory of angular mo-
mentum and of the density matrix), quantum electrodynamics, radiative transfer
(both under LTE and Non-LTE conditions).
Moreover, spectropolarimetry is full of traps: among all the disciplines in as-
trophysics, there can hardly be found one where more attention has to be payed
to each single definition, each transformation, each physical application. Sign er-
rors are especially insidious, as most remarkably shown by the classical example of
circular polarization in a given wing of a spectral line formed in a magnetic atmo-
sphere. There are four operations which produce a sign change in such polarization,
and obviously, there are as many possibilities to make a sign error. To have a sign
switch, one can: a) invert the direction of the magnetic field; b) interchange the red
with the blue wing; c) use the opposite definition of positive and negative circular
polarization; d) consider an emission line instead of an absorption line. This is just
an example, but it shows very well the subtleties of the subject. We tried to make
the exposition as clear as possible by using everywhere the same definitions and
conventions, and by carefully describing all the mathematical developments.
We hope that this book may be useful to the next generations of scholars that
will like to enter the field of spectropolarimetry, solar and non-solar. And we hope
that it may contribute to make this research field more accessible and less hermetic,
thus attracting more and more scientists to the fascinanting world of the Stokes
parameters profiles and of their interpretation.
1
Christiaan Huygens, Treatise on light (1690), translated by S.P. Thompson, Dover
Publications, New York, 1962.
CONTENTS
APPENDIX
A1. A Fortran Code for Computing 3-j, 6-j, and 9-j Symbols. . . . . . . . . . . . 791
A2. Sample Evaluation of a Quantity Involving the Contraction
of 3-j Coefficients . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 794
A3. Momentum and Angular Momentum of the Electromagnetic Field . . . 796
A4. Multipole Components of Collisional Rates . . . . . . . . . . . . . . . . . . . . . . . . . . 799
A5. Explicit Expression for the Exponential of the Propagation Matrix . . 803
A6. Diagonalization of the Propagation Matrix . . . . . . . . . . . . . . . . . . . . . . . . . . 809
A7. Formulae for the Calculation of the Evolution Operator . . . . . . . . . . . . . 812
A8. The Feautrier Method: Numerical Details . . . . . . . . . . . . . . . . . . . . . . . . . . . 813
A9. The Diagonal Element Lambda-Operator (DELO) Method:
Numerical Details . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 818
A10. Equivalent Width in the Presence of Depth-Dependent Line Shifts. . . 820
A11. Net Circular Polarization in Blends . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 822
A12. Evolution Operator in Stochastic Media. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 824
A13. Properties of the Generalized Profiles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 829
A14. Properties of the Symbol [WKK Q (β L Sβu Lu ; B)]fs . . . . . . . . . . . . . . . . . 835
A15. A Property of the Hopf Function . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 838
A16. A Numerical Algorithm for the Solution of the Hopf Equation . . . . . . . 840
A17. Symmetry Properties of the Comoving-Frame Radiation Field
Tensor for a Cylindrically Symmetrical Atmosphere . . . . . . . . . . . . . . . . . 841
CONTENTS xvii
REFERENCES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 869
According to Maxwell’s equations, the electric and magnetic fields associated with
a radiation beam propagating in vacuum are perpendicular to each other and to
the direction of propagation. The phenomena of polarization are connected with
the possibility that is left to the electric field vector (or, alternatively, to the asso-
ciated magnetic field vector) of spanning the plane perpendicular to the direction
of propagation.
The polarization properties of a radiation beam can be described in several dif-
ferent ways, each of them having its proper advantages and disadvantages. In the
following we will go through some of these descriptions showing how they are in-
terrelated and, most of all, how they can be translated into operational definitions
having a direct physical meaning in terms of measured quantities.
where n is the unit vector in the direction of propagation, and where E and B are
both measured in c.g.s. units (the Gauss-Hertz system, that will be used – with
some few exceptions – throughout the whole book). As far as the cross product
is concerned, we will follow the almost universally accepted convention that is
referred to as the right-hand (or screwdriver ) rule, and that is illustrated in Fig. 1.1.
However, we remind the reader that the sign in Eq. (1.1) is related to the sign
conventions which have been historically adopted to define the positive directions
for E and B. These are, in turn, connected with the historical convention of
adopting the positive sign for the proton charge and, in magnetostatic experiments,
the positive sign for the idealized monopole which is found at the North-end of a
dipole oriented in the earth magnetic field (Maxwell, 1873). Note that, according
to this definition, the earth presently shows a South magnetic pole close to the
North geographic pole.
To define the polarization properties of an electromagnetic wave we choose its
representation in terms of electric field vibrations. Again, there are historical rea-
2 CHAPTER 1
Fig.1.1. Mutual relationship among instantaneous electric field vector, instantaneous magnetic
field vector and direction of propagation of an ordinary electromagnetic wave. The cross product
is defined according to the right-hand convention.
sons behind this choice; the description in terms of electric vibrations is nowadays
preferred because polarization measurements are mostly performed, at least in the
optical spectrum, by means of materials whose interaction with the electromagnetic
radiation is dominated by the electric vector. However, a description in terms of
magnetic vibrations would be as appropriate as the former.
In the right-handed coordinate system introduced above, with its x-axis pointing
in an arbitrary direction in the plane perpendicular to the direction of propagation,
the electromagnetic wave is described by the following expressions
Ex (r, t) = E1 cos(kz − ωt + φ1 )
Ey (r, t) = E2 cos(kz − ωt + φ2 ) , (1.2)
where E1 , E2 , φ1 , and φ2 are four positive constants specifying the amplitudes and
phases of the electric oscillations, and where k and ω have the usual meaning of
wavenumber and angular frequency.
In a given plane that is held fixed in space, for instance in the plane z = 0, the
electric field vector oscillates according to the equations
Ex (t) = E1 cos(ωt − φ1 )
Ey (t) = E2 cos(ωt − φ2 ) . (1.3)
The tip of the electric field vector rotates in the x-y plane describing an ellipse that
is called the polarization ellipse and whose characteristic parameters can be found
with the following algebraic manipulations.
Let us consider the couple of axes (x y ) that are obtained by rotating the old
couple (xy) through an angle α measured positively from the x-axis to the y-axis
(see Fig. 1.2). In this new system, with the position
C = cos α , S = sin α
we have
Ex (t) = (E1 C cos φ1 + E2 S cos φ2 ) cos ωt
+ (E1 C sin φ1 + E2 S sin φ2 ) sin ωt
We now choose the angle α so as to make the x and y axes coincide with the
proper axes of the polarization ellipse, the long axis being directed along x . If a
and b are the semi-major and semi-minor axes, respectively, we have
where the phase φ0 must be determined in such a way that a > 0 and a ≥ |b|.
Comparing Eqs. (1.4) and (1.5) we obtain
a2 + b2 = E12 + E22
ab = −E1 E2 sin(φ1 − φ2 )
a2 − b2 = (E12 − E22 ) cos 2α + 2E1 E2 cos(φ1 − φ2 ) sin 2α
(E12 − E22 ) sin 2α = 2E1 E2 cos(φ1 − φ2 ) cos 2α . (1.6)
Equations (1.6) show that the geometrical properties of the polarization ellipse
depend on four bilinear combinations of the electric field components. Introducing
the notations
PI = E12 + E22
PQ = E12 − E22
PU = 2E1 E2 cos(φ1 − φ2 )
PV = 2E1 E2 sin(φ1 − φ2 ) , (1.7)
4 CHAPTER 1
PQ sin 2α = PU cos 2α
a2 − b2 = PQ cos 2α + PU sin 2α ≥ 0 .
where ⎧
⎪
⎨0 if PQ > 0 and PU ≥ 0
α0 = π if PQ > 0 and PU < 0 (1.8b)
⎪
⎩ π/2 if P < 0 ,
Q
and for PQ = 0
1
4π if PU > 0
α= 3
(1.8c)
4π if PU < 0 .
the values of a and b can be found as the solutions of the second degree equation
1
x2 − PI − PV x − PV = 0 ,
2
which gives
1
a= PI − PV + PI + PV
2
1
b= PI − PV − PI + PV . (1.9)
2
Note that the four quantities defined in Eqs. (1.7) are not independent, being
related by the expression
PI2 = PQ2 + PU2 + PV2 . (1.10)
DESCRIPTION OF POLARIZED RADIATION 5
For some special values of the parameters E1 , E2 , and (φ1 − φ2 ), the polarization
ellipse degenerates either into a segment or into a circle. The first case takes place
when PV = 0, which means that either E1 or E2 vanish, or, alternatively, that
φ1 = φ2 or φ1 = φ2 ± π. From Eqs. (1.9) we get
a = PI , b=0,
while the angle α can still be deduced from Eqs. (1.8). In this case the monochro-
matic wave is said to be linearly polarized, and its electric field oscillates in the
constant plane containing the direction of propagation and the direction, in the
normal plane, characterized by the angle α.
The other case corresponds to PQ = PU = 0, which means that E1 = E2 and that
(φ1 − φ2 ) = ± π/2; in other words the electric oscillations have the same amplitude
along the x and y axes and are in phase quadrature. For the fourth parameter we
then obtain
PV = ± PI ,
and correspondingly
a= PI /2 , b = ∓ PI /2 .
In both cases the polarization ellipse degenerates into a circle. When PV = +PI ,
the tip of the electric field vector rotates clockwise for an observer facing the ra-
diation source, as apparent from Eqs. (1.5) being a > 0, b < 0. Conversely, when
PV = −PI the rotation is counterclockwise as seen by the same observer. In this
book we will adopt the convention of referring to the first case as positive (or right-
handed ) circular polarization, and to the second case as negative (or left-handed )
circular polarization. In the first case a snapshot of the electromagnetic wave shows
that the end point of the electric field vector draws a helix that fits the thread of
a usual, right-handed screw; in the second case we have a left-handed screw.
Our convention, which is summarized in Fig. 1.3, agrees with those proposed in
the classical textbooks on polarized light by Shurcliff (1962) and by Clarke and
Grainger (1971). The same convention is also used, although with some few excep-
tions, by optical astronomers working in the field of polarimetry. Many radioas-
tronomers, on the other hand, use the opposite convention, so that the situation
in this field is still rather confusing (see for instance Clarke, 1974).
The sign of circular polarization is also connected with the helicity (or spin) of
photons. This connection is discussed in some detail in the following of this book
(see Sect. 4.4 and App. 3).
positive, negative,
right-handed left-handed
Fig.1.3. Conventions for defining the sign and handedness of circular polarization. The tip of the
electric field vector rotates, as a function of time, in a fixed plane perpendicular to the direction
of propagation; the observer, receiving the radiation into his/her eye, sees a clockwise (a) or
counterclockwise (b) rotation. The corresponding snapshots are shown in the lower half of the
figure.
vibration can be described using complex numbers, with the usual convention that
the physical quantity is represented by the real part of the complex number. The
first of Eqs. (1.2) can then be rewritten in either of the two equivalent forms (i is
the imaginary unit)
i(kz−ωt)
Ex (r, t) = Re E1 e (1.11)
where E1 = E1 eiφ1 , or
i(−kz+ωt)
Ex (r, t) = Re E1 e (1.12)
where E1 = E1 e−iφ1 is the complex conjugate of E1 . Here and in the following we
will use the further convention of describing oscillating quantities with a temporal
exponential of the form e−iωt (rather than eiωt ), so that the expression for Ex (r, t)
in terms of complex notations is that of Eq. (1.11). With the same convention, we
have for the electric oscillation along the y-axis
i(kz−ωt)
Ey (r, t) = Re E2 e (1.13)
with E2 = E2 eiφ2 .
Using the complex quantities E1 and E2 it is possible to introduce a 2×2 Hermitian
matrix J, called the polarization tensor , defined by
with
∗
Jij = Jji . (1.14)
The four parameters describing the geometrical properties of the polarization ellipse
(Eqs. (1.7)) are related to the components of the polarization tensor by the simple
DESCRIPTION OF POLARIZED RADIATION 7
expressions
PI = E1∗ E1 + E2∗ E2
PQ = E1∗ E1 − E2∗ E2
PU = E1∗ E2 + E2∗ E1
PV = i (E1∗ E2 − E2∗ E1 ) . (1.15)
Note that a minus sign would appear in the expression for PV if Eq. (1.12) were
used.
After inversion of Eqs. (1.15) the polarization tensor can be written as
1 PI + PQ PU − iPV
J= ,
2 PU + iPV PI − PQ
which can be cast into a more synthetic form. Defining the formal vector
⎛ ⎞ ⎛ ⎞
P0 PI
⎜ P1 ⎟ ⎜ PQ ⎟
⎝ ⎠=⎝ ⎠ ,
P2 PU
P3 PV
and introducing the 2 × 2 identity matrix σ0 and the Pauli spin matrices σ1 , σ2 ,
and σ3 , we can write1
1
3
J= P σ , (1.16)
2 i=0 i i
where
1 0 1 0 0 1 0 −i
σ0 = , σ1 = , σ2 = , σ3 = (1.17)
0 1 0 −1 1 0 i 0
with
σj σk = δjk σ0 + i jkl σl (j, k, l = 1, 2, 3) . (1.18)
l
In the last equation δjk is the Kronecker symbol – equal to 1 for j = k and to 0
for j = k – and jkl is the antisymmetric (or Levi-Civita) tensor – equal to +1 if
(j, k, l) is an even permutation of (1, 2, 3), to −1 if (j, k, l) is an odd permutation
of (1, 2, 3), and to 0 if two (or three) indices are equal.
Multiplying Eq. (1.16) by σj and using the relation
1
The representation that we have chosen here for the Pauli spin matrices differs from the
standard one, initially introduced by Pauli (1927) and then followed in classical textbooks on
Quantum Mechanics (Schiff, 1949; Messiah, 1961; etc.). The matrices introduced by Pauli (σx ,
σy , σz ) are connected with the ones defined here by the relations: σx = σ2 , σy = σ3 , σz = σ1 .
8 CHAPTER 1
Pj = Tr (σj J ) . (1.20)
det J = 0 . (1.21)
The pure monochromatic wave considered in Sect. 1.1 is nothing but a mathemati-
cal abstraction; in the physical world one always deals with radiation beams having
a non-zero angular spread and a finite frequency bandwidth. Such beams can be
described using wave packets of the form
r, t) = Re
E( k ) e i(k ·r−ω t) n(k ) d3k
E( ,
k ) · k = 0 ,
E(
and where n(k ) is the number density of waves in the three-dimensional wavenum-
ber space.
We now suppose the wave packet to be confined into a small range of wavenum-
bers ∆k centered around k. This means that our packet has an angular spread
in the solid angle ∆Ω ≈ |∆k|/|k| and a finite frequency bandwidth ∆ω ≈ c |∆k|.
Writing
k = k + δk , ω = ω + δω
we obtain
r, t) = Re e i(k·r−ωt) k + δk ) e i(δk·r−δω t) n(δk ) d3 (δk ) .
E( E( (1.22)
Σ ∆Ω
Fig.1.4. Coordinate system for defining the various components of the electric vibration of a
quasi-monochromatic wave. The vector k is directed along the z-axis.
where E1 (P, t) and E2 (P, t) are the x and y components, respectively, of the vector
resulting from the evaluation of the integral in Eq. (1.22). The third component
Ez (P, t) is much smaller than the x and y components (a typical ratio Ez /Ex being
of order ∆Ω) and will be neglected in the following.
If the linear dimensions L of the surface Σ are such that L|∆k| 1, the quan-
tities E1 (P, t) and E2 (P, t) are weakly varying functions of the point P. Moreover,
they are slowly varying functions of time, having temporal variations on typical
time-scales much larger than the period of the wave. As a result, the tip of the
electric field vector at the point P describes a polarization ellipse whose character-
istic parameters vary slowly with time; these parameters will also show a smooth
variation from point to point on the surface Σ.
Obviously, it is impossible to define an instantaneous polarization ellipse in this
situation. It is however possible to define appropriate average quantities by gener-
alizing Eqs. (1.15) in the form
PI = E1∗ (P, t) E1 (P, t) + E2∗ (P, t) E2 (P, t)
PQ = E1∗ (P, t) E1 (P, t) − E2∗ (P, t) E2 (P, t)
PU = E1∗ (P, t) E2 (P, t) + E2∗ (P, t) E1 (P, t)
PV = i E1∗ (P, t) E2 (P, t) − E2∗ (P, t) E1 (P, t) , (1.24)
where the symbol · · · means an average over a time interval much longer than
the wave period and an average over the surface Σ.
Similarly, one can generalize the definition of the polarization tensor by writing
Jij = Ei∗ (P, t) Ej (P, t) , (1.25)
For the average value of its square modulus we have, with easy transformations
|A|2 = E1∗ (P, t) E1 (P, t) E1∗ (P, t) E1 (P, t) E2∗ (P, t) E2 (P, t)
− E1∗ (P, t) E2 (P, t) E2∗ (P, t) E1 (P, t) .
From this equation, excluding the trivial case E1 (P, t) ≡ 0, we conclude that, being
∗ 2
E1 (P, t) E1 (P, t) > 0 and |A| ≥ 0 ,
we must have
∗
E1 (P, t) E1 (P, t) E2∗ (P, t) E2 (P, t) − E1∗ (P, t) E2 (P, t) E2∗ (P, t) E1 (P, t) ≥ 0 ,
that is
det J = PI2 − PQ2 − PU2 − PV2 ≥ 0 . (1.26)
this means either that one of the two components is identically zero, or that the
two components have, at any point P and any time t, the same amplitude ratio and
phase difference. In both cases the quasi-monochromatic wave is said to be totally
polarized. The opposite situation occurs when the two components E1 (P, t) and
E2 (P, t) have the same average amplitude and random phase difference, so that
E1∗ (P, t) E1 (P, t) = E2∗ (P, t) E2 (P, t)
∗
E1 (P, t) E2 (P, t) = E2∗ (P, t) E1 (P, t) = 0 .
and the quasi-monochromatic wave is said to be totally unpolarized (if the frequency
of the radiation beam is in the visible band of the spectrum, the beam is said to be
composed of natural light ). In intermediate cases the quasi-monochromatic wave
is said to be partially polarized with a polarization degree given by
p= PQ2 + PU2 + PV2 PI ,
with 0 ≤ p ≤ 1.
DESCRIPTION OF POLARIZED RADIATION 11
Fig.1.5. For a partially polarized quasi-monochromatic wave, the tip of the electric field vector
at a given point P describes, as a function of time, the trajectory shown in this graph. Note that
the characteristic shape of the polarization ellipse has a slow variation with time. The electric
field is given in arbitrary units.
In Figs. 1.5 and 1.6 the trajectory of the tip of the electric field vector is plotted
for two schematic situations corresponding to a partially polarized and a totally
polarized quasi-monochromatic wave, respectively.
Fig.1.6. Same as Fig.1.5 for a totally polarized quasi-monochromatic wave. Although the size of
the ellipse changes with time, its characteristic shape (i.e. the α angle defined in Fig.1.2 and the
ratio between the axes) remains fixed.
where Ea and Eb are the components, at the entrance of the polarizer, along the
transmission axis and the orthogonal axis, respectively; Ea and Eb are the same
components at the exit of the polarizer, and ψ is a phase difference that is totally
inessential in determining the polarization properties of the beam.
The simplest device whose performances approach in practice those of an ideal
polarizer (in the optical range of the electromagnetic spectrum) is the well-known
Polaroid which has been developed commercially in various forms by the Polaroid
Corporation, Cambridge, Massachusetts, U.S.A. This device is based on the prop-
erty shown by some organic, needle-shaped microcrystals of absorbing light prefer-
entially along the longest axis of the crystal. Roughly speaking, a Polaroid consists
of a large number of such microcrystals oriented through a procedure of unidirec-
tional stretching. Since the basic physical phenomenon involved in the operation
of a Polaroid is dichroism (the property of absorbing light to different extents de-
pending on the polarization of the incident beam), the Polaroid is often referred to
as a dichroic polarizer .
Linear polarizers of a different kind are the so-called birefringence polarizers
which consist of one or more birefringent crystals (generally calcite) suitably pre-
pared and arranged into a compact system. These polarizers operate according to
the following scheme: the incident beam is divided in two orthogonally polarized
components that propagate along different directions in the birefringent crystal;
one of the two components is physically removed by absorption or lateral deflec-
tion, so that a single, linearly polarized component is transmitted through the
whole system. Examples of birefringence polarizers are those bearing the names of
DESCRIPTION OF POLARIZED RADIATION 13
i(nf kz−ωt)
Ef (z, t) = Re Ef e
(0 ≤ z ≤ l)
i(ns kz−ωt)
Es (z, t) = Re Es e ,
where z is the coordinate along the optical axis of the retarder with the origin
(z = 0) at the entrance, Ef and Es are the complex amplitudes of the electric
vibration along the fast and slow axes, respectively, and k is the wavenumber in
vacuum.
Comparing the electric vibrations at the exit of the retarder with those at the
entrance, we have with obvious notations
Ef iψ 1 0 Ef iψ Ef
= e = e iδ , (1.27)
Es
iδ
0 e Es e Es
where ψ is a phase that does not affect the polarization properties of the wave,
while δ, the so-called retardance (or retardation), is given by
δ = (ns − nf ) kl = 2π (ns − nf ) l λ , (1.28)
λ being the wavelength in vacuum. When the thickness of the retarder is such that
δ = π/2, the retarder is called a quarter-wave plate, and, similarly, when δ = π the
retarder is called a half-wave plate.
It is interesting to note that an ideal quarter-wave plate followed by an ideal linear
polarizer whose transmission axis is directed at 45◦ from the fast axis in the coun-
terclockwise direction (as seen by an observer facing the radiation source) behaves
14 CHAPTER 1
fast axis
direction of
propagation
slow
axis
transmission
axis
quarter-wave linear
plate polarizer
Fig.1.7. Practical realization of a filter transparent to positive circular polarization and opaque
to negative circular polarization.
like an ideal filter that is totally transparent to positive (or right-handed) circular
polarization and totally opaque to negative (or left-handed) circular polarization.1
To prove this statement, let us consider a circularly polarized monochromatic
wave propagating along the direction shown in Fig. 1.7. If the wave is totally
circularly polarized in the positive direction, the components of the electric field
vibration along the unit vectors ex and ey before entering the retarder are (in
complex notations)
−iωt
Ex = Re A e
−iωt
Ey = Re −iA e , (1.29)
where A is a constant. At the exit of the retarder, we have from Eq. (1.27) (apart
from an inessential phase factor)
−iωt
Ex = Ey = Re A e , (1.30)
thus the component of the electric vibration along the transmission axis of the
polarizer is given by
1 √ −iωt
Ea = √ Ex + Ey = 2 Re A e .
2
Since this component is simply transmitted through the linear polarizer, the elec-
tric vibration at the exit is still described by Eq. (1.30). Therefore, although the
1
Obviously, when the transmission axis is inclined at 45 ◦ in the opposite direction, the optical
combination behaves in the opposite way with respect to circular polarization.
DESCRIPTION OF POLARIZED RADIATION 15
polarization has been transformed from circular to linear, the energy carried by
the wave has not been changed by the system (the overbar means a time average)
It follows that the device schematized in Fig. 1.7 is totally transparent to positive
circular polarization.
Applying the same argument to a monochromatic wave of negative circular po-
larization (Ey in Eq. (1.29) must be changed in sign) we obtain that the wave is
totally absorbed inside the polarizer.
Most of the retarders that are used in practice are of the birefringent type; they
often consist of a crystal, like quartz or calcite, cut parallel to the optical axis. More
recently piezo-optical birefringence modulators have also been used. In these devices
the birefringence is induced by compressing or distending a material (otherwise
isotropic and uniform) along a given direction. According to Kemp (1969) a very
modest force, that can even be attained by simple finger pressure, is enough to
produce a quarter-wave plate by stressing a block of glass having the size of a
match-box.
Other devices that have been often used to produce retarders with variable or
modulated retardance are the so-called Pockels cells. Here the birefringence is
induced by application of high electric fields (of the order of several kV/cm) in ma-
terials such as ammonium dihydrogen phosphate (ADP) or potassium dihydrogen
phosphate (KDP).
dW0 = I dS dt dΩ dν .
This is just the usual definition of the radiation specific intensity, so that the first
Stokes parameter is simply called the intensity of the beam.
Let’s now insert in the beam path an ideal linear polarizer with its transmission
axis directed in succession at 0◦ , 45◦ , 90◦ , and 135◦ from the reference axis, all the
16 CHAPTER 1
angles being reckoned counterclockwise for an observer looking at the beam from
the detector. If dW1 , dW2 , dW3 , and dW4 are the corresponding energies measured
by the detector in the time interval dt over the surface dS, the second and third
Stokes parameters, Q and U , are defined by
dW1 − dW3 = Q dS dt dΩ dν
dW2 − dW4 = U dS dt dΩ dν .
Finally, for the definition of the fourth parameter we introduce two ideal filters,
the first being totally opaque to negative (left-handed) circular polarization and
totally transparent to positive (right-handed) circular polarization, and the second
behaving in the opposite way (such filters can be realized as shown in the former
section). If dW5 and dW6 are the energies measured by the detector (in the time
interval dt over the surface dS) with the first and second filters – respectively –
interposed, the definition of the fourth parameter, V , is the following
dW5 − dW6 = V dS dt dΩ dν .
The symbols used in this book to represent the Stokes parameters (I, Q, U , V )
were introduced for the first time by Walker (1954). In his original note, Stokes
(1852) used the symbols (A, B, C, D), while Jones (1941) and Perrin (1942) used the
notation (I, M, C, S). Walker’s notation seems nowadays to be preferred especially
in the astrophysical literature.
The operational definitions given above are not universally adopted. In partic-
ular, the opposite sign is used in the definition of V by those authors who name
positive circular polarization the one that we have defined as negative and vice
versa (see the discussion at the end of Sect. 1.2). Less frequently a sign inversion is
found in the definition of U . Figure 1.8 summarizes the conventions that are used
in this book.
Now that the Stokes parameters have been defined, it is necessary to establish
their connection with the description, given in the previous sections, of the polar-
ization properties of an electromagnetic wave in terms of electric fields. Since the
energy flux carried by an electromagnetic wave is given by its Poynting vector, the
energy ∆W measured in the time interval ∆t by an ideal detector having cross-
sectional area ∆S is connected with the electric field components by the relation
c 2
∆W = Ex + Ey2 ∆S ∆t ,
4π
where Ex and Ey are the components of the electric field along two axes perpen-
dicular to each other and to the direction of propagation, and where the brackets
mean an average over the time interval ∆t and over the surface ∆S of the detector.
Using Eqs. (1.3), (1.11), and (1.13), we can also write
c 2 c ∗
∆W = E1 + E22 ∆S ∆t = E1 E1 + E2∗ E2 ∆S ∆t . (1.31)
8π 8π
DESCRIPTION OF POLARIZED RADIATION 17
reference
direction
Fig.1.8. Schematic representation of the definition of the Stokes parameters. The observer is
supposed to face the radiation source.
∗ c
∆W0 = E1 (P, t) E1 (P, t) + E2∗ (P, t) E2 (P, t) ∆S ∆t ;
8π
b) with a linear polarizer interposed, having its transmission axis at 0◦ , 45◦ , 90◦ ,
and 135◦ , respectively
c
∆W1 = E1∗ (P, t) E1 (P, t) ∆S ∆t
8π
1 ∗
∆W2 = E1 (P, t) E1 (P, t) + E2∗ (P, t) E2 (P, t)
2
c
+ E1∗ (P, t) E2 (P, t) + E2∗ (P, t) E1 (P, t) ∆S ∆t
8π
c
∆W3 = E2∗ (P, t) E2 (P, t) ∆S ∆t
8π
1
In the following, ea will be referred to as the reference direction unit vector, while eb will
be referred to as the associated unit vector. Note that the definition of the Stokes parameters
remains the same if we replace ea by −ea and eb by −eb .
18 CHAPTER 1
1 ∗
∆W4 = E1 (P, t) E1 (P, t) + E2∗ (P, t) E2 (P, t)
2
c
− E1∗ (P, t) E2 (P, t) − E2∗ (P, t) E1 (P, t) ∆S ∆t ;
8π
1 ∗
∆W5 = E1 (P, t) E1 (P, t) + E2∗ (P, t) E2 (P, t)
2
c
+ i E1∗ (P, t) E2 (P, t) − i E2∗ (P, t) E1 (P, t) ∆S ∆t
8π
1 ∗
∆W6 = E1 (P, t) E1 (P, t) + E2∗ (P, t) E2 (P, t)
2
c
− i E1∗ (P, t) E2 (P, t) + i E2∗ (P, t) E1 (P, t) ∆S ∆t .
8π
From these expressions, taking the limit for infinitesimal values of ∆S and ∆t, and
bearing in mind Eqs. (1.24) and the operational definitions of the Stokes parame-
ters, we obtain
I = kPI = k E1∗ (P, t) E1 (P, t) + E2∗ (P, t) E2 (P, t)
Q = kPQ = k E1∗ (P, t) E1 (P, t) − E2∗ (P, t) E2 (P, t)
U = kPU = k E1∗ (P, t) E2 (P, t) + E2∗ (P, t) E1 (P, t)
V = kPV = k i E1∗ (P, t) E2 (P, t) − E2∗ (P, t) E1 (P, t) , (1.32)
The Stokes parameters can be measured by several different techniques, and the
various instruments that have been devised for this purpose cannot be classified in
a unique scheme. Following Hauge (1976) it is however possible to describe a kind
of prototype instrument that summarizes the essential characteristics of most of
DESCRIPTION OF POLARIZED RADIATION 19
reference
retarder polarizer
direction
α detector
β
transmission
axis
fast axis
the devices that are used in practice, especially for the measurement of the Stokes
parameters in the visible or near-infrared range of the electromagnetic spectrum.
Referring to Fig. 1.9, if E1 and E2 are (in complex notations) the electric vibrations
along the unit vectors ea and eb (where ea is the reference direction), we have, after
the beam has crossed the retarder (inessential phase factors are omitted)
Ef = E1 cos α + E2 sin α
iδ
Es = e (−E1 sin α + E2 cos α) ,
where Ef and Es are the components of the electric vibrations along the fast and
slow axes, respectively, and δ is the retardance. The only component transmitted
by the polarizer is the one parallel to the transmission axis. This component is
given by
Ep = cos(β − α) Ef + sin(β − α) Es ,
so that for the signal D of the detector we obtain after some algebra
1
D(α, β, δ) = k Ep∗ Ep = I + (Q cos 2α + U sin 2α) cos 2(β − α)
2
− (Q sin 2α − U cos 2α) sin 2(β − α) cos δ
A first method for measuring the Stokes parameters is to set the various devices
at particular angles. If we are concerned with linear polarization only, the retarder
can be removed (δ = 0◦ ), or, alternatively, its fast axis can be set parallel to the
transmission axis of the polarizer (α = β). In both cases the signal D of the
detector takes the simpler form
1
D (β) = I + Q cos 2β + U sin 2β ,
2
20 CHAPTER 1
and the Stokes parameters I, Q, and U can be found by the combination of four
measurements
I = D (0◦ ) + D (90◦ )
Q = D (0◦ ) − D (90◦ )
U = D (45◦ ) − D (135◦ ) ,
2 ◦
I= D (0 ) + D (60◦ ) + D (120◦ )
3
2
Q= 2D (0◦ ) − D (60◦ ) − D (120◦ )
3
2
U = √ D (60◦ ) − D (120◦ ) .
3
1
D (β) = I + Q cos 2β + V sin 2β ,
2
For circular polarization, keeping the polarizer fixed at β = 0◦ , one gets (again
using a quarter-wave plate)
1
D (α) = I + Q cos2 2α + U sin 2α cos 2α − V sin 2α ,
2
and the Stokes parameters I, Q, U , and V can be found by the combination of five
measurements
and the various Stokes parameters can be measured (for a stationary source) by
extracting the Fourier components of the signal D(t).
A third method is to use a compensator with modulated retardance (like a Pockels
cell) while keeping the optical components at fixed orientations. To maximize
the modulation effect it is convenient to set (β − α) = ±45◦ , and two sets of
measurements are necessary to get the four Stokes parameters. For instance, setting
α = 0◦ and β = 45◦ we have
1
D(t) = I + U cos δ(t) + V sin δ(t) ,
2
1
D(t) = I + Q cos δ(t) + V sin δ(t) ,
2
and in both cases three of the four Stokes parameters can be obtained by an analysis
of the modulated signal.
We want to remark that the measurement of Stokes parameters is an art in itself
that can be pushed to a very high degree of sophistication. The few words that
have been spent here on this subject are not meant to be complete, but just to give
the reader an intuitive grasp on how this kind of measurements are performed in
practice.
22 CHAPTER 1
In Sect. 1.6 we have found the relation between the Stokes parameters and the
components of the electric vibration along two orthogonal unit vectors, ea and
eb , with ea the reference direction unit vector and eb the associated unit vector.
Equations (1.32) can be rewritten in terms of the polarization tensor defined in
Sects. 1.3 and 1.4. From Eq. (1.25) we have
I = k J11 + J22
Q = k J11 − J22
U = k J12 + J21
V = k i J12 − J21 . (1.34)
1
3
J= S σ
2k j=0 j j
Sj = k Tr σj J (j = 0, . . . , 3) , (1.35)
I = I11 + I22
Q = I11 − I22
U = I12 + I21
V = i I12 − I21 (1.39)
and
1
3
I= S σ
2 j=0 j j
Sj = Tr σj I (j = 0, . . . , 3) . (1.40)
If the unit vectors chosen to define the polarization tensor differ from (ea , eb ), the
relations between the Stokes parameters and the components Iij are not as simple
as in Eqs. (1.39). Let us consider, in particular, the (complex) unit vectors e+ and
e− defined by
iφ
e+ = cos θ ea + sin θ e eb
iφ
e− = − sin θ ea + cos θ e eb , (1.41)
with
E1 = cos θ E+ − sin θ E−
iφ
E2 = e sin θ E+ + cos θ E− ,
and simple relations can be established between the ‘old’ polarization tensor Iij =
k Ei∗ Ej (i, j = 1, 2) and the ‘new’ polarization tensor Iαβ = k Eα∗ Eβ (α, β =
+, −)
I11 = cos2 θ I++ + sin2 θ I−− − sin θ cos θ I+− + I−+
I22 = sin2 θ I++ + cos2 θ I−− + sin θ cos θ I+− + I−+
∗ iφ
I12 = I21 = e sin θ cos θ I++ − I−− + cos2 θ I+− − sin2 θ I−+ ,
24 CHAPTER 1
and conversely
−iφ iφ
I++ = cos2 θ I11 + sin2 θ I22 + sin θ cos θ e I12 + e I21
−iφ iφ
I−− = sin2 θ I11 + cos2 θ I22 − sin θ cos θ e I12 + e I21
∗
−iφ iφ
I+− = I−+ = − sin θ cos θ I11 − I22 + cos2 θ e I12 − sin2 θ e I21 .
Using these transformations we can express the Stokes parameters in the form
I = I++ + I−−
Q = cos 2θ I++ − I−− − sin 2θ I+− + I−+
U = sin 2θ cos φ I++ − I−− + cos 2θ cos φ I+− + I−+
+ i sin φ I+− − I−+
V = − sin 2θ sin φ I++ − I−− − cos 2θ sin φ I+− + I−+
+ i cos φ I+− − I−+ , (1.42)
I++ + I−− = I
I++ − I−− = cos 2θ Q + sin 2θ cos φ U − sin 2θ sin φ V
I+− + I−+ = − sin 2θ Q + cos 2θ cos φ U − cos 2θ sin φ V
i I+− − I−+ = sin φ U + cos φ V . (1.43)
These formulae show that it is possible to find a particular couple of complex unit
vectors of the form (1.41) such that the polarization tensor is diagonal. In fact, it
can be easily shown that for φ and θ implicitly defined by
tan φ = −V /U , tan 2θ = V 2 + U2 Q, (1.44)
I++ + I−− = I
I++ − I−− = ± Q2 + U 2 + V 2
I+− = I−+ = 0 ,
where the sign ambiguity in the right-hand side of the middle equation is connected
with the determination chosen for the φ and θ angles in Eqs. (1.44). On this basis
of complex unit vectors the electric vibration is decomposed in two independent
orthogonal components.
DESCRIPTION OF POLARIZED RADIATION 25
old reference
direction
new reference
direction
Fig.1.10. The rotation of the reference direction is specified by the angle α. The radiation beam
is propagating from behind the page to the reader’s eye.
Since the Stokes parameters are defined with respect to a given reference direction,
we must establish how they are changed when a different reference direction is
chosen. Referring to Fig. 1.10, and denoting by (I, Q, U, V ) and (I , Q , U , V ) the
Stokes parameters relative to the old and new reference direction, respectively, the
transformation law can be established with the help of the equations derived in the
previous section. Substituting α for θ and 0 for φ into Eqs. (1.41), and interpreting
the unit vectors e+ and e− as ea and eb , respectively, we obtain from Eqs. (1.43)
and (1.39)
I = I
Q = cos 2α Q + sin 2α U
U = − sin 2α Q + cos 2α U
V =V . (1.45)
These transformations show that I and V are invariant under rotation of the refer-
ence direction, while the linear polarization parameters undergo a rotation through
an angle 2αin the Q-U plane, which leaves unchanged the total linear polariza-
tion PL = Q2 + U 2 . It is also apparent that the reference direction has a 180◦
ambiguity, an obvious consequence of the definition of the Stokes parameters.
Introducing the position angle α0 , defined by
Q = PL cos 2α0
U = PL sin 2α0 , (1.46)
the former equations for Q and U become
Q = PL cos 2(α0 − α)
U = PL sin 2(α0 − α) .
26 CHAPTER 1
The two quantities PL and α0 are often used to characterize the linear polariza-
tion in place of Q and U . In astronomical observations the reference direction is
generally taken along the meridian through the observed object, and the position
angle is measured counterclockwise from the same meridian. In solar observations
the reference direction is sometimes chosen to represent a physical direction, like,
for instance, the radial direction from the center of the sun to the observed point,
or the parallel to the solar limb.
Another fundamental property of the Stokes parameters is their additivity for
incoherent beams. This property can be proved quite easily from Eqs. (1.32).
Indeed, if the electric vibrations of two beams are described (in complex notations)
by Ei(1) (P, t) and Ei(2) (P, t), respectively, and if the two beams are incoherent, so
that
(1)
Ei (P, t)∗ Ej(2) (P, t) = Ei(2) (P, t)∗ Ej(1) (P, t) = 0 (i, j = 1, 2) ,
Sk = Sk(1) + Sk(2) (k = 0, . . . , 3) ,
where Sk(1) and Sk(2) are the Stokes parameters of the separate beams. This property,
which can obviously be generalized to an arbitrary number of incoherent beams,
will be referred to in the following as the addition theorem.
As the Stokes parameters of a radiation beam satisfy the relation (see Eqs. (1.38)
and (1.40))
I 2 ≥ Q2 + U 2 + V 2 , (1.47)
we can write, using the addition theorem
⎛ ⎞ ⎛ ⎞ ⎛ 2 ⎞
I I − Q2 + U 2 + V 2 Q + U2 + V 2
⎜Q⎟ ⎜ ⎟ ⎜ ⎟
⎜ ⎟ ⎜ 0 ⎟ ⎜ Q ⎟
⎜ ⎟=⎜ ⎟+⎜ ⎟ .
⎝U ⎠ ⎝ 0 ⎠ ⎝ U ⎠
V 0 V
This means that any radiation beam can be considered as the incoherent super-
position of an unpolarized beam and a totally polarized beam. The latter is char-
acterized by a well-defined polarization ellipse, whose elements can be found from
Eqs. (1.8) and (1.9). The former, on the contrary, is composed of natural radiation.
Another interesting feature of the Stokes parameters stems from the possibility
of establishing a mapping between the Stokes vectors and the points of a three-
dimensional space that will be referred to as the Poincaré space. To the Stokes vec-
tor S = (I, Q, U, V )† we associate the point P having coordinates (Q/I, U/I, V /I),
as shown in Fig. 1.11. When the point P lies on the surface of the sphere of unit
radius (the Poincaré sphere), the corresponding Stokes vector represents a totally
polarized radiation beam, while the center of the sphere represents a beam of natu-
ral radiation. This mapping, proposed by Poincaré (1892) in a classical monograph,
is particularly suitable to visualize the effect produced on a light beam by a given
DESCRIPTION OF POLARIZED RADIATION 27
Fig.1.11. Mapping of a Stokes vector (I, Q, U, V ) into the representative point P inside the
Poincaré sphere of unit radius.
As we have seen in the former sections, the operational definition of the Stokes
parameters of a radiation beam involves at least four independent measurements
performed by interposing polarizers or retarders in the beam path. On the other
hand, according to the principles of Quantum Mechanics, a photon which interacts
with a polarizer is either absorbed or transmitted; in the latter case its state is in
general modified, the photon polarization being now parallel to the transmission
axis of the polarizer. All information about the original polarization state of the
photon is in any case lost, so that it is impossible to perform the four independent
measurements needed to determine its Stokes parameters. We conclude that to
speak about the Stokes parameters of a single photon has no physical meaning.
Stokes parameters can only be measured for a statistical ensemble of photons.
The connection between the measured Stokes parameters and the photon wave-
function can be easily established. Referring to Fig. 1.7, let us suppose that the
i-th photon has a wavefunction of the form
(i) (i)
|ψ (i) = c1 |ψ1 + c2 |ψ2 ,
where |ψ1 (|ψ2 ) is the wavefunction in the polarization space of a photon which
propagates along the z-axis and is transmitted through a polarizer having its trans-
mission axis in the ex (ey ) direction. Applying to the photon beam the same argu-
ments developed for wave packets in Sect. 1.6, we can express the Stokes parameters
28 CHAPTER 1
as
I = k c∗1 c1 + c∗2 c2
Q = k c∗1 c1 − c∗2 c2
U = k c∗1 c2 + c∗2 c1
V = k i c∗1 c2 − c∗2 c1 ,
where k is a dimensional constant and where the statistical averages c∗k cj are
defined by
1 (i) ∗ (i)
N
c∗k cj = c cj (k, j = 1, 2) ,
N i=1 k
N being the number of photons falling on the measuring device in a given time
interval ∆t.
(i)
Only when all the photons are in the same state, so that the amplitudes cj are
independent of i, it is possible to determine their wavefunction from the measured
Stokes parameters.1 This case, characterized by the equality
I 2 = Q2 + U 2 + V 2 ,
is the analogue of the macroscopic case of the monochromatic wave. On the con-
trary, when the states of the single photons are different from each other, we have
the more complicated situation described by Eq. (1.47), that has its macroscopic
analogue in the quasi-monochromatic wave. From the quantum-mechanical point
of view, the photons are said to be in a pure state in the former case and in a
mixture of states in the latter.
1
Apart from an inessential phase factor. According to the principles of Quantum Mechanics,
absolute phases of wavefunctions are not observable quantities.
CHAPTER 2
This chapter, and the two following, are devoted to establish the quantum-mechan-
ical bases that will be needed to approach the theoretical problems involved in the
generation and transfer of polarized radiation. In particular, we will recall in the
present chapter some important concepts about angular momentum and Racah
algebra and will introduce a set of notations and definitions that will be useful in
the following.
These arguments have been treated in various classical textbooks or monographs
(see e.g. Edmonds, 1957; Messiah, 1961; Brink and Satchler, 1968; Varshalovich
et al., 1988). The reason why they are repeated here is to spare the reader the
painful job of translating notations and conventions from one book to the other.
Obviously, the reader who is already familiar with the theory of angular momentum
can simply skip this chapter.
[Jx , Jy ] = iJz
[Jy , Jz ] = iJx
[Jz , Jx ] = iJy . (2.1)
From these equations it can be easily shown that the square of the angular mo-
mentum,
J 2 = Jx2 + Jy2 + Jz2 ,
commutes with each of the three components
[J 2 , Jx ] = [J 2 , Jy ] = [J 2 , Jz ] = 0 .
Taking J 2 and one of the three components – say Jz – as a maximum set of com-
muting operators, we can look for the common eigenvectors of these two operators
that we denote by the symbol |λm
The Hermitian character of the operators Jx , Jy , and Jz , together with the com-
mutation rules (2.1), are sufficient to deduce the eigenvalue spectrum or, in other
words, the allowed values for λ and m.
Let us introduce the operators J+ and J− , usually called the shift operators,
defined by
J± = Jx ± iJy .
Directly from the definition it is possible to prove the following relations
(J± )† = J∓ (2.2a)
[J 2 , J± ] = 0 (2.2b)
[Jz , J± ] = ±J± (2.2c)
[J+ , J− ] = 2Jz (2.2d)
J+ J− = J 2 − Jz2 + Jz (2.2e)
J− J+ = J 2 − Jz2 − Jz . (2.2f)
Using these relations it can be proved that the vectors J± |λm are still eigenvectors
of J 2 and Jz , corresponding to the eigenvalues λ and (m ± 1), respectively. From
Eqs. (2.2b) and (2.2c) we have
On the other hand, if we consider the norms of the vectors J+ |λm and J− |λm
(that cannot be negative numbers), and take into account Eqs. (2.2a), (2.2e),
and (2.2f), we have
†
λm|J+ J+ |λm = λm|J− J+ |λm = (λ − m2 − m) λm|λm ≥ 0
†
λm|J− J− |λm = λm|J+ J− |λm = (λ − m2 + m) λm|λm ≥ 0 , (2.4)
so that
λ − m2 − m ≥ 0 , λ − m2 + m ≥ 0 .
Since λ and m are real numbers (being eigenvalues of Hermitian operators), these
two inequalities give
λ≥0, |m| ≤ λ . (2.5)
Therefore, once we fix a particular eigenvalue λ for J 2 , the eigenvalues of Jz must
be confined within a limited interval.
Starting now from a given eigenvector |λm and applying repeatedly the shift
operator J+ (or J− , respectively), we obtain two distinct chains of eigenvectors
corresponding to eigenvalues of Jz increasing (or decreasing) by unity as shown
in Eq. (2.3). These two chains have to stop at a certain point so that the second
ANGULAR MOMENTUM AND RACAH ALGEBRA 31
inequality in Eq. (2.5) can be satisfied. Thus there must be a maximum eigenvalue
for Jz – say j – such that
J+ |λj = 0 .
Equations (2.4) give for this eigenvector
λ − j2 − j = 0 , (2.6)
so that
λ = j(j + 1) .
Similarly, there must be a minimum eigenvalue for Jz – say j – such that
J− |λj = 0 .
λ − j 2 + j = 0 ,
and comparing this equation with Eq. (2.6) we get the following relation between
j and j
j(j + 1) = j (j − 1) .
This is a second degree equation whose solutions are
j = −j
j = j + 1 .
The second solution must be rejected as it leads to j > j. We are then left with the
first solution which implies that the minimum eigenvalue of Jz is just the opposite
of the maximum eigenvalue.
On the other hand, we can obtain the eigenvector |λ −j from the eigenvector
|λ j by applying to the latter a finite number of times the operator J− ; this implies
that the number [j − (−j)] = 2j is an integer, so that the only allowed values for j
are integers or half-integers; moreover, the number of eigenvectors associated with
a given eigenvalue of J 2 is (2j + 1).
From now on, the eigenvectors of J 2 and Jz will be denoted by the symbol |jm ;
they are supposed to be normalized to unity and to satisfy the eigenvalue equations
Note also that the eigenvectors |jm obey the orthogonality relation
The phase relations among the various eigenvectors characterized by the same
j-value and different m-values must be established using suitable conventions. From
the properties of the shift operators we have
J+ |jm = α |j m+1 ,
where
|α|2 = j(j + 1) − m(m + 1) .
The phase of the α constant is conventionally set equal to zero, so that
J+ |jm = j(j + 1) − m(m + 1) |j m+1
= (j + m + 1)(j − m) |j m+1 , (2.9)
and, similarly
J− |jm = j(j + 1) − m(m − 1) |j m−1
= (j − m + 1)(j + m) |j m−1 . (2.10)
With these phase convention, it is easily found from Eq. (2.10) that any eigenvector
|jm is related to the ‘parent’ eigenvector |jj by the equation
(j + m)! j−m
|jm = J− |jj . (2.11)
(2j)! (j − m)!
On the other hand, the total system can also be described, from the point of
view of its angular momentum properties, in terms of a different set of operators,
namely J12 , J22 , J 2 , Jz , where
J = J1 + J2 .
It is easily seen that these four operators commute among each other; as a conse-
quence, we can introduce a new set of normalized eigenvectors of the form |j1 j2 JM
such that
Since the two different sets of eigenvectors span the same Hilbert space, they must
be connected by a unitary similarity transformation of the form
|j1 j2 JM = |j1 j2 m1 m2 j1 j2 m1 m2 |j1 j2 JM
m1 m2
|j1 j2 m1 m2 = |j1 j2 JM j1 j2 JM |j1 j2 m1 m2 . (2.12)
JM
and the same convention of the former section applies also to the phase relations
among the eigenvectors of the form |JM corresponding to a fixed J-value. The
remaining phase conventions will be established in the following.
Let us consider the eigenvector of the form |j1 j2 m1 m2 corresponding to the
maximum m-values (m1 = j1 , m2 = j2 ). We have
J 2 |j1 j2 j1 j2 = [J12 + J22 + 2J1z J2z + J1+ J2− + J1− J2+ ] |j1 j2 j1 j2
= [j1 (j1 + 1) + j2 (j2 + 1) + 2j1 j2 ] |j1 j2 j1 j2
= (j1 + j2 )(j1 + j2 + 1) |j1 j2 j1 j2 .
34 CHAPTER 2
These equations show that the eigenvector |j1 j2 j1 j2 is also an eigenvector of the
form |JM , with J = M = (j1 + j2 ). By convention we take the same phase for
the two eigenvectors, so that
The above value of J is the maximum of the possible J-values, as it can be argued
from the fact that the maximum eigenvalue of Jz is (j1 + j2 ). The eigenvectors
with the same J-value and lower M -values can be found by repeated application
of the operator J− , so that (see Eq. (2.11))
(j1 + j2 + M )!
|j1 +j2 M = [J + J2− ]j1 +j2 −M |j1 j2 j1 j2 .
(2j1 + 2j2 )! (j1 + j2 − M )! 1−
The eigenvectors of the form |JM with J < (j1 + j2 ) can be found using a set
of operators, On , that will be called the supershift operators, defined by
n
n−r n
On = (−1)n−r J1+
r
J2+ n
J1− J2− .
r=0
Let us consider the application of the operator On to the vector |j1 j2 j1 j2 . Using
Eqs. (2.9) and (2.10) we have, after some algebra
(n! )2 (2j1 )! (2j2 )!
On |j1 j2 j1 j2 =
(2j1 − n)! (2j2 − n)!
n
(2j1 − n + r)! (2j2 − r)!
× (−1)n−r |j1 j2 j1 −n+r j2 −r . (2.13)
r=0
r! (n − r)!
and applying the operator J 2 , after some heavy algebra that is left to the reader
as an exercise, we have
J 2 On |j1 j2 j1 j2 = (j1 + j2 − n)(j1 + j2 − n + 1) On |j1 j2 j1 j2 . (2.15)
Equations (2.14) and (2.15) show that the vector On |j1 j2 j1 j2 is, apart from a
normalization factor and a phase factor, a vector of the form |JJ with J =
(j1 + j2 − n). Note that n cannot be larger than the smaller of the values 2j1 and
2j2 (otherwise On |j1 j2 j1 j2 = 0), so that the allowed values for J must satisfy the
triangular condition
|j1 − j2 | ≤ J ≤ j1 + j2 .
ANGULAR MOMENTUM AND RACAH ALGEBRA 35
The norm Nn of the vector On |j1 j2 j1 j2 can be easily obtained from Eq. (2.13)
The summation above can be evaluated using some properties of the binomial
coefficients (see App. 1 of Edmonds, 1957, for a formal proof)
n
(2j1 − n + r)! (2j2 − r)! (2j1 − n)! (2j2 − n)! (2j1 + 2j2 − n + 1)!
= ,
r=0
r! (n − r)! n! (2j1 + 2j2 − 2n + 1)!
and hence
(n! )3 (2j1 )! (2j2 )! (2j1 + 2j2 − n + 1)!
Nn = . (2.16)
(2j1 − n)! (2j2 − n)! (2j1 + 2j2 − 2n + 1)!
From Eqs. (2.13) and (2.16) we can express any vector |JJ (normalized to unity)
as a function of the vectors of the form |j1 j2 m1 m2 . Apart from a phase factor we
have
The phase convention that will be used in this book is that of assuming Eq. (2.17)
(with no phase factor in front) valid for any value of J. This convention agrees with
those given by Racah (1942), Edmonds (1957), Messiah (1961), Brink and Satch-
ler (1968), Varshalovich et al. (1988), and, together with the other conventions
established previously, leads to vector-coupling coefficients that are all real .
Now that the phase conventions have been fully established, we can turn to the
evaluation of the vector-coupling coefficients. From Eq. (2.11) we have
(J + M )!
|JM = J J−M |JJ . (2.18)
(2J)! (J − M )! −
Writing
J−M
(J − M )!
J−M
J− = [J1− + J2− ]J−M = J k J J−M−k
k! (J − M − k)! 1− 2−
k=0
36 CHAPTER 2
j1 +j2 −J
(j1 − j2 + J + r)! (2j2 − r)!
× (−1)j1 +j2 −J−r .
r=0
(J − j2 − m1 + r)! (j2 − m2 − r)! (j1 + j2 − J − r)! r!
This expression can be transformed into a more symmetrical one (Racah, 1942)
j1 j2 m1 m2 |JM = δm +m2 ,M
1
Fig.2.1. The addition of two vectors having fixed projections on a given axis leads, in classical
physics, to a continuous distribution for the modulus of the resultant vector.
where the sum over t runs over all the values leading to non-negative factorials.
The vector-coupling coefficients have a very simple physical meaning. Given two
independent systems of angular momenta j1 and j2 , suppose that the z-axis pro-
jections of these momenta have been measured and found equal to m1 and m2 ,
respectively. The square modulus | j1 j2 m1 m2 |JM |2 represents the probability
of finding the values J and M , respectively, when measuring the angular momen-
tum of the total system and its z-projection. The Kronecker symbol in Eq. (2.19)
implies that the measured M -value must be equal to (m1 + m2 ), which is the same
result found for classical (non quantum-mechanical) angular momenta. In the clas-
sical case, however, the total angular momentum J has a continuous distribution of
values that depend on the relative orientation of the vectors j1 and j2 (see Fig. 2.1).
The vector-coupling coefficients can be regarded as the quantum-mechanical ana-
logue of this distribution; but the distribution is now discrete, which reflects the
quantum nature of the angular momenta.
The vector-coupling coefficients satisfy a large number of properties. Multiplying
Eqs. (2.12) by j1 j2 J M | and j1 j2 m1 m2 |, respectively, taking Eq. (2.8) into
account, and recalling that the vector-coupling coefficients are real, one gets the
orthogonality relations
j1 j2 m1 m2 |JM j1 j2 m1 m2 |J M = δJJ δMM (2.20a)
m1 m2
j1 j2 m1 m2 |JM j1 j2 m1 m2 |JM = δm m1 δm m2 . (2.20b)
1 2
JM
† †
The identity j1 j2 m1 m2 |J+ |JM = j1 j2 m1 m2 |J1− + J2− |JM leads to the re-
38 CHAPTER 2
cursion relation
J(J + 1) − M (M + 1) j1 j2 m1 m2 |J M +1 =
The other properties of the vector-coupling coefficients are more easily expressed
in terms of the 3-j symbols, which are defined by
√ j1 j2 J
j1 j2 m1 m2 |J −M = (−1)j1 −j2 −M 2J + 1 . (2.22)
m1 m2 M
Nowadays the 3-j coefficients are employed more often than the vector-coupling
coefficients. We just summarize below a set of relations that will be needed later.
Formal proofs can be easily obtained with the help of Eqs. (2.19), (2.20), and (2.22).
a) Orthogonality relations
a b c a b c
(2c + 1) = δcc δγγ (2.23a)
α β γ α β γ
αβ
a b c a b c
(2c + 1) = δαα δββ . (2.23b)
α β γ α β γ
cγ
b) The 3-j symbols are invariant under cyclic permutations of their columns and
are multiplied by (−1)a+b+c under non-cyclic ones
a b c b c a c b a
= = (−1)a+b+c , etc. (2.24)
α β γ β γ α γ β α
c) They are multiplied by (−1)a+b+c under sign inversion of the second row
a b c a b c
= (−1)a+b+c . (2.25)
α β γ −α −β −γ
In the previous section we have shown how it is possible to couple two angular
momenta J1 and J2 to get the resultant vector J. When three angular momenta
are present, say J1 , J2 , and J3 , one can follow the same line of reasoning, by first
adding two angular momenta and then adding the third one to the sum of the first
two to obtain, as a final result, the total angular momentum J.
In this procedure we are however faced with some ambiguity. In fact, we could
start from the couple (J1 , J2 ) to obtain J12 = J1 + J2 , and then add J12 to J3 to
obtain J. Alternatively, we could start from the couple (J , J ) to obtain J =
2 3 23
J2 + J3 , and then add J23 to J1 to obtain again the resultant J.
Finally, we could
introduce the intermediate vector J13 = J1 + J3 and then add J13 to J2 .
These different coupling schemes are related to the fact that, given three com-
muting angular momentum operators J1 , J2 , and J3 , three different sets of six
commuting operators can be considered, namely
I) J12 , J22 , J32 , J12
2
, J 2 , Jz
Expressing the eigenvectors of sets I, II, and III in terms of the eigenvectors of
set IV, it is possible to find the transformations connecting the different bases. The
coefficients entering these transformations are called the recoupling coefficients and
can be derived as follows.
Denoting the eigenvectors of set I by the symbol
If we now take the scalar product of the two vectors in Eqs. (2.28) and (2.29) we
obtain, using the shorthand notation of the vector-coupling coefficients
The scalar product in the left-hand side is independent of the M -value, as it can
be easily proved by evaluation of the matrix element
Fig.2.2. When adding in classical physics three angular momenta j1 , j2 , j3 to give the resultant
J,
once the sum of two of them (j1 and j2 ) is specified, the modulus of (j2 +j3 ) shows a continuous
distribution.
via Eqs. (2.9), (2.2a), and (2.2f). On the other hand, owing to the properties of the
vector-coupling coefficients, the summation in the right-hand side can be restricted
to m1 , m2 , and m3 , with the condition (m1 + m2 + m3 ) = M .
The scalar product now derived is called the recoupling coefficient and has a
simple physical meaning. Given three vectors j1 , j2 , and j3 , that combine to give
the resultant J, the square modulus of the recoupling coefficient is the probability
that, if by measuring (j1 +j2 )2 we found the value j12 (j12 +1), then, when measuring
(j2 +j3 )2 , we find the value j23 (j23 +1). As illustrated in Fig. 2.2, the modulus of the
vector j23 is undetermined in classical physics too; it has a continuous distribution
of probabilities which originates from purely geometrical reasons. The quantum
nature of the angular momenta adds to this ‘geometrical indetermination’ to give
a discrete distribution for the probabilities.
Dropping the inessential parameter M , the recoupling coefficients can be used to
express the transformations between different bases in the form
|j1 , (j2 j3 )j23 , JM = |(j1 j2 )j12 , j3 , JM
j12
Starting from Eq. (2.30), and using Eqs. (2.22), (2.32), and (2.33), it is possible
to express the 6-j symbol as a sum of products of four 3-j symbols
!
a b c
δee δ = (−1)b+c+d+β+γ+δ (2e + 1)
d e f
αβγδφ
c d e d b f b c a f a e
× . (2.34)
γ −δ δ −β φ β −γ α φ α
It should be remarked that, owing to the symmetry properties (2.24) and (2.25),
the right-hand side can be written in several equivalent ways, obtained by column
permutations or by sign change of the second row of each 3-j symbol. Since for
each 3-j symbol there are six permutations of the columns and two determinations
for the sign of the second row, 124 different realizations exist for the right-hand side
of Eq. (2.34), apart from the ordering of the 3-j coefficients. All these realizations
have however the same ‘topological invariant’ which can be visualized by graphical
methods. The theory of graphical methods for angular momentum problems can
be found in Edmonds (1957) or in Brink and Satchler (1968).
The 6-j symbol has several important properties. First of all, it is zero unless four
triangular conditions are satisfied. These conditions can be illustrated as follows
! ! ! !
◦ ◦ ◦ ◦ ◦ ◦
, , , .
◦ ◦ ◦ ◦ ◦ ◦
Another remarkable property is the invariance of the 6-j symbol both under inter-
change of any two columns and under interchange of the upper and lower arguments
in any two columns.
An analytical expression for the 6-j symbol can be obtained from Eq. (2.30) by
substituting the series expansion (2.19) of the vector-coupling coefficients. After a
long calculation, Racah (1942) gives the following expression
!
a b c
= ∆(abc) ∆(aef ) ∆(dbf ) ∆(dec) (−1)z (z + 1)!
d e f
z
× (z − a − b − c)! (z − a − e − f )! (z − d − b − f )! (z − d − e − c)!
−1
×(a + b + d + e − z)! (b + c + e + f − z)! (a + c + d + f − z)! , (2.35)
where
(a + b − c)! (a − b + c)! (−a + b + c)!
∆(abc) =
(a + b + c + 1)!
ANGULAR MOMENTUM AND RACAH ALGEBRA 43
and where the sum is extended to all the values of z leading to non-negative facto-
rials.
By specifying the value of one of the arguments, simple analytical formulae can
be obtained, e.g.:
!
a b 0 1
= δab δed (−1)a+e+f (2.36a)
d e f (2a + 1)(2d + 1)
!
a a+ 1 1 1 (a + b + c + 2)(a + b − c + 1)
1
2 2 = (−1)a+b+c+1 (2.36b)
b+ 2 b c 2 (a + 1)(2a + 1)(b + 1)(2b + 1)
!
a a+ 1 1 1 (a − b + c + 1)(c − a + b)
2 2 = (−1)a+b+c (2.36c)
b− 1
2 b c 2 (a + 1)(2a + 1)b(2b + 1)
!
a a 1 1 a(a + 1) + b(b + 1) − c(c + 1)
= (−1)a+b+c+1 (2.36d)
b b c 2 a(a + 1)(2a + 1)b(b + 1)(2b + 1)
!
a a+1 1
= (−1)a+b+c
b+1 b c
1 (a + b + c + 3)(a + b + c + 2)(a + b − c + 2)(a + b − c + 1)
× (2.36e)
2 (a + 1)(2a + 1)(2a + 3)(b + 1)(2b + 1)(2b + 3)
!
a a+1 1
= (−1)a+b+c+1
b b c
1 (a + b + c + 2)(a − b + c + 1)(a + b − c + 1)(c − a + b)
× (2.36f)
2 (a + 1)(2a + 1)(2a + 3)b(b + 1)(2b + 1)
!
a a+1 1
= (−1)a+b+c
b−1 b c
1 (c − a + b)(c − a + b − 1)(a − b + c + 2)(a − b + c + 1)
× (2.36g)
2 (a + 1)(2a + 1)(2a + 3)(2b − 1)b(2b + 1)
!
a a 2
= (−1)a+b+c
b b c
3 s(s + 1) − 43 a(a + 1)b(b + 1)
× (2.36h)
2 (2a − 1)a(a + 1)(2a + 1)(2a + 3)(2b − 1)b(b + 1)(2b + 1)(2b + 3)
where
s = c(c + 1) − a(a + 1) − b(b + 1) .
44 CHAPTER 2
Further analytical formulae have been given by Biedenharn et al. (1952), while
numerical tables have been prepared by Biedenharn (1952), Sharp et al. (1954),
Simon et al. (1954), Rotenberg et al. (1959), and Varshalovich et al. (1988). As
noted in the previous section, these tabulations have nowadays lost much of their
interest since modern computers provide the possibility of computing 6-j symbols
with simple numerical codes like that in App. 1.
Other important properties of the 6-j symbols, that can be proved either from
the analytical expression (2.35) or directly from their definition, are given below.
Sum rules:
!
a b c
(−1)2c (2c + 1) =1 (2.37)
a b f
c
!
a b c
(−1)a+b+c (2c + 1) = δf 0 (2a + 1)(2b + 1) (2.38)
b a f
c
! !
a b c a b c
(2c + 1)(2f + 1) = δf g (2.39)
d e f d e g
c
! ! !
a b c a b c a e f
(−1)f +g+c (2c + 1) = (2.40)
d e f e d g b d g
c
! ! !
a+b+c+d+e+f +g+h+i+j a b c a b c g h c
(−1) (2c + 1) =
d e f g h i e d j
c
! !
f i j f i j
= (2.41)
g d b h e a
!
a b e c a f b d f
(−1)a+b+c+d−e+f −α−δ (2f + 1) =
d c f γ α φ β δ −φ
f
a b e d c e
= . (2.43)
α β − δ γ
ANGULAR MOMENTUM AND RACAH ALGEBRA 45
When four angular momenta are present, say J1 , J2 , J3 , and J4 , different coupling
schemes can be followed, as in the case of three angular momenta discussed in the
previous section. For instance, one can start from the couples (J1 , J2 ) and (J3 , J4 )
to obtain the intermediate vectors J12 = J1 + J2 and J34 = J3 + J4 , and then add
J12 to J34 to obtain the resultant J. Alternatively, one can start from the couples
(J1 , J3 ) and (J2 , J4 ) to get the intermediate vectors J13 = J1 + J3 and J24 = J2 + J4 ,
and then add J13 to J24 to obtain again the resultant J. A third possibility would
be to introduce the intermediate vectors J14 = J1 + J4 and J23 = J2 + J3 .
These different coupling schemes are related to the fact that, given four indepen-
dent angular momenta, three different sets of eight commuting operators can be
constructed, namely
The recoupling coefficients defined in this equation are independent of M (see the
comments about Eq. (2.30)). In their place different symbols are usually intro-
duced, like the 9-j symbols defined by
The 9-j symbol can be easily expressed in terms of 3-j coefficients by means of a
procedure similar to that of the previous section for the 6-j coefficients (Eqs. (2.27)-
(2.30)). The final result is the compact formula
⎧ ⎫
⎨a b c⎬ a b c d e f g h i
d e f = (2a + 1)
⎩ ⎭ α β γ δ φ χ η ξ
g h i
a d g b e h c f i
× ,
α δ χ β η γ φ ξ
where the sum runs over all the Greek symbols except α, namely over β, γ, δ, ,
φ, χ, η, ξ.
The 9-j symbol can be also expressed in terms of 6-j symbols. We start from
the vector |(ad)g, (be)h, i and neglect the fact that the vector h is obtained by
combining b with e. From Eq. (2.31) we have
|(ad)g, (be)h, i = a, (dh)k, i|(ad)g, h, i |a, (dh)k, i . (2.45)
k
Recalling that h is the resultant of b and e, we recouple the three angular momenta
d, b, e to obtain
|a, (dh)k, i = b, (de)f, k|d, (be)h, k |a, (bf )k, i . (2.46)
f
Substituting Eq. (2.47) into Eq. (2.46) and then into Eq. (2.45), and recalling that
the vector f is the combination of d and e, we obtain
|(ad)g, (be)h, i = a, (dh)k, i|(ad)g, h, i b, (de)f, k|d, (be)h, k
kf c
× (ab)c, f, i|a, (bf )k, i |(ab)c, (de)f, i .
The three recoupling coefficients can be expressed in terms of 6-j symbols. For this
purpose it is necessary to recall the definitions (2.32)-(2.33) and to observe that,
owing to the properties of the vector-coupling coefficients, the order of the various
angular momenta can be changed according to the following rules
Using these properties and the definition of the 9-j symbol given in Eq. (2.44), we
obtain the remarkable expression
⎧ ⎫
⎨a b c⎬ ! ! !
2k a i k b f k a i k
d e f = (−1) (2k +1) . (2.48)
⎩ ⎭ h d g d h e f b c
g h i k
The 9-j symbol satisfies several important properties. First of all, the symbol is
zero unless the six triangular conditions illustrated below are satisfied
⎧ ⎫ ⎧ ⎫
◦ ◦ ◦⎪
⎨◦
⎪ ◦ ◦⎪
⎬ ⎪
⎨ ⎬
◦ ◦ ◦ , ◦ ◦ ◦ .
⎪
⎩ ⎪
⎭ ⎪
⎩ ⎪
⎭
◦ ◦ ◦ ◦ ◦ ◦
For the numerical evaluation of 9-j symbols Eq. (2.48) can be conveniently used,
and numerical tables are also available (Smith and Stevenson, 1957; Smith, 1958;
Varshalovich et al., 1988). However, as already mentioned about 3-j and 6-j sym-
bols, simple computer codes like that in App. 1 are nowadays generally preferred.
Other important properties of the 9-j symbols are given below.
Orthogonality:
⎧ ⎫⎧ ⎫
⎨a b c ⎬⎨ a b c⎬
(2c + 1)(2f + 1)(2g + 1)(2h + 1) d e f d e f = δgg δhh (2.50)
⎩ ⎭⎩ ⎭
cf g h i g h i
Sum rule:
⎧ ⎫⎧ ⎫ ⎧ ⎫
⎨a b c ⎬⎨ a d g ⎬ ⎨a b c⎬
(−1)2b+h+k−f (2g + 1)(2h + 1) d e f e b h = e d f (2.51)
⎩ ⎭⎩ ⎭ ⎩ ⎭
gh g h i j k i j k i
48 CHAPTER 2
⎧ ⎫
⎨a b c ⎬
a b c b e h
(2b + 1) d e f =
α β γ β η ⎩ ⎭
b g h i
c f i a d g d e f g h i
= (2.53)
γ φ ξ α δ χ δ φ χ η ξ
φξδχ
⎧ ⎫
⎨a b c⎬
a b c b e h c f i
(2b + 1)(2c + 1) d e f =
α β γ β η γ φ ξ ⎩ ⎭
bc g h i
a d g d e f g h i
= (2.54)
α δ χ δ φ χ η ξ
Finally, we want to remark that the methods illustrated so far for the coupling
of 2, 3, and 4 angular momentum operators can be directly generalized to any
number of operators. The addition of n angular momenta will involve recoupling
ANGULAR MOMENTUM AND RACAH ALGEBRA 49
where D(R) is an appropriate linear operator such that D† (R) = D−1 (R) and
D(R−1 ) = D−1 (R), where R−1 is the inverse rotation of R.
On the contrary, adopting the active point of view, the operators remain un-
changed, while the state vector |ψ changes into the new state vector |ψ given
by
|ψ = D(R)|ψ .
Obviously these transformations satisfy the property
ψ |O |ψ = ψ|O|ψ ,
which means that if the same rotation is performed both on the coordinate system
and on the physical system, the expectation value of any observable remains the
same.
Note that the formalism here introduced is consistent with the intuitive fact
that an arbitrary rotation R performed on the coordinate system is equivalent to
50 CHAPTER 2
positive
rotation
δ α
Fig.2.3. To find the variation δv of the components of the vector v under a positive rotation of
the coordinate system through an angle φ about the u-axis, we observe that this variation is the
same as that obtained by rotating the vector v through an angle −φ about the same axis. The
figure shows that for an infinitesimal rotation of the arbitrary vector w
through an infinitesimal
angle α we have δw = α u × w
, so that δv = − φ u × v.
the inverse rotation R−1 performed on the physical system. This can be formally
proved by considering the quantity
ψ|D(R) OD† (R)|ψ .
On one hand, this expression can be regarded as
which represents the expectation value of the operator O after a rotation of the
coordinate system (passive point of view). On the other hand, the same expression
can be considered as
ψ|D(R) O D† (R)|ψ = ψ|D† (R−1 ) O D(R−1 )|ψ ,
which represents the expectation value of the same operator after the inverse rota-
tion of the physical system (active point of view).
The expression for the operator D(R) can be easily found in the following way.
We consider a physical system referred to a given coordinate system, and fix our
attention on a particular observable like the vector v (representing for instance the
position or the velocity or the angular momentum of a particle, etc.). We now
perform a positive rotation of the coordinate system through an infinitesimal angle
φ about the unit vector u, where by positive rotation we mean, here and in the
following, the one that makes a right-handed screw advance along the direction of
u. The vector v will be changed by the quantity (see Fig. 2.3)
δv = − φ u × v . (2.58)
ANGULAR MOMENTUM AND RACAH ALGEBRA 51
where Ju = J ·u is the projection of the total angular momentum on the u-direction
(note that Du† (φ) = Du−1 (φ) = Du (−φ)).
From Eq. (2.61) we can find the expression for the operator corresponding to the
rotation through a finite angle Ω. To this aim, we divide the angle Ω into a large
number n of equal parts, and perform the finite rotation as the succession of n
infinitesimal rotations of amplitude Ω/n. We have
Ω
n
n
n! (−iΩJu )r
Du (Ω) = lim 1 − i Ju = lim
n→∞ n n→∞
r=0
r! (n − r)! nr
∞
(−iΩJu )r n!
= lim ,
r! n→∞ (n − r)! nr
r=0
∞
(−iΩJu )r −iΩJu
Du (Ω) = =e , (2.62)
r=0
r!
where, by definition, the exponential of an operator has the usual meaning given
by its Taylor expansion.
Having established the expression for the operator D corresponding to a finite
rotation about an arbitrary axis, we can easily find its expression for the most
1
For a system consisting of one particle, if r , p, l = h̄1 r × p, s, and
J = l + s are the
position, momentum, orbital angular momentum, spin, and total angular momentum operators,
respectively, Eq. (2.60), with v denoting any of these vectors as well as any linear combination of
them, can be easily deduced from the fundamental commutation rules
%
[ri , rj ] = [pi , pj ] = 0 , [ri , pj ] = ih̄ δij , [si , sj ] = i k
ijk sk .
The generalization to the case of many-particle systems is straightforward.
52 CHAPTER 2
α γ
β α
γ
general rotation of the reference system. But we must first introduce the Euler
angles, which are defined as follows.
Referring to Fig. 2.4, we first perform a rotation through an angle α (0 ≤ α < 2π)
about the z-axis of the original reference system. This rotation brings the y-axis
into a new axis, called the line of nodes, or w-axis. Next we perform a rotation
through an angle β (0 ≤ β < π) about the line of nodes. This rotation brings
the z-axis of the original system into a new axis, called the figure-axis, or z -axis.
Finally we perform a rotation through an angle γ (0 ≤ γ < 2π) about the figure-axis
to get the new system (x y z ).
For the general rotation specified by the three Euler angles (α, β, γ) we have
an expression which has the disadvantage that the rotations β and γ are relative to
axes that have been carried over by previous rotations. To overcome this drawback
we observe that1
Dw (β) = Dz (α) Dy (β) Dz (−α) (2.64)
1
Equation (2.64) is self-evident from geometrical considerations, since
Dw (β) Dz (α) = Dz (α) Dy (β) ;
however, it can also be proved analytically. From Eq. (2.62) we have
Dw (β) = e−iβJw ,
ANGULAR MOMENTUM AND RACAH ALGEBRA 53
and that
Dz (γ) = Dw (β) Dz (γ) Dw (−β) . (2.65)
Substituting Eqs. (2.65) and (2.64) into Eq. (2.63), and noting that two rotations
about the same axis commute, we obtain
−iαJz −iβJy −iγJz
D(α, β, γ) = Dz (α) Dy (β) Dz (γ) = e e e . (2.66)
This expression shows that the rotation described by Eq. (2.63) can also be achieved
by performing three successive rotations about the axes of the old reference system
in the following order: first a rotation through an angle γ about the z-axis, then
a rotation through an angle β about the y-axis, and finally a rotation through an
angle α about the z-axis. The former realization of the rotation, involving the
line of nodes and the figure-axis, is easier to visualize; the latter is more useful in
practice, since it involves rotation operators depending on the projections of the
angular momentum along the axes of a unique coordinate system.
The matrix elements of the rotation operator D(α, β, γ) between eigenstates of the
total angular momentum are called rotation matrices. These quantities, which are
explicitly defined by the relation1
DMN
J
(R) = JM |D(R)|JN (2.67)
with the symbol R representing the triad (α, β, γ), are of fundamental importance
in angular momentum theory. Their physical meaning descends directly from that
of the rotation operator D(R) defined in the previous section. Given a physical
system of angular momentum J, the square modulus
|DMN
J
(R)|2
represents the probability of finding the value N for the z -axis projection of the
momentum after its z-axis projection has been measured and found equal to M ,
the system (x y z ) being obtained from (xyz) through the rotation R. Note that
where Jw is the operator obtained from Jy by a rotation through an angle α about the z-axis,
Taking this expression into account we obtain, by a power series expansion of the exponential
−iβJy
Dw (β) = Dz (α) e Dz (−α) = Dz (α) Dy (β) Dz (−α) .
where
−iβJy
dJMN (β) = JM |e |JN .
The reduced rotation matrices can be easily calculated in the case J = 1/2. In-
troducing the spinors | + and | − to represent the eigenstates |JM having
J = 1/2 and M equal to +1/2 and −1/2, respectively, we have from Eqs. (2.9)
and (2.10), taking into account that Jy = −i (J+ − J− )/2
i i
Jy | + = |− , Jy | − = − |+ ,
2 2
so that, with easy transformations
∞
−iβJy (−iβ)n n β β
e |+ = Jy | + = cos |+ + sin |− ,
n=0
n! 2 2
and similarly
−iβJy β β
e | − − sin | + .
|− = cos
2 2
Thus the reduced rotation matrices for J = 1/2 are the following
1 1 β
d 21 1 (β) = d−
2
1
−1
(β) = cos
2 2 2 2 2
1 1 β
1 1 (β) = −d 1
2 2
d− −1
(β) = sin .
2 2 2 2 2
Turning to the general case, let us consider an angular momentum eigenstate of
the form |JJ . This eigenstate can be expressed in terms of spinors by regarding
the angular momentum J as the result of the addition of 2J momenta J1 , J2 , . . . , J2J
each equal to 1/2. In this representation we have, with obvious notations
|JJ = | + 1| + 2 ···|+ 2J ,
(J + M )! (J − M )! & '
|JM = | − | − ···|− | + | + ···|+ ,
(2J)! p
( )* +( )* +
(J−M)terms (J+M)terms
with the sum extended to all the permutations having (J − M ) spinors of the form
| − and (J + M ) spinors of the form | + .
Thus we can write
−iβJy (J + M )! (J − M )! (J + N )! (J − N )!
dMN (β) = JM |e
J
|JN =
(2J)!
& '
−iβ(J1y +J2y +···+J2Jy )
× − | − |··· − | + | + |··· + | e
p
( )* + ( )* +
(J−M)terms (J+M)terms
& '
× | − | − ···| − | + | + ···| + .
q
( )* +( )* +
(J−N )terms (J+N )terms
Let us consider a single term in the sum over q; the operator e−iβJy , acting on it,
will produce, apart from the ordering of the single vectors | + and | − , a state
vector of the form
J−N J+N
β β β β
cos | − − sin | + cos | + + sin | − =
2 2 2 2
J+N
J−N J−N −t+J+N −r t+r
J −N J +N β β
= (−1)t cos sin
t=0 r=0
t r 2 2
× | − | − ···|− | + | + ···|+ .
( )* +( )* +
(J−N −t+r)terms (J+N −r+t)terms
Whatever the ordering of the various indices, this state vector gives a non-zero
scalar product with one and only one of the terms in the sum over p, provided that
J −M =J −N −t+r
J +M =J +N −r+t.
56 CHAPTER 2
These equalities are both satisfied if r = N − M + t, and since the sum over q gives
(2J)! [(J + N )! (J − N )! ]−1 terms of the same kind
dJMN (β) = (J + M )! (J − M )! (J + N )! (J − N )!
, -2J+M−N −2t , -2t−M+N
cos β2 sin β2
× (−1)t , (2.69)
t
(J + M − t)! (J − N − t)! t! (t + N − M )!
where the sum runs over all the t values which lead to non-negative factorials.
The reduced rotation matrices have several important properties that can be
easily deduced either from their definition or from their analytical expression.
Reality:
dJMN (β)∗ = dJMN (β)
Closure:
dJMN (β1 ) dJN P (β2 ) = dJMP (β1 + β2 )
N
Symmetry:
dJMN (β) = (−1)M−N dJ−M−N (β)
= (−1)M−N dJN M (β)
= (−1)M−N dJMN (−β)
= (−1)J+M dJM−N (π − β)
= (−1)J+N dJM−N (π + β) . (2.70)
Explicit analytical formulae for J = 1 and 2 are given in Table 2.1. Additional
formulae for J = 3 to 6 can be found in Buckmaster (1964, 1966). An interesting
connection between reduced rotation matrices and Jacobi polynomials can be found
in Edmonds (1957).
Going back to the rotation matrices defined in Eq. (2.67), it is possible to establish
for them a number of important properties. First of all, as the rotation operator
D(R) is a unitary operator (D† (R) = D−1 (R)), we have
DMN
J
(αβγ)∗ = DN
J
M (−γ −β −α) , (2.71)
where (−γ −β −α) are the Euler angles characterizing the inverse rotation of (αβγ).
From the unitary character of the operator D(R) one gets the orthogonality
relations
DPJ N (R)∗ DPJ M (R) = δMN
P
∗
DMP
J
(R) DN
J
P (R) = δMN . (2.72)
P
ANGULAR MOMENTUM AND RACAH ALGEBRA 57
TABLE 2.1
Algebraic formulae for d1M N (β) (top) and d2M N (β) (bottom). C and S stand for cos β and sin β,
respectively.
N = −1 N =0 N = +1
M = −1 1
2
(1 + C) 1
√ S 1
2
(1 − C)
2
M =0 − √1 S C 1
√ S
2 2
M = +1 1
2
(1 − C) − √1 S 1
2
(1 + C)
2
N = −2 N = −1 N =0 N = +1 N = +2
3
M = −2 1
4
(1 + C)2 1
2
S(1 + C) 8
S2 1
2
S(1 − C) 1
4
(1 − C)2
3
M = −1 − 12 S(1 + C) (C − 12 )(C + 1) 2
SC (C + 12 )(1 − C) 1
2
S(1 − C)
3 3 3 3
M =0 8
S2 − 2
SC 1
2
(3C 2 − 1) 2
SC 8
S2
3
M = +1 − 12 S(1 − C) (C + 12 )(1 − C) − 2
SC (C − 12 )(C + 1) 1
2
S(1 + C)
3
M = +2 1
4
(1 − C)2 − 12 S(1 − C) 8
S2 − 12 S(1 + C) 1
4
(1 + C)2
From Eq. (2.68), using the symmetry properties (2.70), it can be easily shown
that
DMN
J
(αβγ)∗ = (−1)M−N D−M−N
J
(αβγ) . (2.73)
If R is a rotation composed of two consecutive rotations R1 and R2 , the operator
D(R) can be written as the product D(R1 ) D(R2 ).1 For the rotation matrices we
thus have, directly from their definition
DMN
J
(R) = JM |D(R1 )D(R2 )|JN = DMP
J
(R1 ) DPJ N (R2 ) , (2.74)
P
1
This statement is not trivial. Indeed the operator D(R) should be written
D(R) = D (R2 ) D(R1 ) ,
where
−iβ2 Jy
D (R2 ) = e−iα2 Jz e e−iγ2 Jz ,
with Jy , Jz the components of J on the axes y , z obtained from y, z through the rotation R1 .
Since
Jy = D(R1 ) Jy D −1 (R1 ) , Jz = D(R1 ) Jz D −1 (R1 ) ,
we have, by series expansion of the exponentials
−iβ2 Jy
D (R2 ) = D(R1 ) e−iα2 Jz D −1 (R1 ) D(R1 ) e D −1 (R1 ) D(R1 ) e−iγ2 Jz D −1 (R1 ) ,
so that
according to the definition of rotation matrices, this product can be written in the
form
−iαJz −iβJy −iγJz −iαJz −iβJy −iγJz
JM |e e e |JN J M |e e e |J N .
If we now consider the angular momentum operators appearing in the two matrix
elements as referring to two different systems, we can write
−iαJz −iβJy −iγJz
DMN
J
(αβγ) DM
J
N (αβγ) = JJ M M |e e e |JJ N N ,
where the J-operator is the resultant of the angular momenta of the two systems.
On the other hand, bearing in mind the coupling rules of two angular momenta
and the definition of 3-j symbols (Eqs. (2.12) and (2.22)), one obtains with easy
transformations1
DMN
J
(αβγ) DM
J
N (αβγ) =
J J K J J K ∗
= (2K + 1) DQQ
K
(αβγ) , (2.75)
M M Q N N Q
K
a formula that shows how the product of two rotation matrices of rank J and J
can be reduced to the linear combination of rotation matrices having rank K such
that |J − J | ≤ K ≤ (J + J ).
From Eq. (2.75), using the orthogonality relation (2.23a) of the 3-j symbols, the
following expressions are also obtained
∗
DQQ
K
(αβγ) =
J J K J J K
= (2K + 1) DMN
J
(αβγ) DM
J
N (αβγ) ;
M M Q N N Q
MM N N
J J J J J
DMN
J
(αβγ) DM N (αβγ) DM N (αβγ) =
M M M
MM M
J J J
= .
N N N
1
Note that a formal summation over Q and Q might be added in the right-hand side of
Eq. (2.75).
ANGULAR MOMENTUM AND RACAH ALGEBRA 59
Another important property of the rotation matrices is the so-called Weyl theo-
rem
2π 2π π
J ∗
dα dγ dβ sin β DMN
J
(αβγ) DM N (αβγ) =
0 0 0
2
8π
= δ δ δ , (2.76)
2J + 1 JJ MM N N
which is indeed valid only when J and J are both integers or both half-integers.
To prove the theorem, we note that Eqs. (2.73) and (2.75) give
∗ M −N
DMN
J
(αβγ) DM
J
N (αβγ) = (−1)
J J K J J K ∗
× (2K + 1) DQQ
K
(αβγ) , (2.77)
M −M Q N −N Q
K
where, if we confine ourselves to one of the two cases specified above, the sum over
K runs only over integral values. The integral in Eq. (2.76) is therefore reduced to
the sum of various integrals of the form
2π 2π π
∗
dα dγ dβ sin β DQQ
K
(αβγ) ,
0 0 0
with K, Q, and Q integers. The integrals over α and γ can be easily performed
(see Eq. (2.68)) and give the value 4π 2 δQ0 δQ 0 . On the other hand, Eq. (2.69)
yields
, -2K−2t , -2t
cos β2 sin β2
dK
00 (β) = (K! )
2
(−1)t ,
t
[(K − t)! t! ]2
and since
π/2
a! b!
(sin x)2a+1 (cos x)2b+1 dx = ,
2 (a + b + 1)!
0
we have
π
2 K!
dK
00 (β) sin β dβ = (−1)t = 2 δK0 .
K+1 t (K − t)! t!
0
4π 4π π
J ∗ 32π 2
dα dγ dβ sin β DMN
J
(αβγ) DM N (αβγ) = δ δ δ .
2J + 1 JJ MM N N
0 0 0
From the Weyl theorem in the form of Eq. (2.76) it is possible to find a highly
symmetrical expression for the integral of the product of three rotation matrices
2π 2π π
J J2 J3
dα dγ dβ sin β DM1 N1 (αβγ) DM2 N2 (αβγ) DM3 N3 (αβγ) =
1
0 0 0
2 J1 J2 J3 J1 J2 J3
= 8π .
M1 M2 M3 N1 N2 N3
It should be remarked, however, that this expression is valid only when the three
J’s are integers or when two of them are half-integers and the third is an integer.
In different cases the expression in the right-hand side is meaningless.
(i1 , i2 , . . . ik , j1 , j2 , . . . , jk = 1, 2, 3) ,
where T (j1 , j2 , . . . , jk ) and T (i1 , i2 , . . . , ik ) are the tensor components in the old
and new reference system, respectively, and where the coefficients a(il , jl ) are the
direction cosines of the new axes relative to the old axes.
ANGULAR MOMENTUM AND RACAH ALGEBRA 61
satisfy a transforma-
For instance, the Cartesian components of a usual vector A
tion of the form
Ai = a(i, j) Aj ,
j
and B
thus a vector is a Cartesian tensor of rank 1. If A are two arbitrary vectors,
defining
Cij = Ai Bj
we have
Cij = a(i, k) a(j, l) Ckl ,
kl
which shows that the dyadic product of two vectors forms a Cartesian tensor of
rank 2.
An irreducible spherical tensor of rank k is defined as a set of (2k + 1) quanti-
ties (called the components of the spherical tensor) which, under rotation of the
coordinate system, change according to the linear relation
T kq = Tpk Dpq
k
(R) (q, p = −k, −k + 1, . . . , k) , (2.78)
p
where T kq and Tpk are the tensor components in the new and old reference system,
respectively, and where R is the rotation that brings the old system into the new
one.
Contrary to the case of Cartesian tensors, the dyadic product of two irreducible
spherical tensors no longer is an irreducible spherical tensor. In fact, if Rqk and Sqk
are two irreducible tensors, the (2k + 1)(2k + 1) components of the form Rqk Sqk
change, under the rotation R, into
(Rqk Sqk ) = (Rpk Spk ) Dpq
k
(R) Dpk q (R) .
pp
Q
Recalling the conjugation property of the rotation matrices (Eq. (2.73)) and chang-
ing the dummy index Q in −P , we obtain for TQK the typical transformation law
for spherical tensors,
T K
Q = TPK DPKQ (R) .
P
and equating Ju with Jx , Jy , and Jz one gets (see Eqs. (2.7), (2.9), and (2.10))
[Jz , Az ] = 0 ,
so that, summarizing
1
A1−1 = √ (Ax − iAy )
2
1
A0 = Az
1
A11 = − √ (Ax + iAy ) . (2.82)
2
1 1
T00 = √ (−1)1−q Rq1 S−q
1
= √ (R11 S−1
1
− R01 S01 + R−1
1
S11 )
3 q 3
1
= −√ R ·S , (2.83)
3
√
which shows that, apart from a numerical factor (−1/ 3), the 0-rank tensor is
simply the scalar product of the two vectors.
For the tensor of rank 1, a direct application of Eq. (2.79), with the 3-j symbols
computed from Eqs. (2.26c,d), gives
1
1
T−1 = √ (R01 S−11
− R−11
S01 )
6
1
T01 = √ (R11 S−1
1
− R−1
1
S11 )
6
1
T11 = √ (R11 S01 − R01 S11 ) ,
6
64 CHAPTER 2
or, in terms of Cartesian components (see Eqs. (2.82) and their inverse)
i
Tx = √ (Ry Sz − Rz Sy )
6
i
Ty = √ (Rz Sx − Rx Sz )
6
i
Tz = √ (Rx Sy − Ry Sx ) .
6
Therefore the spherical components of the tensor of rank 1 determine a vector T
which, apart from a numerical factor, is simply the cross product of the vectors R
and S,
i
T = √ R ×S .
6
Finally, the components of the tensor of rank 2 are given by
1 1 1
2
T−2 = √ R−1 S−1
5
1
2
T−1 = √ (R01 S−1
1 1
+ R−1 S01 )
10
1
T02 = √ (R−11
S11 + 2R01 S01 + R11 S−1
1
)
30
1
T12 = √ (R01 S11 + R11 S01 )
10
1 1 1
T2 = √ R1 S1 .
2
(2.84)
5
It should be remarked that the adjoint (or Hermitian conjugate) of a spherical
tensor operator Tqk no longer is a spherical tensor operator. This can be easily
proved by taking the Hermitian conjugates of Eqs. (2.81), recalling that
[A, B]† = −[A† , B † ] , (J± )† = J∓ ;
we have
[Jz , (Tqk )† ] = −q (Tqk )†
[J± , (Tqk )† ] = − (k ∓ q + 1)(k ± q) (Tq∓1
k
)† .
The spherical tensor operator Oqk can be considered as the Hermitian conjugate of
the tensor Tqk ; employing for this tensor the notation T † we have
(T † )kq = Tqk ;
form
It is seen at once that the spherical components A1q of a Hermitian vector A
a Hermitian spherical tensor operator. From Eqs. (2.82) we have
1
(A1−1 )† = √ (Ax + iAy ) = −A11
2
(A10 )† = Az = A10
1
(A11 )† = − √ (Ax − iAy ) = −A1−1 ,
2
where Tqk is the q-component of a spherical tensor operator of rank k, and where
α and α are supplementary quantum numbers (relative to Hermitian operators
commuting with J 2 and Jz ), we first consider the (2k + 1)(2J + 1) vectors
Tqk |α J M ,
where J is a given angular momentum eigenvalue having the same character (in-
teger or half-integer) of (J + k), and limited between |J − k| and (J + k). We
will now show that the (2J + 1) vectors obtained by varying M in the linear
combinations (2.88) are eigenvectors of J 2 and Jz corresponding to the eigenvalues
J (J + 1) and M , respectively.
Denoting by |v(J , M ) the linear combination (2.88), we have
Jz |v(J , M ) = Jz Tqk |α J M J kM q|J M ,
Mq
and using the commutation rules of spherical tensors (Eqs. (2.81)) we obtain
Jz |v(J , M ) = (M + q) Tqk |α J M J kM q|J M
Mq
= M |v(J , M ) .
(2.89)
J+ |v(J , M ) =
= J (J + 1) − M (M + 1) Tqk |α J M +1 J kM q|J M
Mq
+ k(k + 1) − q(q + 1) Tq+1
k
|α J M J kM q|J M ,
Mq
J+ |v(J , M ) =
= Tqk |α J M J (J + 1) − M (M − 1) J k M −1 q|J M
Mq
+ k(k + 1) − q(q − 1) J kM q−1|J M ,
which can be rewritten, with the help of the recursion relation (2.21), in the form
J+ |v(J , M ) = J (J + 1) − M (M + 1) |v(J , M +1) . (2.90)
1
J 2 |v(J , M ) = Jz2 + (J+ J− + J− J+ ) |v(J , M )
2
= J (J + 1) |v(J , M ) . (2.92)
ANGULAR MOMENTUM AND RACAH ALGEBRA 67
Equations (2.89) and (2.92) show that the (2J + 1) vectors of the form
|v(J , M ) are angular momentum eigenvectors (in general not normalized) cor-
responding to the eigenvalues J and M .
Recalling the definition of |v(J , M ) we have, after inversion of Eq. (2.88) via
the orthogonality relations of the vector-coupling coefficients (Eq. (2.20b))
Tqk |α J M = J kM q|J M |v(J , M ) ,
J
and hence the scalar products (2.87) can be written in the form
αJM |Tqk |α J M = J kM q|J M αJM |v(J , M ) . (2.93)
J
On the other hand, the scalar product αJM |v(J , M ) is zero unless J = J
and M = M . Moreover, it is independent of M (see the comments after Eq. (2.30))
so that we can write
The reduced matrix element can be expressed in terms of ordinary matrix ele-
ments by inversion of Eq. (2.96). From the orthogonality relation of the vector-
coupling coefficients (Eq. (2.20a)) we have
αJ T k α J = (−1)2k J kM q|JM αJM |Tqk |α J M .
Mq
A different way for calculating the reduced matrix element is to substitute some
special values of M and M into Eq. (2.95). For the reduced matrix element of the
angular momentum operator J we have for instance (denoting J 1 by J )
and being
M
J1M 0|JM =
J(J + 1)
one obtains
αJ J α J = δαα δJJ J(J + 1) . (2.97)
An important property of the reduced matrix element can be easily deduced from
the orthogonality relation of the vector-coupling coefficients
| αJM |Tqk |α J M |2 = | αJ T k α J |2 .
Mq
The most striking consequence of the Wigner-Eckart theorem is the direct ap-
pearance of selection rules for the matrix elements of the components of spherical
tensor operators. Indeed, because of the presence of the 3-j symbol in Eq. (2.96),
the matrix element αJM |Tqk |α J M is identically zero unless
|J − k| ≤ J ≤ J + k , M = M − q .
In particular, we obtain from Eq. (2.26a) that the matrix elements of a zero-rank
spherical tensor operator are diagonal with respect to J and M , and are indepen-
dent of M .
From the Wigner-Eckart theorem an important relation can be deduced for the
reduced matrix elements of a Hermitian spherical tensor operator Hqk . Taking the
complex conjugate of Eq. (2.95) we have
∗ ∗
αJM |Hqk |α J M = (−1)2k αJ H k α J J kM q|JM . (2.98)
On the other hand, being Hqk Hermitian, from Eq. (2.86) we deduce
∗
αJM |Hqk |α J M = α J M |(Hqk )† |αJM
= (−1)r+q α J M |H−q
k
|αJM .
Applying again the Wigner-Eckart theorem to the last matrix element we get
∗
αJM |Hqk |α J M = (−1)r+q+2k α J H k αJ JkM −q|J M . (2.99)
which is the relation, valid for Hermitian spherical tensor operators only, between
reduced matrix elements of the form αJ H k α J and α J H k αJ .
In Sect. 2.7 we have shown how it is possible to construct an irreducible spherical
tensor TQK from the product of two irreducible tensors Rqk and Sqk . We want here
to show how the reduced matrix elements of these tensors are related among each
other. The Wigner-Eckart theorem, applied to the tensor TQK , gives
√ J J K
αJM |TQK |α J M = (−1)J +K+M
2J + 1 αJ T K α J .
−M M Q
From Eq. (2.79), introducing intermediate states between the tensors Rqk and Sqk
and applying again the Wigner-Eckart theorem, one obtains
√
+M +2k−K+Q
(−1)J 2J + 1
qq α J M
J J k J J k k k K
×
−M M q −M M q q q −Q
× αJ Rk α J α J S k α J =
J J K
= αJ T K α J .
−M M Q
followed by summation over M and M gives, using Eqs. (2.23a) and (2.34)1
√ !
J J K
αJ T K α J =(−1)J+J +K
2J + 1
k k J
α J
× αJ Rk α J α J S k α J , (2.102)
which shows how the reduced matrix element of a ‘composite’ spherical tensor is
related to the reduced matrix elements of the constituent tensors.
1
The deduction of Eq. (2.102) is not trivial, as it involves the contraction of four 3-j symbols
which must be suitably manipulated to obtain the formula (Eq. (2.34)) leading to the final result.
In App. 2 we give, for the unexperienced reader, an example of how the various manipulations
can be performed.
70 CHAPTER 2
A direct application of this formula to the 0-rank tensor T 0 obtained from the
scalar product of an arbitrary vector v with the angular momentum J gives (see
Eq. (2.83))
1
αJ T 0 α J = − √ αJ v · J α J
3
!
√ J J 0
= (−1)J+J 2J + 1 αJ v α J α J J α J ,
1 1 J
α J
which, using Eqs. (2.36a) and (2.97), can be written in the form
αJ v · J α J = δJJ J(J + 1) αJ v α J .
From this equation a remarkable relation, often referred to as the projection theo-
rem, can be easily proved by a double application of the Wigner-Eckart theorem,
(v · J ) J
veff = ,
J2
thus the projection theorem can be considered as the direct generalization to Quan-
tum Mechanics of this simple geometrical expression.
The projection theorem can be expressed in the equivalent form (see Eq. (2.104))
where
αJM | v · J |α JM
αJ v α J
gαα J (v ) = = (2.105)
J(J + 1) J(J + 1)
is the generalization of the Landé factor that will be encountered in the next chap-
ter.
A final application of the Wigner-Eckart theorem concerns the matrix elements of
a spherical tensor operator acting on the dynamical variables of a single component
of a composite system.
Let us consider a system composed of two parts of angular momenta J1 and J2 ,
respectively, having total angular momentum J = J1 + J2 . If Tqk (1) is a spher-
ical tensor operator (the argument (1) means that it is acting only on the first
component of the system), we have for its matrix elements
αj1 j2 JM | Tqk (1) |α j1 j2 J M =
Comparison of Eqs. (2.106) and (2.107) gives the following relation between reduced
matrix elements
Similarly, if Tqk (2) is a spherical tensor operator acting only on the second part of
the composite system, we obtain
k k
More generally, if Rq11 (1) and Sq22 (2) are two spherical tensor operators acting on
the first and second part of the composite system, respectively, and if TQK is the
spherical tensor operator constructed according to Eq. (2.79), the reduced matrix
elements are connected by the following relation
The proof of this relation, that follows from Eqs. (2.34) and (2.48), is left as an
exercise to the reader.
CHAPTER 3
ATOMIC SPECTROSCOPY
According to the theory of atomic structure (see for instance Condon and Shortley,
1935; Condon and Odabaşi, 1980), the modifications produced in atomic spectra by
an external, uniform magnetic field can be described by adding to the unperturbed
Hamiltonian H0 of the atomic system an additional term HB , called the magnetic
Hamiltonian, given by
2
e0 h + e0 (B
)·B × r )2 ,
HB = (L + 2S (3.1)
4πmc 8mc2
where e0 is the absolute value of the electron charge, m is the electron mass, h and
c have their usual meaning of Planck constant and speed of light, L and S
are the
(dimensionless) total orbital angular momentum and total spin of the electronic
is the magnetic field vector, and r is the position operator defined by
cloud, B
r = ri ,
i
where ri is the position of the i-th optical electron relative to the nucleus. For one-
electron atoms, Eq. (3.1) can be derived by taking the lowest-order, non relativistic
limit of Dirac’s equation describing the motion of a particle in an electromagnetic
field (see for instance Schiff, 1949).
The second term in Eq. (3.1) is the so-called diamagnetic term. Its importance
is very limited in practice, at least for the magnetic fields that are usually found
74 CHAPTER 3
second term 1 h3
≈ B = 1.06 × 10−10 B , (3.2)
first term 32π 3 m2 c e30
where B is expressed in G. If we exclude the extremely large fields that are found in
magnetic white dwarfs (see e.g. Angel et al., 1981, or Schmidt, 1987, for a review
on the subject) or are supposed to exist in neutron stars, the diamagnetic term can
be safely neglected in Eq. (3.1), so that we can write
e0 h )·B
= µ (L
+ 2S
)·B
,
HB = (L + 2S 0 (3.3)
4πmc
where µ0 is the so-called Bohr magneton (µ0 = 9.27 × 10−21 erg G−1 ).
If the magnetic field is so weak as to keep the magnetic energy much smaller than
the energy intervals relative to the unperturbed Hamiltonian H0 , the effect of HB
can be computed by perturbation theory (see Messiah, 1961, for an introduction to
the argument). According to this theory, the shifts of an N -fold degenerate energy
level of H0 are obtained by diagonalization of the matrix
(α) (α)
(HB )ij = ui |HB |uj (i, j = 1, . . . , N ) ,
(α)
where |uk (k = 1, . . . , N ) are the degenerate eigenvectors of H0 corresponding
to the eigenvalue Eα .
In our case, since the Hamiltonian H0 is invariant under rotations, the total
angular momentum J and its projection M along an arbitrary z-axis are good
quantum numbers, so that the eigenvectors can be written in the form |αJM
(where α is a collection of inner quantum numbers of H0 ), with
The matrix elements can be evaluated for any direction of the magnetic field in
the reference system (xyz) chosen to describe the atomic system. Writing B in the
form B = B b, where b is a unit vector in the magnetic field direction, and writing
(L+2 ) in the form (J + S
S ), where J is the total angular momentum of the atomic
system, we have
Introducing the spherical components of the three vectors, the scalar product can
be written in the form (see Eq. (2.83))
) · b =
(J + S (−1)q (Jq + Sq ) b−q ,
q
so that, applying the Wigner-Eckart theorem in the form of Eq. (2.96), and using
Eq. (2.97), one obtains
√ J 1 J
αJM |HB |αJM = µ0 B (−1) J+M+q+1
2J + 1
M q −M
q
, -
× αJ b .
J(J + 1) + αJ S (3.5)
−q
Owing to the selection rules of the 3-j symbol, these matrix elements are zero
unless (M − M ) = 0, or ±1. We have therefore a tridiagonal matrix that might
in principle be diagonalized by standard methods.
The diagonalization can however be avoided by aligning the z-axis of the reference
system with the direction of the magnetic field. In this case, in fact, the only non-
zero component of the unit vector b is b0 = 1, and we easily obtain, substituting
the value of the 3-j symbol given by Eq. (2.26d)
αJ S αJ
g =1+ (J = 0). (3.7)
J(J + 1)
Equation (3.6) shows that the magnetic Hamiltonian is now represented by a di-
agonal matrix. This means that, to the first order of perturbation theory, the
eigenstates of the total Hamiltonian (H0 + HB ) are of the form |αJM , while the
eigenvalues are given by
Thus the Hamiltonian HB removes the degeneracy, and any level characterized by
the quantum number J is split into (2J + 1) equally spaced sublevels, the splitting
being proportional to the Landé factor g and to the magnetic field.
We want to remark, however, that the states |αJM are eigenvectors of the total
Hamiltonian only when the z-axis of the reference system points in the magnetic
1
As apparent from the 3-j symbol in Eq. (3.5), the matrix elements of HB identically vanish
if J = 0. This simply means that the magnetic field does not affect, to the first order, the energy
of the atomic levels characterized by J = 0. Consequently, one can formally put g = 0 for such
levels.
76 CHAPTER 3
field direction. This means that a state having a definite value for the projection
of the total angular momentum along the direction of the magnetic field has a
well-defined energy value. When the z-axis of the reference system is not in the
direction of the magnetic field, a state of the form |αJM is an eigenvector of H0 ,
but not of (H0 + HB ).
The general expression for the Landé factor, given in Eq. (3.7), involves the
evaluation of the reduced matrix element of S. This can be easily done when
the levels are described by the Russel-Saunders (or L-S ) coupling scheme (see
e.g. Condon and Shortley, 1935). In this scheme the quantum numbers L and
S corresponding to the total orbital angular momentum and to the total spin,
respectively, are good quantum numbers, so that an eigenvector of the Hamiltonian
H0 can be written in the form
|βLSJM ,
where β is a collection of quantum numbers representing the electronic configura-
tion. The reduced matrix element is then
βLSJ ,
βLSJ S
and it can be evaluated in the easiest way with the help of the projection theorem.
From Eq. (2.105) we have
· J |βLSJM
βLSJM |S
βLSJ =
βLSJ S ,
J(J + 1)
· J in the form
and writing S
· J = 1 J 2 + S 2 − L2
S
2
we obtain for the Landé factor
gLS = 1 + γ(J, S, L) ,
In the j-j coupling scheme (see e.g. Condon and Shortley, 1935) a simple expres-
sion for the Landé factor can be obtained for levels originating from two optical
ATOMIC SPECTROSCOPY 77
β (l s )j , (l s )j , J
β (l1 s1 )j1 , (l2 s2 )j2 , J S ,
1 1 1 2 2 2
Another case where it is possible to find an analytical expression for the Landé
factor is the case of the so-called J1 -l coupling. In this coupling scheme a ‘parent’
level of orbital angular momentum L1 and spin S1 couples its total angular mo-
mentum J1 with the orbital angular momentum l of a further electron, to give an
angular momentum K which in its turn couples with the electron spin to give the
total angular momentum J. Thus the reduced matrix element to be evaluated is
the following
β (L1 S1 )J1 , l K, s, J S β (L S )J , l K, s, J ,
1 1 1
where s = 1/2. After some tedious algebra, that is left as an exercise to the reader,
one gets the formula
where gLS are the Landé factors computed from Eq. (3.8). This formula is however
of little use, because the coefficients are seldom known with a sufficient degree of
accuracy.
It should be remarked, however, that in many cases the Landé factors are known
from experimental work carried out in spectroscopy laboratories. A systematic
(but incomplete and non-updated) list of experimental g-values can be found in
78 CHAPTER 3
Moore (1949, 1952, 1958). For the FeI and FeII spectra, further g-values can be
found in Reader and Sugar (1975) and in Corliss and Sugar (1982).
For an atom in L-S coupling, the Landé factor is equal to 1 for a level having
S = 0, while it is equal to 2 for a level having L = 0. When both L and S are
non-zero, g is generally bounded between 1 and 2, although many exceptions are
found to this simple rule, especially for low J-values. In rare cases, the combination
of the quantum numbers may give a negative or vanishing value for g. L-S levels
having g = 0 are 4 D1/2 , 5 F1 , 6 G3/2 , 7 H2 , 8 I5/2 , etc., while L-S levels having g < 0
are 6 F1/2 , 7 G1 , 8 G1/2 , 8 H3/2 , etc. In other cases, large values of g can result; for
instance, L-S levels having g ≥ 2.5 are 4 P1/2 (g = 2.667), 5 P1 (g = 2.5), 6 D1/2
(g = 3.333), 7 D1 (g = 3), 8 D3/2 (g = 2.8), 8 F1/2 (g = 4), etc.
Let’s now consider the Zeeman pattern to be expected in the transition between
two atomic levels, both split by the presence of a magnetic field. If J and J are the
angular momentum quantum numbers of the lower and upper level, respectively,
and if g and g are the corresponding Landé factors, the spectral line originating
from the transition between the two levels splits, owing to the magnetic field, into
a collection of components whose frequencies are given by
µ0 B
JJ
νMM = ν0 + (g M − gM ) , (3.9)
h
where ν0 is the frequency of the unperturbed line and where M and M are the
magnetic quantum numbers of the lower and upper sublevels, respectively. Formula
(3.9) is often written in the form
= ν0 + νL (g M − gM ) ,
JJ
νMM
where
µ0 B e B
= 0
νL = (3.10)
h 4πmc
is the so-called Larmor frequency; numerically we have
νL λ2 e B
∆λB = λ20 = 0 0 2 . (3.13)
c 4πmc
ATOMIC SPECTROSCOPY 79
Numerically we have
∆λB = 4.6686 × 10−10 λ20 B , (3.14)
where ∆λB is in mÅ, λ0 in Å, and B in G.
It is important to note that, of the various Zeeman components (whose wave-
lengths are given by Eq. (3.12)) arising in a given transition, only a limited number
is actually observed: this is due to selection rules that depend on the interaction
between atoms and radiation field. The general properties of Zeeman patterns
(namely the number of observed lines, their wavelength position, their relative
strength and polarization features) depend on the kind of interaction (electric-
dipole, magnetic-dipole, electric-quadrupole, etc.). In the rest of this section we
summarize the main properties of the Zeeman patterns resulting from the electric-
dipole interaction, which is the simplest one and the most important in practice.
The reader is referred to Sect. 9.1 for a formal derivation of what is anticipated
here.
The selection rule for electric-dipole transitions is
∆M = M − M = 0, ±1 , (3.15)
1
In absorption, the situation is more complicated because the polarization of the radiation
absorbed by the σr components depends also on the polarization of the incident radiation. For
the special case of an unpolarized incident beam, the absorbed radiation has the same polariza-
tion characteristics as the radiation that would be emitted in the direction of the beam. As a
consequence, the opposite polarization will be present in the beam after absorption.
80 CHAPTER 3
σ
σ
σ π σ
σ
π
σ
σ
σ
Fig.3.1. The polarization properties of the radiation emitted by different Zeeman components
depend on the type of component and on the angle between the observing direction and the
magnetic field vector. For the radiation emitted along the x, y, and z-axes, the tip of the wave
electric field vector draws, in a fixed point, the figures here shown.
TABLE 3.1
Analytical expressions for the strengths of Zeeman components. J and M are the quantum
numbers of the lower sublevels, J and M the quantum numbers of the upper sublevels.
J = J + 1 J = J J = J − 1
where the index q = −(M − M ) = −∆M is equal to −1, 0, +1 for the σb , π, and
σr components, respectively. It can be easily seen that the relative strengths, as
defined by Eq. (3.16), are normalized to unity; from Eq. (2.23a) we have in fact
J J 1
2
SqJJ (M, M ) = 3 =1 (q = −1, 0, +1) . (3.17)
−M M −q
MM MM
Using the analytical expressions of the 3-j symbols (Eqs. (2.26)), the strengths of
the various components can be written as algebraic functions of M and M . The
results are summarized in Table 3.1.
From Eq. (3.16) one can prove some symmetry properties for the strengths. Equa-
tions (2.24) and (2.25) give
SqJJ (M, M ) = S−q
JJ
(−M, −M ) (3.18)
SqJJ (M, M ) = SqJ J (−M , −M ) . (3.19)
On the other hand, defining the wavelength displacement from line center via
Eq. (3.12),
MM = −∆λB (g M − gM ) ,
∆λJJ (3.20)
we also have
MM = −∆λ−M−M
∆λJJ JJ
(3.21)
∆λJJ J J
MM = ∆λ−M −M . (3.22)
Equations (3.18) and (3.21) show that for any σr (σb ) component there is a σb
(σr ) component of the same strength symmetrical about line center, and for any
82 CHAPTER 3
d2 x e dx − γ dx ,
= −4π 2 ν02 x − 0 ×B (3.23)
dt2 mc dt dt
where γ is a constant which accounts for the damping of the electron due to its
energy loss by radiation.
The actual value of this constant, that is introduced here in a purely phenomeno-
logical way, can be deduced by equating the work per unit time done on the electron
by the friction force
2
dL dx
= mγ
dt dt
with the energy loss by radiation per unit time
2
2e2 d2 x
W = 30 .
3c dt2
By averaging these quantities over an oscillation period, and disregarding the slight
difference – due to the magnetic field – between the actual oscillation frequency
and ν0 , we have
8π 2 e20 2 8π πe20 1
γ= ν = . (3.24)
3 mc3 0 3 mc λ20
1
In Beckers’ table the Landé factors of the levels involved in the transitions are calculated
according to the L-S coupling scheme (Eq. (3.8)).
ATOMIC SPECTROSCOPY 83
Fig.3.2. Characteristic Zeeman patterns for different transitions. The Landé factors are computed
according to the L-S coupling scheme. Following the usual convention, the π components are
drawn upward and the σ components downward. The σb components in panels (c) and (f) are
dashed for clarity; the σr and σb components at line center in panel (c) are drawn somewhat
apart but they actually coincide. From left to right, top to bottom, we have patterns of Type 0
(a), Type II (b), Type III (c,d), and Type I (e,f) (see Sect. 3.3 for the definition of Type).
84 CHAPTER 3
To find the solution to Eq. (3.23) we introduce a real unit vector u0 parallel to
B and two real unit vectors ur and us in the plane perpendicular to B, in such a
way that (ur , us , u0 ) form, in this order, a right-handed coordinate system. From
ur and us we next define two complex unit vectors u±1 by the equation
1
u±1 = √ (∓ ur + i us ) . (3.25)
2
Expanding x on the basis uα (α = 0, ±1),
x = xα uα ,
α
d2 xα dx dx
= −4π 2 ν02 xα + 4πi α νL α − γ α (α = 0, ±1) , (3.27)
dt2 dt dt
where νL is the Larmor frequency defined in Eq. (3.10). If we now look for a
solution of the form
−2πi να t
xα = Aα e ,
and observe that the quantities νL and γ are generally much smaller than the
frequency ν0 ,1 we obtain
γ
−2πi (ν0 −ανL ) t −2t
xα = Aα e e , (3.28)
where the constants Aα are to be determined from the initial conditions and depend
on the excitation mechanism of the classical dipole.
To determine the radiation emitted by our classical model atom we have just to
recall some important results from classical electrodynamics. It is well-known (see
e.g. Jackson, 1962) that an oscillating, monochromatic dipole p (t) produces in the
radiation zone an electromagnetic wave whose frequency is the same as that of the
dipole, and whose electric field is described by the equation
ikr , - 2 ikr
t) = k 2 e
E (r, Ω, × p (t) × Ω
Ω = k e p⊥ (t) , (3.29)
r r
where k is the wavenumber, r is the distance from the dipole, Ω is a unit vector in
p−Ω
the direction of propagation, and p⊥ = (Ω
·
p ) is the component of the dipole
1
For instance, for a spectral line at 5000 Å and a magnetic field of 103 G, we have (see
Eqs. (3.11) and (3.24))
νL 1.4 × 109 s−1 , γ 8.9 × 107 s−1 , ν0 6.0 × 1014 s−1 .
ATOMIC SPECTROSCOPY 85
Fig.3.3. Geometry for calculating the polarization properties of the radiation emitted by a classical
The unit vector e is the reference direction for defining the Stokes
dipole in a magnetic field B. 1
parameters.
∞
−2πiνt
pi (t) = −e0 Cαi Aα Fα (ν) e dν , (3.32)
α −∞
1
Indeed, in this formula, the direction cosines could be just defined as Cαi = uα · ei , since
the unit vectors e1 and e2 are real. However, Eq. (3.30) and the following ones up to Eq. (3.37)
are valid also when the unit vectors e1 , e2 are of the more general form of Eqs. (1.41).
86 CHAPTER 3
where
∞ γ
−2πi (ν0 −ανL −ν) t −2t i 1
Fα (ν) = e e dt = − (3.33)
2π (ν0 − ανL − ν) − iΓ
0
with Γ = γ/4π. Thus the component of the electric field along the unit vector ei
can be written in the form (as k = 2πν/c)
2 ∞
t) = − 4π e0
Ei (r, Ω, Cαi Aα Fα (ν) ν 2 e
i(kr−2πνt)
dν ,
rc2 α −∞
and hence
16π 4 e20 ∗
Ei∗ Ej = Cαi Cβj A∗α Aβ
r 2 c4
αβ
∞ ∞
i(k −k)r −2πi(ν −ν)t
× dν dν Fα (ν)∗ Fβ (ν ) ν 2 ν 2 e e .
−∞ −∞
To obtain the polarization tensor we must take the average of this quantity over a
time interval τ sufficiently long as specified in Sect. 1.4. This leads to
τ /2
−2πi(ν −ν)t 1 −2πi(ν −ν)t 1
e = lim e dt = δ(ν − ν ) ,
τ τ →∞ τ
−τ /2
∞
16π 4 e20 ∗ A∗α Aβ
Jij (r) = 2 4 Cαi Cβj ν 4 Fα (ν)∗ Fβ (ν) dν . (3.34)
r c τ
αβ −∞
The quantities Aα depend on the initial conditions for the motion of the oscillator.
For random initial conditions (as one would have, for instance, if the oscillator
were collisionally excited by a population of perturbing particles with isotropically
distributed velocities), these quantities are uncorrelated with each other, so that,
performing a statistical average over the initial conditions, we can replace A∗α Aβ
by |A|2 δαβ . On the other hand, the quantity |A|2 is related to the average energy
Ē contained in each of the three degrees of freedom of the oscillator. In fact, since
1
Note that this formula contains the normalization time τ which is not related to any physical
quantity. We will see shortly (Eq. (3.35)) how this quantity disappears from the final results.
ATOMIC SPECTROSCOPY 87
for a classical oscillator the average kinetic energy is equal to the average potential
energy, writing Aα = |A| eiφ we have from Eq. (3.28)
2
Ē = 4π 2 ν02 m Re(xα ) = 4π 2 ν02 m |A|2 cos2 [2π(ν0 − ανL )t − φ] e−γt ,
and being the time interval τ much larger than the decay time γ −1 of the oscillator,
we can write
τ
1 1 −γt 2π 2 ν02 m
Ē = 4π 2 ν02 m |A|2 lim e dt = |A|2 . (3.35)
2 τ τ →∞ γτ
0
Assum-
Let us now introduce the emission coefficient in tensorial form, εij (ν, Ω).
ing N oscillators per unit volume, we have (see the analogous Eq. (1.31))
∞
dν dt dΩ = c
εij (ν, Ω) J (r) N dt r2 dΩ , (3.36)
8π ij
−∞
= πe20 ν2 ∗ 1 Γ
εij (ν, Ω) N 02 Ē Cαi Cαj . (3.37)
mc c α
π (ν0 − ανL − ν) + Γ
2 2
1 Γ
φb =
π (ν0 + νL − ν)2 + Γ 2
1 Γ
φp =
π (ν0 − ν)2 + Γ 2
1 Γ
φr = , (3.38)
π (ν0 − νL − ν)2 + Γ 2
and expressing the direction cosines for the geometry specified in Fig. 3.3,
1
C±1 1 = √ (∓ cos θ cos φ + i sin θ sin φ)
2
C0 1 = − sin θ
1
C±1 2 = √ (± sin φ + i cos φ)
2
C0 2 = 0 , (3.39)
88 CHAPTER 3
= ε (ν, Ω)
εI (ν, Ω) = k 1 φ sin2 θ + φb + φr (1 + cos2 θ)
+ ε (ν, Ω)
11 22
2 p 2
= ε (ν, Ω)
− ε (ν, Ω)
=k 1 φ + φ
εQ (ν, Ω) 11 22 φ − b r
sin2 θ
2 p 2
= ε (ν, Ω)
εU (ν, Ω) + ε (ν, Ω)
=0
12 21
, -
= i ε (ν, Ω)
εV (ν, Ω) = k 1 φ − φ cos θ ,
− ε (ν, Ω) (3.40)
12 21
2 r b
where
πe20 2ν 2
k= N 20 Ē .
mc c
These formulae show that our collection of randomly excited classical model
atoms emits along B (θ = 0) a radiation that is right-handed (or positively) circu-
larly polarized around ν = (ν0 − νL ) (in the ‘red’ component σr ) and left-handed
(or negatively) circularly polarized around ν = (ν0 + νL ) (in the ‘blue’ component
σb ). Obviously, the handedness of circular polarization changes when the direc-
tion is reversed (θ = π). For θ = π/2, on the contrary, the emitted radiation is
linearly polarized. The direction of linear polarization is parallel to the magnetic
field in the π component and perpendicular to the magnetic field in the σb and σr
components.
To sum up, we see that the classical theory provides a satisfactory explanation
for the normal Zeeman effect with respect both to the wavelength separation of the
Zeeman components and to their polarization characteristics. In the classical theory
the frequency separation of the components is found to be νL , which corresponds
to a Landé factor g = 1. This is quite obvious since g-values other than 1 are
produced by the spin (see Eq. (3.7)), that cannot be accounted for by a classical
theory.
We also want to remark that the results obtained for the polarization of the
various components depend on the negative sign that we have assumed for the
electric charge (see Eq. (3.23)). Assuming a positive charge for the electron, νL has
to be replaced by -νL in Eqs. (3.27) and following, and a further sign change must be
performed in Eq. (3.30). This last change is irrelevant, as the Stokes parameters
are quadratic functions of the dipole components, so that the net result is the
exchange of φr and φb in Eqs. (3.40), which leads to the opposite sign for εV (ν, Ω).
In other words, a sign inversion in the electron charge produces a sign inversion of
circular polarization. This also means that by measuring the handedness of circular
polarization in the various Zeeman components it is possible to determine the sign
of the electron charge.
This was just the procedure followed by Lorentz and Zeeman to give, for the
first time, the correct negative sign for the electron charge. It is curious to note,
however, that at first they deduced the wrong sign: hence the witty remark by Segrè
(1976) that ‘when signs are involved, even two Dutch physicists as scrupulous as
Lorentz and Zeeman may make errors’.
ATOMIC SPECTROSCOPY 89
The first parameter that can be used to characterize Zeeman patterns is the so-
called effective Landé factor ḡ,1 which represents the wavelength shift from line
center of the ‘center of gravity’ of the σr components in units of ∆λB (sometimes
also called Lorentz units). For an electric-dipole transition connecting a given level,
having angular momentum J1 and Landé factor g1 , with another level of angular
momentum J2 and Landé factor g2 , the value of ḡ can be easily derived from
Eqs. (3.12) and (3.16)
/ J1 J2 0
JJ λM M − λ0
1 2 1 2
ḡ = S1 (M1 , M2 )
∆λB
M1 M2
2
J2 J1 1
= 3 (g1 M1 − g2 M2 ) . (3.41)
−M2 M1 −1
M1 M2
This expression is valid both when the index 1 refers to the lower level and the
index 2 to the upper level, and in the opposite case. This is a consequence of the
symmetry properties discussed at the end of Sect. 3.1, and reflects the fact that
Zeeman patterns depend only on the quantum numbers of the levels involved in
the transition, and not on which of them is the lower or the upper.
Obviously, because of the symmetry properties just mentioned (Eqs. (3.18)-
(3.22)), the center of gravity of the σb components is -ḡ, while the center of gravity
of the π components is 0.
Equation (3.41) can be transformed into a simpler expression by means of some
Racah algebra. First we note that, with the help of Eq. (2.26d), we can write for
M1 and M2
J1 −M1 J1 J1 1
M1 = J1 (J1 + 1)(2J1 + 1) (−1) (3.42a)
M1 −M1 0
J2 J2 1
M2 = J2 (J2 + 1)(2J2 + 1) (−1)J2 −M2 . (3.42b)
M2 −M2 0
Next we observe that the term originating from M1 in Eq. (3.41) can be written,
using Eq. (2.42), in the form
J J1 1
2
J1 J1 1
2 (−1)J1 −M1 =
−M2 M1 −1 M1 −M1 0
M1 M2
J1 J1 1 J2 J1 1
= (−1)J1 −M1
M1 −M1 0 −M2 M1 −1
M1 M1 M2
J2 J1 1
× =
−M2 M1 −1
1
The quantity ḡ is often denoted by the symbol z in the literature concerning magnetic stars.
90 CHAPTER 3
!
1 1 1 1 1 1
= (−1)1+J1 +J2 . (3.43)
−1 1 0 J1 J1 J2
Performing the same transformation on the other term of Eq. (3.41), and taking
into account the analytical expressions of the 3-j and 6-j symbols (Eqs. (2.26d)
and (2.36d)), we obtain
1 1
ḡ = (g + g2 ) + (g1 − g2 ) d , (3.44)
2 1 4
where
d = J1 (J1 + 1) − J2 (J2 + 1) .
Note that this formula, originally due to Shenstone and Blair (1929), is invariant
under interchange of the indices 1 and 2. In most cases, Eq. (3.44) is employed
together with the Landé factors obtained from the L-S coupling scheme (Eq. (3.8)).
The resulting ḡ-value, that will be denoted in the following by ḡLS , can be easily
calculated for any given electric-dipole transition (see e.g. Beckers, 1969b). A more
appropriate value for ḡ can be obtained by substituting in Eq. (3.44) the values g1
and g2 deduced by spectroscopic measurements. In some cases, and especially for
lines belonging to complex spectra and involving levels of high excitation potential,
the L-S coupling scheme fails, so that the value ḡLS may be very different from ḡ.
Comparisons between ḡ and ḡLS values for several lines of the FeI and FeII spectra
have been published by Landi Degl’Innocenti (1982a) and by Solanki and Stenflo
(1985).
Typical ḡ values range from 0.5 to 2.5, but, for some transitions, ḡ can attain
null or even negative values, while for other transitions it can be larger than 2.5. A
list of transitions characterized by such ‘anomalous’ values of the effective Landé
factor is presented in Table 3.2. The list includes all the transitions between L-S
levels having S ≤ 7/2 and L ≤ 5, and subjected to the following limitations
∆J = 0, ±1 , ∆S = 0, ±1 , ∆L = 0, ±1, ±2 .
Lines having either exceptionally large values or null values for their effective Landé
factor are of great importance for the study of solar (and stellar) magnetism. Obvi-
ously, lines with large ḡ values produce strong polarization signals, and are partic-
ularly useful for the detection and measurement of magnetic fields. On the other
hand, lines with ḡ = 0 are especially useful for velocity measurements and for
calibration. A reduced list of lines with large ḡ-values has been given by Harvey
(1973), while Sistla and Harvey (1970) have presented a list of ḡ = 0 lines. A table
of close line pairs (less than 100 Å apart), with one line having a large ḡ value and
the other a small (or null) ḡ value, has been presented by Robinson (1980).
In some studies of stellar magnetism, it can be useful to compute an average
value for the effective Landé factor over the lines contained in a given statistical
sample or in an assigned spectral interval. For instance, considering the sample
of approximately 400 unblended lines of the FeI solar spectrum given by Stenflo
and Lindegren (1977), if the effective Landé factor of each line is weighted by the
ATOMIC SPECTROSCOPY 91
TABLE 3.2
Transitions having ‘anomalous’ values for the effective Landé factor ḡLS . The transitions listed
are those obeying the restrictions specified in the text.
ḡLS < 0
4P
1 − 4F 3 −0.167 8D
3 − 6G 5 −0.6 8F
1 − 6G 3 −1.
2 2 2 2 2 2
5P
1 − 3 F2 −0.25 5F
1 − 7 G1 −0.25 8F
1 − 8H 3 −1.5
2 2
4D
1 − 6F
1 −0.333 6F
1 − 6F
1 −0.667 8F
3 − 6H
5 −1.
2 2 2 2 2 2
5D
0 − 7G
1 −0.5 6F
1 − 8G
1 −1. 8F
3 − 8H
5 −0.3
2 2 2 2
5D
1 − 5G
2 −0.25 6F
1 − 8H
3 −0.333 8F
5 − 6H
7 −0.286
2 2 2 2
6D
1 − 4F 3 −0.333 6F
3 − 6H 5 −0.3 6G
3 − 8H 3 −0.2
2 2 2 2 2 2
6D
1 − 6G 3 −0.833 6F
5 − 4H 7 −0.143 7G
1 − 7 G1 −0.5
2 2 2 2
6D
3 − 4G
5 −0.4 7F
0 − 7G
1 −0.5 8G
1 − 8G
1 −1.333
2 2 2 2
7D
1 − 5G
2 −1. 7F
1 − 5G
2 −0.25 8G
1 − 8H
3 −0.167
2 2
ḡLS = 0
3P
0 − 5 F1 7D
1 − 5 F2 7F
2 − 7 H3
6P
3 − 4F 5 8D
5 − 6G 7 7F
3 − 5 H4
2 2 2 2
4D
1 − 4D 1 5F
1 − 5 F1 8F
3 − 6G 5
2 2 2 2
4D
1 − 6G
3
5F
1 − 7F
0
6G
3 − 6G 3
2 2 2 2
5D
0 − 5 F1 5F
1 − 7 H2 7H
2 − 7 H2
5D
2 − 3 G3 5F
2 − 5 H3
ḡLS ≥ 3
4P
1 − 6D 1 3. 6D
1 − 8F 1 3.667 8D
5 − 6G 3 3.6
2 2 2 2 2 2
6P
3 − 4D
1 3. 7D
1 − 7D
1 3. 8D
7 − 6G
5 3.
2 2 2 2
6P
3 − 6F
1 3.167 7D
1 − 7F
0 3. 7F
3 − 7H
2 3.
2 2
6P
5 − 4F
3 3. 7D
2 − 5F
1 3. 7F
4 − 5H
3 3.
2 2
7P
2 − 5 F1 3.5 7D
2 − G1
7 3.25 8F
1 − 8F 1 4.
2 2
8P
5 − 6F
3 3.2 7D
3 − 5G
2 3.167 8F
5 − 6G
3 3.
2 2 2 2
5D
0 − 7D
1 3. 8D
3 − 6F
1 3.667 8F
5 − 8H
3 3.3
2 2 2 2
6D
1 − 6D
1 3.333 8D
3 − 8G
1 3.833 8F
7 − 6H
5 3.286
2 2 2 2 2 2
92 CHAPTER 3
central depression observed for the same line, one finds the value ḡLS = 1.22
(Landi Degl’Innocenti, 1985a).1
Besides the effective Landé factor, higher order moments can be introduced to
characterize Zeeman patterns. For each type of component we define the barycen-
(n)
tric n-th order moment vq to be
JJ J J2 n
vq(n) (J1 , J2 ) = Sq 1 2 (M1 , M2 ) ∆λM1 M − q ḡ ∆λB , (3.45)
1 2
M1 M2
J J J J
where ∆λM1 M 2
and Sq 1 2 (M1 , M2 ) are defined in Eqs. (3.20) and (3.16), and where
1 2
q = −1, 0, +1 for the σb , π, and σr components, respectively.
Using the symmetry properties derived in Sect. 3.1 (Eqs. (3.18)-(3.22)), and
recalling that ḡ is invariant under interchange of J1 and J2 , one easily gets
Equation (3.46) shows that the various moments are not altered by the interchange
of the lower and the upper level – an obvious consequence of the same property
holding for the Zeeman patterns. Equation (3.47) shows that the odd-order mo-
ments of the π components identically vanish, while, for the σ components, the
even-order moments of the σb and σr components are the same, and the odd-order
moments have opposite sign. Obviously, all the even-order moments are positive
quantities.
We will now show that the barycentric moments defined in Eq. (3.45) can be
reduced to a simple form which contains 3-j and 6-j symbols only. Substitution of
Eqs. (3.20) and (3.44) into Eq. (3.45) leads after some algebra to the expression
with
d+2
n
J J
Xq(n) (J1 , J2 ) = M1 − q Sq 1 2 (M1 , M2 )
4
M1 M2
n n−i
n d+2
= (−1)n−i q Zq(i) (J1 , J2 ) , (3.49)
i=0
i 4
(i)
where Zq (J1 , J2 ) is the i-th moment of M1 weighted by the strength of the q-
transition originating from M1 ,
J J
Zq(i) (J1 , J2 ) = M1i Sq 1 2 (M1 , M2 ) . (3.50)
M1 M2
1
Owing both to the L-S assumption and to the procedure itself by which it is obtained, this
value has only an approximate and statistical meaning.
ATOMIC SPECTROSCOPY 93
(i)
The quantities Zq (J1 , J2 ) can be evaluated by some Racah algebra. For this
purpose we note that the α-component of an angular momentum a can be written
in the form (see Eqs. (3.42))
a−α
a a 1
α = (−1) a(a + 1)(2a + 1) . (3.51)
α −α 0
where the sum over f runs over the values 0 and 2 only (for f = 1 the first 3-j
symbol in the right-hand side is zero).
Repeated application of the same procedure gives the following remarkable for-
mula
n/2
αn = (−1)(2n−3)a+α a(a + 1)(2a + 1)
× (2f1 + 1)(2f2 + 1) · · · (2fn−1 + 1)
f1 f2 ···fn−1
! ! !
1 1 f1 1 f1 f2 1 fn−2 fn−1
× ···
a a a a a a a a a
1 1 f1 1 f1 f2 1 fn−2 fn−1
× ···
0 0 0 0 0 0 0 0 0
a a fn−1
× (n = 2, 3, . . .) .
α −α 0
Owing to the property (2.24) of the 3-j symbols, the indices f1 , f2 ,. . . are alternately
even and odd numbers. Moreover, they are chained in such a way that the allowed
values for fi+1 are (fi ± 1), starting from f1 = 0 and f1 = 2. For example, if n = 4
the chains of allowed values for f1 , f2 , and f3 are: (0,1,0), (0,1,2), (2,1,0), (2,1,2),
(2,3,2), (2, 3, 4).
We now use the above expansion to express the quantity M1i in Eq. (3.50), and
substitute the explicit expression for the strengths given by Eq. (3.16). Performing
94 CHAPTER 3
the sum over M1 and M2 by the same method that leads to Eq. (3.43), we obtain,
for i ≥ 2
Zq(i) (J1 , J2 ) =
i/2
= (−1)(2i−3)J1 −J2 +q 3 J1 (J1 + 1)(2J1 + 1)
× (2f1 + 1)(2f2 + 1) · · · (2fi−1 + 1)
f1 f2 ···fi−1
! ! !
1 f1 1 1 f1 f2 1 fi−2 fi−1
× ···
J1 J1 J1 J1 J1 J1 J1 J1 J1
1 1 f1 1 f1 f2 1 fi−2 fi−1
× ···
0 0 0 0 0 0 0 0 0
!
1 1 fi−1 1 1 fi−1
× ,
−q q 0 J1 J1 J2
Zq(0) (J1 , J2 ) = 1
These formulae, together with Eqs. (3.49) and (3.48), give the required expres-
sions for the barycentric moments. Using Eqs. (2.26) and (2.36) one can indeed
obtain simple analytical formulae for the low-order moments. The results, that
are contained in Table 3.3, show the characteristic symmetrical behavior of the
(n)
quantities Xq (J1 , J2 ) under interchange of J1 with J2 and of q with −q.
Besides the barycentric moments of the components, it is also convenient to
introduce the moments tout court. These moments are defined by
J J J J2 n
wq(n) (J1 , J2 ) = Sq 1 2 (M1 , M2 ) ∆λM1 M , (3.53)
1 2
M1 M2
and it can be easily seen that they satisfy the same symmetry properties found for
the barycentric moments (Eqs. (3.46) and (3.47)).
The relations between the two types of moments are the following
n
n n−i
vq(n) (J1 , J2 ) = (−1) n−i
wq(i) (J1 , J2 ) q ḡ ∆λB
i=0
i
n
n n−i
wq(n) (J1 , J2 ) = vq(i) (J1 , J2 ) q ḡ ∆λB .
i=0
i
ATOMIC SPECTROSCOPY 95
TABLE 3.3
Analytical expressions for the low-order barycentric moments of Zeeman components. (∗) The
expressions have been simplified taking into account that, for electric dipole transitions, the
quantity [d(2s − d2 )] is identically zero.
(0) (0)
Zq =1 Xq =1
(1) (1)
Z0 =0 X0 =0
(1) (1)
Z±1 = ± 1
4
(d + 2) X±1 = 0
(2) (2)
Z0 = 1
10
(−d2 + 3s − 2) X0 = 1
10
(−d2 + 3s − 2)
(2) (2)
Z±1 = 1
20
(d2 + 5d + 2s + 2) X±1 = 1
80
(−d2 + 8s − 12)
(3) (3)
Z0 =0 X0 =0
(3) (3)
Z±1 = ± 1
80
(3d3 + 6d2 + 12s + 8d − 8) X±1 = ± 1
160
d(4 − d2 )
(4) (4)
Z0 = 1
140
(−3d4 + 7d2 + 15s2 − 30s + 20) X0 = 1
140
(−3d4 + 7d2 + 15s2 − 30s + 20)
(4) (4)
Z±1 = 1
280
(3d4 + 21d3 + 14d2 + 6s2 + 16s X±1 = 1
8960
(−9d4 + 56d2 + 192s2 − 832s
(∗)
s = [J1 (J1 + 1) + J2 (J2 + 1)] , d = [J1 (J1 + 1) − J2 (J2 + 1)]
(n)
After some algebraic manipulations, wq can be expressed in the form
where
n
n
G(n)
q (J1 , J2 ) = (g1 − g2 )i (g2 q)n−i Zq(i) (J1 , J2 ) . (3.55)
i=0
i
(n)
Analytical formulae for the quantities Gq (J1 , J2 ) up to n = 4 are contained in
Table 3.4.
(n) (n)
An alternative derivation of the quantities vq and wq in terms of Bernoulli
polynomials has been given by Mathys and Stenflo (1987a,b), who employ the
(n) (q)
notations µq and n! (−∆λB )n Cn , respectively. In their second paper (1987b),
(n) (q)
these authors give also extensive numerical tables for µq and Cn ; these last
quantities are however evaluated, for any given transition, by assuming for the
Landé factors the values deduced from the L-S coupling scheme.
Zeeman patterns can be classified into four different types according to the value
of their barycentric moments, with particular emphasis on the third-order mo-
ment v1(3) .
First of all, we will call Type 0 the normal and anomalous Zeeman triplets (see
(n)
Sect. 3.1): obviously, these patterns are characterized by vq = 0 for n ≥ 1. A
complete list of transitions giving rise to Type 0 patterns has been published by
Mathys and Stenflo (1987b).
96 CHAPTER 3
TABLE 3.4
(∗)
Analytical expressions for the low-order moments of Zeeman components
(0)
Gq =1
(1)
G0 =0
(1)
G±1 = ± ( 12 gs + 14 dgd ) = ± ḡ
(2)
G0 = 1
10
(3s − d2 − 2)gd2
(2)
G±1 = 1
20
(d2 + 2s − 3)gd2 + 5dgs gd + 5gs2
(3)
G0 =0
(3)
G±1 = ± 1
80
(3d2 − 7)dgd3 + 6(d2 + 2s − 3)gd2 gs + 15dgd gs2 + 10gs3
(4)
G0 = 1
140
(−3d4 + 7d2 + 15s2 − 30s + 20)gd4
(4)
G±1 = 1
560
(6d4 − 14d2 + 12s2 − 52s + 51)gd4 + 14(3d2 − 7)dgd3 gs + 42(d2 + 2s − 3)gd2 gs2
+70dgd gs3 + 35gs4
(∗)
s = [J1 (J1 + 1) + J2 (J2 + 1)] , d = [J1 (J1 + 1) − J2 (J2 + 1)] , gs = (g1 + g2 ) , gd = (g1 − g2 )
For anomalous Zeeman patterns, we will call, following Back and Landé (1925),
Type I those having v1(3) > 0, Type II those having v1(3) < 0, and Type III those
with v1(3) = 0. As it can be argued from the definition of the barycentric moments
(Eq. (3.45)), patterns of Type I have their strongest σr component toward the blue
side of the spectrum, while patterns of Type II have their strongest σr component
toward the red side of the spectrum. Finally, Type III patterns have σ components
symmetrical about their center of gravity. Some examples of Zeeman patterns of
different types are shown in Fig. 3.2.
From the analytical expressions given in Table 3.3 and from Eq. (3.48) we have
1
v1(3) = ∆λ3B (g1 − g2 )3 d (4 − d2 ) .
160
This formula shows that Type III patterns can only originate when d = 0, or, in
other words, when J1 = J2 . By contrast, Type I and Type II patterns correspond
to transitions having ∆J = ±1. In particular, Type I patterns arise when the
Landé factor of the level having the smaller J is larger than the Landé factor of the
level having the larger J, while the opposite situation leads to Type II patterns.
It should be remarked that the classification of Zeeman patterns, although based
on the σ components, characterizes at the same time the behavior of the π com-
ponents. Indeed, with increasing distance from line center the strengths of the π
components decrease (quadratically) for Type I and Type II patterns, while they
increase (also quadratically) for Type III patterns. This is apparent from Table 3.1
and Eq. (3.20).
ATOMIC SPECTROSCOPY 97
It should also be mentioned that a useful concept for certain applications is that
of minimal equivalent Zeeman pattern. Given an arbitrary Zeeman pattern, we
define its minimal equivalent as the pattern having only two π components and
two σ components (both for the blue and the red wing) with such splittings and
strengths that the barycentric moments up to n = 3 of the original pattern are
reproduced. The minimal equivalent Zeeman pattern is sketched in Fig. 3.4. It can
be shown that the strengths Si and the splittings zi (expressed in units of ∆λB ,
and satisfying the condition z2 > z1 ) are given by
1
z0 = |gd | X0(2) S0 =
2
1 1 z
z1 = ḡ − |g | X1(2) 4 + z2 − z S1 = 1+ √
2 d 2 4 + z2
1 1 z
z2 = ḡ + |gd | X1(2) 4 + z2 + z S2 = 1− √ ,
2 2 4 + z2
where
gd X1(3)
|gd | X (2) 3/2
z= .
1
Note that the value of z remains undefined when X1(2) = 0; this is a degenerate
case corresponding either to a Zeeman triplet (Type 0 pattern) or to the transition
1/2 → 1/2. In both cases the two σ components merge into a single one with
z1 = z2 = ḡ, (S1 + S2 ) = 1.
As already stated in Sect. 3.1, when the magnetic field is so strong as to produce
on a J-level a splitting comparable with the energy separation between different
98 CHAPTER 3
J-levels, the perturbation theory presented for the Zeeman effect cannot be applied
any longer. In this regime, that is called the Paschen-Back effect regime, the energy
levels have to be found by diagonalization of the total Hamiltonian H given by
H = H0 + HB ,
and, in particular, when the quantization axis for J is in the magnetic field direction
= µ0 B M δαα δJJ
√ J J 1
+ (−1)J +M+1
2J + 1 α J
αJ S δMM . (3.56)
−M M 0
Equation (3.56) shows that the magnetic Hamiltonian HB connects only eigenstates
having the same value of the magnetic quantum number M : in other words, it is
block-diagonal, each block being characterized by an assigned M -value.
The general problem of finding the eigenvalues and the eigenvectors of the matrix
H can be solved only by numerical methods. There are, however, some special cases
where the analytical calculations can be pushed somewhat further. If we suppose,
for instance, that the atomic system is exactly described by the L-S coupling
scheme, the eigenstates of H0 are of the form |βLSJM (where β summarizes the
electronic configuration quantum numbers), so that Eq. (3.56) becomes
βLSJM |HB |β L S J M =
From this expression we obtain the important result that in the L-S coupling
scheme the magnetic Hamiltonian is block-diagonal, each block being now charac-
terized by the magnetic quantum number M and by the quantum numbers (β, L, S)
specifying the term. In other words, the magnetic field produces a J-mixing of the
various levels belonging to a particular term, so that the eigenvectors of the total
Hamiltonian will be of the form
j
|βLSjM = CJ (βLS, M ) |βLSJM , (3.58)
J
where the index j labels the different states of the N -fold degenerate subspace
corresponding to assigned values of (β, L, S, M ); N is given by
N = 1 + L + S − max(|M |, |L − S|) .
Of course, in contrast with the Zeeman effect regime, J is not a good quantum
number.
We now recall that in the L-S coupling scheme the energies EβLS (J) of the
different J-levels originating from a given term (β, L, S) are determined by the
spin-orbit Hamiltonian Hso (included in H0 ), which in most cases can be written
in the form (see Condon and Shortley, 1935)
·S
Hso = ζ(βLS) L , (3.59)
·S
where ζ is a constant having the dimensions of energy. Writing L in the form
= 1 J 2 − L2 − S 2 ,
·S
L
2
we have1
Thus the whole problem of finding the energy levels of an atom described by the
L-S coupling scheme, embedded in a magnetic field, reduces to the diagonalization
of a set of matrices of the form
From Eq. (3.57), using the analytical expressions of the 3-j and 6-j symbols
(Eqs. (2.26) and (2.36)), we have that the only non-zero matrix elements are those
of the form
1 L+1
L ζ + µ0 B
λ1 = M
2 L + 12
for M = ±(L + 1/2), and
2
1 1 1
λ1,2 = − ζ + µ0 B M ± ζ2 L+ + 2ζµ0 BM + (µ0 B)2
4 2 2
ATOMIC SPECTROSCOPY 101
Fig.3.5. The energy levels of the term 2 P plotted as functions of the magnetic field strength. The
energy E (vertical scale) and the magnetic energy µ0 B (horizontal scale) are both normalized to
the fine-structure energy ζ (supposed positive). For an inverted multiplet (ζ < 0) the graph is
simply reversed about the E = 0 axis.
for M = ±(L + 1/2). In all the other cases, the diagonalization can be performed
only by numerical methods.
Figures 3.5 to 3.9 illustrate the behavior of the energy eigenvalues of the most
common L-S terms as a function of the magnetic field strength, parameterized
through the quantity γ defined by
µ0 B
γ= . (3.63)
ζ
For γ 1 the eigenvalues spread out linearly from their degenerate, zero-magnetic
field value (Zeeman effect regime). For intermediate values of γ, the eigenvalues
start crossing each other and the linearity with the magnetic field is, in general, lost
(incomplete Paschen-Back effect regime); finally, for large values of γ (γ 1), the
eigenvalues behave again linearly with γ, as shown in Fig. 3.10 (complete Paschen-
Back effect regime).
In this last regime, the spin-orbit interaction can be considered as a perturbation
in comparison with the magnetic interaction. On the other hand, the magnetic
Hamiltonian is diagonal on the basis |βLSMLMS , and the eigenvalues are given
by
βLSML MS |HB |βLSML MS = µ0 B (ML + 2MS ) ,
which explains their linear behavior with B for γ 1. As the magnetic field
increases from γ 1 to γ 1, the energy eigenvectors gradually evolve from
102 CHAPTER 3
Fig.3.10. For large values of the magnetic field, the energy eigenvalues spread out linearly. This
figure is just an enlargement of Fig.3.5 and is intended to show the behavior of the eigenvalues
for γ 1.
the form |βLSJM to the form |βLSML MS . In other words, the magnetic field
induces a gradual basis-transformation on the energy eigenstates.
As apparent from Figs. 3.5-3.9, the incomplete Paschen-Back effect regime is
characterized by several ‘crossings’ of the energy eigenvalues, each corresponding
to a well-defined value of the γ parameter. It will be shown in Sect. 10.18 that
such level-crossings induce important phenomena in resonance scattering. Table 3.5
gives, for the most common L-S terms, the values of γ relative to each level-crossing,
and the corresponding M -values of the levels involved. To illustrate how this table
should be used, consider the 3p 2 P term of NaI: from the tables of Moore (1949)
we get a value for the constant ζ equal to 11.4642 cm−1 , whence we deduce that
level-crossing takes place for B = 1.64 × 105 G and for B = 2.46 × 105 G.
In some cases, and particularly for light atoms such as He, the spin-orbit inter-
action cannot be described by the simple expression given in Eq. (3.59). In these
cases the energy levels in the presence of a magnetic field can still be obtained by
diagonalizing the matrix given in Eqs. (3.61), but the actual values EβLS (J) are
now to be found in tables of atomic energy levels (see e.g. Moore, 1949, 1952, 1958)
or in the specialized literature.
Let’s now consider how the splitting of the energy levels due to the magnetic
field affects, in the general case of the Paschen-Back regime, the spectral lines
originating in the transition between two given terms. The main properties of the
Paschen-Back patterns resulting from the electric-dipole interaction are described
ATOMIC SPECTROSCOPY 105
TABLE 3.5
Level-crossings for the most common L-S terms. The magnetic field is expressed through the
parameter γ defined in the text. The M -values refer to a normal multiplet (ζ > 0). For inverted
multiplets they must be changed in sign.
Term γ M M Term γ M M
2P 0.667 − 32 1
2
3P 0.45 −2 1
1. − 32 − 12 0.616 −2 0
2D 0.6 − 52 3
2
0.707 −1 1
0.75 − 52 1
2
0.859 −1 0
0.866 − 32 3
2
1.185 −1 0
1. − 52 −2 1
1.217 −2 0
1.257 − 32 1
2
1.5 −2 −1
1.5 − 52 −2 3
2.210 0 1
1.591 − 12 3
2
3D 0.476 −3 2
3. − 32 − 12 0.576 −3 1
2F 0.571 − 72 5
2
0.611 −1 2
0.667 − 72 3
2
0.724 −2 1
0.707 − 52 5
2
0.730 −3 0
0.8 − 72 1
2
0.786 −2 1
0.862 − 52 3
2
0.856 −1 2
0.928 − 32 5
2
0.930 −2 0
1. − 72 − 12 1. −3 −1
1.106 − 52 1
2
1.100 −2 0
1.225 − 32 3
2
1.157 −3 1
1.333 − 72 − 32 1.253 −1 1
1.356 − 12 5
2
1.287 −2 −1
1.562 − 52 − 12 1.370 −3 0
1.840 − 32 1
2
1.463 0 2
2. − 72 −2 5
1.667 −3 −1
2.174 − 12 3
2
1.760 −1 1
2.562 1
2
5
2
1.797 −2 1
3. − 52 − 32 1.813 −2 −1
6. − 32 − 12 2. −3 −2
2.5 −1 0
2.600 −2 0
4.773 0 1
106 CHAPTER 3
below.
First of all, since the projection M of the total angular momentum on the mag-
netic field direction is a good quantum number, the individual components can be
characterized by the value ∆M = (M − M ), where M refers to the lower level and
M to the upper one. Owing to the selection rule (3.15) we thus obtain that – in
strict analogy with the Zeeman effect – the components can be divided into three
distinct groups: σr (∆M = −1), π (∆M = 0), and σb (∆M = +1).
As far as the strengths of the components are concerned, it can be shown that
they are proportional to the quantities
M
SqjM,j = | βLSjM |rq |β L S j M |2 (q = −1, 0, +1) ,
where the unprimed quantum numbers refer to the lower level, the primed ones to
the upper level, and where rq are the spherical components of the position operator
of the optical electrons.
Expanding the eigenvectors as in Eq. (3.58), and taking into account that the
coefficients CJj are real, one obtains
M
SqjM,j = CJj (βLS, M ) CJj (βLS, M ) CJj (β L S , M ) CJj (β L S , M )
JJ J J
∗
× βLSJM |rq |β L S J M βLSJ M |rq |β L S J M .
Evaluating the matrix elements through the Wigner-Eckart theorem (Eq. (2.96)),
and noting that rq is an operator which acts only on the orbital part of the eigen-
vector, one gets from Eq. (2.108)
SqjM,j M =
= CJj (βLS, M ) CJj (βLS, M ) CJj (β L S, M ) CJj (β L S, M )
JJ J J
× (2L + 1) (2J + 1)(2J + 1)(2J + 1)(2J + 1)
! !
L L 1 L L 1
×
J J S J J S
J J 1 J J 1
× | βL r β L |2 δSS ,
−M M −q −M M −q
∆L = 0, ±1 L = 0 L = 0
∆S = 0
∆M = 0, ±1 .
ATOMIC SPECTROSCOPY 107
Summation over all the possible transitions gives (see Eq. (3.62b))
!2
jM,j M L L 1
Sq = (2L + 1)(2J + 1)(2J + 1)
J J S
jj MM JJ MM
2
J J 1
× | βL r β L |2 ,
−M M −q
This property makes it possible to define the normalized strengths in the form
SqjM,j M =
3
= C j (βLS, M ) CJj (βLS, M ) CJj (β L S, M ) CJj (β L S, M )
2S + 1 J
JJ J J
× (2J + 1)(2J + 1)(2J + 1)(2J + 1)
! !
L L 1 L L 1
×
J J S J J S
J J 1 J J 1
× , (3.64)
−M M −q −M M −q
with
M
SqjM,j =1 (q = −1, 0, +1) .
jj MM
so that
!2 2
M 3 L L 1 J J 1
SqJM,J = (2J + 1)(2J + 1) ,
2S + 1 J J S −M M −q
and summing over the degeneracy parameters M and M we obtain the usual
expression for the relative strengths of a fine-structure multiplet
!2
J,J JM,J M (2J + 1)(2J + 1) L L 1
S = Sq = . (3.65)
2S + 1 J J S
MM
108 CHAPTER 3
∆νq = −q νL , (3.66)
jj
with ∆νMM the frequency shift of the component corresponding to the transition
between the lower state |βLSjM and the upper state |β L Sj M from the
unperturbed frequency of the transition between the two terms (βLS) and (β L S).
To prove Eq. (3.66) we substitute into Eq. (3.67) the strengths of Eq. (3.64) and
the frequency shifts
jj λj (β L S, M ) − λj (βLS, M )
∆νMM = , (3.68)
h
where λj , λj are the eigenvalues of the matrix (3.61) for the lower and the upper
term, respectively. We then perform the sums over j and j using Eqs. (3.62b) and
(3.62c), to obtain
1 3
∆νq = (2J + 1)(2J + 1)(2J + 1)(2J + 1)
h 2S + 1
MM JJ J J
! !
L L 1 L L 1
×
J J S J J S
J J 1 J J 1
×
−M M −q −M M −q
It is easy to prove that the spin-orbit Hamiltonian gives zero contribution. Indeed,
noting that this Hamiltonian is diagonal with respect to the quantum numbers J
and M , and that its matrix elements are independent of M , and performing the
sums over M and M via Eq. (2.23a), we have for this contribution
!2
1 (2J + 1)(2J + 1) L L 1
∆νq so =
h (2S + 1) J J S
JJ
Summation over J for the first term, and over J for the second term gives, with
the use of Eq. (2.39)
1 2J + 1
∆νq so = β L SJ M |Hso |β L SJ M
h (2L + 1)(2S + 1)
J
2J + 1
− βLSJM |Hso |βLSJM .
(2L + 1)(2S + 1)
J
The first sum is simply the average energy of the fine-structure levels of the upper
term, while the second sum is the same quantity for the lower term. It can be
easily proved that both these quantities are identically zero when the spin-orbit
Hamiltonian is described by Eq. (3.59).1
The only non-vanishing contribution to Eq. (3.69) is due to the magnetic field. To
evaluate this contribution, we substitute the matrix elements given by Eq. (3.57)
and perform the sums over M and M by means of Eq. (2.42). After some tedious
Racah-algebra calculations which involve the use of Eqs. (2.38), (2.48), (2.55), and
(3.51), we obtain that only the first term in the square bracket of the matrix
element (3.57) gives a non-zero contribution, and we get the expression
!2
3 νL L L 1 1 1 1
∆νq = (−1)J+J +q (2J + 1)(2J + 1)
2S + 1 J J S q −q 0
JJ
!
1 1 1
× J (J + 1)(2J + 1)
J J J
!
1 1 1
+ J(J + 1)(2J + 1) .
J J J
Evaluating the 6-j symbols in the square bracket by Eq. (2.36d) and the 3-j symbol
by Eq. (2.26d), and performing the sums over J and J via Eq. (2.39), we finally
obtain Eq. (3.66).
The theorem just proved shows an interesting property of the Paschen-Back effect
which can be summarized by the following statement: The frequency shifts of the
centers of gravity of the σr , σb , and π components in the Paschen-Back effect regime
are those typical of the normal Zeeman effect between two spinless levels. In other
words, given a fine-structured lower term (βLS) and a fine-structured upper term
1
To prove this statement, one has just to recall the sum rules for integers
%n 1
%n 1
%n 1
k=1
k= 2
n (n + 1), k=1
k2 = 6
n (n + 1)(2n + 1), k=1
k3 = 4
n2 (n + 1)2 .
If, on the other hand, the spin-orbit Hamiltonian cannot be written in the form (3.59), the quantity
(∆νq )so may indeed differ from zero. In such cases, it represents a constant shift (independent
of q) that affects the unperturbed frequency of the transition. In other words, Eq. (3.66) still
holds provided Eq. (3.68) is replaced by
jj 1
∆νM M
= λ (β L S, M ) − λj (βLS, M ) − (∆νq )so .
h j
110 CHAPTER 3
(β L S), the centers of gravity of the σr , σb , and π components have the same
frequencies as the individual components that would originate from the transition
between a spinless lower level (βL) and a spinless upper level (β L ), both split
according to the normal Zeeman effect. This result is what should be expected
from the principle of spectroscopic stability,1 as it can be inferred from considering
the limit of negligible fine structure.
It is well-known that hyperfine structure results from the interaction of the elec-
tronic cloud with the atomic nucleus. The energy involved in this interaction is
approximately one thousand times smaller than the energy relative to the spin-
orbit interaction, so that the hyperfine-structure components of a spectral line
have typical wavelength separations of a few mÅ.
In most cases (and particularly for lines originating from astrophysical plasmas),
these components remain unresolved, because their separation is much smaller than
the line width due to different broadening mechanisms. For this reason the hyper-
fine structure of spectral lines is often neglected, especially in usual astrophysical
applications.
However, when polarization phenomena are involved, the situation is substan-
tially different, and hyperfine structure generally plays an important role (see
Sect. 9.23). We describe here the effect of a magnetic field on a hyperfine-structured
line, in order to state some basic properties of the resulting pattern.
We consider a particular isotope having nuclear spin I, and introduce the rep-
resentation |αJIF f to describe its energy eigenvectors. In this notation, apart
from the symbols already used, F is the total angular momentum (electronic plus
nuclear: F = J + I ) quantum number, while f represents its projection on the
z-axis of the coordinate system (f = −F, −F + 1, . . . , F ).
The hyperfine-structure interaction energy can be expressed as an infinite series
of electric and magnetic multipoles (see for instance Kopfermann, 1958). In most
cases, a very good approximation to the observed energy intervals is obtained by
retaining only the first two terms in the multipole series; these are the magnetic-
dipole and electric-quadrupole terms, which are given by
A(α, J, I)
αJIF f |Hhfs
(1)
|αJIF f = δF F δf f K
2
αJIF f |Hhfs
(2)
|αJIF f = δF F δf f B(α, J, I)
4
× K(K + 1) − J(J + 1)I(I + 1) , (3.70)
3
where A(α, J, I) and B(α, J, I) are the magnetic-dipole and electric-quadrupole hy-
perfine structure constants relative to a given atomic level of a given isotope, while
1
A formulation of this principle is given in Sect. 7.10.
ATOMIC SPECTROSCOPY 111
K is defined by
K = F (F + 1) − J(J + 1) − I(I + 1) . (3.71)
An extensive table of experimental A and B values can be found in Brix and
Kopfermann (1952). From such values, and from Eqs. (3.70), the energy of the
various hyperfine-structure sublevels can be easily computed. The sublevels are
characterized by the total angular momentum F and are degenerate with respect
to f .
If a magnetic field is present, so weak as to produce a splitting much smaller
than the energy differences between the various hyperfine-structure sublevels, its
effect can be deduced from perturbation theory, by diagonalizing the magnetic
Hamiltonian HB of Eq. (3.3) on the degenerate basis |αJIF f .1 Aligning the
z-axis of the reference system with the direction of the magnetic field, the matrix
elements to be evaluated are of the form
Let us evaluate the more general matrix element (that will be needed later)
between two states having different F -values. Writing (Lz + 2Sz ) in the form
(Jz + Sz ) and bearing in mind the definition of the Landé factor (Eq. (3.7)), we
obtain from Eqs. (2.96) (Wigner-Eckart theorem) and (2.108)
1 F (F + 1) + J(J + 1) − I(I + 1)
ghfs (F ) = (F = 0) . (3.73)
2 F (F + 1)
Thus for weak magnetic fields any hyperfine level of total angular momentum F
splits up into (2F + 1) magnetic sublevels, each characterized by its own f -value.
The splitting is proportional to the magnetic field and to the product of the usual
Landé factor gαJ times a Landé factor ghfs (F ) which depends on the particular
hyperfine level. This is the Zeeman effect for hyperfine structure.
1
The Hamiltonian describing the direct interaction between the magnetic field and the nuclear
spin can be safely neglected, since the corresponding energy is about 10 3 times smaller than the
energy involved in the interaction between the magnetic field and the electronic cloud.
112 CHAPTER 3
If the magnetic field is so strong that the Zeeman splitting is of the same order as
the energy difference between different hyperfine levels, perturbation theory cannot
be applied any longer, and one has to go back to the simultaneous diagonalization
of the hyperfine-structure and magnetic Hamiltonians, whose matrix elements are
given in Eqs. (3.70) and (3.72). The situation here is quite similar to that of the
Paschen-Back effect (Sect. 3.4): since the matrix elements are diagonal with respect
to f , the overall Hamiltonian can be factored into [2(I + J) + 1] submatrices, each
characterized by a particular f -value.1 Thus the eigenvalues and eigenvectors are
of the form
λi (αJI, f )
|αJIif = CFi (αJI, f ) |αJIF f (3.74)
F
(where the CFi coefficients can be chosen real) and can be found, in general,
by numerical diagonalization of the total Hamiltonian. Figure 3.11 illustrates
the behavior of the energy eigenvalues for the hyperfine-structure components of
the level 3 2 P3/2 of NaI as a function of the magnetic field (the values of the
hyperfine-structure constants are from Figger and Walther, 1974). As in the case
of the Paschen-Back effect, there are several level-crossings between the hyperfine-
structure magnetic sublevels. Such level-crossings induce important phenomena in
resonance scattering (see Sect. 10.22).
The strengths and the splittings of the various hyperfine-structure components
can be computed in strict analogy with the case of the Paschen-Back effect. For
the normalized strengths of the hyperfine components of the transition connecting
a lower level (αJ) with an upper level (α J ) one obtains
3
Sqif,i f = CFi (αJI, f ) CFi (αJI, f ) CFi (α J I, f ) CFi (α J I, f )
2I + 1
F F F F
× (2F + 1)(2F + 1)(2F + 1)(2F + 1)
! !
J J 1 J J 1
×
F F I F F I
F F 1 F F 1
× , (3.75)
−f f −q −f f −q
with
Sqif,i f = 1 (q = −1, 0, +1) .
ii f f
Fig.3.11. Energies of the hyperfine sublevels of the level 3 2 P3/2 of NaI as functions of the magnetic
field strength.
Similarly to the Paschen-Back effect, the strengths defined in Eq. (3.75) have
some important properties, which can be proved by observing that the coefficients
CFi (αJI, f ) satisfy the following relations, similar to Eqs. (3.62)
CFi (αJI, f ) CFi (αJI, f ) = δii (3.76a)
F
CFi (αJI, f ) CFi (αJI, f ) = δF F (3.76b)
i
CFi (αJI, f ) CFi (αJI, f ) λi (αJI, f ) =
i
= αJIF f |Hhfs
(1) (2)
+ Hhfs + HB |αJIF f . (3.76c)
The properties of the strengths are the following:
a) In the absence of magnetic fields, F is a good quantum number, and the eigen-
vectors |αJIif converge towards the states |αJIF f that are degenerate with
respect to f ; the coefficients CFi are given by
CFi (αJI, f ) = δiF ,
114 CHAPTER 3
so that we obtain
!2 2
f 3 J J 1 F F 1
SqF f,F = (2F + 1)(2F + 1) ,
2I + 1 F F I −f f −q
and summing over the degeneracy parameters f and f one gets the usual expression
for the relative strengths of hyperfine-structure multiplets
!2
f (2F + 1)(2F + 1) J J 1
S F,F = SqF f,F = .
(2I + 1) F F I
ff
where νL is the Larmor frequency, ∆νfiif is the frequency shift of the component
corresponding to the transition between the lower state |αJIif and the upper
state |α J Ii f from the unperturbed frequency of the transition between the
two levels (αJI) and (α J I),
1 1
ḡ = (g + gα J ) + (gαJ − gα J ) [J(J + 1) − J (J + 1)] .
2 αJ 4
Comparison with Eq. (3.44) shows that the factor ḡ is just the effective Landé
factor that one would obtain for the transition between the levels (αJ), (α J ) if
hyperfine structure were not present.
The proof of Eq. (3.77) is rather tedious and quite similar to the proof of the
analogous equation for the Paschen-Back effect. For this reason it will not be
given here and is left as an exercise to the reader.1 Equation (3.77) is particularly
important, and it can be expressed by the following statement: The frequency shifts
of the centers of gravity of the σr , σb , and π components of a hyperfine-structured
line are the same as those resulting from the same line without hyperfine structure.
This result is what should be expected from the principle of spectroscopic stability.
1
To prove that the hyperfine-structure Hamiltonian (Eqs. (3.70)) gives no contribution to
the quantity ∆νq , one needs the sum rules
%n %n
k=1
k4 = 1
30
n(n + 1)(2n + 1)(3n2 + 3n − 1), k=1
k5 = 1
12
n2 (n + 1)2 (2n2 + 2n − 1).
To evaluate the contribution of the magnetic Hamiltonian (Eq. (3.72)), the sum rule (2.41) must
be used (instead of (2.48), employed for the Paschen-Back calculation).
ATOMIC SPECTROSCOPY 115
Fig.3.12. Logarithm of the strengths of π (upper) and σr (lower) components against wavelength
for the line EuII λ 4205. The value of the magnetic field is 6 kG. Only the components having
strength larger than 10−8 are shown. The number of components of each type is approximately
400.
The numerical calculation of the strengths and splittings of the various hyperfine-
structure components of a spectral line requires the diagonalization of the total
Hamiltonian for the lower and for the upper level followed by the evaluation of
expressions (3.75). If the element has more than one isotope, the calculations must
be repeated for each of them and the splittings must be adjusted to allow for isotope
shift.
Landi Degl’Innocenti (1978a) has presented a computer program to perform such
calculations. The program has then been applied to the line EuII λ 4205 using the
data for the hyperfine-structure constants given by Krebs and Winckler (1960).
The result is shown in Fig. 3.12 for a magnetic field of 6 kG. The number of
components is indeed striking, with an intricate pattern recalling a ‘wild forest’.
The implications of such a pattern on the determination of the europium abundance
in magnetic stars have been discussed by Landi Degl’Innocenti (1975).
When an atomic system is excited through a physical process which, for any reason,
is not spatially isotropic, the various magnetic sublevels of the system, degenerate
or quasi-degenerate with respect to energy, are, in general, not evenly populated;
moreover, they are characterized by definite phase relations, and the atomic system
is said to be polarized. A typical example occurs when the atomic system is excited
by a unidirectional (or polarized) radiation beam, or by collisions with a collimated
beam of fast particles.
116 CHAPTER 3
In such situations, the description of the atomic excitation in terms of the usual
concept of populations of the atomic levels (or sublevels) is no longer adequate,
and one must resort to the more sophisticated concept of atomic density operator .
The density operator can be defined for any assigned quantum system. If pα
(pα ≥ 0) is the probability of finding the system in the pure state |ψ (α) (normalized
to 1), the density operator ρ is defined by
ρ= pα |ψ (α) ψ (α) | . (3.79)
α
Introducing a complete and orthogonal basis of unit vectors {|uk } for the Hilbert
space spanned by the vectors |ψ (α) , one has
um | ρ |un = ρmn = pα um |ψ (α) ψ (α) |un ,
α
where ρmn are the matrix elements of the density operator, also called density-
matrix elements.
Some important properties of the density operator follow directly from its defi-
nition:
a) The density operator is Hermitian,
ρ = ρ† ,
b) The density operator is positive definite. In fact, for an arbitrary state vector
|φ we have
φ| ρ |φ = pα | ψ (α) |φ |2 ≥ 0 ;
α
Tr (ρ) = 1 . (3.84)
f) The expectation value of any dynamical variable is equal to the trace of the
product of the density operator times the operator associated to the variable
A = pα ψ (α) |A|ψ (α) = Tr (ρA) . (3.86)
α
g) If a system consists of two subsystems a and b, the trace of the density operator
of the system over the states of one subsystem is the density operator of the other
one. Indeed, let {| an } and {| bm } be two complete bases for the Hilbert spaces
118 CHAPTER 3
On the other hand, if ρ(a) is the density operator of subsystem a, we also have
(a)
O = Tr(a) ρ(a) O(a) = an | ρ(a) O(a) | an .
n
As far as the density-matrix elements are concerned, their time evolution depends
on the particular basis {|un }. If we choose the basis of the energy eigenvectors,
or, in other words, if the vectors |un are such that
H|un = En |un ,
d d 2π
ρ = u | ρ |un = um | [H, ρ] |un = −2πi νmn ρmn ,
dt mn dt m ih
ATOMIC SPECTROSCOPY 119
Em − En
νmn = .
h
d 2πi , -
ρmn = − Hmp ρpn − Hpn ρmp ,
dt h p
where
Hij = ui |H|uj .
For an atomic system, the most natural basis for defining the matrix elements
of the density operator is the basis of the eigenvectors of the angular momentum,
although in some cases this is not the basis of the energy eigenvectors (see for
instance the discussion of the Paschen-Back effect in Sect. 3.4). On this basis, the
elements of the density matrix are
For any other incidence direction, or any other polarization of the absorbed
photon, the atom is excited to an upper level which is a linear combination of the
form
c−1 |1 −1 + c0 |1 0 + c1 |1 1 = cM |1M ,
M
where the three coefficients cM depend on the direction and polarization of the
absorbed photon. In this case the coherences are non-zero and take the simple
form
1M | ρ |1M = cM c∗M .
The physical meaning of coherences is just contained in this simple example. We
will see in Sect. 5.13 how a qualitative analogy can be drawn between atomic
coherences and interferences in classical oscillators.
Although coherences can be defined for any pair of magnetic sublevels, no matter
how large is the energy difference between them, the most important are those
between degenerate or quasi-degenerate sublevels. For such coherences one can
introduce more compact symbols to simplify the notations; for instance, for the
magnetic sublevels pertaining to an assigned term α one can write
For the coherences between magnetic sublevels of the same J-level the following
notation can be used
The atomic density matrix can be defined in the presence of hyperfine structure as
well. With obvious notations, for the matrix elements between different hyperfine-
structure magnetic sublevels one can write in general
and for the coherences between hyperfine-structure sublevels belonging to the same
J-level
αJIF f | ρ |αJIF f = ραJI (F f, F f ) . (3.92)
1
The more compact notation ραJ (M ) can be conveniently used in the physical situations
where all the off-diagonal elements vanish.
ATOMIC SPECTROSCOPY 121
Important relations can be established between the different quantities here in-
troduced, like for instance between those defined in Eqs. (3.91) and those defined
in Eq. (3.92). We can consider an atom with hyperfine structure as consisting of
two separate parts, characterized by the eigenstates |αJM and |IMI (where MI
is the projection of the nuclear spin along the z-axis) and by the density operators
ρ(J) and ρ(I) , respectively. Since the quantum number I of the atomic system is
fixed, we have from Eq. (3.87)
IMI | ρ |IMI = ρ(J) ,
MI
where ρ is the density operator of the whole system. Taking the matrix element of
these operators between the eigenstates |αJM and |αJM we get
αJM | ρ(J) |αJM = αJIM MI | ρ |αJIM MI ,
MI
which relates the density-matrix elements in Eqs. (3.91) to those in Eq. (3.92). In
the following we will use this kind of relations by dropping the apex (J), and we
will simply write
ραJ (M, M ) = αJIM MI | ρ |αJIM MI .
MI
× αJIF f | ρ |αJIF f ,
× ραJI (F f, F f ) . (3.93)
where ML is the projection of the orbital angular momentum on the z-axis, and
then introducing the inner structure due to the spin, one obtains
ρβL (ML , ML ) = (−1)M−M (2J + 1)(2J + 1)
JJ MM MS
L S J L S J
×
ML MS −M ML MS −M
We want to remark that the density-matrix elements on the basis of the eigen-
vectors of the angular momentum depend on the reference system chosen to define
such eigenvectors. If R is the rotation that brings a reference system (the ‘old’
one) into another reference system (the ‘new’ one), the eigenvectors of the angular
momentum in the new system are connected with those in the old system by the
relation
|JM new = D(R) |JM old ,
so that the transformation law for the density-matrix elements is the following
ρ (αJM, α J M ) =
new
∗ J
= DN
J
M (R) DN M (R) ρ (αJN, α J N ) . (3.95)
old
NN
As shown by Eq. (3.95), the transformation law for the density-matrix elements
on the basis of the eigenvectors of the angular momentum involves the product of
two rotation matrices. We can however construct – similarly to Sect. 2.7 – linear
combinations of these matrix elements whose transformation law involves just one
rotation matrix. By so doing we obtain the irreducible spherical components of the
density matrix, which are often referred to as the multipole moments of the density
matrix or the spherical statistical tensors.
ATOMIC SPECTROSCOPY 123
The proof of Eq. (3.98) will not be given here, being quite similar to an √ analogous
proof given in Sect. 2.7. We just want to remark that the factor 2K + 1 in
Eq. (3.97) is inessential: in fact, if this factor is replaced by an arbitrary function
of K, Eq. (3.98) is still satisfied. The definition given in Eq. (3.97) is the same as
Omont’s (1977), but different authors give different definitions for the multipole
moments, and no definition has a special advantage.
Comparison of Eqs. (3.98) and (2.78) shows that the transformation law for the
multipole moments involves the complex conjugate of the rotation matrix, whereas
the transformation law for irreducible spherical tensors involves the rotation matrix
itself. Thus the multipole moments – as defined in Eq. (3.97) – are not, strictly
speaking, irreducible spherical tensors. A different definition could be given by
substituting ρ (α J M , αJM ) for ρ (αJM, α J M ) in Eq. (3.97), and the multipole
moments so defined would indeed satisfy the transformation law for irreducible
spherical tensors. However, this alternative definition has no advantages over ours.
Equation (3.97) can be easily inverted using the orthogonality relations of the
3-j symbols. From Eq. (2.23b) one obtains
√
ρ (αJM, α J M ) = (−1)J−M 2K + 1
KQ
J J K
× ρK
Q (αJ, α J ) . (3.99)
M −M −Q
The conjugation property of the multipole moments follows from Eq. (3.97) and
from the Hermitian character of the density operator
∗ −Q
ρK
Q (αJ, α J ) = (−1)
J−J
ρK
−Q (α J , αJ) . (3.100)
Similarly to Sect. 3.6, one can introduce shorthand notations for the multipole
moments when dealing with restricted subspaces. For instance, for the multipole
moments relative to the magnetic sublevels of a term one can write
ρK αK
Q (αJ, αJ ) = ρQ (J, J ) ,
1
Note that for any atom (or ion) the quantum numbers J and J are both integers or both
half-integers, thus the rank K of the multipole moments is always an integer.
124 CHAPTER 3
ρK K
Q (αJ, αJ) = ρQ (αJ)
√ J J K
= (−1)J−M 2K + 1 ραJ (M, M ) , (3.101)
M −M −Q
MM
The analytical expressions for the most common multipole moments, as functions
of the ordinary density-matrix elements, are given in Tables 3.6 and 3.7.
The multipole moments can be also defined for a J-level having hyperfine struc-
ture. With self-evident notations, we have
√
ρQ (F, F )
αJI K
= (−1)F −f 2K + 1
ff
F F K
× ραJI (F f, F f ) . (3.103)
f −f −Q
Relations similar to those in Eqs. (3.93) and (3.94) can be easily established for
the multipole moments. From Eq. (3.93), after some Racah algebra involving the
contraction of 3-j symbols (Eq. (2.42)), one obtains
ρK
Q (αJ) = (−1)J+I+F +K
(2F + 1)(2F + 1)
FF
!
F F K
× ρQ (F, F )
αJI K
. (3.104)
J J I
In a similar way, from Eq. (3.94) the following relation can be proved
ρK
Q (βL) = (−1)L+S+J +K
(2J + 1)(2J + 1)
JJ
!
J J K
× ρQ (J, J )
βLS K
, (3.105)
L L S
where ρK
Q (βL), the multipole moment for a spinless L-term, is defined by
√ L L K
ρK
Q (βL) = (−1)L−ML 2K + 1 ρβL (ML , ML ) .
ML −ML −Q
ML ML
TABLE 3.6
Analytical expressions for the multipole moments ρK Q (αJ) as functions of the matrix elements
ραJ (M, M ). The index (αJ) is suppressed for conciseness. The multipole moments for negative
values of Q can be obtained through the relation ρK Q K ∗
−Q (αJ) = (−1) ρQ (αJ) .
J =0 ρ00 = ρ(0, 0)
J= 1
2
ρ00 = √1 ρ( 12 , 12 ) + ρ(− 12 , − 12 )
2
ρ10 = √1 ρ( 12 , 12 ) − ρ(− 12 , − 12 )
2
ρ11 = −ρ( 12 , − 21 )
J =1 ρ00 = √1 ρ(1, 1) + ρ(0, 0) + ρ(−1, −1)
3
ρ10 = √1 ρ(1, 1) − ρ(−1, −1)
2
ρ22 = √1 ρ( 32 , − 12 ) + ρ( 12 , − 32 )
2
ρ30 = 1
√ ρ( 32 , 32 ) − 3 ρ( 12 , 12 ) + 3 ρ(− 12 , − 12 ) − ρ(− 32 , − 32 )
2 5
3 1 √ 1 1
ρ31 = − √1 ρ( 2 , 2 ) − 3 ρ( 2 , − 2 ) + ρ(− 12 , − 32 )
5
ρ32 = √1 ρ( 32 , − 12 ) − ρ( 12 , − 32 )
2
ρ33 = −ρ( 32 , − 23 )
J =2 ρ00 = √1 ρ(2, 2) + ρ(1, 1) + ρ(0, 0) + ρ(−1, −1) + ρ(−2, −2)
5
ρ10 = √1 2 ρ(2, 2) + ρ(1, 1) − ρ(−1, −1) − 2 ρ(−2, −2)
√
10
√ √ √
ρ11 = − √1 2 ρ(2, 1) + 3 ρ(1, 0) + 3 ρ(0, −1) + 2 ρ(−1, −2)
10
ρ20 = √1 2 ρ(2, 2) − ρ(1, 1) − 2 ρ(0, 0) − ρ(−1, −1) + 2 ρ(−2, −2)
14
√ √
ρ21 = − √1 6 ρ(2, 1) + ρ(1, 0) − ρ(0, −1) − 6 ρ(−1, −2)
√
14
√ √
ρ22 = √1 2 ρ(2, 0) + 3 ρ(1, −1) + 2 ρ(0, −2)
7
126 CHAPTER 3
TABLE 3.6
(continued)
J =2 ρ30 = √1 ρ(2, 2) − 2 ρ(1, 1) + 2 ρ(−1, −1) − ρ(−2, −2)
√
10
√ √ √
ρ31 = − √1 3 ρ(2, 1) − 2 ρ(1, 0) − 2 ρ(0, −1) + 3 ρ(−1, −2)
10
ρ32 = √1 ρ(2, 0) − ρ(0, −2)
2
ρ33 = − √1 ρ(2, −1) + ρ(1, −2)
2
ρ40 = √1 ρ(2, 2) − 4 ρ(1, 1) + 6 ρ(0, 0) − 4 ρ(−1, −1) + ρ(−2, −2)
70
√ √
ρ1 = − √
4 1
ρ(2, 1) − 6 ρ(1, 0) + 6 ρ(0, −1) − ρ(−1, −2)
14
√ √ √
ρ42 = √1 3 ρ(2, 0) − 2 2 ρ(1, −1) + 3 ρ(0, −2)
14
ρ43 = − √1 ρ(2, −1) − ρ(1, −2)
2
by TQK its expectation value, and referring, for simplicity, to an atom devoid of
hyperfine structure, we have
K , -
TQ = Tr ρ TQK = α J M | ρ TQK |α J M .
α J M
and applying the Wigner-Eckart theorem (Eq. (2.96)) and Eq. (3.99), one obtains
.
K 2J + 1
TQ = αJ T K α J ρK ∗
Q (αJ, α J ) , (3.106)
2K + 1
αα JJ
which shows that the expectation value of a spherical tensor operator is strictly
connected with the multipole moments of the same rank.
Let’s now consider the physical meaning of the multipole moments. For the
0-rank moment, we have from Eq. (3.101)
1
ρ00 (αJ) = √ ρ (M, M ) , (3.107)
2J + 1 M αJ
TABLE 3.7
ρ22 ( 12 , 32 ) = −ρ( 12 21 , 32 − 32 )
√ 1 1 3 1
ρ21 ( 12 , 32 ) = 12 3 ρ( 2 2 , 2 − 2 ) − ρ( 12 − 12 , 3
2
− 32 )
ρ20 ( 12 , 32 ) = − √1 ρ( 12 1 3 1
,
2 2 2
) − ρ( 12 − 12 , 32 − 12 )
2
√
ρ2−1 ( 12 , 32 ) = 1
2
ρ( 12 1 3
,
2 2
3
2
) − 3 ρ( 12 − 12 , 32 12 )
ρ2−2 ( 12 , 32 ) = ρ( 12 − 12 , 3 3
2 2
)
√ √
J =1 ρ11 (1, 2) = √1 ρ(1 1, 2 0) + 3 ρ(1 0, 2 −1) + 6 ρ(1 −1, 2 −2)
10
√ √
J = 2 ρ10 (1, 2) = − √1 3 ρ(1 1, 2 1) + 2 ρ(1 0, 2 0) + 3 ρ(1 −1, 2 −1)
√
10
√
ρ1−1 (1, 2) = √1 6 ρ(1 1, 2 2) + 3 ρ(1 0, 2 1) + ρ(1 −1, 2 0)
10
√
ρ22 (1, 2) = − √1 ρ(1 1, 2 −1) + 2 ρ(1 0, 2 −2)
√
3
√
ρ21 (1, 2) = √1 3 ρ(1 1, 2 0) + ρ(1 0, 2 −1) − 2 ρ(1 −1, 2 −2)
6
ρ20 (1, 2) = − √1 ρ(1 1, 2 1) − ρ(1 −1, 2 −1)
√
2
√
ρ2−1 (1, 2) = √1 2 ρ(1 1, 2 2) − ρ(1 0, 2 1) − 3 ρ(1 −1, 2 0)
6
√
ρ2−2 (1, 2) = √1 2 ρ(1 0, 2 2) + ρ(1 −1, 2 1)
3
Fig.3.13. To get an intuitive grasp of the meaning of the orientation and alignment components of
the density matrix, we can regard the atomic angular momentum as having a uniform probability
of being found on the surface of the figures drawn above. For the sphere in panel (a), the atom
is neither oriented nor aligned; for the ellipsoid in panel (b) the atom is aligned but not oriented;
finally, for the ellipsoid in panel (c) the atom is both oriented and aligned. To have an example of
an atom oriented but not aligned we can go back to panel (a) and regard the angular momentum
as having more probability of being found on the upper hemisphere than on the lower one.
with
1
f (J) = ,
J(J + 1)(2J − 1)(2J + 1)(2J + 3)
the reduced matrix element αJ J 2 αJ having been evaluated via Eqs. (2.102)
and (2.36h).
The formulae now deduced show that the multipole moments of rank 2 are con-
nected with the average value of bilinear combinations of the angular momentum
components Jx , Jy , and Jz . These multipole moments are called the alignment
components of the density matrix. The word ‘alignment’, as opposed to ‘orienta-
tion’, is used in this context to mean that the atom may be regarded as aligned
with a particular straight line in space, irrespective, however, of which direction is
chosen on the line. In Fig. 3.13 we give an intuitive illustration of the meaning of
the multipole moments of rank 1 and 2.
Turning to multipole moments of higher rank, it is possible to show that those
of rank n are connected with the average values of polynomials of degree n in
the angular momentum components Jx , Jy , and Jz . No particular name has been
assigned to these moments.
130 CHAPTER 3
Another general property of multipole moments, which follows directly from the
definition (3.97), should be remarked: a moment of the form ρK Q (Q = 0) is a
linear combination of coherences between states characterized by projections of the
angular momentum differing by Q; a moment of the form ρK 0 is a linear combination
of populations.
Finally, some inequalities can be established for the multipole moments as a
consequence of Eq. (3.85). Taking the square modulus of Eq. (3.101) and summing
over K and Q we obtain
|ρK 2
Q (αJ)| = |ραJ (M, M )|2 ;
KQ MM
1
≥ |ραJ (M, M )|2 ,
2J + 1
MM
Similarly, for the multipole moments connecting two levels (αJ), (α J ) one obtains,
starting from Eq. (3.97)
2
|ρK
Q (αJ, α J )| ≤ (2J + 1)(2J + 1) ρ00 (αJ) ρ00 (α J ) .
KQ
√ J J K
ρK
0 (αJ) = 2K + 1 (−1)J−M
ραJ (M, M ) .
M −M 0
M
On the other hand, all the quantities ραJ (M, M ) are non-negative (see Eq. (3.81));
since their sum equals [2J + 1]1/2 ρ00 (αJ), we can write
3 3
3 J J K 33 0
|ρK
0 (αJ)| ≤ (2K + 1)(2J + 1) 33 ρ (αJ) ,
M̃ −M̃ 0 3 0
where M̃ is the M -value giving the largest absolute value for the 3-j symbol.
CHAPTER 4
Many of the physical phenomena involved in the generation and the transfer of po-
larized radiation in spectral lines can indeed be described by means of the classical
theories of the electron and of the radiation field. In Chap. 5 we will apply these
theories to obtain a simple derivation of the radiative transfer equations for polar-
ized radiation and of the law of resonance scattering in the presence of a magnetic
field.
However, only the most simplified cases are accurately described by the classical
theory. As far as the atomic system is concerned, the classical theory of the elec-
tron gives an appropriate representation only of a two-level atom with a normal
Zeeman triplet and an unpolarized ground level, so that its applicability is very lim-
ited. Similarly, the classical description of the radiation field prevents the correct
treatment of important phenomena such as, for instance, stimulated emission.
In the main body of this book the interaction between atomic systems and the
electromagnetic field will be described through a full quantum-mechanical formal-
ism: this will allow us to handle even the most complicated physical situations
without being obliged to introduce additional terms into the relevant equations by
using phenomenological or heuristic arguments. To this aim, we need however the
formalism of second quantization for the electromagnetic field, a formalism that we
are going to develop in this chapter.
We also want to remark that the formalism of second quantization for the radia-
tion field is a classical subsection of quantum field theory and has been fully treated
in excellent monographs (see for instance Dirac, 1935; Heitler, 1954; Akhiezer and
Berestetskii, 1965). The reason why these concepts are again developed in this
book is to establish on a firm basis a set of notations that will be employed in
the following. On the other hand, in view of the physical applications that will be
presented later, the formalism of second quantization needs not to be developed
in its general, relativistically invariant form. We will thus restrict attention to
the simpler non-relativistic formulation, while we refer the reader to the textbooks
quoted above for an introduction to invariant formalisms.
p2 mω 2 2
H(q, p) = + q , (4.1)
2m 2
where m and ω are the mass and the angular frequency of the oscillating particle,
respectively. The Hamilton equations give for the particle motion
dq ∂H(q, p) p
= =
dt ∂p m
dp ∂H(q, p)
=− = −mω 2 q.
dt ∂q
To solve these coupled equations we introduce, following Dirac (1935), two linear
combinations of the variables q and p of the form
where A is a real constant which, for the moment, does not need to be specified.
Taking the time derivative of these quantities we get the equations
da da∗
= −i ωa , = i ωa∗ ,
dt dt
which have the obvious solution
−iωt iωt
a = a0 e , a∗ = a∗0 e , (4.3)
with a0 to be determined from the initial conditions. Inversion of Eqs. (4.2) yields
the solution to the classical problem
i 1
q= (a − a∗ ) , p= (a + a∗ ) ,
2mωA 2A
with a and a∗ given by Eqs. (4.3).
The corresponding quantum-mechanical problem can be treated along the lines
of the Correspondence Principle. The Hamiltonian is still given by Eq. (4.1), but
q and p are now Hermitian operators obeying the commutation rule
[q, p] = ih̄ .
We now choose the constant A in such a way as to make this commutator equal to
unity; expressing a a† in terms of a† a we get
1
H = h̄ω a† a +
2
with
[a, a† ] = 1 . (4.4)
To find the eigenvalues and the eigenvectors of H, first of all we observe that if
|v is an eigenvector of H corresponding to the eigenvalue v,
H|v = v |v ,
we have
† H 1 v 1
v|a a|v = v| − |v = − v|v .
h̄ω 2 h̄ω 2
Since the left-hand side of this equality is the square modulus of the vector a |v ,
it must be non-negative, so that we obtain
1
v≥ h̄ω , (4.5)
2
which means that all the eigenvalues are larger than h̄ω/2.
Next we observe that, owing to the commutation relations
we have
Ha |v = (aH − h̄ω a)|v = (v − h̄ω) a |v ,
which means that the vector a |v is an eigenvector of the Hamiltonian H corre-
sponding to the eigenvalue (v − h̄ω).
Similarly, it can be easily shown that the vector a2 |v is an eigenvector of H
corresponding to the eigenvalue (v − 2h̄ω), and, in general, that the vector an |v is
an eigenvector corresponding to the eigenvalue (v−nh̄ω). The chain of eigenvectors
a |v , a2 |v , . . . , an |v , . . ., corresponding to smaller and smaller eigenvalues, must
however be limited, because otherwise property (4.5) would be violated. This
means that it must exist an eigenvector, which we denote by the symbol |0 , such
that
a |0 = 0 . (4.7)
For this eigenvector we have
† 1 1
H |0 = h̄ω a a + |0 = h̄ω |0 .
2 2
134 CHAPTER 4
0|an a†n |0 = n! .
1
|n = √ a†n |0 ,
n!
Consider the electromagnetic field enclosed in a cubic cavity having its sides equal
to L. In the absence of charges and currents, one can choose a special case of the
QUANTIZATION OF THE ELECTROMAGNETIC FIELD 135
Lorentz gauge where the scalar potential φ(r, t) is identically zero, so that the field
r, t) which obeys the wave equation
is described only by the vector potential A(
r, t) − 1 ∂2
∇2 A( A(r, t) = 0 . (4.9)
c2 ∂t2
r, t) = 0 .
div A( (4.10)
r, t) by the relations
The electric and magnetic fields are connected with A(
r, t) = − 1 ∂ A(
E( r, t)
c ∂t
r, t) = curl A(
B( r, t) . (4.11)
At any given time t the vector potential, as well as any other physical function of
r, can be expanded in Fourier series. If we require that the vector potential obey
the so-called periodicity conditions
y, z, t) = A(x
A(x, + m L, y + m L, z + m L, t)
x y z
where the summation is extended to all the values of k satisfying the periodicity
conditions
k = n 2π , n 2π , n 2π , (4.13)
x
L y L z L
with nx , ny , nz arbitrary integers. Because of the real character of the function
r, t), the complex vector C
A( (t) satisfies the conjugation property
k
(t)∗ = C
C (t) .
k −
k
We now require that the vector potential A obey the wave equation (4.9). This
means
(t)
d2 C k
= −ωk2 C (t) , (4.14)
dt2 k
where
ωk = c |k| . (4.15)
1
Note that Eq. (4.10) is not relativistically invariant.
136 CHAPTER 4
(−) e−iωk t + C
(t) = C
C (+) e iωk t ,
k
k
k
(−) and C
where C (+) are constants. Substituting into Eq. (4.12) we obtain
k k
(−) e i(k·r−ωk t) +
r, t) =
A( C (+) e i(k·r+ωk t)
C (4.16)
k k
k
k
with
(−)∗ = C
C (+) .
k −k
The vector potential in Eq. (4.16) is decomposed into progressive and regressive
waves. Taking into account that ωk = ω−k , we write the exponential of the regres-
sive wave term in the form
i( r+ωk t)
k· −i(− r −ω−k t)
k·
e =e
and then we change the summation index of this term from k to -k to obtain
r, t) =
A( (−) e i(k·r−ωk t) + C
C (−)∗ e−i(k·r−ωk t) .
k
k
k
k · C
(−) = 0 ,
k
which is a transversality condition holding for any Fourier component of the vector
potential. This relation can be satisfied in the following way. For any wavevector
k we define two complex unit vectors e (λ = 1, 2) both perpendicular to k and
kλ
perpendicular to each other,
∗
ekλ · k = 0 , ekλ · ekλ = δλλ . (4.17)
Next we write
(−) e−iωk t =
C ckλ (t) ekλ ,
k
λ=1,2
where ckλ (t) are oscillatory functions satisfying the differential equation
d
c (t) = −i ωk ckλ (t) , (4.18)
dt kλ
and we finally obtain
i
k·
r −i
k·
r
r, t) =
A( ckλ (t) ekλ e + ckλ (t)∗ ekλ
∗
e , (4.19)
kλ
QUANTIZATION OF THE ELECTROMAGNETIC FIELD 137
from which the electric and magnetic fields can be obtained via Eqs. (4.11)
r, t) = i i ∗ −ik·
k·
r r
E( ωk ckλ (t) ekλ e − ckλ (t)∗ ekλ e
c
kλ
k × c (t) e e ik·r − c (t)∗ e ∗ e−ik·r .
r, t) = i
B( (4.20)
kλ kλ kλ
kλ
kλ
The summations are extended to all the wavenumbers satisfying Eq. (4.13) and,
for each wavenumber, to the two possible polarization states. Each set of values
(k, λ) defines a mode of the radiation field inside the cavity. The number of modes
for which the modulus of k is contained within k and k + dk and its direction is
contained in the solid angle dΩ is given by
3
L 1
2
N (k) dk dΩ = 2 k dk dΩ = V k 2 dk dΩ , (4.21)
2π 4π 3
which gives the energy of the field contained in the volume V. Substituting for E
and B their expressions given in Eqs. (4.20) (with the coefficients c (t) and c (t)∗
kλ kλ
replaced by the operators ckλ and c† , respectively) the integral in Eq. (4.22)
kλ
reduces to the sum of integrals of the form
i(
k+k )·
r
e dV ,
V
which can be easily evaluated to give, owing to the periodicity conditions (4.13),
V δk,−k .
138 CHAPTER 4
We thus obtain
V ωk2
H=− ckλ c−kλ 2
ekλ · e−kλ − k × ekλ · k × e−kλ
8π c
kλλ
ωk2 ∗
+ c†kλ c†−kλ 2
ekλ · e−∗kλ − k × ekλ
∗
· k × e−∗kλ
c
ωk2
− ckλ c†kλ 2
∗
ekλ · ekλ
∗
+ k × ekλ · k × ekλ
c
4
ωk2 ∗
− c†kλ ckλ e · e ∗
+ k × ekλ · k × ekλ . (4.23)
c2 kλ kλ
Using Eqs. (4.15) and (4.17) we have that the first two terms in the right-hand side
identically vanish, while the last two terms give1
V 2
H= ωk c c† + c† c . (4.24)
4πc2 kλ kλ kλ kλ
kλ
we can express the Hamiltonian of the electromagnetic field as the sum of an infinite
number of Hamiltonians (one for each mode) formally identical to the Hamiltonian
of the harmonic oscillator,
1
H= h̄ω a a† + a†kλ akλ .
2 k kλ kλ
kλ
The commutation properties of the operators akλ and a† can be derived from the
kλ
Correspondence Principle. Since these operators are, apart from a factor, the quan-
tum equivalent of the coefficients ckλ (t) which obey the differential equation (4.18),
we require that the same differential equation hold for the expectation value of the
operator akλ on an arbitrary state vector |φ of the Hilbert space,
d
φ|akλ |φ = −i ωk φ|akλ |φ .
dt
1
The cross products in Eq. (4.23) can be easily evaluated using the relations
% %
a × b =
jk ijk
aj bk ,
k ijk lmk
= δil δjm − δim δjl ,
i
where the antisymmetrical tensor ijk is defined on p.7.
QUANTIZATION OF THE ELECTROMAGNETIC FIELD 139
d
ih̄ |φ = H |φ ,
dt
we obtain
φ| [akλ , H] |φ = h̄ωk φ|akλ |φ ,
or 1
ω φ| [a , a a† + a† a ] |φ = ωk φ|akλ |φ .
2 k kλ k λ k λ k λ k λ
k λ
Being |φ an arbitrary state vector, it can be proved that this equation is satisfied
if
[akλ , ak λ ] = [a , a† ] = 0 (4.26)
kλ k λ
k or λ = λ ), and if
when the operators refer to different modes (k =
Equation (4.28), together with the commutation rules (4.26) and (4.27), proves
that the Hamiltonian of the electromagnetic field is the sum of an infinite number
of independent Hamiltonians, each formally identical to the Hamiltonian of the
harmonic oscillator.
If we now arrange the different modes of the electromagnetic field in some definite
order, and recall the results obtained in Sect. 4.1 for the harmonic oscillator, we
have that the eigenvectors of the Hamiltonian H can be described by a state vector
of the form
1
|n1 , n2 , . . , ni , . . = (a†1 )n1 (a†2 )n2 · · (a†i )ni · · |φ0 , (4.29)
n1 ! n2 ! · · ni ! · ·
|φ0 = |0, 0, . . . , 0, . . .
Note that the presence of the term 1/2 in this expression leads to a non-zero (and
indeed infinite) energy eigenvalue even when all the occupation numbers are zero.
This represents a zero-point energy that can be eliminated from the theory by
simply shifting the scale on which energy is measured. By so doing we can write
& '
H |n1 , n2 , . . . , ni , . . . = h̄ωj nj |n1 , n2 , . . . , ni , . . . .
j
where kB is the Boltzmann constant, so that the mean number of photons in the
mode is ∞
1
n̄ = n Pn = h̄ω .
n=0 e kB T
−1
Recalling Eq. (4.21), one obtains that the energy per unit volume with angular
frequency between ω and ω + dω is given by
5
dk 1 h̄ω 3 1
u(ω) = h̄ω n̄ N (k) dΩ = 2 3 h̄ω ,
dω V π c
e kB T
−1
and that the energy density with frequency between ν and ν + dν is
dω hν 3 1
u(ν) = u(ω) = 8π 3 hν
,
dν c
e kB T
−1
which is the classical Planck’s law .
The action of the operators ai and a†i on a given state vector can be deduced
from Eqs. (4.8)
√
ai |n1 , n2 , . . . , ni , . . . = ni |n1 , n2 , . . . , ni − 1, . . .
a†i |n1 , n2 , . . . , ni , . . . = ni + 1 |n1 , n2 , . . . , ni + 1, . . .
a†i ai |n1 , n2 , . . . , ni , . . . = ni |n1 , n2 , . . . , ni , . . . .
QUANTIZATION OF THE ELECTROMAGNETIC FIELD 141
These equations fully justify the denomination of destruction and creation oper-
ators for ai and a†i , respectively. The last equation shows that a†i ai is a positive
definite Hermitian operator whose eigenvalues are the occupation numbers ni . For
this reason it is named the occupation number operator relative to the i-th mode.
The formalism introduced in the previous sections can be slightly modified to make
it more suitable for the physical applications that will be presented in Chap. 6.
Instead of labelling the modes with the wavenumber k we characterize them by the
frequency ν and the direction Ω. Thus we replace the operators a and a† by
kλ kλ
λ) and a† (ν, Ω,
the operators a(ν, Ω, λ), respectively. Moreover, instead of the unit
which still depend on the direction
vectors ekλ we will use the unit vectors eλ (Ω)
but not any longer on the radiation frequency. With these notations, and taking
Ω
into account Eqs. (4.25) and (4.15), we can write for the operator A ˆ (r ) defined in
its classical form in Eq. (4.19)1
.
ˆ
(r ) =
h &
e ik·r + a† (ν, Ω,
λ) e (Ω) λ) e (Ω)
'
∗ e−ik·r , (4.30)
A c a(ν, Ω, λ λ
2πνV
ν Ωλ
where
k = 2πν Ω
c
·Ω
e (Ω) =0, ∗= δ
· e (Ω)
eλ (Ω) (λ, λ = 1, 2) , (4.31)
λ λ λλ
and
λ), a(ν , Ω
a(ν, Ω, , λ ) = a† (ν, Ω,
λ), a† (ν , Ω
, λ ) = 0
λ), a† (ν , Ω
a(ν, Ω, , λ ) = δ δ δ . (4.32)
νν ΩΩ λλ
− a† (ν, Ω, ∗ e−ik·r
λ) e (Ω)
λ
.
ˆ (r ) = i
2πνh & λ) Ω
e ik·r
× e (Ω)
B a(ν, Ω,
V λ
'
ν Ωλ
− a† (ν, Ω,
λ) Ω ∗ e−ik·r ,
× e (Ω) (4.33)
λ
1
To avoid any possible confusion, we use in this section the symbol ˆ to distinguish the
operators from the corresponding classical quantities.
142 CHAPTER 4
= dE
I(ν, Ω) ,
dS dt dν dΩ
where dE is the energy crossing the surface dS (perpendicular to the beam) in the
time interval dt, with frequency between ν and ν + dν and direction contained in
the solid angle dΩ around the direction Ω. The corresponding quantum operator
can be derived by the following arguments. The number of modes having frequency
contained in the interval (ν, ν + dν) and direction in the solid angle dΩ is given,
for each polarization state, by (see Eq. (4.21))
ν2
N (ν) dν dΩ = V dν dΩ . (4.35)
c3
λ) is the number of photons belonging to a particular mode, the number
If n(ν, Ω,
of such photons crossing the surface dS in the time interval dt is
λ) c dt dS
n(ν, Ω, ,
V
and since each photon carries the energy hν one obtains
= hν 3 λ) .
I(ν, Ω) n(ν, Ω, (4.36)
c2
λ=1,2
and the generalization to the polarization tensor defined in Eq. (1.36) is quite
obvious
3
= hν a† (ν, Ω,
Iˆαβ (ν, Ω) α) a(ν, Ω,
β) . (4.37)
c2
The relations connecting the Stokes parameters with the components of the po-
larization tensor depend on the reference direction chosen to define the Stokes
QUANTIZATION OF THE ELECTROMAGNETIC FIELD 143
chosen to expand
parameters (see Sect. 1.6) and on the specific unit vectors eλ (Ω)
ˆ
the vector potential A (r ) (see Eq. (4.30)). Obviously, both choices are completely
arbitrary, and the relation between the Stokes parameters and the polarization
tensor is in general rather involved (see Eqs. (1.42)). It is however convenient, for
in the following way
future applications, to define the unit vectors eλ (Ω)
= √1 − e (Ω)
e+1 (Ω) + i e (Ω)
a b
2
= √1 e (Ω)
e−1 (Ω) + i e (Ω)
, (4.38)
a b
2
and e (Ω)
where ea (Ω) are the reference direction unit vector and the associated
b
unit vector, respectively (see Sect. 1.6). It can be easily shown that these vectors
satisfy Eqs. (4.17), namely
·Ω
e±1 (Ω) =0
∗ = e (Ω)
· e (Ω)
e+1 (Ω) ∗=1
· e (Ω)
+1 −1 −1
∗ = e (Ω)
· e (Ω)
e+1 (Ω) ∗=0.
· e (Ω)
−1 −1 +1
= −Iˆ (ν, Ω)
Q̂ (ν, Ω) − Iˆ (ν, Ω)
+− −+
= −i Iˆ (ν, Ω)
Û (ν, Ω) + i Iˆ (ν, Ω)
+− −+
= Iˆ (ν, Ω)
V̂ (ν, Ω) − Iˆ (ν, Ω)
, (4.39)
++ −−
where we have shortened the notations writing Iˆ++ instead of Iˆ+1 +1 , and so on.
Finally, substituting Eq. (4.37) we obtain
3
ˆ Ω)
I(ν, = hν a† (ν, Ω, +1) + a† (ν, Ω,
+1) a(ν, Ω, −1) a(ν, Ω,
−1)
c 2
3
= − hν a† (ν, Ω,
Q̂ (ν, Ω) −1) + a† (ν, Ω,
+1) a(ν, Ω, −1) a(ν, Ω,
+1)
c 2
3
= −i hν a† (ν, Ω,
Û (ν, Ω) −1) − a† (ν, Ω,
+1) a(ν, Ω, −1) a(ν, Ω,
+1)
c 2
3
= hν a† (ν, Ω,
V̂ (ν, Ω) +1) − a† (ν, Ω,
+1) a(ν, Ω, −1) a(ν, Ω,
−1) ,
c 2
144 CHAPTER 4
which give the Stokes parameters of the radiation field in terms of creation and de-
and the angular
struction operators. The connection between the operator V̂ (ν, Ω)
momentum operator of the electromagnetic field is discussed in App. 3.
The density operator ρ(R) of the radiation field can be introduced using the same
general formalism presented at the beginning of Sect. 3.6. The definition of ρ(R) is
indeed the same as that of Eq. (3.79),
(α) (α)
ρ(R) = pα |ψR ψR | ,
α
(α)
where |ψR is now a state vector of the Hilbert space considered in Sect. 4.3.
A complete basis of this Hilbert space is that of the energy eigenvectors given in
Eq. (4.29), 6 7
|n1 , n2 , . . . , ni , . . . ,
so that the matrix elements of the operator ρ(R) have the form
ρ(R)
(n ,n ,...,ni ,...) (n1 ,n2 ,...,ni ,...) = n1 , n2 , . . . , ni , . . . | ρ(R) |n1 , n2 , . . . , ni , . . . .
1 2
and analogous formulae can be written for the other physical variables associated
with the electromagnetic radiation field.
In some special cases, the density operator of the radiation field can be written
as the direct product of an infinite number of operators each spanning one mode of
1 In these
the radiation field characterized by the frequency ν and the direction Ω.
cases we have
) ⊗ ρ(R) (ν , Ω
ρ(R) = ρ(R) (ν1 , Ω ) · · · ⊗ ρ(R) (ν , Ω
) ··· .
1 2 2 i i
The expectation values of the polarization tensor components relative to the mode
) are then given by
(ν0 , Ω 0
Iαβ (ν0 , Ω ) ρ(R) = Tr Iˆ (ν , Ω
) = Tr Iˆ (ν , Ω ) ρ(R) (ν , Ω
) ,
0 αβ 0 0 αβ 0 0 0 0
1
The concept of mode is used here in a broader sense than in previous sections. Modes
are defined here irrespective of the photon polarization, so that, more precisely, they are double-
modes.
QUANTIZATION OF THE ELECTROMAGNETIC FIELD 145
hν 3
Iαβ = Tr Iˆαβ ρ(R)
0 = 2 Tr a†α aβ ρ(R)
0 ,
c
where ρ(R)
0 is the ‘reduced’ density operator acting on the mode (ν0 , Ω0 ) and where
a†α and aβ are the operators a† (ν0 , Ω
, α) and a(ν , Ω
0 0
, β), respectively.
0
It should be remarked that the density operator described above is an operator
acting on the Hilbert space of the occupation numbers. Sometimes, a different
density operator is introduced, acting on the wavefunction space of the photons
(Fano, 1949; Blum, 1981). This latter operator will not be used in this book.
CHAPTER 5
In this chapter we will discuss the physical mechanisms involved in the generation
and transfer of polarized radiation in spectral lines within the framework of classical
physics. In particular, the classical theory of the electron (already introduced in
Sect. 3.2) will be used to describe the atomic system interacting with polarized
radiation, and the electromagnetic radiation field will be described in terms of
classical electrodynamics.
The results that will be derived are of limited validity as, in most cases, the
correct results can only be obtained by a quantum-mechanical treatment. The
formalism of classical physics is, however, simpler and more transparent than the
quantum-mechanical one. Thus it is suited to give the reader a plain introduction
and an intuitive approach to the physical processes that will be treated in full
generality in the following chapters. Moreover, some of the results obtained from
classical physics coincide with special cases of the corresponding quantum results.
And it is indeed illuminating and encouraging to find, as a limiting case of an in-
volved quantum-mechanical calculation, the same result deduced from the simpler,
classical formalism.
This chapter is also intended to show the strict connection between some clas-
sical and quantum concepts. This subject – generally overlooked in textbooks on
Quantum Mechanics – can be conveniently illustrated by the physical processes
described here, such as resonance scattering and the Hanle effect. Although the
connection can be established only in a qualitative way, it is nevertheless important
for the correct understanding of polarization phenomena.
= 1 ∂D
curl B (5.1c)
c ∂t
1 ∂B
=−
curl E , (5.1d)
c ∂t
is connected
where c is the velocity of light and where the electric displacement D
with E by the equation
=E
D + 4π P ,
P being the electric polarization. From Eqs. (5.1c) and (5.1d) it easily follows that
1 ∂2D
− grad div E
∇2 E = . (5.2)
c ∂t2
2
is related to E
For an isotropic medium, D by the simple equation
= E
D ,
where is the dielectric constant of the medium. Substitution into Eqs. (5.2) and
(5.1a) leads to the wave equation for the electric field
∂2E
=
∇2 E
c2 ∂t2
=0.
divE
ν
e 2πi c (n r · Ω−ct) ,
=E
E (5.3)
0
√ is a constant such
where n = is the index of refraction of the medium and E0
that
·Ω
E =0.
0
Note that Eq. (5.3) gives for the spatial evolution of the electric field components
∂Ej ν
= 2πi n Ej ,
∂s c
vα = uα · v , (5.7)
1 1 1
u1 = u × u3 , u2 = u × u1 , u3 = u × u2 , (5.8)
g 2 g 3 g 1
1
For a dielectric crystal we simply have uα = uα . In the case of a vapor permeated by a
magnetic field, it will be shown in Sect. 5.3 that uα =
uα∗ .
150 CHAPTER 5
with
n2α = α (5.9)
and
=0,
uα · Ω
we easily see that Eqs. (5.2) and (5.1a) are satisfied. Obviously this is a very special
case because the wave is polarized along one of the principal dielectric axes and it
propagates along a direction perpendicular to the same axis.
Instead of looking for a general solution to the problem of wave propagation
within the medium,1 we will confine ourselves to establishing an evolution equation
for the components of the electric field. Our derivation will be further restricted
to media having principal dielectric constants very close to unity. In other words,
defining
α = 1 + ξα (α = 1, 2, 3) , (5.10)
we suppose that |ξα | 1.
Let us now consider a stationary plane wave of frequency ν propagating along
The vectors E
an arbitrary direction Ω. and D associated with the wave depend
) and on time t, thus Eq. (5.2)
only on s (the spatial coordinate measured along Ω
can be written in the form
∂2E 2 2 2
− ∂ (E
Ω = − 4π ν D
· Ω) . (5.11)
∂s2 ∂s2 c2
gives
Scalar multiplication by Ω
·Ω
D =0, (5.12)
so that the transversality condition (5.1a) is satisfied. These equations show that
is perpendicular to Ω
D while E has, in general, a non-zero component along the
direction of propagation.
Let’s now introduce two mutually orthogonal unit vectors ej perpendicular to
the direction of propagation,
1
This is a classical problem in crystal optics and leads to the so-called Fresnel equation of
wave normals (see Born and Wolf, 1964).
MATTER-RADIATION INTERACTION (CLASSICAL) 151
These vectors can be, for instance, the reference direction unit vector ea and the
associated unit vector eb defined in Sect. 1.6 or, more generally, the complex unit
vectors e+ and e− defined in Eqs. (1.41). Defining also
e3 = e3∗ = Ω
, (5.14)
and D
we can expand the vectors E on the basis (e , e , e ),
1 2 3
E = Ei ei , =
D Di ei ,
i i
where
Ei = ei∗ · E
, Di = ei∗ · D
.
The relation between the components Ei of the electric field in the reference system
(e1 , e2 , e3 ) and the components Eα along the triplet (u1 , u2 , u3 ) is the following
Ei = (ei∗ · uα ) Eα , Eα = (uα · ei ) Ei . (5.15)
α i
Analogous equations obviously hold for the components of the electric displace-
ment.
From the transversality condition (5.12) one obtains, using Eqs. (5.4), (5.5),
(5.15), (5.14), and (5.10)
(e3∗ · uα ) (uα · ej ) (1 + ξα ) Ej = 0 . (5.16)
j α
On the other hand, if we expand any vector ej on the basis uα via Eqs. (5.6) and
(5.7),
ej = (uα · ej ) uα ,
α
This formula shows that the ratio of the longitudinal component E3 of the electric
field to the transverse components is of order ξα . Since we have supposed |ξα | 1,
we have
| E 3 | | Ei | (i = 1, 2) .
152 CHAPTER 5
Consider now Eq. (5.11). Performing the scalar product by ej∗ (j = 1, 2) and
using Eqs. (5.4), (5.5), (5.10), and (5.15), we get1
∂ 2 Ej 4π 2 ν 2
2
= − 2 Dj
∂s c
4π 2 ν 2 ∗
3
=− 2 (ej · uα ) (uα · ek ) (1 + ξα ) Ek (j = 1, 2) .
c α
k=1
∂ 2 Ej 4π 2 ν 2 ∗
2
= − (ej · uα ) (uα · ek ) (1 + ξα ) Ek .
∂s2 c2 α
k=1
To the same order of approximation it can be easily shown, with the help of
Eq. (5.17), that this equation is equivalent to the following
∂Ej ν ∗
2 , 1 -
= ± 2πi (ej · uα ) (uα · ek ) 1 + ξα Ek . (5.18)
∂s c α
2
k=1
ν ∗
2
∂Ej
= 2πi (ej · uα ) (uα · ek ) nα Ek (j = 1, 2) . (5.20)
∂s c α k=1
This equation expresses the variation along the propagation direction of the trans-
verse components of the electric field. Its validity is limited to media whose prin-
cipal dielectric constants differ slightly from unity and it cannot be applied, in
general, to crystals. The same equation has been used (although without physical
explanation) by Jefferies et al. (1989) in their derivation of the transfer equations
1
The scalar product by e3∗ leads to the trivial identity 0 = 0.
MATTER-RADIATION INTERACTION (CLASSICAL) 153
for polarized radiation from classical physics. A simpler, heuristic derivation, that
is however basically inconsistent, has been given by Landi Degl’Innocenti (1992).
Equation (5.20) describes the spatial evolution, along the ray path, of the transverse
components of the electric field associated with a stationary plane wave of frequency
ν. This equation can be rewritten in the form
∂
Ej (s, t) = − Gjk Ek (s, t) (j, k = 1, 2) , (5.21)
∂s
k
where
ν
Gjk = −2πi (uα · ej∗ ) (uα · ek ) nα . (5.22)
c α
The tensor G, which will be referred to in the following as the propagation tensor
of the electric field, depends both on the physical nature of the medium, specified
by the (complex) refractive indices nα , and on the geometry of the propagation
specified by the two scalar products. It is worth noticing that in an isotropic
medium, where nα = n, we get from Eq. (5.17)
ν
Gjk = −2πi n δjk . (5.23)
c
From Eq. (5.21) the transfer equation for the polarization tensor can be eas-
ily derived. Recalling the definition of polarization tensor given in Chap. 1 (see
Eq. (1.25) and its generalization in Sect. 1.8) we have
and since the propagation tensor can be extracted from the statistical average
implied by the symbol · · ·, we obtain
d 2
Jjk = − G∗jl Jlk + Gkl Jjl (j, k = 1, 2) . (5.24)
ds
l=1
To derive the transfer equations for the Stokes parameters we need the relation
between these quantities and the components of the polarization tensor. We recall
154 CHAPTER 5
that this relation depends on the choice of the unit vectors e1 and e2 defined in
Eq. (5.13). The easiest choice is
where ea is the reference direction unit vector and eb the associated unit vector
(see Sect. 1.6). With this definition the relation between the Stokes parameters
and the components of the polarization tensor is given by Eqs. (1.34), which can
be inverted to give
1 1
J11 = I +Q J12 = U − iV
2k 2k
1 1
J22 = I −Q J21 = U + iV .
2k 2k
Substitution into Eqs. (5.24) leads to the following propagation equation for the
Stokes parameters
⎛ ⎞ ⎛ ⎞⎛ ⎞
I ηI ηQ ηU ηV I
d ⎜Q⎟ ⎜ ηQ ηI ρV −ρU ⎟ ⎜ Q ⎟
⎝ ⎠ = −⎝ ⎠⎝ ⎠ , (5.25)
ds U ηU −ρV ηI ρQ U
V ηV ρU −ρQ ηI V
where the seven independent quantities appearing in the propagation matrix are
given by
ηI = Re (G11 + G22 )
ηQ = Re (G11 − G22 ) ρQ = −Im (G11 − G22 )
ηU = Re (G12 + G21 ) ρU = −Im (G12 + G21 )
ηV = Im (G12 − G21 ) ρV = Re (G12 − G21 ) . (5.26)
Equations (5.25) and (5.26) are at the basis of radiative transfer for polarized
radiation and deserve some discussion. First of all it should be emphasized the
remarkable symmetry property of the 4 × 4 propagation matrix. This matrix is
constructed with only seven independent quantities and can be decomposed into a
diagonal matrix, proportional to ηI , and two off-diagonal matrices, one symmetric
and the other antisymmetric about the main diagonal.
Denoting the propagation matrix by K, we have
⎛ ⎞
ηI 0 0 0
⎜ 0 ηI 0 0 ⎟
K =⎝ ⎠+
0 0 ηI 0
0 0 0 ηI
⎛ ⎞ ⎛ ⎞
0 ηQ ηU ηV 0 0 0 0
⎜η 0 0 0 ⎟ ⎜0 0 ρV −ρU ⎟
+⎝ Q ⎠+⎝ ⎠ . (5.27)
ηU 0 0 0 0 −ρV 0 ρQ
ηV 0 0 0 0 ρU −ρQ 0
MATTER-RADIATION INTERACTION (CLASSICAL) 155
The first matrix is responsible for the absorption of the energy of the electromag-
netic wave irrespective of its polarization state. This matrix would produce by
itself an exponential decrease of the whole Stokes vector, so that the quantity ηI
can be regarded as the generalization to the polarized case of the absorption coef-
ficient encountered in the standard theory of radiative transfer. In this respect it
is worth noticing that for an isotropic medium we simply have from Eq. (5.23)
ν
ηI = 4π k,
c
k being the imaginary part of the refractive index which is connected with the
standard absorption coefficient. This first matrix will be referred to in the following
as the absorption matrix .
As apparent from Eq. (5.23), both the second and the third matrix are zero for
an isotropic medium. For reasons that will be clarified in the next section, these
matrices will be referred to in the following as dichroism matrix and dispersion
matrix , respectively.
The expression that we have obtained for the propagation matrix K, and in par-
ticular its symmetry properties, descend directly from the physical approximations
adopted. First of all we have assumed the principal dielectric constants α – which
describe the dielectric properties of the medium – to be independent of the ampli-
tude of the propagating electric field. This means that the polarization P induced
in the medium is linearly related to the electric field itself. The present theory is
therefore inadequate to treat the physical phenomena connected with non-linear
optics. Moreover, we have assumed the principal dielectric constants of the medium
to differ slightly from unity, so that the applicability of Eqs. (5.25)-(5.26) is limited
to such media. Finally, we have supposed the electromagnetic wave to propagate
in a homogeneous medium, whose properties do not depend on the spatial coordi-
nate r. It can however be shown that for non-homogeneous media the equations
now derived can still be applied provided that
3 3
3 3 3 3
3 grad n 3 3 n 2π 3 ,
α 3 α λ3
We now apply the results of the previous section to the case of an atomic vapor in
the presence of a magnetic field. We describe the atom by the classical theory of
the electron, going back to the oscillator model already used in Sect. 3.2. In the
presence of a constant magnetic field B and of an oscillating electric field of the
form
e−2πiνt ,
E
156 CHAPTER 5
d2 x e dx − γ dx − e0 E
e−2πiνt ,
= −4π 2 ν02 x − 0 ×B (5.28)
dt2 mc dt dt m
where ν0 is the frequency of the oscillator and γ is the damping constant.
To find the dipole induced by the oscillating electric field associated with the
incident electromagnetic wave we disregard the transient solution depending on
the initial conditions and we look for a solution of the form
−2πiνt
x = a e .
aα = χα Eα , (5.29)
e0 1
χα = − (α = 0, ±1) (5.30)
4π 2 m (ν02 − ν 2 ) − 2iνΓ − 2αννL
with Γ = γ/4π.
If there are N oscillators per unit volume, the electric polarization is
P = −N e0 a
=E
D + 4π P = E
− 4πN e a .
0
Therefore the principal dielectric constants α defined in Eq. (5.5) are given by
N e20 1
α = 1 + (α = 0, ±1) .
πm (ν0 − ν ) − 2αννL − 2iνΓ
2 2
MATTER-RADIATION INTERACTION (CLASSICAL) 157
Since both νL and Γ are usually much smaller than ν0 (see Sect. 3.2), the principal
dielectric constants are sharp functions of the frequency ν peaked at ν0 , so that we
can replace (ν02 − ν 2 ) by 2ν (ν0 − ν).
If we now assume – consistently with the treatment of Sect. 5.2 – the principal
dielectric constants to be close to unity, and separate the refractive indices nα in
their real and imaginary parts,
nα = (1 + δα ) + i kα (5.31)
N e20 Γ
kα =
4πmν (ν0 − ανL − ν)2 + Γ 2
N e20 ν0 − ανL − ν
δα = . (5.32)
4πmν (ν0 − ανL − ν)2 + Γ 2
It should be emphasized that the basis uα (α = 0, ±1) introduced above is such
that the unit vectors uα appearing in the expression of the propagation tensor G
(Eq. (5.22)) are given by
uα = uα∗ ,
as it can be proved directly from Eqs. (5.8). As a consequence, the propagation
tensor reduces to
ν ∗
Gjk = −2πi Cαj Cαk nα , (5.33)
c α
where Cαj are the direction cosines defined in Eq. (3.31), and the completeness
relation (5.17) becomes
∗
Cαi Cαj = δij . (5.34)
α
Equations (5.33), (5.34), and (5.31) allow the coefficients of the propagation matrix
K given in Eq. (5.26) to be written in the form1
ν
ηI = 2π kα |Cα1 |2 + |Cα2 |2
c α
ν
ηQ = 2π kα |Cα1 |2 − |Cα2 |2
c α
ν
∗
ηU = 2π kα 2 Re Cα1 Cα2
c α
ν
∗
ηV = 2π kα 2 Im Cα1 Cα2
c α
1
Note that these expressions hold provided the unit vectors ei appearing in the direction cosines
Cαi are the reference direction unit vector ea and the associated unit vector eb (see Sect. 5.2).
158 CHAPTER 5
θ
Ω
Fig.5.1. The direction of the magnetic field B is specified by the polar angle θ and the azimuth
the unit vector
angle χ, reckoned from the reference direction ea . In the plane perpendicular to B,
and Ω;
ur lies in the plane containing B
us is defined accordingly.
ν
ρQ = 2π δα |Cα1 |2 − |Cα2 |2
c α
ν ∗
ρU = 2π δα 2 Re Cα1 Cα2
c α
ν ∗
ρV = 2π δα 2 Im Cα1 Cα2 . (5.35)
c α
ur = cos θ cos χ ea + cos θ sin χ eb − sin θ Ω
us = − sin χ ea + cos χ eb
u = sin θ cos χ e + sin θ sin χ e + cos θ Ω ,
0 a b
so that
1
u±1 = √ (∓ cos θ cos χ − i sin χ) ea + (∓ cos θ sin χ + i cos χ) eb ± sin θ Ω
.
2
MATTER-RADIATION INTERACTION (CLASSICAL) 159
TABLE 5.1
α = −1 α=0 α = +1
|Cα1 |2 + |Cα2 |2 1
2
(1 + cos2 θ) sin2 θ 1
2
(1 + cos2 θ)
The Cαi combinations appearing in Eqs. (5.35) are summarized in Table 5.1. Using
Eqs. (5.32) one finally obtains the following expressions for the elements of the
propagation matrix
πe20 1 φb + φr
ηI = N φp sin2 θ + (1 + cos2 θ)
mc 2 2
πe20 1 φ + φr
ηQ = N φp − b sin2 θ cos 2χ
mc 2 2
πe20 1 φ + φr
ηU = N φp − b sin2 θ sin 2χ
mc 2 2
πe20 1
ηV = N φr − φb cos θ
mc 2
πe20 1 ψb + ψr
ρQ = N ψp − sin2 θ cos 2χ
mc 2 2
πe20 1 ψ + ψr
ρU = N ψp − b sin2 θ sin 2χ
mc 2 2
πe20 1
ρV = N ψr − ψb cos θ , (5.36)
mc 2
where
1 Γ 1 ν0 + νL − ν
φb = ψb =
π (ν0 + νL − ν)2 + Γ 2 π (ν0 + νL − ν)2 + Γ 2
1 Γ 1 ν0 − ν
φp = ψp =
π (ν0 − ν)2 + Γ 2 π (ν0 − ν)2 + Γ 2
1 Γ 1 ν0 − νL − ν
φr = ψr = . (5.37)
π (ν0 − νL − ν)2 + Γ 2 π (ν0 − νL − ν)2 + Γ 2
The functions φb , φp , φr – where the subscripts stand for ‘blue’, ‘parallel’, and ‘red’,
respectively – have already been encountered in Sect. 3.2 (see Eqs. (3.38)). They
are Lorentzian profiles centered at the frequencies (ν0 + νL ), ν0 , and (ν0 − νL ),
160 CHAPTER 5
∞
φb,p,r dν = 1 .
−∞
1 φ + φr
εI = k φp sin2 θ + b (1 + cos2 θ)
2 2
MATTER-RADIATION INTERACTION (CLASSICAL) 161
1 φ + φr
εQ = k φ − b sin2 θ cos 2χ
2 p 2
1 φ + φr
εU = k φ − b sin2 θ sin 2χ
2 p 2
1
εV = k φ − φb cos θ , (5.38)
2 r
where
πe20 2ν 2
k= N 20 Ē ,
mc c
Ē being the mean energy contained into each degree of freedom of the classical
oscillator.
Adding to Eq. (5.25) the contribution due to emission, we obtain the radiative
transfer equations for polarized radiation in the form
⎛ ⎞ ⎛ ⎞⎛ ⎞
I ηI ηQ ηU ηV I −S
d ⎜Q⎟ ⎜η ηI ρV −ρU ⎟ ⎜ Q ⎟
⎝ ⎠ = −⎝ Q ⎠⎝ ⎠ , (5.39)
ds U ηU −ρV ηI ρQ U
V ηV ρU −ρQ ηI V
where the scalar source function S is given by
2ν02
S= Ē . (5.40)
c2
In particular, under the hypothesis of Local Thermodynamic Equilibrium (LTE)
the mean energy Ē is given by kB T , where kB is the Boltzmann constant and T
the absolute temperature, so that
2ν02
SLTE = k T ,
c2 B
which is the classical expression for the Planck function BP in the limit h → 0
(approximation of Rayleigh and Jeans).
Obviously, we cannot expect to find by a classical theory the exact expression
for the Planck function, which is intimately connected with quantum concepts.
However, using a semiclassical approach, we can go back to the quantization of the
harmonic oscillator (see Sect. 4.1) and write
%
∞
−nx
ne
1
Ē = hν0 n=0
%
∞ = hν0
−nx ex − 1
e
n=0
where x = hν0 /kB T , and substituting this result into Eq. (5.40) we obtain
2hν03 1
SLTE = hν0
= BP (ν0 ) ,
c2
e kB T
−1
162 CHAPTER 5
an expression that could also be obtained by invoking the Kirchhoff law for the
emission of a source in LTE.
Equation (5.39), with the matrix coefficients given by Eqs. (5.36) and with
the Planck function in the place of the source function, was first obtained by
Unno (1956) by a heuristic approach which neglected anomalous dispersion effects
(ρQ = ρU = ρV = 0). These effects were subsequently introduced by Rachkovsky
(1962a) through classical considerations, similar to those presented in this section,
based on the refractive index. Another derivation has been recently presented by
Jefferies et al. (1989). In the meantime a quantum derivation of the same equation
was given by Landi Degl’Innocenti and Landi Degl’Innocenti (1972), who pointed
out some minor errors or ambiguities (mainly connected with sign conventions) ap-
pearing both in the original papers and in subsequent papers based on those (like
for instance in Beckers, 1969a). The reader is referred to Rees (1987) for a clear
presentation of the possible errors that may arise in the derivation of Eq. (5.39).
In the previous section we have derived the absorption profiles φb , φp , φr and the
corresponding dispersion profiles ψb , ψp , ψr for a collection of atoms supposed at
rest in the laboratory frame (the frame where the Stokes parameters are defined).
These profiles are of the form p (ν0 − ν), where ν0 is the frequency of the classical
oscillator.
Actually, however, the atoms will always be in motion. In order to realize how
the profiles are affected by these motions, let us consider a radiation beam of
frequency ν propagating along a given direction, which we identify with the line of
sight. If w is the velocity component of the atom along the line of sight, the atomic
frequency ν0 is shifted, according to the classical formula of the Doppler effect, to
the new value , w-
ν0 = ν0 1 − ,
c
where we have assumed w c (non-relativistic approximation) and where we have
adopted the sign astrophysical convention according to which w > 0 when the atom
is receding from the observer.
Therefore, for a collection of atoms having a normalized distribution of velocity
components f (w), the profile p (ν0 − ν) must be replaced by the expression
∞
w
p ν0 − ν0 − ν f (w) dw . (5.41)
c
−∞
to the Maxwellian law and characterized by the velocity wT . We thus assume for
f (w) the following expression
w−w 2
1 − A
f (w) = √ e wT
. (5.42)
π wT
Substituting Eq. (5.42) into Eq. (5.41), and introducing the reduced variables
wT Γ
∆νD = ν0 , a=
c ∆νD
νL wA ν w ν0 − ν
vB = , vA = = 0 A , v= (5.43)
∆νD wT c ∆νD ∆νD
one obtains with easy transformations
1
φα = √ H(v − vA + α vB , a)
π ∆νD
1
ψα = √ L(v − vA + α vB , a) , (5.44)
π ∆νD
where α = −1, 0, +1 for the ‘red’, ‘parallel’, and ‘blue’ component, respectively,
and where we have defined the functions
∞
a −y 2 1
H(v, a) = e dy
π (v − y)2 + a2
−∞
∞
1 −y 2 v−y
L(v, a) = e dy . (5.45)
π (v − y)2 + a2
−∞
In the above expressions ∆νD represents the Doppler width in frequency units,
and it is used to normalize all the other quantities: v is the so-called reduced
frequency, vB the normalized Zeeman splitting, vA the normalized shift due to the
bulk motion, and a the damping constant . Figure 5.3 shows the two sets of profiles
φ and ψ.
The functions defined in Eqs. (5.45) are the so-called Voigt function H(v, a) and
the associated dispersion profile L(v, a). In previous works on this subject the
function L(v, a) has been usually written in the form
L(v, a) = 2F (v, a) ,
and the name of Faraday-Voigt function has sometimes been employed for F (v, a).
It should be remarked that the quantities defined in Eqs. (5.43) can be expressed
in terms of wavelength (instead of frequency) displacements and broadenings. In-
troducing the Doppler width in wavelength units,
λ20 w
∆λD = ∆νD = λ0 T , (5.46)
c c
164 CHAPTER 5
−1
Fig.5.3. Absorption profiles φ and dispersion profiles ψ in units of ∆νD , corresponding to vB = 3,
vA = 0, and a = 0.05. Note that the magnetic field is 12 times stronger than in Fig.5.2.
λ − λ0 λ20 Γ λ0 wA
v= , a= , vA =
∆λD c ∆λD c ∆λD
λ20 νL λ20 e0 B ∆λB
vB = = = , (5.47)
c ∆λD 4πmc2 ∆λD ∆λD
where ∆λB is defined in Eq. (3.13). With these positions Eqs. (5.44) are still valid.1
The thermal velocity wT in the above equations can be related to the kinetic
temperature T and to the microturbulent velocity ξ by the expression
2kB T
wT = + ξ2 , (5.48)
µM
1
Note, however, that the profiles φα defined in Eqs. (5.44) are normalized to unity in frequency.
The corresponding profiles, normalized to unity in wavelength, are obtained by substituting ∆λD
for ∆νD in the same equations.
MATTER-RADIATION INTERACTION (CLASSICAL) 165
where µ is the atomic weight of the atom which originates the spectral line and M
is the mass of unit atomic weight. Therefore, the normalized Zeeman splitting vB
can be expressed in the form
λ0 e0 B 1
vB =
4πmc 2kB T
+ ξ2
µM
or, numerically
1
vB = 1.400 × 10−7 λ0 B , (5.49)
1.663 × 10−2 T
µ + ξ2
z = v + ia , (5.51)
we obtain with easy transformations
∞ −y 2
i e
W (z) = dy , (5.52)
π z−y
−∞
where, being a > 0, we have Im z > 0. The function W (z) is an analytical function
of the complex variable z that can be related to the complementary error function
(see Abramowitz and Stegun, 1965).
By the substitution (z − y) = u we first transform the integral in Eq. (5.52) into
the following
−u2 +2uz
i −z2 e
W (z) = e du ,
π L u
where L is a straight line parallel to the real axis in the half plane Im u > 0 (see
Fig. 5.4).
Next we observe that
−u2 +2uz
d e −u2 +2uz z2 −(u−z)2 √ z2
du = 2 e du = 2 e e du = 2 π e ,
dz L u L L
where C is a constant that can be determined by taking the limit z → 0 of Eq. (5.53)
−u2 +2uz −u2
e e
C = lim du = du .
z→0 L u L u
The last integral can be evaluated by observing that the integration variable u has
a pole at the origin. Deforming the integration path L into the path L shown in
Fig. 5.4, the contributions arising from the integration along the real axis cancel
out, while the contribution from the semicircle around the origin simply gives −iπ.
To conclude, we have obtained that the function W (z) can be written in the form
z
−z 2 2i t2 −z 2
W (z) = e 1+ √ e dt = e erfc (−iz) , (5.54)
π
0
where erfc (x) is the complementary error function defined as in Abramowitz and
Stegun (1965).
From Eqs. (5.52) and (5.54) some important properties of the functions H(v, a)
and L(v, a) can be easily derived:
i) Limiting case a = 0.
For a = 0, being z = v, Eq. (5.54) gives
v
−v 2 2i −v2 t2
W (z) = e +√ e e dt ,
π
0
−v 2 2
H(v, 0) = e , L(v, 0) = √ D(v) , (5.55)
π
MATTER-RADIATION INTERACTION (CLASSICAL) 167
ia 2
a2 2i t
W (z) = e 1+ √ e dt ,
π
0
so that
a
a2 2 −t2 a2
H(0, a) = e 1− √ e dt = e erfc (a)
π
0
L(0, a) = 0 .
−1 ∞
1 1 y yk
= 1− = ,
z−y z z z k+1
k=0
therefore
∞ ∞ ∞
i y k −y2 i Γ(n + 1/2)
W (z) = e dy = ,
π z k+1 π n=0 z 2n+1
k=0 −∞
where Γ(x) is the Euler Gamma function. By taking the leading terms of the
expansion in inverse powers of z one easily obtains the asymptotic behavior
1 a 1 3v 2 − a2
H(v, a) ∼ √ 2 2
1+ + ···
π a +v 2 (a2 + v 2 )2
1 v 1 v 2 − 3a2
L(v, a) ∼ √ 1+ + ··· . (5.57)
π a2 + v 2 2 (a2 + v 2 )2
iv) Derivatives.
From Eq. (5.54) we have
dW (z) 2i
= −2z W (z) + √ ,
dz π
168 CHAPTER 5
whence
∞ ∞
√
H(v, a) dv = π , L(v, a) dv = 0 . (5.60)
−∞ −∞
MATTER-RADIATION INTERACTION (CLASSICAL) 169
This formula proves that the profiles φα in Eqs. (5.44) are normalized to unity in
frequency.
viii) Convolutions.
Consider the integral
∞
W (v − v0 , a) W (v − v0 , a ) dv .
−∞
Using Eqs. (5.51)-(5.52) and inverting the order of the integrals one gets
∞
W (v − v0 , a) W (v − v0 , a ) dv =
−∞
∞ ∞ ∞
1 −y 2 −y 2 1
=− 2 dy e dy e dv .
π (v − v0 − y + i a) (v − v0 − y + i a )
−∞ −∞ −∞
The last integral can be performed via the residue theorem. Since both poles lie in
the half plane Im v < 0, one obtains
∞
W (v − v0 , a) W (v − v0 , a ) dv = 0 . (5.61)
−∞
Similarly,
∞
W (v − v0 , a) W (v − v0 , a )∗ dv =
−∞
∞ ∞ ∞
1 −y 2 −y 2 1
= 2 dy e dy e dv ,
π (v − v0 − y + i a) (v − v0 − y − i a )
−∞ −∞ −∞
∞
W (v − v0 , a) W (v − v0 , a )∗ dv =
−∞
∞ ∞
2i −y 2 −y 2 1
= dy e dy e .
π v0 − v0 + y − y + i (a + a)
−∞ −∞
170 CHAPTER 5
1 1
x = √ y − y , x = √ y + y , (5.62)
2 2
which leads to
∞
W (v − v0 , a) W (v − v0 , a )∗ dv =
−∞
∞ ∞ −x2
2i −x 2 e
= e dx
√ dx .
π v0 − v0 − 2 x + i (a + a)
−∞ −∞
Performing the integral in dx and recalling Eq. (5.52), one finally obtains
∞
√ v0 − v0 a + a
W (v − v0 , a) W (v − v0 ∗
, a ) dv = 2π W √ , √ . (5.63)
2 2
−∞
∞ .
π v − v a + a
H(v − v0 , a) H(v − v0 , a ) dv = H 0√ 0 , √
2 2 2
−∞
∞ .
π v − v a + a
L(v − v0 , a) L(v − v0 , a ) dv = H 0√ 0 , √
2 2 2
−∞
∞ .
π v0 − v0 a + a
L(v − v0 , a) H(v − v0
, a ) dv = L √ , √ . (5.64)
2 2 2
−∞
∞
v − v0 v − v0
W ,a W , a dv
b b
−∞
and
∞ ∗
v − v0 v − v0
W ,a W , a dv ,
b b
−∞
MATTER-RADIATION INTERACTION (CLASSICAL) 171
∞
v − v0 v − v0
L ,a L , a dv =
b b
−∞
√
b b π v − v0 a b + ab
= √ H √0 ,√
b2 + b 2 b2 + b 2 b2 + b 2
∞
v − v0 v − v0
L ,a H , a dv =
b b
−∞
√
b b π v − v0 a b + ab
= √ L √0 ,√ . (5.65)
b2 + b 2 b2 + b 2 b2 + b 2
1
a H(v0 , a) + v0 L(v0 , a) = √ .
π
x) Numerical methods.
Several algorithms have been proposed for the numerical computation of the func-
tions H(v, a) and L(v, a). The most extensively used for astrophysical applications
are those by Reichel (1968), Hui et al. (1978), and Humlicek (1982). The values
obtained using these algorithms can be checked against the tabulated values of the
complex function W (z) given by Faddeeva and Terent’ev (1961).
1
Transformation (5.62) should be replaced by the following
by − b y b y + by
x= √ , x = √ .
2
b + b 2 b2 + b 2
172 CHAPTER 5
d 3
S =− Kij Sj + εi (i = 0, . . . , 3) , (5.66)
ds i j=0
S † ≡ (S0 , S1 , S2 , S3 ) = (I, Q, U, V ) ,
ε† ≡ (ε0 , ε1 , ε2 , ε3 ) = (εI , εQ , εU , εV ) .
We assume here that both K and ε do not depend on the Stokes vector S. This
may indeed appear as a rather restrictive assumption, because there are many
physical situations, mainly of astrophysical interest, where the properties of the
medium at any given point depend in fact upon the local radiation field. It should
be remarked, however, that the angular spread of the radiation beam that we are
considering can always be made sufficiently small for the properties of the medium
to be not affected by the beam itself. Keeping this argument in mind, we can
derive some important properties of K and ε by varying arbitrarily – in a sort of
Gedankenexperiment – the value of the Stokes vector S appearing in Eq. (5.66).
If we start by considering the limiting case of very intense radiation beams, the
contribution of the emission vector in Eq. (5.66) can be simply neglected and the
transfer equations reduce to
d 3
Si = − Kij Sj (i = 0, . . . , 3) , (5.67)
ds j=0
MATTER-RADIATION INTERACTION (CLASSICAL) 173
1 d 2
I − Q2 − U 2 − V 2 =
2 ds
= −K00 I 2 + K11 Q2 + K22 U 2 + K33 V 2
+ (K10 − K01 )IQ + (K20 − K02 )IU + (K30 − K03 )IV
+ (K12 + K21 )QU + (K13 + K31 )QV + (K23 + K32 )U V . (5.68)
These are just the symmetry relations contained in Eq. (5.27). In the light of the
arguments presented above we see that these relations are a direct consequence
of: a) the definition of the Stokes parameters (which implies the inequality I 2 ≥
Q2 + U 2 + V 2 ); b) the linear character of Eq. (5.67) (which is equivalent to the
assumption K independent of S); c) the reversible character of the interaction of
radiation with the ambient medium (which implies the equal sign in Eq. (5.69)).
174 CHAPTER 5
When conditions b) and c) are fulfilled the propagation matrix K can be written
in the form ⎛ ⎞
ηI ηQ ηU ηV
⎜η ηI ρV −ρU ⎟
K=⎝ Q ⎠ .
ηU −ρV ηI ρQ
ηV ρU −ρQ ηI
We will now prove that one, or even two, of the seven independent coefficients
appearing in the expression of K can be set equal to zero by an appropriate choice
of the reference direction which defines the Stokes parameters. Recalling the results
derived in Sect. 1.9 (see Eq. (1.45) and Fig. 1.10), the Stokes vector S relative to
the ‘new’ reference direction characterized by the angle α can be written in the
form
S = R(α) S ,
where ⎛ ⎞
1 0 0 0
⎜ 0 cos 2α sin 2α 0⎟
R(α) = ⎝ ⎠ .
0 − sin 2α cos 2α 0
0 0 0 1
Thus in the new reference frame the expression for the propagation matrix is
These formulae show that while the quantities ηI , ηV , and ρV are invariant with
respect to the choice of the reference direction, ηQ and ηU change into each other
according to a rotation through an angle 2α in the ηQ -ηU plane, and the same
happens for ρQ and ρU . It follows that the two quantities ηL and ρL defined by
ηL2 = ηQ
2 2
+ ηU , ρ2L = ρ2Q + ρ2U
the transformation properties of the unit vectors ea , eb under rotation of the ref-
erence direction. The proof is straightforward and will not be given here. In the
special case of the propagation of radiation in a magnetized medium (Sect. 5.3) one
can easily see from Eqs. (5.36) that the preferred reference direction lies (indepen-
dently of frequency) in the plane containing the magnetic field and the direction
of propagation.1
The above remarks show that when a preferred reference direction can be defined
the propagation matrix K can be cast into the simpler form
⎛ ⎞
ηI ηQ 0 ηV
⎜ ηQ ηI ρV 0 ⎟
K=⎝ ⎠ .
0 −ρV ηI ρQ
ηV 0 −ρQ ηI
Going back to Eq. (5.66), we can also deduce some interesting properties of the
emission vector ε by considering the limiting case of very weak intensity of the
radiation beam. Under this limit the transfer equations reduce to
dSi
= εi (i = 0, . . . , 3) ,
ds
and by integration over the interval ∆s one gets
Si (s + ∆s) = Si (s) + εi ∆s .
ηI ≥ 2 + η2 + η2 ,
ηQ U V (5.72)
does not hold in general. However, under Local Thermodynamic Equilibrium or,
more generally, when one can write
⎛ ⎞ ⎛ ⎞
εI ηI
⎜ εQ ⎟ ⎜η ⎟
⎝ ⎠=S⎝ Q⎠ , (5.73)
εU ηU
εV ηV
In Sect. 1.9 we have introduced the concept of Poincaré sphere as a mapping be-
tween Stokes vectors and points within a sphere of unit radius (see Fig. 1.11).
Thus it is quite natural to give a geometrical interpretation to the radiative trans-
fer equations for polarized radiation in terms of the motion of the representative
point within the Poincaré sphere. This is schematically illustrated in Fig. 5.5.
In a right-handed orthogonal system (xyz) we consider the formal vectors p , η ,
ρ , defined by
Q U V
p=
, ,
I I I
η = ηQ , ηU , ηV
ρ = ρQ , ρU , ρV
ε , εQ εU εV -
= = , , ,
I I I I
and the scalar quantity
ε
I = I .
I
With these positions the transfer equations for polarized radiation can be trans-
formed into the following
dI
= −(ηI +
η · p − I ) I (5.74a)
ds
d
p
= − η · p ) p + ρ
η + ( ×p + − I p . (5.74b)
ds
MATTER-RADIATION INTERACTION (CLASSICAL) 177
on the
surface
Fig.5.5. The transfer of polarized radiation in an arbitrary medium can be described in terms of
the motion of the representative point inside the Poincaré sphere. The point cannot pierce the
surface of the sphere but may describe a curve that is tangent to it, like in the point P. The
starting point P0 represents the radiation entering the medium.
The scalar equation describes the transfer of the beam intensity and it can be
considered as the equation which controls the scaling factor connecting the Stokes
parameters with their fractional value given by the vector p . It is interesting to
note that this equation differs from the standard equation for unpolarized radiation
only for the presence of the term ( η · p ) which adds to the (generalized) absorption
coefficient ηI . Note also that the dispersion term, ρ , does not appear directly
in this equation, nevertheless it affects the intensity via the term (η · p ) (since p
depends on ρ ).
The vector equation describes the motion of the representative point inside the
Poincaré sphere. The first term in the right-hand side tends to align the fractional
polarization p with the vector − η . If we consider for instance the simple case
where η does not depend on s, and if we take p = 0 (unpolarized radiation) as the
boundary condition at s = s0 , when the ray has traveled a distance ∆s from s0 we
obtain from this term
p (∆s) = − η ∆s ,
and, as ∆s grows, the representative point might pierce the Poincaré sphere. This
is however prevented by the second term, which is opposed to the first one and
whose relative importance grows as the point approaches the surface of the sphere.
Clearly this term describes a phenomenon of saturation for the fractional linear
polarization.
In this respect it is also interesting to consider the effect of the term (η · p )
in the scalar equation for the intensity. In the simple situation discussed above
(
η = const., p = 0 at s = s0 ) this term brings a negative contribution to the
absorption coefficient, since p is directed along −η . Therefore, as the radiation
beam propagates through the medium, the polarization grows and the medium
becomes more and more transparent. We are facing again a saturation phenomenon
which is strictly connected with the former.
178 CHAPTER 5
ρ
ρ
Fig.5.6. Geometrical interpretation of Faraday rotation (a) and Faraday pulsation (b). In case
(a) only the component ρV is non-zero and we have a transformation from Q to U , and vice versa.
In case (b) the component ρU induces a transformation from Q to V , and vice versa.
S
s̃ = ,
I
we have from Eq. (5.74b)
d
p
= −(1 − s̃) η + (
η · p − s̃ ηI ) p + ρ
× p ,
ds
an equation containing the driving term, the saturation term, and the dispersion
term.
We have seen in Sects. 5.1-5.2 how it is possible to deduce the transfer equations
for polarized radiation using the concepts of classical electrodynamics. The general
equations (5.25)-(5.26) have then been applied to the case of an atomic vapor
embedded in a magnetic field. By schematizing the atomic system as a negative
charge oscillating under the action of an elastic, restoring force, we have derived
Eqs. (5.36) which give the basic quantities describing the transfer of polarized
radiation in a magnetic field.
In this section we will use the same atomic model to deduce the laws of resonance
scattering and of the Hanle effect. Starting from Eq. (3.34), we can write for the
frequency-integrated radiation emitted per unit time in the solid angle dΩ by an
atomic oscillator embedded in a magnetic field
c
=
dI˜ij (Ω) J (r) r2 dΩ
8π ij
∞
2π 3 e20 ∗
A∗α Aβ
= dΩ Cαi Cβj ν 4 Fα (ν)∗ Fβ (ν) dν , (5.75)
c3 τ
αβ −∞
incident scattered
radiation radiation
Ω Ω
Ω Ω
to the direction Ω
Fig.5.7. A radiation beam is scattered from the direction Ω in the presence of
a magnetic field.
where dΩ is an infinitesimal solid angle around the direction Ω . Note that the
frequency dependence of the components Eα in the above equation is not specified:
they are supposed to be constant over a spectral interval centered at the reso-
nance frequency ν0 and much larger than the Larmor frequency νL (flat spectrum
approximation).
If we now introduce two mutually orthogonal unit vectors perpendicular to the
,
direction Ω
ei∗ · ej = δij , ei · Ω
= 0 (i, j = 1, 2) ,
we can write
Eγ = ∗
Cγi Ei , (5.77)
i
where the direction cosines Cγi are given by
Cγi = uγ · ei∗ .
Substituting Eq. (5.77) into Eq. (5.76), and averaging over all the possible realiza-
tions of the incident electric field, we have
K ∗
A∗α Aβ = C C I (Ω ) dΩ , (5.78)
k ij αi βj ij
where k is the constant appearing in Eqs. (1.34) and Iij (Ω ) is the polarization
tensor of the incident radiation.
MATTER-RADIATION INTERACTION (CLASSICAL) 181
ν02 kB T
Iij = δij ,
c2
where we have used the classical expression for the Planck function. Substitution
into Eq. (5.78) gives
5
K ν02 kB T
A∗α Aβ =
Cαi ∗
Cβi dΩ
k c2 i
5
K ν02 kB T
= · u )(Ω
δαβ − (Ω · u ∗ ) dΩ
α β
k c2
8π K ν02 kB T
= δαβ . (5.79)
3 k c2
On the other hand, the amplitudes of the atomic oscillations in thermodynamic
equilibrium are given by (see Eq. (3.35) for a similar derivation)
γτ kB T
A∗α Aβ = δ . (5.80)
2π 2 ν02 m αβ
K 3 γτ c2
= . (5.81)
k 16π 3 ν04 m
dI˜ij (Ω)
=
∞
3 πe20 ∗
∗ dΩ
= γ Cαi Cβj Cαk Cβl Ikl (Ω ) dΩ Fα (ν)∗ Fβ (ν) dν , (5.82)
2 mc 4π
αβ kl −∞
where we have extracted the factor ν04 from the integral since the Fourier transforms
Fα (ν) are substantially non-zero only for ν ν0 (see Eq. (3.33)).
The frequency integral can be easily evaluated with the help of the residue the-
orem
∞
1 1
Fα (ν)∗ Fβ (ν) dν = , (5.83)
γ 1 + i (α − β)H
−∞
where
2πνL e B
H= = 0 . (5.84)
γ 2mcγ
182 CHAPTER 5
where
πe20
σcl =
mc
is the classical (frequency-integrated) cross section of the atomic oscillator, and T
– the so-called scattering phase matrix for the polarization tensor – is defined by
Ω
Tij,kl (Ω, )= 3
; B ∗
Cαi
Cβj Cαk ∗
Cβl
1
. (5.87)
2 1 + i (α − β)H
αβ
This matrix depends on the geometry of the scattering event and on the magnitude
and direction of the magnetic field vector; moreover, it depends on the choice of
the unit vectors (e1 , e2 ) and (e1 , e2 ). Note that expression (5.87) is valid both
for real unit vectors and, more generally, for complex unit vectors of the form of
Eqs. (1.41).
The following symmetry property can be proved directly from Eq. (5.87)
Ω
Tij,kl (Ω, ; B
)∗ = T
ji,lk (Ω, Ω ; B ) . (5.88)
If the direction of the magnetic field vector is reversed, one gets a sign inversion of
the term in H in Eq. (5.87) (see Eqs. (3.23), (3.27), and (3.33)). This leads to the
further property
Ω
Tij,kl (Ω, ; −B
)=T
lk,ji (Ω , Ω; B ) . (5.89)
In the case of zero magnetic field the expression for the matrix T reduces to a much
simpler form. Summation over the indices α and β yields
; 0) ≡ T 3
Ω
Tij,kl (Ω, ij,kl (Ω, Ω ) = D D∗ ,
2 ik jl
where the direction cosines Dik are given by
both for the incident and for the scattered radiation, the reference direction unit
vector and the associated unit vector defined in Sect. 1.6. In other words, we take
e1 = ea (Ω) e1 = ea (Ω
)
e = e (Ω)
2 b
e = e (Ω
2 b
) .
In this case the required relation is given by Eqs. (1.40), which imply that the
scattered in the solid angle dΩ by
frequency-integrated Stokes parameters dS̃i (Ω)
an atomic oscillator are given by
=
dS̃i (Ω) (σi )nm dI˜mn (Ω)
,
nm
with σi the Pauli spin matrices defined in Eq. (1.17). Using again Eqs. (1.40), we
obtain
=σ
dS̃i (Ω) Ω
Rij (Ω, ; B ) dΩ dΩ
) S (Ω (i, j = 0, . . . , 3) , (5.91)
cl j
j
4π
where the scattering matrix R for the Stokes parameters, sometimes called the
phase matrix , is given by
Ω
Rij (Ω, ; B
)=
1 Ω ; B)
= (σi )lk (σj )mn Tkl,mn (Ω,
2
klmn
3
∗ ∗ 1
= (σi )lk (σj )mn Cαk Cβl Cαm Cβn . (5.92)
4 1 + i (α − β)H
klmn αβ
In the case of zero magnetic field the matrix R reduces to the so-called Rayleigh
phase matrix
Ω
Rij (Ω, ; 0) ≡ R (Ω, ) = 3
Ω ∗
(σi )lk (σj )mn Dkm Dln ,
ij
4
klmn
Ω
Rij (Ω, ; −B
) = R (Ω , Ω;
B). (5.94)
ji
184 CHAPTER 5
scattering atom
incident
radiation
magnetic field
α
scattered radiation
Fig.5.8. The Hanle effect is illustrated by this simple scattering experiment. For zero magnetic
field the polarization of the scattered beam is perpendicular to the plane of scattering; when
a magnetic field is present, the polarization is reduced and its direction is rotated through an
angle α.
Ω
Rij (Ω, ) = R (Ω , Ω)
, (5.95)
ji
θ
θ Ω
Ω θ
γ
γ
χ
χ
χ
Fig.5.9. The most general geometry for the scattering event. The unit vector 1, lying in the plane
is the reference direction unit vector e (Ω),
perpendicular to Ω, while 2 is the associated unit
a
The unit vectors 3 and 4 are e (Ω
vector eb (Ω). ) and e (Ω
), respectively.
a b
Equation (5.92) describes the Hanle effect from the classical point of view; we
will see in Chap. 10 how this equation is generalized in the quantum theory.
In the previous section we have obtained a general expression for the scattering
phase matrix that can be applied to any geometrical configuration of the scattering
event and of the magnetic field and to any choice of the reference directions for the
Stokes parameters. For practical applications it is however useful to have explicit
analytical expressions for the scattering phase matrix in particular geometrical
situations.
The most general geometry is illustrated in Fig. 5.9, where the three directions Ω,
, and B
Ω are specified by their polar angles and where the reference directions are
defined by the angles γ and γ . In this geometry the scattering phase matrix will
depend on 8 parameters, namely θ, θ , θB , (χ − χB ), (χ − χB ), γ, γ , and H, and
the calculation of one element Rij will require the sum of 144 (= 32 × 24 ) terms.
It is not worth here to develop in detail these calculations; the interested reader is
referred to Landi Degl’Innocenti and Landi Degl’Innocenti (1988), where an ana-
lytical expression for the phase matrix is derived under the assumption γ = γ = 0
(which means that the positive Q-directions for the incident and scattered beams
lie in the meridian planes of Fig. 5.9). Being deduced from the quantum theory,
186 CHAPTER 5
Ω
θ θ
Ω
Fig.5.10. A particular geometry for the calculation of the scattering phase matrix. The polar
and Ω
angles of the directions Ω are reckoned from the magnetic field direction. The meaning of
the unit vectors 1, 2, 3, 4 is the same as in Fig.5.9.
the expressions presented in the paper just quoted are in fact more general than
those obtainable from Eq. (5.92); however, they reduce to the classical results with
easy transformations.
Some general properties of the scattering phase matrix relative to the geometrical
configuration of Fig. 5.9 will be given in Sect. 5.12. Here we want to derive the
explicit expression of this matrix for the much simpler geometry illustrated in
Fig. 5.10.
Now the magnetic field vector is parallel to the z-axis and the reference direction
for the Stokes parameters lies in the meridian plane containing the magnetic field
and the propagation direction. The various unit vectors are given by
= e = − sin χ ı + cos χ j
eb (Ω) 2
ea (Ω
) = e = cos θ cos χ ı + cos θ sin χ j − sin θ k
1
eb (Ω
) = e = − sin χ ı + cos χ j
2
u0 = k
1
u±1 = √ ∓ ı + i j ,
2
1 −iχ i −iχ
C1 1 = − √ cos θ e C1 2 = √ e
2 2
C0 1 = − sin θ C0 2 = 0
1 iχ i iχ
C−1 1 = √ cos θ e C−1 2 = √ e ,
2 2
with analogous expressions for Cαi .
From these expressions the elements of the scattering phase matrix for the polar-
ization tensor, given by Eq. (5.87), can be evaluated via some heavy algebra that
is left to the reader as an exercise. The results are the following
3 3 1
T11,11 = 1 − µ2 − µ2 + µ2 µ2 + 2µµ 1 − µ2 1 − µ2 C1 + µ2 µ2 C2
2 2 2
3 1
T11,12 = T11,21 = −µ 1 − µ2 1 − µ2 S1 − µ2 µ S2
2 2
3 1
T12,11 = T21,11 = µ 1 − µ2 1 − µ2 S1 + µµ2 S2
2 2
3 1 2 1 2
T11,22 = µ − µ C2
2 2 2
3 1 2 1 2
T22,11 = µ − µ C2
2 2 2
3 1 1
T12,12 = T21,21 = µµ + 1 − µ2 1 − µ2 C1 + µµ C2
2 2 2
3 1 1
T12,21 = T21,12 = − µµ + µµ C2
2 2 2
3 1
T12,22 = T21,22 = − µ S2
2 2
3 1
T22,21 = T22,12 = µ S2
2 2
3 1 1
T22,22 = + C , (5.96)
2 2 2 2
where
µ = cos θ µ = cos θ
C1 = cos α1 cos α1 + χ − χ S1 = cos α1 sin α1 + χ − χ
C2 = cos α2 cos α2 + 2 (χ − χ) S2 = cos α2 sin α2 + 2 (χ − χ) (5.97)
are particularly useful to express the quantities (1 ± iH)−1 and (1 ± 2iH)−1 in the
form
∓ iα1
(1 ± iH)−1 = cos α1 e = cos2 α1 ∓ i sin α1 cos α1
∓ iα2
(1 ± 2iH)−1 = cos α2 e = cos2 α2 ∓ i sin α2 cos α2 .
Note that the intensity of the magnetic field is contained only in the quantities C1 ,
C2 , S1 , S2 . In particular, for zero magnetic field one gets
C1 = cos(χ − χ) S1 = sin(χ − χ)
C2 = cos 2(χ − χ) S2 = sin 2(χ − χ) ,
C1 = S1 = C2 = S2 = 0 ,
Ω ; B
)= 3 3
R33 (Ω, µµ + 1 − µ2 1 − µ2 C1 . (5.99)
2 2
These formulae show that the V Stokes parameter (S3 ) is totally decoupled from
the other Stokes parameters in a scattering event.
In the limiting case of strong magnetic field one gets
⎛8 1 2 2 ⎞
3 + 3 (1 − 3µ )(1 − 3µ ) (1 − 3µ )(1 − µ ) 0
2 2
0
⎜ (1 − µ2 )(1 − 3µ2 ) 3(1 − µ2 )(1 − µ2 ) 0 0 ⎟
Ω
R(Ω, → ∞) = 3 ⎜
; B ⎜
⎟
⎟
8⎝ 0 0 0 0 ⎠
0 0 0 4µµ
which shows that the U Stokes parameter of the scattered radiation is zero irre-
spective of the direction and polarization of the incoming beam. If, in addition, the
incoming beam is unpolarized, the fractional polarization of the scattered radiation
is found to be
Q (1 − µ2 )(1 − 3µ2 )
= ,
I 3 − µ2 − µ2 + 3µ2 µ2
thus it is positive, negative, or zero according as (µ2 − 1/3) is negative, positive,
or zero. Defining the Van Vleck angle θV according to the formula
√
cos θV = 1/ 3 , (5.100)
which implies
θV = 54◦ .74 , (5.101)
one obtains
Q )
> 0 (linear polarization parallel to B for θV < θ < π − θV
I
Q ) for 0 < θ < θ
< 0 (linear polarization perpendicular to B V
I
or π − θV < θ < π .
190 CHAPTER 5
Ω
Ω
scattered
reference
beam unpolarized
direction
incident beam
Fig.5.11. Geometry for a 90◦ scattering. The magnetic field lies in the plane perpendicular to
the incident beam and makes an angle β with the direction of the scattered beam. The reference
direction for the Stokes parameters of the scattered radiation is perpendicular to the scattering
plane.
In other words, for an unpolarized incoming beam and in the limit of large magnetic
field (H 1), the linear polarization of the scattered beam is either parallel
or perpendicular to the magnetic field according to the direction of the incident
radiation.
Another consequence that can be easily deduced from Eqs. (5.99) concerns the
particular case of an incident beam parallel (or antiparallel) to the magnetic field
vector. Setting µ = ± 1 in Eqs. (5.99) we see that the matrix elements of the
form Ri0 (i = 0, . . . , 3) are independent of the magnetic field. Therefore, if the
incident beam is unpolarized the scattered radiation is completely insensitive to
the magnetic field.
To understand in further detail the role of the magnetic field in resonance scattering,
it is convenient to apply the results obtained in the previous section to some specific
geometrical configurations of the scattering process.
Consider first the configuration of Fig. 5.11, where we want to investigate the
polarization of the radiation scattered at 90◦ from an unpolarized incident beam.
The scattering event takes place in the presence of a magnetic field which lies in
the plane perpendicular to the incident beam.
Since the incident beam is unpolarized, the only elements of the scattering phase
matrix that must be computed are those of the form Ri0 (i = 0, . . . , 3). Taking as
reference direction the perpendicular to the scattering plane (see Fig. 5.11), these
matrix elements can be deduced from Eqs. (5.99) by the substitutions
π
µ = cos β , µ = 0 , χ − χ = .
2
This leads to the expressions
3
R00 = 3 − cos2 β − sin2 β cos2 α2
8
MATTER-RADIATION INTERACTION (CLASSICAL) 191
Fig.5.12. Polarization diagram (or Hanle diagram) relative to the scattering process illustrated in
Fig.5.11. Full lines correspond to β = const., while broken lines correspond to constant magnetic
field strength, parameterized through the quantity α2 (see Eqs.(5.84) and (5.98)).
3 2
R10 = sin β + (1 + cos2 β) cos2 α2
8
3
R20 = cos β sin α2 cos α2
4
R30 =0,
so that the fractional linear polarization of the scattered beam is given by
Q sin2 β + (1 + cos2 β) cos2 α2
=
I 3 − cos2 β − sin2 β cos2 α2
U 2 cos β sin α2 cos α2
= . (5.102)
I 3 − cos2 β − sin2 β cos2 α2
The results now deduced can be conveniently plotted in the polarization diagram
(or Hanle diagram) shown in Fig. 5.12. The solid lines are isoazimuth curves
(β = const.), while the broken lines are isostrength curves (α2 = const.). For
any assigned couple (β, α2 ) the diagram gives the polarization Q/I, U/I of the
scattered beam. Note that the values of β can be confined in the range [0, π] since
Eqs. (5.102) are invariant under the substitution β → −β.
There are several characteristics of Eqs. (5.102) that it is worthwhile to discuss
in some detail. First of all, in the case of zero magnetic field (α2 = 0) one gets
Q U
=1, =0, (5.103)
I I
192 CHAPTER 5
thus the scattered beam is totally polarized perpendicularly to the scattering plane.
On the other hand, for very strong magnetic field (α2 = π/2) one obtains
Q sin2 β U
= , =0,
I 3 − cos2 β I
Q U
= cos2 α2 , = ± sin α2 cos α2 ,
I I
where the plus sign for U/I corresponds to β = 0 and the minus sign to β = π. In
both cases the total linear polarization is given by
2 2
Q U 1
pL = + = cos α2 = √ , (5.104)
I I 1 + 4H 2
1 1
α0 = ± α2 = ± arctan 2H (5.105)
2 2
with the plus sign for β = 0 and the minus sign for β = π. This formula shows
that a magnetic field with the same direction as the scattered radiation produces
a counterclockwise rotation (for an observer looking at the scattering point) of the
plane of linear polarization; conversely, the rotation is clockwise if the magnetic
field is in the opposite direction.
Equations (5.104) and (5.105) are often used to discuss the role of the Hanle
effect in scattering polarization (see e.g. Mitchell and Zemansky, 1934). They are
however of limited use because they refer to an extremely particular case.
In the scattering configuration just considered the magnetic field produces, be-
sides a rotation of the polarization direction, a decrease of the polarization degree
(depolarization); but this is not always the case. A counter example is provided,
for instance, by forward scattering.
The relevant geometry is now illustrated in Fig. 5.13, where we consider the effect
of a magnetic field on the forward scattering of an unpolarized radiation beam. The
polarization of the scattered beam can again be deduced from Eqs. (5.99) by the
substitutions
µ = µ = cos β , χ = χ .
MATTER-RADIATION INTERACTION (CLASSICAL) 193
reference
direction
unpolarized
incident beam β scattered beam
Ω Ω
Fig.5.13. Forward scattering. The reference direction for the Stokes parameters lies in the plane
containing the magnetic field and the direction Ω.
We obtain
1 2 3 3
R00 = 1 + 3 cos2 β − 1 + sin2 β cos2 β cos2 α1 + sin4 β cos2 α2
8 2 8
3
R10 = sin2 β 1 − 3 cos2 β + 4 cos2 β cos2 α1 − (1 + cos2 β) cos2 α2
8
3
R20 = sin2 β cos β 2 sin α1 cos α1 − sin α2 cos α2
4
R30 = 0 . (5.106)
Q U
= =0,
I I
so that the scattered beam is unpolarized as the incident beam. For strong magnetic
field (α1 = α2 = π/2) one obtains
The equations that we have derived in the preceding sections involve the evaluation
of several direction cosines relating the unit vectors uα , relative to the magnetic
needed for the definition of the Stokes parameters.
field, to the unit vectors ei (Ω)
These direction cosines and, in particular, their bilinear combinations can be conve-
niently expressed in terms of ordinary rotation matrices. This leads, in most cases,
to much more compact and handy expressions for the relevant physical quantities,
like the scattering phase matrix derived in Sect. 5.7.
The introduction of rotation matrices requires, however, some remarks about
the spherical components of ordinary vectors, an argument that we have already
treated in Sect. 2.7 and that we are going to develop in further detail here.
Given an arbitrary vector v , its spherical components vq are defined by (see
Eqs. (2.82))
1
v−1 = √ (vx − i vy )
2
v0 = vz
1
v1 = − √ (vx + i vy ) , (5.107)
2
where vx , vy , vz are the Cartesian components in a right-handed coordinate system.
Since in the following we will often deal with vectors of the form
v = a + i b (5.108)
with a and b real, we want to establish some properties of the spherical components
of such vectors.
Taking the complex conjugate of Eqs. (5.107), with v given by Eq. (5.108), we
have
1
(v−1 )∗ = √ (ax + by ) − i (bx − ay )
2
(v0 )∗ = az − i bz
1
(v1 )∗ = − √ (ax − by ) − i (bx + ay ) . (5.109)
2
On the other hand, taking the complex conjugate of Eq. (5.108),
v ∗ = a − i b ,
we have for the spherical components of the vector v ∗
1
(v ∗ )−1 = √ (ax − by ) − i (bx + ay )
2
(v ∗ )0 = az − i bz
1
(v ∗ )1 = − √ (ax + by ) − i (bx − ay ) . (5.110)
2
MATTER-RADIATION INTERACTION (CLASSICAL) 195
we have
Cαi = uα · ei∗ = (−1)q (uα )q (e∗i )−q = (uα )q (ei )∗q , (5.113)
q q
This expression suggests the introduction of a tensor – the dyadic product of two
irreducible spherical tensors of rank 1 – which plays a very important role in the
description of polarization phenomena. Indeed, as we will see in the following, this
tensor – and the analogous ones that will be derived from it – appears as a natural
ingredient of almost any mathematical expression relevant to polarized radiation.
For this reason it deserves a careful definition.
In the right-handed reference system (xyz) of Fig. 5.14 we consider a particular
direction characterized by the real unit vector c. In the plane perpendicular to c
we introduce two real unit vectors, a and b, such that a, b, and c form, in this
order, a right-handed coordinate system. The orientation of (a, b, c ) relative to
(xyz) is specified by the angles θ, χ, and γ defined in Fig. 5.14, with
0≤θ≤π, 0 ≤ χ < 2π , 0 ≤ γ < 2π .
From the unit vectors a, b, c we then define the unit vectors c−1 , c0 , c+1 via the
equations
1
c−1 = √ (a + i b )
2
c0 = c
1
c+1 = √ (−a + i b ) , (5.114)
2
196 CHAPTER 5
plane
perpendicular
to
θ
γ
Fig.5.14. Geometry for the definition of the tensor Eqq (α, β,c ).
and, finally, we define the reducible spherical tensor Eqq relative to the triplet
(a, b, c ) as1
The explicit expression for Eqq (α, β, c ) can be found by the following argument.
In the reference system (a, b, c ) we have, from Eqs. (5.107) applied to the unit
vectors defined in Eqs. (5.114)
Since (cα )q is an irreducible spherical tensor of rank 1, its expression in the reference
system (xyz) of Fig. 5.14 can be obtained by application of the transformation law
of irreducible spherical tensors under rotations. From Eq. (2.78) we get
(cα )q = δαp Dpq
1
(R) = Dαq1
(R) (5.116)
p
and, similarly
(cβ )∗q = Dβq
1 ∗
(R) ,
1
The quantity Eqq is neither an irreducible spherical tensor nor the dyadic product of
two irreducible spherical tensors. The denomination ‘reducible spherical tensor’ is due to the fact
that it can be reduced to the linear combination of irreducible spherical tensors (see Eq. (5.125)).
Note also that Eqq depends not only on the direction of the unit vector c (specified by the angles
θ and χ), but also on the direction (specified by the angle γ) of the unit vector a in the plane
perpendicular to c. To shorten notations, we keep only c as an explicit argument of the tensor.
MATTER-RADIATION INTERACTION (CLASSICAL) 197
where R ≡ (−γ, −θ, −χ) is the rotation bringing the system (a, b, c ) into the system
(xyz). Thus the expression for Eqq (α, β, c ) in the reference system (xyz) is
∗
Eqq (α, β, c ) = Dαq
1
(R) Dβq
1
(R) . (5.117)
Let’s now go back to the scattering phase matrix T appearing in Eqs. (5.86) and
(5.87), and consider the case where the polarization unit vectors (e1 , e2 ) relative to
the scattered radiation and the analogous vectors (e1 , e2 ) relative to the incident
radiation are defined by the expressions (note the similarity with Eqs. (5.114))
1
e1 = √ + i e (Ω)
ea (Ω) b
≡ e (Ω)
−1
2
1
e2 = √ + i e (Ω)
−ea (Ω) b
≡ e (Ω)
+1
2
1
e1 = √ ea (Ω
) + i e (Ω
b
) ≡ e (Ω
−1
)
2
1
e2 = √ −ea (Ω
) + i e (Ω
b
) ≡ e (Ω
+1
) , (5.118)
2
e (Ω
e (Ω),
where ea (Ω), ), e (Ω
) are the reference direction unit vectors and the
b a b
associated unit vectors defined as in Fig. 5.9.
With this choice Eq. (5.86) can be rewritten as
= σ
dI˜µν (Ω) Ω
Tµν,ρσ (Ω, ; B ) dΩ dΩ
) I (Ω
cl ρσ
ρσ
4π
where µ, ν, ρ, σ = ±1, with the elements of the matrix T given by (see Eq. (5.113))
Ω
Tµν,ρσ (Ω, ; B
)=
3
= (uα )∗q (eµ (Ω))
∗ ∗ ∗
q (uβ )q (eν (Ω))q (uα )q (eρ (Ω ))q (uβ )q (eσ (Ω ))q
2
αβ qq q q
1
× (α, β = 0, ±1) . (5.119)
1 + i (α − β)H
This formula can be evaluated in any reference system. If we choose the reference
system (ur , us , u0 ) used in Sects. 3.2 and 5.3 we simply have
(uα )q = δαq , (uβ )q = δβq . (5.120)
Thus Eq. (5.119) can be written, with the help of Eqs. (5.115) and (5.117), in the
form
Eqq (µ, ν, Ω) )
E (σ, ρ, Ω
Ω
Tµν,ρσ (Ω, )= 3
; B q q
2 1 + i (q − q )H
qq
3 Dµq
1
(R) Dνq
1 ∗ ∗
(R) Dρq (R ) Dσq (R )
1 1
= , (5.121)
2 1 + i (q − q )H
qq
198 CHAPTER 5
where α is an arbitrary angle whose actual value is unimportant for the evalua-
tion of the quantity Tµν,ρσ and that can be set equal to zero. In the geometrical
configuration of Fig. 5.10, R and R are simply given by
As shown in Sect. 2.7, the dyadic product of two irreducible spherical tensors of
rank k and k can be used to construct an irreducible spherical tensor of rank K.
Therefore, recalling Eq. (5.111), one can obtain an irreducible spherical tensor by
taking appropriate linear combinations of the quantities Eqq (α, β, c ) defined in
Eq. (5.115). According to Eq. (2.79) such combinations are given by1
1 1 K
EQ
K
(α, β, c ) = (−1)1+q 3(2K + 1) Eqq (α, β, c ) , (5.124)
q −q −Q
qq
1 1 K ∗
EQ
K
(α, β, c ) = (−1)1+q
3(2K + 1) Dαq
1
(R) Dβq
1
(R) ,
q −q −Q
qq
which, using Eqs. (2.77), (2.23a), (2.73) and the fact that α and β can only take
the values ±1, can be written in the form
1 1 K
EQ
K
(α, β, c ) = 3(2K + 1) DQ
K
Q (R) (α, β = ±1) . (5.126)
α −β −Q
1
The linear combinations of Eq. (2.79) have been multiplied by the factor f (K) = − 3(2K+1) .
This slightly simplifies some expressions that will be deduced in the following.
MATTER-RADIATION INTERACTION (CLASSICAL) 199
Substitution of Eq. (5.125) into Eq. (5.121) leads to the expression of the scat-
tering phase matrix T in terms of irreducible spherical tensors
Ω
Tµν,ρσ (Ω, ; B
)=
1 )
E K (σ, ρ, Ω 1
= (−1)Q EQ
K
(µ, ν, Ω) −Q
2 1 + i QH
KQ
3 1 1 K 1 1 K
= (−1)Q (2K + 1)
2 µ −ν −Q σ −ρ −Q
KQ
1
× DQ
K
Q (R) DQ −Q (R )
K
, (5.127)
1 + i QH
I = I++ + I−− Q = − I+− + I−+
V = I++ − I−− U = − i I+− − I−+ ,
where we have shortened the notations writing I++ instead of I+1+1 and so on.
The inverse transformations are
1 1
I++ = I +V I+− = −Q + i U
2 2
1 1
I−− = I −V I−+ = −Q − i U .
2 2
If we take an appropriate representation for the Pauli spin matrices (which differs
from that of Eqs. (1.17)) we can write these transformations in a more compact
form. Defining
1 0 0 −1 0 −i −1 0
σ̂0 = , σ̂1 = , σ̂2 = , σ̂3 = (5.128)
0 1 −1 0 i 0 0 1
with
σ̂i σ̂j = δij σ̂0 + i ijk σ̂k (i, j, k = 1, 2, 3)
k
and
Tr (σ̂k σ̂l ) = 2 δkl (k, l = 0, 1, 2, 3) ,
200 CHAPTER 5
and labelling the rows and columns of the matrices in the order (−, +) (so that,for
instance, (σ̂i )+− = (σ̂i )21 ), we have1
1
3
Iαβ = S (σ̂ ) (α, β = ±1)
2 j=0 j j αβ
Sj = (σ̂j )αβ Iβα (j = 0, 1, 2, 3) . (5.129)
αβ
Since these transformations are identical to (1.40) provided the matrices σi are
exchanged with σ̂i , the scattering phase matrix for the Stokes parameters can be
easily obtained along the same lines of Sect. 5.7. The result is
Ω
Rij (Ω, )= 1
; B (σ̂ ) (σ̂ ) T Ω
(Ω, ; B
)
2 µνρσ=±1 i νµ j ρσ µν,ρσ
Tqq (i, Ω) )
T (j, Ω
Ω
Rij (Ω, ; B
)=3 q q
, (5.130)
1 + i (q − q )H
qq
as
where we have defined the tensor Tqq (i, Ω)
1
=
Tqq (i, Ω) .
(σ̂ ) E (β, α, Ω) (5.131)
2 i αβ qq
αβ=±1
From the tensor Tqq it is possible to construct – similarly to what done for
Eqq – an irreducible spherical tensor, T Q
K
, that will play an important role in the
continuation of this book
1 1 K
TQK =
(i, Ω) (−1)1+q 3(2K + 1)
Tqq (i, Ω)
q −q −Q
qq
1 1 K
= (−1)1+q 3(2K + 1)
q −q −Q
qq αβ=±1
1
× (σ̂ ) E (β, α, Ω)
2 i αβ qq
1
= .
(σ̂ ) E K (β, α, Ω) (5.132)
2 i αβ Q
αβ=±1
1
Equations (5.129) have the same form as Eqs. (1.40). Note, however, that they involve the
polarization tensor defined on the basis of the unit vectors (5.118), while the polarization tensor
in Eqs. (1.40) is defined on the basis of the reference direction unit vector and the associated unit
vector.
MATTER-RADIATION INTERACTION (CLASSICAL) 201
which can be substituted into Eq. (5.130) to obtain the expression of the phase
matrix R in terms of the irreducible spherical tensor T Q
K
1
Ω
Rij (Ω, ; B
)= (−1)Q T Q
K
(i, Ω) )
T K (j, Ω . (5.133)
−Q
1 + i QH
KQ
where
1
1 1 K
tK
P (i) = (σ̂i )αβ 3(2K + 1) . (5.135)
2 β −α −P
αβ=±1
From this equation, using some properties of the rotation matrices (Eqs. (2.71) and
∗
(2.73)) and noticing that tK K
P (i) = t−P (i), Eq. (5.133) can be rewritten in the form
1
Ω
Rij (Ω, ; B
)= tK K ∗
DPKQ (R) DQP
K −1
P (i) tP (j) (R ) , (5.136)
1 + i QH
KP P Q
which is our final equation expressing the phase matrix R in terms of rotation
matrices. In this equation we can isolate the contributions of the various K-values
(sometimes referred to as the multipole components of the phase matrix)
Ω
Rij (Ω, ; B
)= (K)
Rij (Ω, Ω ; B ) (5.137)
K
with
(K)
1
Rij (Ω, Ω ; B ) = (−1)Q T Q
K
(i, Ω) )
T K (j, Ω
−Q
1 + i QH
Q
1
∗ −1
= tK K
P (i) tP (j) DPKQ (R) DQP
K
(R ) . (5.138)
1 + i QH
PP Q
The tensors introduced in this section satisfy several properties, which are col-
lected in the following section together with a number of tables that are particularly
useful for applications. With the help of such tables we can easily get back expres-
sions (5.99) which give the scattering phase matrix in the reference system with
202 CHAPTER 5
the z-axis parallel to the magnetic field (see Fig. 5.10). In terms of multipole
components we get
⎛ ⎞
1 0 0 0
Ω
R(0) (Ω, )=⎜
; B ⎝
0 0 0 0⎟
⎠ ,
0 0 0 0
0 0 0 0
⎛ ⎞
0 0 0 0
0 0 0 0⎟
Ω
R(1) (Ω, ) = 3 µµ + 1 − µ2 1 − µ2 C ⎜
; B
1 ⎝0 0 0 0⎠ ,
2
0 0 0 1
⎛ ⎞
(1 − 3µ2 )(1 − 3µ2 ) (1 − 3µ2 )(3 − 3µ2 ) 0 0
⎜ (3 − 3µ2 )(1 − 3µ2 ) (3 − 3µ2 )(3 − 3µ2 ) 0 0⎟
Ω
R(2) (Ω, )= 1⎜
; B ⎜
⎟
⎟
8⎝ 0 0 0 0⎠
0 0 0 0
⎛ ⎞
µµ C1 µµ C1 −µ S1 0
⎜
0⎟
3 ⎜ µµ C µµ C1 −µ S1 ⎟
+ 1 − µ2 1 − µ2 ⎜ 1 ⎟
2 ⎝ µ S1 µ S1 C1 0⎠
0 0 0 0
⎛ 2 2 2 ⎞
(1 − µ )(1 − µ )C2 −(1 − µ )(1 + µ )C2 2(1 − µ )µ S2
2 2
0
⎜
3 ⎜−(1 + µ2 )(1 − µ2 )C2 (1 + µ2 )(1 + µ2 )C2 −2(1 + µ2 )µ S2 0⎟⎟
+ ⎜ ⎟ , (5.139)
8 ⎝ −2µ(1 − µ2 )S2 2µ(1 + µ2 )S2 4µµ C2 0⎠
0 0 0 0
where the notations are the same as in Eqs. (5.97). Adding the contributions of
the three matrices R(0) , R(1) , and R(2) , Eqs. (5.99) are easily obtained.
The various tensors defined in the former section are particularly suitable for the
description of polarization phenomena in spectral lines. As they will be heavily
used in the following of this book, their properties deserve a thorough discussion.
At the same time, we will take the opportunity of introducing some new tensors
that will also be useful later.
Starting with the reducible tensors, we have defined
= e (Ω)
Eqq (α, β, Ω) ∗
eβ (Ω) (q, q = 0, ±1; α, β = ±1) (5.140)
α q q
with
= √1 ∓ e (Ω)
e±1 (Ω) + i e (Ω)
, (5.141)
a b
2
MATTER-RADIATION INTERACTION (CLASSICAL) 203
= D1 (R) D1 (R)∗ ,
Eqq (α, β, Ω) (5.142)
αq βq
where
R ≡ (−γ, −θ, −χ)
∗ = E (β, α, Ω)
Eqq (α, β, Ω) . (5.145)
q q
as a function of χ, θ, and γ is
The explicit expression for the tensor Eqq (α, β, Ω)
given in Table 5.2.
defined by (see
The other reducible tensor that we have introduced is Tqq (i, Ω),
Eq. (5.131))
1 (q, q = 0, ±1)
=
Tqq (i, Ω)
(σ̂ ) E (β, α, Ω) (5.146)
2 i αβ qq (i = 0, . . . , 3) .
αβ=±1
TABLE 5.2
E−1 −1 (−, −, Ω)
= 1
4
(1 + cos θ)2
E0 0 (−, −, Ω)
= 1
2
sin2 θ
E1 1 (−, −, Ω)
= 1
4
(1 − cos θ)2
E−1 0 (−, −, Ω)
=− 1
√ sin θ (1 + cos θ) e−iχ
2 2
E−1 1 (−, −, Ω)
= 1
4
sin2 θ e−2iχ
E0 1 (−, −, Ω)
=− 1
√ sin θ (1 − cos θ) e−iχ
2 2
E−1 −1 (−, +, Ω)
= 1
4
e−2iγ sin2 θ
E0 0 (−, +, Ω)
= − 12 e−2iγ sin2 θ
E1 1 (−, +, Ω)
= 1
4
e−2iγ sin2 θ
E−1 0 (−, +, Ω)
= 1
√ e−2iγ sin θ (1 + cos θ) e−iχ
2 2
E−1 1 (−, +, Ω)
= 1
4
e−2iγ (1 + cos θ)2 e−2iχ
E0 1 (−, +, Ω)
=− 1
√ e−2iγ sin θ (1 + cos θ) e−iχ
2 2
E−1 −1 (+, −, Ω)
= 1
4
e2iγ sin2 θ
E0 0 (+, −, Ω)
= − 12 e2iγ sin2 θ
E1 1 (+, −, Ω)
= 1
4
e2iγ sin2 θ
E−1 0 (+, −, Ω)
=− 1
√ e2iγ sin θ (1 − cos θ) e−iχ
2 2
E−1 1 (+, −, Ω)
= 1
4
e2iγ (1 − cos θ)2 e−2iχ
E0 1 (+, −, Ω)
= 1
√ e2iγ sin θ (1 − cos θ) e−iχ
2 2
E−1 −1 (+, +, Ω)
= 1
4
(1 − cos θ)2
E0 0 (+, +, Ω)
= 1
2
sin2 θ
E1 1 (+, +, Ω)
= 1
4
(1 + cos θ)2
E−1 0 (+, +, Ω)
= 1
√ sin θ (1 − cos θ) e−iχ
2 2
E−1 1 (+, +, Ω)
= 1
4
sin2 θ e−2iχ
E0 1 (+, +, Ω)
= 1
√ sin θ (1 + cos θ) e−iχ
2 2
MATTER-RADIATION INTERACTION (CLASSICAL) 205
we get
∗ = T (i, Ω)
Tqq (i, Ω) . (5.148)
q q
Another property can be obtained from Eq. (5.144) using the trace properties of
the matrices σ̂i
1 1
=
Tqq (i, Ω) (σ̂i )αα = Tr (σ̂i ) = δi0 . (5.149)
q α=±1
2 2
The explicit expression for the tensor Tqq (i, Ω) is given in Table 5.3.
Using the tensors Eqq (α, β, Ω) and Tqq (i, Ω) we can construct a further reducible
which describes the polarization properties of the radiation field
tensor, Iqq (ν, Ω),
propagating along the direction Ω. If I (ν, Ω) is the polarization tensor (defined on
αβ
is the Stokes vector,
the basis of the unit vectors e±1 (Ω) of Eq. (5.141)) and Si (ν, Ω)
both relative to the frequency ν and the direction Ω, we define the tensor I (ν, Ω)
qq
according to either of the equivalent relations (see Eqs. (5.129) and (5.146))
=
Iqq (ν, Ω) I (ν, Ω)
Eqq (α, β, Ω)
βα
αβ=±1
3
= S (ν, Ω)
Tqq (i, Ω) . (5.150)
i
i=0
This tensor satisfies the conjugation property (easily deduced from Eq. (5.148))
∗ = I (ν, Ω)
Iqq (ν, Ω) , (5.151)
q q
These combinations are independent of the angle γ which defines the reference
direction for the Stokes parameters Q and U .
206 CHAPTER 5
TABLE 5.3
T−1 −1 (0, Ω)
= 1
4
(1 + cos2 θ)
T0 0 (0, Ω)
= 1
2
sin2 θ
T1 1 (0, Ω)
= 1
4
(1 + cos2 θ)
T−1 0 (0, Ω)
=− 1
√ sin θ cos θ e−iχ
2 2
T−1 1 (0, Ω)
= 1
4
sin2 θ e−2iχ
T0 1 (0, Ω)
= 1
√ sin θ cos θ e−iχ
2 2
T−1 −1 (1, Ω)
= − 1 cos 2γ sin2 θ
4
T0 0 (1, Ω)
= 1
2
cos 2γ sin2 θ
T1 1 (1, Ω)
= − 14 cos 2γ sin2 θ
T−1 0 (1, Ω)
=− 1
√ (cos 2γ cos θ − i sin 2γ) sin θ e−iχ
2 2
T−1 1 (1, Ω)
= − 4 cos 2γ (1 + cos2 θ) − 2i sin 2γ cos θ e−2iχ
1
T0 1 (1, Ω)
= 1
√ (cos 2γ cos θ − i sin 2γ) sin θ e−iχ
2 2
T−1 −1 (2, Ω)
= 1
4
sin 2γ sin2 θ
T0 0 (2, Ω)
= − 12 sin 2γ sin2 θ
T1 1 (2, Ω)
= 1
4
sin 2γ sin2 θ
T−1 0 (2, Ω)
= 1
√ (sin 2γ cos θ + i cos 2γ) sin θ e−iχ
2 2
T−1 1 (2, Ω)
= 1
4
sin 2γ (1 + cos2 θ) + 2i cos 2γ cos θ e−2iχ
T0 1 (2, Ω)
=− 1
√ (sin 2γ cos θ + i cos 2γ) sin θ e−iχ
2 2
T−1 −1 (3, Ω)
= − 1 cos θ
2
T0 0 (3, Ω)
=0
T1 1 (3, Ω)
= 1
2
cos θ
T−1 0 (3, Ω)
= 1
√ sin θ e−iχ
2 2
T−1 1 (3, Ω)
=0
T0 1 (3, Ω)
= 1
√ sin θ e−iχ
2 2
MATTER-RADIATION INTERACTION (CLASSICAL) 207
TABLE 5.4
I−1 −1 (ν, Ω)
= 1
4
− sin2 θ Q̃(ν, Ω)
(1 + cos2 θ) I(ν, Ω) − 2 cos θ V (ν, Ω)
I0 0 (ν, Ω)
= 1
2
+ Q̃(ν, Ω)
sin2 θ I(ν, Ω)
I1 1 (ν, Ω)
= 1
4
− sin2 θ Q̃(ν, Ω)
(1 + cos2 θ) I(ν, Ω) + 2 cos θ V (ν, Ω)
I−1 0 (ν, Ω)
= 1
√ sin θ − cos θ I(ν, Ω)
− cos θ Q̃(ν, Ω)
+ i Ũ (ν, Ω) e−iχ
+ V (ν, Ω)
2 2
I−1 1 (ν, Ω)
= 1 − (1 + cos2 θ) Q̃(ν, Ω)
sin2 θ I(ν, Ω) e−2iχ
+ 2i cos θ Ũ(ν, Ω)
4
I0 1 (ν, Ω)
= 1
√ sin θ cos θ I(ν, Ω) − i Ũ (ν, Ω)
+ cos θ Q̃(ν, Ω) e−iχ
+ V (ν, Ω)
2 2
= cos 2γ U (ν, Ω)
Ũ(ν, Ω) + sin 2γ Q(ν, Ω)
This tensor depends on the polarized radiation field propagating in all directions,
and will be called the (reducible) radiation field tensor . Its trace is given by (see
Eq. (5.152)) 5
dΩ
Jqq (ν) = = J(ν) ,
I(ν, Ω)
q
4π
which is the usual definition of the mean intensity of the radiation field over the
solid angle. From Eq. (5.151) it follows the conjugation property
All the tensors defined so far are reducible tensors. Under rotation of the coor-
dinate system the new components, Tqq , are obtained from the old ones, Tpp , by
where α, β, γ are the Euler angles of the rotation which brings the old reference
system into the new one.
208 CHAPTER 5
We can easily construct the associated irreducible tensors through the standard
procedure (see Eq. (5.124)). By so doing, the following irreducible tensors are
obtained
EQ
K
(α, β, Ω), TQ
K
(i, Ω), IQ
K
(ν, Ω), JQK
(ν)
corresponding to
Eqq (α, β, Ω),
Tqq (i, Ω),
Iqq (ν, Ω), Jqq (ν) ,
respectively. The relations between the irreducible tensors and the corresponding
reducible ones are repeated here for the sake of completeness
K 1+q
1 1 K
TQ = (−1) 3(2K + 1) Tqq , (5.155)
q −q −Q
qq
Obviously, the various irreducible tensors are connected with each other by the
same relations which connect the corresponding reducible tensors, namely (cf.
Eqs. (5.131), (5.150), and (5.153))
1
TQ K =
(i, Ω)
(σ̂ ) E K (β, α, Ω)
2 i αβ Q
αβ=±1
3
IQ
K
(ν, Ω) = EQ
K I (ν, Ω)
(α, β, Ω) = TQ
K S (ν, Ω)
(i, Ω)
βα i
αβ=±1 i=0
5 5
dΩ K
3
dΩ K .
K
JQ (ν) = I (ν, Ω) = T (i, Ω) Si (ν, Ω) (5.157)
4π Q 4π i=0 Q
Their conjugation properties can be deduced from the corresponding properties of
the reducible tensors, and are found to be
∗ = (−1)Q E K (β, α, Ω)
E K (α, β, Ω)
Q −Q
TQ
K ∗ = (−1)Q T K (i, Ω)
(i, Ω)
−Q
IQ
K ∗ = (−1)Q I K (ν, Ω)
(ν, Ω)
−Q
K
JQ (ν)∗ = (−1)Q J−Q
K
(ν) . (5.158)
We rewrite here, for the sake of clarity, the expressions of the tensors EQ
K
(α, β, Ω)
in terms of rotation matrices (see Eqs. (5.126), (5.134), and (5.135))
and T Q (i, Ω)
K
1 1 K
K
EQ (α, β, Ω) = 3(2K + 1) DQ K
Q (R) (α, β = ±1) ;
α −β −Q
TQ =
P (i) DP Q (R)
K
(i, Ω) tK K
(5.159)
P
MATTER-RADIATION INTERACTION (CLASSICAL) 209
Ω Ω
Ω
Ω Ω
Ω
Fig.5.15. Reference direction unit vector (ea ) and associated unit vector (eb ) relative to the
and −Ω
directions Ω is parallel to e (−Ω).
. e (Ω)
a a
with
1
1 1 K
tK
P (i) = (σ̂i )αβ 3(2K + 1) , (5.160)
2 β −α −P
αβ=±1
e (Ω),
where R is the rotation bringing the reference system (ea (Ω), Ω) into the
b
system (xyz).
One can easily deduce some remarkable properties of the coefficients tKP (i):
a) The only non-zero components of tKP (i) are those with P = 0, ±2, as the indices
α and β can only take the values ±1.
b) From Eq. (5.147) and from the properties of the 3-j symbols it follows the
conjugation relation
∗
tK K
P (i) = t−P (i) .
c) Defining the quantities
ξi = (1, 1, 1, −1)
τi = (1, 1, −1, 1)
ζi = (1, 1, −1, −1) = ξi τi (5.161)
with i = 0, . . . , 3, one gets
(σ̂i )−α−β = ξi (σ̂i )βα , (σ̂i )αβ = τi (σ̂i )βα , (σ̂i )−α−β = ζi (σ̂i )αβ ,
and hence
tK K K
P (i) = ξi (−1) tP (i)
tK K
P (i) = τi t−P (i)
tK K K
P (i) = ζi (−1) t−P (i) . (5.162)
These properties allow a simple derivation of the relation between the tensors
TQK and T K (i, −Ω),
(i, Ω) which will be needed in the following.
Q
Let us consider two opposite directions, Ω and −Ω,
and the corresponding ref-
erence direction unit vectors ea (Ω), ea (−Ω). We assume for simplicity that the
and e (−Ω)
directions ea (Ω) coincide (see Fig. 5.15). From the definition (5.159)
a
we have
210 CHAPTER 5
TABLE 5.5
0 δP,0 0 √1 δP,0
2
√
1 0 0 − 23 δP,−2 + δP,2
√
2 0 0 i 23 δP,−2 − δP,2
3
3 0 2
δP,0 0
=
TQ
K
(i, −Ω) P (i) DP Q (R ) ,
tK K
P Q
R0 ≡ (π, π, 0) ,
we obtain
= −iπP
TQ
K
(i, −Ω) tK
P (i) e (−1)K+P D−P
K
Q (R) .
P
It should be remarked that this relation is valid only when the directions ea (Ω)
coincide.
and ea (−Ω)
Let us turn to the evaluation of the irreducible tensor T Q K
(i, Ω). From Eqs.
(5.160), (5.128), and (2.26) we can easily obtain the values of the coefficients tKP (i),
which are collected in Table 5.5. The explicit expressions for T Q K can then
(i, Ω)
be derived from the definition (5.159) by direct evaluation of the rotation matrices
(Eq. (2.68) and Table 2.1). The results are contained in Table 5.6.
MATTER-RADIATION INTERACTION (CLASSICAL) 211
TABLE 5.6
Explicit expression for the tensor T Q The components with negative Q-value can be ob-
K (i, Ω).
tained from the relation T−Q ∗ . The angles χ, θ, γ are shown in Fig.5.14.
= (−1)Q T K (i, Ω)
K (i, Ω)
Q
Note that the rotation carrying the reference system (a, b,c ) into (xyz) is R ≡ (−γ, −θ, −χ).
T 00 (0, Ω)
=1
T 01 (0, Ω)
=0
T 11 (0, Ω)
=0
T 02 (0, Ω)
= 1
√ (3 cos2 θ − 1)
2 2
√
T 12 (0, Ω)
= − 23 sin θ cos θ eiχ
√
T 22 (0, Ω)
=
4
3
sin2 θ e2iχ
T 00 (1, Ω)
=0
T 01 (1, Ω)
=0
T 11 (1, Ω)
=0
T 02 (1, Ω)
=− √3
cos 2γ sin2 θ
2 2
√
T 12 (1, Ω)
= − 23 (cos 2γ cos θ + i sin 2γ) sin θ eiχ
√
T 22 (1, Ω)
= − 43 cos 2γ (1 + cos2 θ) + 2i sin 2γ cos θ e2iχ
T 00 (2, Ω)
=0
T 01 (2, Ω)
=0
T 11 (2, Ω)
=0
T 02 (2, Ω)
= 3
√ sin 2γ sin2 θ
2 2
√
T 12 (2, Ω)
=
2
3
(sin 2γ cos θ − i cos 2γ) sin θ eiχ
√
T 22 (2, Ω)
=
4
3
sin 2γ(1 + cos2 θ) − 2i cos 2γ cos θ e2iχ
T 00 (3, Ω)
=0
3
T 01 (3, Ω)
=
2
cos θ
√
T 11 (3, Ω)
= − 23 sin θ eiχ
T 02 (3, Ω)
=0
T 12 (3, Ω)
=0
T 22 (3, Ω)
=0
212 CHAPTER 5
TABLE 5.7
I00 (ν, Ω)
= I(ν, Ω)
3
I01 (ν, Ω)
=
2
cos θ V (ν, Ω)
√
I11 (ν, Ω)
= − 23 eiχ
sin θ V (ν, Ω)
I02 (ν, Ω)
= 1
√ (3 cos2 θ − 1) I(ν, Ω)
− 3 sin2 θ Q̃(ν, Ω)
2 2
√
I12 (ν, Ω)
= − 23 sin θ cos θ I(ν, Ω)
+ cos θ Q̃(ν, Ω)
+ i Ũ (ν, Ω)
eiχ
√
I22 (ν, Ω)
=
4
3 − (1 + cos2 θ) Q̃(ν, Ω)
sin2 θ I(ν, Ω) − 2i cos θ Ũ(ν, Ω)
e2iχ
= cos 2γ U (ν, Ω)
Ũ (ν, Ω) + sin 2γ Q(ν, Ω)
The derivation of the explicit expression for the tensor IQ K defined in Eq.
(ν, Ω)
(5.157) is straightforward (see Table 5.7). Note that the expressions in Table 5.7
– like those of Table 5.4 – contain only the linear combinations Q̃ and Ũ , which
are independent of the reference direction chosen to define the Stokes parameters
Q and U .
K
Finally, the irreducible tensor of the radiation field JQ (ν) is obtained by averag-
ing IQ (ν, Ω) over the solid angle. In the particular case of an unpolarized radiation
K
field having cylindrical symmetry about the z-axis only two components are not
zero, namely
5
dΩ
J00 (ν) = I(ν, θ)
4π
5
1 dΩ
J02 (ν) = √ (3 cos2 θ − 1) I(ν, θ) . (5.164)
2 2 4π
Obviously, the component J02 (ν) is also zero if the radiation field is isotropic.
The expression given in Sect. 5.10 for the scattering phase matrix and the various
relations proved in Sect. 5.11 allow a further extension of the remarkable properties
of this matrix.
Starting from Eq. (5.133) and using Eq. (5.163) one can directly prove the fol-
lowing relations (which are however valid only when the same reference directions
MATTER-RADIATION INTERACTION (CLASSICAL) 213
and −Ω,
are chosen for Ω and −Ω
and for Ω )
Ω
Rij (−Ω, ; B
) = ζ R (Ω, ; B
Ω )
i ij
Rij (Ω, ; B
−Ω ) = ζ R (Ω, ; B
Ω )
j ij
Rij (−Ω, ; B
−Ω ) = ζ ζ R (Ω, ; B
Ω ).
i j ij
The first relation means that if (I, Q, U, V ) is the Stokes vector scattered along the
then, irrespective of the polarization of the incident beam and of the
direction Ω,
intensity and direction of the magnetic field, the Stokes vector scattered along the
direction −Ω is (I, Q, −U, −V ). The two other relations have analogous meanings.
Considering next Eq. (5.136), we can write the expression of the scattering phase
matrix for the most general geometry of Fig. 5.9. The two relevant rotations are
(see Eq. (5.122))
KP P QQ
which shows explicitly that the magnetic kernel is independent of the angle γB ,
that can be set equal to zero.
The magnetic kernel satisfies several important properties which can be easily
derived from Eqs. (5.166) and (5.167) using some properties of the rotation matri-
ces:
= DQQ
K
(0 0 0) = δQQ ,
214 CHAPTER 5
and hence
Ω ; 0) = ∗
P (i) tP (j) DP P (R) ,
tK K K
Rij (Ω, (5.168)
KP P
e (Ω),
where R is the rotation bringing the reference system (ea (Ω), Ω) into the
b
system (ea (Ω ), eb (Ω ), Ω ).
lim MK
QQ (B ) = DQ0 (χB θB 0) D0Q (0 −θB −χB ) .
K K
B→∞
∗ Q−Q
MK
QQ (B ) = (−1) MK
−Q−Q (B ) .
QQ (B, π − θB , π − χB ) = (−1)
MK Q−Q
MK
−Q−Q (B, θB , χB )
MK
QQ (B, θB , π + χB ) = (−1)
Q−Q
MK
QQ (B, θB , χB ) ,
a set of formulae giving the transformation of the magnetic kernel under a 180◦
rotation of the magnetic field vector about the axes x, y, and z of Fig. 5.9, respec-
tively.
MK
QQ (−B ) =
1
= DQQ
K
(π+χB π−θB 0) DQ Q (0 −π+θB −π−χB )
K
1 + i Q H
Q
1
= DQQ
K
(χB θB 0) DQ Q (0 −θB −χB )
K
, (5.169)
1 − i Q H
Q
which shows that the reversal of the magnetic field can also be obtained by the
formal substitution H → −H.
2π π
K
) = 1
MQQ (B dχB sin θB MK
QQ (B ) dθB (5.170)
4π
0 0
MATTER-RADIATION INTERACTION (CLASSICAL) 215
1 1
µK = (5.172)
2K + 1 1 + i Q H
Q
or, explicitly
µ0 = 1
1 1 2
µ1 = 1 + 2 cos2 α1 = 1+
3 3 1 + H2
1 1 2 2
µ2 = 1 + 2 cos2 α1 + 2 cos2 α2 = 1+ 2
+ , (5.173)
5 5 1+H 1 + 4H 2
In particular, in the limiting case of strong magnetic fields the matrix R(1) is
reduced by a factor 3, while the matrix R(2) is reduced by a factor 5.
In the previous sections we have derived the expression for the scattering phase
matrix in the presence of a magnetic field both in terms of direction cosines and
in terms of rotation matrices. However, these derivations are rather involved and
there is a danger that the physical meaning of the various results may be hidden
by the mathematical formalism. Thus we think it is worthwhile to present some
qualitative arguments – based on the atomic oscillator model – aimed at clarifying
the underlying physics. At the same time we will draw some interesting analogies
between the classical and quantum-mechanical descriptions of scattering phenom-
ena.
Let us consider a simple scattering event like that illustrated in Fig. 5.16a, left.
The incident radiation beam is unpolarized (conventionally, this is represented by
216 CHAPTER 5
unpolarized linear
incident beam oscillators
scattered beam
circular
oscillators
Fig.5.16. Qualitative description of a 90 ◦ scattering event in the absence of magnetic fields and
in the presence of a magnetic field parallel to the line of sight. See text for explanation.
MATTER-RADIATION INTERACTION (CLASSICAL) 217
into a different curve (that we will rather call a ‘daisy’, see Fig. 5.16d right) and
the scattered radiation is totally unpolarized.
This simple example shows that the effect of the magnetic field is to cause a
relaxation of the phase relations (or coherences) between the different oscillators:
the coherence between the σ+ and σ− oscillators is maximum for zero magnetic field
(Fig. 5.16b) and gradually decreases with increasing field strength (Fig. 5.16c,d).
The Hanle effect is just the consequence of this relaxation process in resonance
scattering.
Different geometrical configurations can be envisaged to get a deeper insight into
the phenomena of resonance polarization and the Hanle effect. Figure 5.17 presents
an example of forward scattering; here the scattered beam is unpolarized for zero
magnetic field and partially linearly polarized along the field’s direction when the
magnetic field is present. The discussion of this case, that can be carried out along
the same lines as before, is left to the reader as an exercise (cf. Eqs. (5.106)).
The classical model that represents the atom as a collection of one linear os-
cillator and two circular oscillators of opposite directions has a simple quantum
analogue. This is the two-level atom having a lower level of angular momentum
J = 0 and an upper level of angular momentum J = 1. The excitation of one of
the classical oscillators induced by the electric field of the incident beam has its
quantum equivalent in the excitation of one of the Zeeman sublevels of the upper
level. In this analogy, the linear oscillator corresponds to the sublevel M = 0, while
the circular oscillators of frequency (ν0 ± νL ) correspond to the sublevels M = ±1,
respectively.
In scattering experiments, apart from special geometrical configurations, the var-
ious oscillators are excited unevenly by the electric field of the incident beam, and
well-defined phase relations arise between different oscillators. This means – in the
language of Quantum Mechanics – that the upper level of the atom is polarized
(see Sect. 3.6); in other words, the diagonal density-matrix elements ρJ (M , M )
are different from each other and the off-diagonal elements ρJ (M , M ) are non-
zero. In particular, the amount of excitation present in a given oscillator has its
quantum analogue in the corresponding diagonal element of the density matrix,
while the combined excitation of two different oscillators with a well-defined phase
relation has its analogue in the off-diagonal element of the density matrix between
the two corresponding sublevels. For example, in the case of panel (b) of Fig. 5.16,
the non-zero density-matrix elements are ρ (0, 0), ρ (1, 1), ρ (−1, −1), ρ (1, −1), and
ρ (−1, 1), the last two elements being related to the coherence between the circular
oscillators. Passing to panels (c) and (d) of the same Figure, the diagonal ele-
ments remain unchanged, while the off-diagonal ones are reduced by the presence
of the magnetic field. In the following of this book we will see that these qualita-
tive concepts are fully confirmed by a more rigorous treatment based on Quantum
Mechanics.
MATTER-RADIATION INTERACTION (CLASSICAL) 219
circular
oscillators
Fig.5.17. Same as Fig.5.16 for a forward scattering event. The trajectories of the oscillating
electron in the y-z plane are the same as in Fig.5.16 right.
220 CHAPTER 5
where the symbol . . . means a statistical average over all the possible collisions.
This is just the physical situation described in Sect. 3.2. We remind the reader that
exciting collisions, as defined here, have their quantum counterpart in collisions able
to induce transitions between atomic levels.
In different cases, like for instance when the atom is bombarded by a collimated
and/or oriented beam of particles, Eq. (5.175) does not hold and, as a result, the
atom emits polarized radiation in the decay process. This phenomenon is known
under the name of impact polarization, a subject that will not be deepened here.
b) Perturbing collisions, on the contrary, are elastic collisions which are not able to
interrupt the oscillatory motion of the atom and to make it restart from scratch,
MATTER-RADIATION INTERACTION (CLASSICAL) 221
but are effective in shifting the phase of the oscillation during a time interval tc
that is called the collision time.
Following a model that was proposed by Lorentz (1915) and refined by Weisskopf
(1932) and Lenz (1933), we suppose that the collision time tc is much smaller than
the other typical times involved in the physical process, namely the decay time
of the oscillator 1/γ and the mean time between collisions 1/f , where f is the
frequency of perturbing collisions1
1 1
tc , tc . (5.176)
γ f
f = Np v̄r σ ,
where Np is the number density of perturbers, v̄r is the average velocity of the perturbers relative
to atoms, and σ is the cross-section. Expressing Np in cm−3 , v̄r in km s−1 , and σ in units of a20
(a0 being the Bohr radius), we have
f = 2.80 × 10−12 Np v̄r σ s−1 .
222 CHAPTER 5
Let t = 0 indicate the time at which the first collision takes place and suppose
the previous collision was sufficiently far in time for its transient to be already
damped (statistically, there will always be such a collision and nothing prevents us
from calling it the ‘first’ collision). Employing the same notations as in Sect. 5.3,
and making the same approximation (ν02 − ν 2 ) ≈ 2ν (ν0 − ν), we can write
e0 1
x (t = 0− ) = − E u . (5.177)
8π 2 mν α (ν0 − ανL − ν) − iΓ α α
According to the above assumptions, the effect of the collision is such that
e0 1 iφ
x (t = 0+ ) = − E D e i ti ,
8π mν αi (ν0 − ανL − ν) − iΓ α αi
2
To obtain the motion of the electron after the collision we must solve Eq. (5.28).
Its general solution is the following
e0 1 −2πiνt
x (t) = − Eβ e uβ
2
8π mν (ν0 − βνL − ν) − iΓ
β
γ
−2πi (ν0 −βνL ) t − 2 t
+ Cβ e e uβ ,
β
where the constants Cβ must be determined in such a way as to satisfy the boundary
condition (5.179). Hence we obtain
e Eα iφ ∗
Cβ = − 20 Dαi e i Dβi
8π mν α
(ν0 − ανL − ν) − iΓ i
!
Eβ
− .
(ν0 − βνL − ν) − iΓ
1
uα · ti as we have supposed the unit vectors ti to
The quantity Dαi could also be written as
be real. However, we adopt the definition (5.178) to make easier a possible generalization of our
hypotheses on the effects of collisions.
MATTER-RADIATION INTERACTION (CLASSICAL) 223
Now we perform a statistical average over collisions. Since the phases φi are arbi-
trary large numbers (strong collision hypothesis), we have
iφi
e =0 (i = 1, 2, 3) .
or, in other words, an electric susceptibility variable with time. The average value of
the electric susceptibility can then be obtained by multiplying its value at time t by
the probability that no collision occurred in the interval (0, t). If f is the frequency
of collisions, we have1
∞
−f t e0 1
χβ (t) = f e χβ (t) dt = − ,
8π mν (ν0 − βνL − ν) − iΓ
2
0
where
γ f
Γ = Γ + 2Γc = + , (5.180)
4π 2π
with2
f
Γc = . (5.181)
4π
Comparison with Eq. (5.30) shows that the effect of perturbing collisions consists
in a broadening of both the absorption profile and the associated dispersion profile.
The same result could be formally obtained by the addition of a supplementary
friction force in the equation of motion of the oscillator.
1
The probability that a collision occurs in the interval (t, t + dt) is given by
dp = f dt .
Dividing the interval (0, t) into a large number N of equal parts, the probability that the first
collision occurs in the interval (t, t + dt) is given by (see Eq. (2.62) for a similar derivation)
t N
dP = lim 1 − f f dt = f e−f t dt .
N→∞ N
2
The definition Γc = f /4π (instead of f /2π) leads to simpler forms for the expressions that
will be encountered in the following.
224 CHAPTER 5
where Aα are the initial amplitudes set up by the exciting mechanism. At time t1
the oscillator undergoes the first collision, which is characterized by its particular
geometry (specified by the three unit vectors ti(1) and by the corresponding phase
shifts φ(1)
i ). Along the same lines leading from Eq. (5.177) to Eq. (5.179) one can
write for the time interval (t1 , t2 )
(1) −2πi (ν0 −ανL ) t
γ
−2t
xα (t) = Aδ Kδα e e (t1 < t < t2 ) ,
δ
(1)
where Kδα , the collisional kernel due to the first collision, is given by
(1)
with the direction cosines Dδi defined as in Eq. (5.178). Similarly, for the time
interval (tn , tn+1 ) one has
To shorten notations we define the cumulative collisional kernel due to the first
n collisions as (0)
Kδα
(n)
= Kδ K (1) (n)
ϕ . . . Kρα (5.184)
ϕ...ρ
with
Kαβ
(0) (0)
= Kαβ = δαβ , (5.185)
and we can write
−2πi (ν0 −ανL ) t
γ
−2t
xα (t) = Aδ Kδα
(n)
e e (tn < t < tn+1 ) ,
δ
where1
∞ ∞
tn+1
γ
2πiνt −[2πi (ν0 −ανL −ν) + 2 ]t
Fα (ν) = xα (t) e dt = Aδ Kδα
(n)
e dt
0 n=0 δ tn
∞
−Nα tn −Nα tn+1
e −e
= Aδ Kδα
(n)
, (5.186)
n=0
Nα
δ
with
γ
Nα = 2πi (ν0 − ανL − ν) + = 2πi (ν0 − ανL − ν) − iΓ . (5.187)
2
To investigate how the Zeeman effect and resonance polarization are affected
by perturbing collisions we must start from Eqs. (3.34) and (5.75), respectively.
Owing to the different definition of the Fourier transforms Fα (ν) and Fα (ν), these
two equations must be rewritten in the form
∞
16π 4 e20 ∗ 1
Jij (r) = 2 4 Cαi Cβj ν 4 Fα (ν)∗ Fβ (ν) dν (5.188)
r c τ
αβ −∞
and
∞
2π 3 e20 ∗ 1
˜
dIij (Ω) = dΩ Cαi Cβj ν 4 Fα (ν)∗ Fβ (ν) dν , (5.189)
c3 τ
αβ −∞
respectively.
From Eq. (5.186) we have
Fα (ν)∗ Fβ (ν) =
∞
∞
= A∗δ A Kδα
(n) ∗
K(nβ )
n=0 n =0 δ
∗ ∗
−Nα tn −Nα tn+1 −Nβ tn −Nβ tn +1
e −e e −e
× . (5.190)
Nα∗ Nβ
1
The Fourier transform Fα (ν) differs from Fα (ν) defined in Eq. (3.32) for the presence of the
amplitudes Aδ . In the special case of no collisions (φ(n)
i = 0, K(n)
αβ
= δαβ ) we have
Fα (ν) = Aα Fα (ν) .
226 CHAPTER 5
f (Nα∗ + Nβ + 2f )
n
× .
(Nα∗ + Nβ + f )n+1 (Nα∗ + f ) (Nβ + f )
Performing the sum over n, and using the expressions of Nα∗ and Nβ given in
Eq. (5.187), one finally obtains
Fα (ν)∗ Fβ (ν) =
Γ
1 ∗ δ−α Γ + i (α − β)H
= A A (−1)
4π 2
δ
δ
(ν0 − ανL − ν) + iΓ (ν0 − βνL − ν) − iΓ
1 1 K 1 1 K
× (2K + 1)
−δ Q β −α Q
KQ
−1
Γ + Γc (1 − rK )
× + i (α − β)H , (5.191)
Γ
MATTER-RADIATION INTERACTION (CLASSICAL) 227
which is the general expression for the scattering phase matrix of the polarization
tensor in the presence of perturbing collisions. It can be easily seen that properties
(5.88) and (5.89) are still satisfied.
The scattering phase matrix for the Stokes parameters in the presence of collisions
is given by the analogue of Eq. (5.92), namely
Ω ; B,
f) = 1 Ω ; B,
f) .
Rij (Ω, (σi )lk (σj )mn Tkl,mn (Ω,
2
klmn
Ω ; B,
f) = 3 )
E (σ, ρ, Ω
Tµν,ρσ (Ω, (−1)p−q Eqq (µ, ν, Ω) pp
2
qq pp
1 1 K 1 1 K
× (2K + 1)
p −p Q q −q Q
KQ
−1
Γ + Γc (1 − rK )
× + i (q − q )H .
Γ
Ω
Tµν,ρσ (Ω, ; B,
f) =
1
−1
= (−1)Q EQ
K
(µ, ν, Ω) ) Γ + Γc (1 − rK ) + i QH
E K (σ, ρ, Ω ,
−Q
2 Γ
KQ
Similarly, one finds that Eq. (5.133), which gives the scattering phase matrix for
the Stokes parameters, must be replaced by
−1
Ω
Rij (Ω, ; B,
f) = (−1)Q T Q
K
(i, Ω) ) Γ + Γc (1 − rK ) + i QH
T K (j, Ω .
−Q
Γ
KQ
These expressions show the substantial difference between magnetic field and
collisions as depolarizing agents in resonance scattering. Collisions affect all the
K-pole components (except that with K = 0, as it will be seen shortly), while the
magnetic field does not affect the Q = 0 components. It has been mentioned in
MATTER-RADIATION INTERACTION (CLASSICAL) 229
Sect. 5.13 – and will be shown in detail in Chap. 10 – that this feature is related to
the fact that the magnetic field can alter the coherences but not the populations
of the atomic levels, whereas collisions can alter both. In terms of classical dipoles
this means that collisions affect both phase correlations and amplitude differences
between oscillations, while the magnetic field affects only the former.
To discuss the results just obtained, we can write the depolarization factor in the
form
−1
Γ + Γc (1 − rK ) 1 1 1
+ i QH = = , (5.193)
Γ 1 + δK + i QH 1 + δK 1 + i QHK
where
Γc f
δK = (1 − rK ) = (1 − rK )
Γ γ
H 2πνL
HK = = . (5.194)
1 + δK γ + f (1 − rK )
This expression shows that the effect of collisions is twofold. First, each K-pole
contribution to the scattering phase matrix is reduced by the factor (1 + δK ), where
δK is a parameter proportional to the frequency of perturbing collisions or, in other
words, to the density of perturbers. It will be shown in the next section that the
values of δK depend on the specific assumptions which are made about the phase
shifts induced by perturbing collisions. The model considered here leads to the
values (see Eq. (5.199) of next section)
f 3 f
δ0 = 0 , δ1 = , δ2 = .
γ 5 γ
Thus
δ1 5
= , (5.195)
δ2 3
a result deduced through a quantum-mechanical calculation by Omont (1965) for
the case of Van der Waals, dipole-dipole interactions.
Moreover, perturbing collisions reduce the efficiency of the magnetic field in de-
polarizing (or in polarizing) the scattered radiation, the reduction being larger for
K = 1 than for K = 2.
As a specific example, let us consider the scattering geometry illustrated in
Fig. 5.10. In the absence of collisions the multipole components of the scatter-
ing phase matrix are given by Eqs. (5.139). In the presence of collisions, one finds
that the matrix R(1) must be divided by the factor (1 + δ1 ), and the quantity C1
appearing in its expression must be replaced by
C1 = cos α1 cos α1 + χ − χ ,
Similarly, the matrix R(2) must be divided by the factor (1 + δ2 ), and the quantities
C1 , S1 , C2 , S2 must be replaced by
C1 = cos α1 cos α1 + χ − χ S1 = cos α1 sin α1 + χ − χ
C2 = cos α2 cos α2 + 2(χ − χ) S2 = cos α2 sin α2 + 2(χ − χ) ,
with
tan α1 = H2 , tan α2 = 2H2 .
Finally, we want to recall that the theory presented in this section is based on
rather restrictive assumptions, and that it could in principle be generalized by
considering the effects of weak collisions, the existence of correlations between
different phase shifts, and so on.
Kδα
(n) ∗
K(n)
β ,
Kδα
(1) ∗
K(1) (1) ∗
β = Kδα K β
(1)
2πi (α−δ−β+ ) νL t1
3
(1) ∗
(1)
−iφi (1)
3 (1)
iφj
= e Dδi e Dαi D(1)
j e
(1) ∗
Dβj . (5.196)
i=1 j=1
3
∗ ∗
S= D i Dδi Dαi Dβi ,
i=1
where
Dαi = uα · ti∗ = uα · ti .
Since we must average over all the possible orientations of the unit vectors ti , and
since, on the other hand, these three unit vectors are equivalent, we have
∗ ∗
S = 3 D i Dδi Dαi Dβi , (5.197)
MATTER-RADIATION INTERACTION (CLASSICAL) 231
where i = 1, or 2, or 3.
The average is easily evaluated using spherical tensors. In the reference system
of the magnetic field (ur , us , u0 ) we have, with the help of Eqs. (5.111), (5.112),
and (5.120)
Di= (u )q (ti )∗q = δ q (ti )∗q = (ti )∗ .
q q
From Eq. (5.116), identifying the unit vectors t1 , t2 , t3 with a, b, c, respectively,
we have
(tα )q = Dαq
1
(R) ,
where R is the rotation bringing (t1 , t2 , t3 ) into (ur , us , u0 ). Since in Eq. (5.197)
the index i can be chosen arbitrarily, taking i = 3 one has
∗ ∗
S = 3 D 3 Dδ3 Dα3 Dβ3 = 3 (t0 )∗ (t0 )δ (t0 )∗α (t0 )β
= 3 D01 (R)∗ D0δ
1
(R) D0α 1
(R)∗ D0β1
(R) .
To evaluate this expression we use twice Eq. (2.77) and then Eq. (2.73) and the
Weyl theorem (2.76) to get
δ−α 1 1 K 1 1 K
S = (−1) rK (2K + 1) , (5.198)
−δ Q β −α Q
KQ
where ⎧
2 ⎨1 for K = 0
1 1 K
rK = 3 = 0 for K = 1 (5.199)
0 0 0 ⎩
2/5 for K = 2 .
Substitution into Eq. (5.196) gives
(1) ∗ (1) 1 1 K 1 1 K
Kδα K β = (−1)δ−α rK (2K + 1) .
−δ Q β −α Q
KQ
with (t1 , t2 , t3 ) the usual triplet of real, orthogonal unit vectors. One finds the
same results as before, except that Eq. (5.197) must be replaced by the following
∗ ∗
S = D µ Dδµ Dαµ Dβµ
µ
(obviously the three unit vectors of Eq. (5.201) cannot be considered equivalent).
In terms of rotation matrices we have
1
S = Dµ (R)∗ Dµδ1
(R) Dµα
1
(R)∗ Dµβ
1
(R) ,
µ
We have seen in this chapter how it is possible to describe in classical terms some
of the different phenomena which are able to induce polarization in spectral lines
or to affect its characteristics. In particular, we have seen that polarization can be
originated either by the presence of a magnetic field – which induces a frequency
splitting between the different classical dipoles – or by the presence of an intrinsic
MATTER-RADIATION INTERACTION (CLASSICAL) 233
ν ν
I II III
V Intense field regime
Collision-dominated regime
Collision-free regime
III Strong field regime
anisotropy in the radiation field which is illuminating the atom. The anisotropy of
the radiation field, that is in general related both to its angular distribution and
to its polarization state, can induce amplitude differences between the oscillations
of the dipoles and well-defined phase correlations (or coherences) between such
oscillations. We have also seen that a magnetic field has the further effect of
reducing such phase correlations, and that collisions with an isotropic distribution
of perturbers produce a relaxation of the amplitude differences and of the phase
correlations.
All these phenomena, that will in general act simultaneously in a magnetized
plasma, can be characterized by four different parameters all having the dimensions
of frequency. These parameters are the following:
a) νL , the Larmor frequency, proportional to the magnetic field magnitude, which
is defined in Eq. (3.10);
b) ∆νD , the Doppler width, which depends on the temperature of the plasma (and
on its microturbulent velocity), defined in Eqs. (5.43) and (5.48);
c) γ, the oscillator damping constant, which has its quantum analogue in the
Einstein A coefficient for spontaneous emission from the upper level;
d) f , the frequency of depolarizing collisions, which is proportional to the number
density of perturbers.
According to the relative values of these four parameters we have different phys-
ical regimes that can be suitably classified in a two-dimensional diagram (Landi
Degl’Innocenti, 1983). Such diagram, shown in Fig. 5.18, bears a magnetic field
indicator (the Larmor frequency) on the vertical axis and a density indicator (the
frequency of collisions) on the horizontal axis.
In drawing the diagram we have assumed ∆νD γ. In terms of the thermal
velocity wT we have indeed
∆νD w
= T ,
γ λγ
234 CHAPTER 5
where λ is the line wavelength. Substituting typical values relative to a spectral line
of optical wavelength formed in a stellar atmosphere (wT = 2 km s−1 , λ = 5000 Å,
γ = 107 s−1 ) we get
∆νD
= 4.0 × 102 .
γ
The various physical regimes are sketched in Fig. 5.18; each of them can be
assigned a specific denomination.1 As far as the magnetic field is concerned, we
have the following five regimes:
Ia) Zero field regime (νL γ): the magnetic field is so weak as to induce a
negligible Zeeman splitting and a negligible relaxation effect on phase correlations.
In this regime the magnetic field can just be ignored.
IIa) Hanle effect regime (γ ≈ 2πνL ∆νD ): the magnetic field is strong enough
to reduce the phase correlations between different oscillators. By contrast, the
Zeeman splitting is still very small and can be neglected.
IIIa) Strong field regime (γ νL < ∆νD ): the relaxation of coherences is now
complete, so that the different oscillators can be treated as independent. The
Zeeman splitting is a small fraction of ∆νD and is able to produce an observable
(although weak) polarization signal.
IVa) Magnetograph (or intermediate) regime (γ νL ≈ ∆νD ): the Zeeman split-
ting is comparable with the Doppler width, which results in a strong polarization
signal; the different Zeeman components are, however, still unresolved. The obser-
vations performed by solar or stellar magnetographs usually fall in this regime.
Va) Intense field regime (νL > ∆νD ): the Zeeman splitting is now so large that the
different components are well-separated in the spectrum.
To sum up, we have that the Zeeman splitting can be neglected in regimes Ia and
IIa, while in regimes IIIa, IVa, and Va coherences are completely relaxed and can
be ignored. As far as the amplitude differences between oscillators are concerned,
the magnetic field has no effect on them.
Turning now to the influence of collisions, we have three different regimes:
Ib) Collision-free regime (f γ): depolarizing collisions have a negligible influence
on atomic polarization and can be disregarded.
IIb) Collisional depolarization regime (f ≈ γ): depolarizing collisions are important
in reducing atomic polarization (both coherences and amplitude differences between
oscillators) but not so strong as to produce a complete relaxation. In this regime
a detailed knowledge of the depolarizing rates is essential for a correct description
of the physical situation.
IIIb) Collision-dominated regime (f γ): the effect of depolarizing collisions is
so strong that atomic polarization is totally destroyed. In this regime there are no
coherences between different dipoles and the oscillation amplitudes are equal; in
other words, the classical dipoles are thermalized.
1
The denominations used here are slightly different from those in Landi Degl’Innocenti
(1983).
MATTER-RADIATION INTERACTION (CLASSICAL) 235
Zeeman
effect
∆ν
γ Hanle
effect Thermal
Resonance radiation
polarization
γ
Fig.5.19. The (νL - f ) diagram is divided into fifteen different regions resulting from the intersec-
tion of the physical regimes sketched in Fig.5.18.
By combining the five magnetic regimes and the three collisional regimes, one
finds out that there are 15 characteristic regimes for polarization in spectral lines,1
each occupying a well-defined region in the (νL - f ) diagram. Figure 5.19 points out
the regions of the diagram where conventional laboratory experiments are usually
performed.
It is important to remark that the classical description given above can be trans-
posed into quantum-mechanical terms. As outlined at the end of Sect. 5.13, the
phase correlations between different dipoles have their quantum analogue in the
off-diagonal density-matrix elements, while the oscillation amplitudes of the single
dipoles have their analogue in the diagonal matrix elements. Finally, the oscillator
damping constant γ has its quantum equivalent in the Einstein A coefficient for
spontaneous emission.
1
Indeed, regime (Ia-IIIb) is equivalent to (IIa-IIIb).
CHAPTER 6
We derive in this chapter the basic equations describing the interaction of a material
system with a polarized radiation field. These equations consist essentially of two
distinct sets, one for the material system (statistical equilibrium equations) and the
other for the radiation field (radiative transfer equations). But the two sets are
coupled, as the properties of the material system turn out to be affected by the
radiation field, and vice versa. From this point of view, the theory presented in
the following can be regarded as the basis for a generalized theory of Non Local
Thermodynamic Equilibrium which takes the polarization characteristics of the
material system and of radiation into account.
The two sets of equations are deduced by strictly similar procedures, which start
from the very principles of Quantum Mechanics and are based on the use of the
density operator. The radiation field is described with the formalism of (non-
relativistic) second quantization, while the material system is characterized by its
own Hamiltonian which does not need to be specified in detail. The interaction
is also treated under the non-relativistic approximation, and explicit formulae for
electric-dipole and magnetic-dipole transitions are derived.
The formalism used in this chapter is rather complicated, but the preceding
outlines of density operator and second quantization theories (Chaps. 3 and 4,
respectively) should enable the reader to follow all the developments. We also tried
to point out, whenever possible, the physical meaning of the various equations and
the analogies with the classical treatment presented in Chap. 5.
The results that will be obtained in this chapter suffer from some basic limita-
tions that arise both from the formalism itself and from the different approxima-
tions introduced in the derivation. Indeed, such results cannot be applied when
coherences between non-degenerate levels are present, unless the spectrum of the
radiation incident on the material system is flat across a range wider than the
frequency separation of those levels. Similarly, if coherences between degenerate
levels are present, the results provide an exact description of scattering phenomena
only if the spectrum of the incident radiation is flat across a frequency range wider
than the inverse lifetime of the levels.
These limitations are analogous – in the polarized case – to those contained in
the ‘complete frequency redistribution’ approach of the standard non-LTE theory
(see e.g. Mihalas, 1978). They are ultimately related to the fact that two consec-
utive interactions of the material system with the radiation field are considered as
independent: this means, for instance, that when an emission event takes place, no
memory is kept of the frequency of the photon that induced the previous absorption
event.
238 CHAPTER 6
Notwithstanding its basic limitations, the theory has a wide range of physical and
astrophysical applications because, in many cases, the spectrum of the incident
radiation is in fact flat across the relevant frequency range. Usually coherences
are important as long as the separation between energy levels is less than about
108 s−1 (a typical value of the spontaneous de-excitation Einstein coefficient), and
such frequency interval – which corresponds to 1 mÅ in the visible – is much smaller
than the typical Doppler widths of spectral lines.
However, there are some phenomena that cannot be fully described by the formal-
ism presented in this chapter, like the quantum-interference polarization observed
in the CaII H and K lines at the solar limb (Stenflo, 1980). In that case, coherences
between energy levels separated by a very large interval (about 50 Å) are found
to play a major role, and the incident radiation field is definitely not flat across
such interval. In these cases a more general formalism, able to encompass fre-
quency redistribution effects, is required. Although some results have indeed been
obtained on this subject (Stenflo, 1994; Bommier, 1997a,b; Landi Degl’Innocenti et
al., 1997), the underlying theory is still in a preliminary phase and remains outside
the scope of this book.
This chapter is based on previous works by the authors and collaborators (Landi
Degl’Innocenti and Landi Degl’Innocenti, 1972, 1975; Landi Degl’Innocenti et al.,
1976; Landi Degl’Innocenti, 1983). The same method has been reconsidered and
illustrated in some detail by Cannon (1985), Trujillo Bueno (1990), and Stenflo
(1994).
According to the principles of Quantum Mechanics (see e.g. Messiah, 1961) the
state of a physical system at time t is described by a vector in the Hilbert space,
| ψ(t) , which satisfies the normalization condition
ψ(t) | ψ(t) = 1 .
The time evolution of the system is governed by the well-known Schrödinger equa-
tion
h d
i | ψ(t) = H | ψ(t) ,
2π dt
where i is the imaginary unit, h is the Planck constant, and H is the Hamiltonian
operator that we suppose here independent of time.
On the other hand, any observable of the system is described by a linear, Her-
mitian operator acting on the same Hilbert space. The expectation value of the
observable (i.e. the average value of the measurement of the observable ideally
performed on an infinite number of ‘copies’ of the same physical system) is given
by
O(t) = ψ(t) | Ô | ψ(t) ,
where Ô is the quantum operator corresponding to the observable O.
MATTER-RADIATION INTERACTION (QUANTUM) 239
However, there are many physical situations where the state of the system cannot
be described by a pure state | ψ(t) (nor by a superposition of pure states), but a
detailed quantum-statistical description is necessary. This can be achieved in a very
natural way using the concept of density (or statistical) operator (see Sect. 3.6).
We recall that the expectation value of the observable O is related to the density
operator ρ by the expression (see Eq. (3.86))
6 7
O(t) = Tr Ôρ . (6.1)
The time evolution of O(t) can be easily obtained from Eq. (6.1). We have
! !
d dÔ dρ
O(t) = Tr ρ + Tr Ô , (6.2)
dt dt dt
where the first term is non-zero only when the operator Ô depends explicitly on
time. This equation can be rewritten, with the help of Eq. (3.88), in the form
! !
d dÔ 2πi
O(t) = Tr ρ − Tr Ô H, ρ ,
dt dt h
or, using the cyclic property of the trace, in the more convenient form
! !
d dÔ 2πi
O(t) = Tr ρ − Tr Ô, H ρ . (6.3)
dt dt h
In many cases the total Hamiltonian H of the system can be expressed as the
sum of two terms,
H = H0 + V , (6.4)
where H0 is the so-called unperturbed Hamiltonian and V is the interaction Hamil-
tonian. In such cases it is in many respects useful to introduce a different rep-
resentation for the quantum phenomena, called the interaction picture. This is
obtained from the usual Schrödinger picture – which has been employed so far –
by the following unitary transformation
a) for the state vectors:
i 2π H0 t
| ψ(t) I = e h | ψ(t)
b) for the operators:
i 2π H0 t −i 2π H0 t
ÔI (t) = e h Ô e h , (6.5)
where the index I means that the corresponding symbol (operator, or state vector)
is defined in the interaction picture.
The equation of motion for the density operator expressed in the interaction
picture can be easily obtained from Eq. (6.5). Using Eqs. (3.88) and (6.4) we get
d 2πi
ρI (t) = − VI (t), ρI (t) , (6.6)
dt h
240 CHAPTER 6
where
i 2π H0 t −i 2π H0 t
VI (t) = e h V e h (6.7)
is the interaction Hamiltonian expressed in the interaction picture.
Substitution of Eq. (6.6) into Eq. (6.2) leads to
! !
d d 2πi
O(t) = Tr Ô (t) ρI (t) − Tr ÔI (t), VI (t) ρI (t) . (6.8)
dt dt I h
Comparison of Eqs. (6.8) and (6.3) shows that when the interaction picture is
used, the time evolution of the expectation value O(t) is determined by the inter-
action Hamiltonian only (rather than the total Hamiltonian, as in the Schrödinger
picture). When the interaction Hamiltonian is small in comparison with the un-
perturbed Hamiltonian,1 Eq. (6.8) can be usefully expanded as follows. Integration
of Eq. (6.6) between 0 and t gives
t
2πi
ρI (t) = ρI (0) − VI (t ), ρI (t ) dt ,
h
0
which can be substituted into Eq. (6.8) to obtain, using the cyclic property of the
trace
! !
d d 2πi
O(t) = Tr ÔI (t) ρI (t) − Tr ÔI (t), VI (t) ρI (0)
dt dt h
t !
4π 2
− 2 Tr ÔI (t), VI (t) , VI (t ) ρI (t ) dt . (6.9)
h
0
This is an exact equation describing the time evolution of the observable O(t)
produced by the interaction Hamiltonian. In the following sections it will be applied
to describe the interaction between a material system and the radiation field.
We consider the interaction between an arbitrary material system and the radiation
field. The material system is here schematized as the collection of N independent
subsystems (that will be called ‘atoms’) occupying a definite volume V in the
ordinary three-dimensional space. The total Hamiltonian of the coupled system
composed of one atom and the radiation field can be written, in the Schrödinger
picture, in the form
H = H0 + V ,
1
‘Small’ means here that the typical matrix elements of V are much smaller than the matrix
elements of H0 .
MATTER-RADIATION INTERACTION (QUANTUM) 241
λ) and a† (ν, Ω,
where a(ν, Ω, λ) are, respectively, the destruction and creation oper-
ators associated with the mode of frequency ν, direction Ω, and polarization state
characterized by the unit vector eλ (λ = 1, 2), and where the sum runs over all the
possible modes of the radiation field.
The expression of the atomic Hamiltonian HA depends on the particular atomic
system and on the possible presence of external agents (like, for instance, a static,
electric or magnetic field). For the time being we make no specific assumption
on HA . However, we suppose to know its normalized eigenstates | n and energy
eigenvalues En , solution of the stationary Schrödinger equation
HA | n = En | n , (6.12)
with
n | m = δnm (6.13)
and
|n n| = 1 . (6.14)
n
where ri , pi , and si are the position, momentum, and spin operators of the i-th
optical electron of the atom, and where A( r ) and B(
r ) are the operators corre-
sponding to the vector potential and to the magnetic field vector of the radiation
field, respectively.
Writing the different contributions to V in the form
V = V1 + V2 + V3 ,
we can easily find the order of magnitude of the various terms. We have just to
recall that, as an order of magnitude
h c h
p≈ , A≈ E, s≈ , B≈E,
2πa0 2πν 2π
242 CHAPTER 6
where E and ν are the electric field and the frequency of the radiation field, re-
spectively, and where a0 is the Bohr radius. Moreover, for the atomic Hamiltonian
HA we have
p2 e2
HA ≈ ≈ 0 ,
m a0
so that we obtain
V1 V3 e0 Ea0 V 2πa0
≈ ≈ , 2 ≈ . (6.16)
HA V1 hν V1 λ
At optical wavelengths the quantity hν is comparable with the atomic energy, so
that we can also write
V V e Ea E
1 ≈ 3 ≈ 0 0 = ,
HA V1 e0 Eat a0 Eat
where Eat is a typical interatomic electric field. Thus when the ratio between
the radiation electric field and the typical interatomic electric fields is small, V1
can be considered a perturbation in comparison with HA , and V3 a perturbation
(of the same order) in comparison with V1 . On the other hand, Eq. (6.16) shows
that V2 , at optical wavelengths, is also much smaller (by about four orders of
magnitude) than V1 . It follows that if we restrict attention to typical astrophysical
or laboratory plasmas, where the inequality E/Eat 1 is always well-satisfied,1
and to wavelengths λ such that λ a0 , we can simply write the interaction
Hamiltonian in the form
e r).
V = 0 pi · A(i (6.17)
mc i
This is the form that will be actually used in the following to describe the interaction
between the atomic system and the radiation field.
Substituting the expression of the operator A( r ) in terms of destruction and
creation operators (Eq. (4.30)) into Eq. (6.17), we obtain
& '
V = Q(ν, Ω, λ) + Q† (ν, Ω,
λ) a(ν, Ω, λ) a† (ν, Ω,
λ) , (6.18)
ν Ωλ
1
This inequality is not satisfied in optical experiments performed with high-power lasers,
giving rise to non-linear optics phenomena.
2
To prove that the operator multiplying a† (ν, Ω, λ) in Eq. (6.18) is just the adjoint of
Q(ν, Ω, λ) one needs the commutation rule
[ p · eλ (Ω)
, eik·r ] = 0 ,
which follows from the commutation rule [rj , pk ] = ih̄ δjk and from Eq. (4.31).
MATTER-RADIATION INTERACTION (QUANTUM) 243
with .
e h
dν = 0 . (6.19)
m 2πνV
The interaction Hamiltonian given in Eq. (6.18) is expressed in the Schrödinger
picture. To derive its expression in the interaction picture, it should be noticed
λ) and Q† (ν, Ω,
that the operators Q(ν, Ω, λ) act only on the atomic system, while
†
the operators a(ν, Ω, λ) and a (ν, Ω, λ) act only on the radiation field, which implies
λ) = H , a(ν, Ω,
HR , Q(ν, Ω, λ) = H , H = 0.
A R A
i 2π H0 t −i 2π H0 t
VI (t) = e h V e h = B(t) + B † (t) , (6.20)
where
i 2π H t 2π
!
i 2π H t 2π
!
B(t) = λ) e−i h HA t
e h A Q(ν, Ω, λ) e−i h HR t . (6.21)
e h R a(ν, Ω,
ν Ωλ
Denoting by P (t) the operator resulting from the second curly bracket, we can
easily prove that
−2πiνt
P (t) = e λ) .
a(ν, Ω, (6.22)
λ) ,
P (0) = a(ν, Ω, (6.23)
d 2πi i 2π H t 2π
λ) e−i h HR t .
P (t) = e h R HR , a(ν, Ω,
dt h
On the other hand, taking into account the expression for HR (Eq. (6.11)) and the
commutation properties of the operators a and a† (Eqs. (4.32)), one gets
λ) = −hν a(ν, Ω,
HR , a(ν, Ω, λ) ,
so that
d
P (t) = −2πi ν P (t) .
dt
Integration of this differential equation with the initial condition (6.23) leads to
expression (6.22).
244 CHAPTER 6
As far as the first curly bracket is concerned, we can express it in a more con-
venient form by introducing the complete set of eigenvectors {| n } of the atomic
Hamiltonian defined in Eqs. (6.12)-(6.14). We have
i 2π HA t λ) e −i 2π HA t
e h Q(ν, Ω, h =
i 2π HA t −i 2π HA t
= e h |n λ) | m
n | Q(ν, Ω, m| e h
nm
2πi νnm t
= λ)
dν q(ν, Ω, e |n m| , (6.24)
nm
nm
λ)
q(ν, Ω, = n| e i k·ri | m .
pi · eλ (Ω) (6.26)
nm
i
and, similarly
B † (t) = dν q † (ν, Ω,
λ) |n λ) e 2πi (νnm + ν) t ,
m | a† (ν, Ω, (6.28)
nm
ν Ωλ nm
where
†
∗ e−i k·ri | m = q(ν, Ω,
λ)
q (ν, Ω, = n | pi · eλ (Ω) λ) ∗ .
nm mn
i
λ)]
The matrix element [q(ν, Ω, nm defined in Eq. (6.26) can be somewhat simplified
by a series of transformations that are discussed here in some detail.
First we observe that the coordinate ri of the i-th electron is measured from an
arbitrary origin. Introducing the relative coordinate xi , measured from the center
of mass of the atomic system, we have
+ x ,
ri = X i
MATTER-RADIATION INTERACTION (QUANTUM) 245
is the coordinate of the center of mass. With this position, the exponential
where X
λ)]
appearing in the definition of [q(ν, Ω, nm can be written as
i
k·
ri i
k·X i
k·
xi
e = e e ,
k · x ≈ 2πa0 1 ,
i
λ
we can simply write (long-wavelength approximation)
i
k·
ri i
k·X
e = e , (6.30)
It should be remarked, however, that the theory presented in the following cannot
give – because of the assumption X = const. – a consistent description of the
phenomena associated with the motion of the atomic system (Doppler effect). We
observe that if the coordinate X is considered as an ordinary variable describing
the physical state of the atomic system (like the relative coordinates xi of the
optical electrons) the formalism of Quantum Electrodynamics is able to provide
an elegant derivation of the fundamental formulae of the Doppler effect – which
basically result from the laws of conservation of momentum and energy. The reader
is referred to Louisell (1973) for an example of such derivation; the original idea
of deducing the Doppler effect from the momentum-energy conservation is due to
Fermi (1932).
Equation (6.31) can be further simplified by considering the expression of the
atomic Hamiltonian HA . Neglecting relativistic corrections, we have
p2
i
HA = + VCoul ,
i
2m
where VCoul represents the electrostatic energy of the electrons which depends on
the coordinates xi . Using the commutation rule of the operators pi and xi , and
defining
x = xi ,
i
246 CHAPTER 6
we have
h
HA , x = −i pi ,
2πm i
where νnm is the Bohr frequency defined in Eq. (6.25). This new expression involves
the matrix elements of the operator x , which coincides, apart from the factor -e0 ,
with the dipole operator
d = −e0 x .
This justifies the alternative name of (electric) dipole approximation that is given
to the long-wavelength approximation (Eq. (6.30)).
An important point concerning the dipole approximation should be explicitly
stressed. Substitution of Eq. (6.32) into Eq. (6.27) gives
.
2πh
B(t) e.d. = i e0 νnm m
n | x · eλ (Ω)|
νV
ν Ωλ nm
× |n λ) e 2πi (νnm − ν) t ,
m | a(ν, Ω, (6.33)
where Eq. (6.19) has been used. This expression can be compared with a similar
expression that is obtained by assuming a priori that the interaction Hamiltonian
has the form
V = r)
e0 xi · E(i
i
i 2π H0 t −i 2π H0 t
VI (t) = e h V e h = B (t) + B † (t) ,
where
.
2πνh
m
B (t) = i e0 n | x · eλ (Ω)|
V
ν Ωλ nm
2πi (νnm − ν) t
× |n λ) e
m | a(ν, Ω, . (6.34)
Comparison of Eqs. (6.33) and (6.34) shows that B (t) can be obtained from
[B(t)]e.d. by replacing the factor νnm with ν. In most physical applications the
distinction between these two forms of the interaction Hamiltonian is insignifi-
cant. Roughly speaking, this is due to the presence of the rapidly oscillating factor
exp [2πi (νnm − ν) t] which, for large values of t, is zero unless νnm = ν. A full
MATTER-RADIATION INTERACTION (QUANTUM) 247
discussion on this subject, together with a formal proof on how the two differ-
ent Hamiltonians can be connected by a similarity transformation (the so-called
Göppert-Mayer transformation) can be found in Cohen-Tannoudji et al. (1987).
In this book we will take for B(t) the more compact expression of Eq. (6.34). The
final form of the interaction Hamiltonian is therefore
VI (t) = B(t) e.d. + B † (t) e.d. ,
where
2πi (νnm − ν) t
B(t) e.d. = −i cν d · eλ (Ω)
nm
|n λ) e
m | a(ν, Ω,
ν Ωλ nm
B † (t) e.d. = i ∗
cν d · eλ (Ω) |n λ) e 2πi (νnm + ν) t
m | a† (ν, Ω, (6.35)
nm
ν Ωλ nm
with
.
2πνh
cν =
V
d = −e0 xj = −e0 x
j
d · eλ (Ω)
nm
= n | d · eλ (Ω)|
m = n | d | m · e (Ω)
λ
∗
d · eλ (Ω) ∗ | m = n | d | m · e (Ω)
= n | d · eλ (Ω) ∗
nm λ
∗
= (d )nm · eλ (Ω) (6.36)
and with
∗
d · eλ (Ω) ∗ .
= d · eλ (Ω) (6.37)
nm mn
In some cases, the dipole matrix elements between two particular levels | n and
| m may well be zero. This means that the electric dipole approximation gives no
contribution to the transition between such levels, and one must go back to those
parts of the interaction Hamiltonian that have been neglected so far. The various
terms neglected in the Taylor expansion (6.29) give rise to the electric quadrupole,
electric octupole, etc., Hamiltonians and to those parts of the magnetic-multipole
Hamiltonian related to the orbital angular momentum of the electrons, while the
second term in the right-hand side of Eq. (6.15) gives rise to those parts of the
magnetic-multipole Hamiltonian related to the electron spin.
In particular, let us consider the so-called magnetic-dipole transitions. Setting
again X = 0, the contribution of the second term of the expansion (6.29) to the
interaction Hamiltonian can still be written in the form of Eq. (6.18) with the
248 CHAPTER 6
substitutions1
λ) → Q (ν, Ω,
Q(ν, Ω, λ) =
1
i
= dν k · x + k · x p · e (Ω)
pi · eλ (Ω)
i i i λ
2 i
λ) → Q† (ν, Ω,
Q† (ν, Ω, λ) =
1
i
= − dν ∗ k · x + k · x p · e (Ω)
pi · eλ (Ω) ∗ . (6.38)
i i i λ
2 i
+ k · p x · e (Ω)
+ k · xi pi · eλ (Ω) i i λ
.
where the definitions of dν and cν (Eqs. (6.19), (6.36)) have been used.
Consider now the second term of the interaction Hamiltonian in Eq. (6.15). Using
r ), and adopting the long-wavelength approximation (6.30)
expression (4.33) for B(
with X = 0, we see that this term can also be written in the form of Eq. (6.18)
with the substitutions
λ) → Q (ν, Ω,
Q(ν, Ω, λ) = i c e0 × e (Ω)
si · Ω
2 ν λ
mc i
λ) = −i c e0
λ) → Q† (ν, Ω,
Q† (ν, Ω, si · Ω ∗.
× e (Ω) (6.40)
2 ν λ
mc i
1
It can be easily shown that the operators pi ·eλ (Ω)
and k·x commute (cf. footnote 2 on p.242).
i
However, the symmetrized forms in Eqs. (6.38) lead in the most straightforward way to the correct
decomposition of the interaction Hamiltonian into the electric-quadrupole and magnetic-dipole
terms (see e.g. Shu, 1991).
MATTER-RADIATION INTERACTION (QUANTUM) 249
The terms in square brackets of Eqs. (6.39) are responsible for electric-quadrupole
transitions, while the first terms, combined with those in Eqs. (6.40), are responsible
for magnetic-dipole transitions. Passing to the interaction picture, the magnetic-
dipole Hamiltonian can thus be written in the form
VI (t) m.d. = B(t) m.d. + B † (t) m.d. ,
where
B(t) m.d. = −i cν × e (Ω)
µ·Ω
λ nm
ν Ωλ nm
2πi (νnm − ν) t
× |n λ) e
m | a(ν, Ω,
†
B (t) m.d. = i cν µ·Ω ∗
× e (Ω)
λ nm
ν Ωλ nm
× |n λ) e 2πi (νnm + ν) t .
m | a† (ν, Ω, (6.41)
is the magnetic moment associated with the electrons, and the matrix elements are
defined in strict analogy with Eqs. (6.36).
Comparison of Eqs. (6.41) and (6.35) shows that the magnetic-dipole Hamilto-
nian can be simply obtained from the electric-dipole Hamiltonian by the formal
substitutions
d →
µ, →Ω
eλ (Ω) × e (Ω)
λ
. (6.42)
Substituting Eq. (6.20) into Eq. (6.9), the time variation of any physical observable
can be written as
! !
d d 2πi
O(t) = Tr ÔI (t) ρI (t) − Tr ÔI (t), B(t) ρI (0)
dt dt h
!
2πi †
− Tr ÔI (t), B (t) ρI (0)
h
t !
4π 2
− 2 Tr ÔI (t), B(t) , B(t ) ρI (t ) dt +
h
0
250 CHAPTER 6
t !
4π 2
− 2 Tr ÔI (t), B(t) , B † (t ) ρI (t ) dt
h
0
t !
4π 2
− Tr ÔI (t), B † (t) , B(t ) ρI (t ) dt
h2
0
t !
4π 2
− Tr ÔI (t), B † (t) , B † (t ) ρI (t ) dt . (6.43)
h2
0
It should be emphasized that this is a very general, exact equation that follows
directly from the principles of Quantum Mechanics. Now we are going to introduce
a number of assumptions which will enable us to derive a self-contained set of
evolution equations for the physical quantities characterizing the interaction of an
atomic system with the radiation field.
First of all we suppose that the density operator of the full system is the direct
product of the density operators of the individual systems,
I (t) ⊗ ρI (t)
ρI (t) = ρ(R) t≥0.
(A)
for (6.44)
In other words, if Ô(R) is an operator acting only on the radiation field, and Ô(A)
is an operator acting only on the atomic system, we assume that the expectation
value of their product is equal to the product of their expectation values,
& ' & ' & '
Tr ÔI(R) ÔI(A) ρI = Tr(R) ÔI(R) ρ(R)
I Tr (A)
ÔI
(A) (A)
ρ I ,
where Tr(R) and Tr(A) denote tracing over the states of the radiation field and of
the atomic system, respectively.
The physical meaning of this approximation is that the radiation field and the
atomic system are supposed to be uncorrelated. At time t = 0 (when the interac-
tion starts) this is indeed a reasonable approximation to describe the interaction
between light and matter in an environment like a stellar atmosphere. In such an
environment an atom interacts with radiation quanta that have been emitted at
typical distances of the order of many kilometers, so that atom-radiation correla-
tions can be safely neglected. The above approximation is also valid in laboratory
experiments, provided the light source is physically separated from the cell (or
analogous device) containing the atoms. By contrast, the approximation breaks
down in other physical conditions (like for instance in lasers, where atom-radiation
correlations play a major role).
Equation (6.44) postulates, however, that the uncorrelation between the radia-
tion field and the atomic system persists for t > 0. When Eq. (6.44) is substituted
into Eq. (6.43), we obtain an equation which can be considered as the lowest-order
approximation to the exact equation of motion. This approximation is justified
when the relaxation time of the density operator, tr , is much longer than the typ-
ical temporal factors which appear in Eq. (6.43) when the explicit expressions of
MATTER-RADIATION INTERACTION (QUANTUM) 251
B(t) and B † (t) are used (Eqs. (6.27)-(6.28), or (6.35)-(6.36) in the dipole approxi-
mation).
Our second assumption concerns the properties of the radiation field, and can be
expressed by the set of conditions
& ' & '
λ) ρ(R) (t) = Tr(R) a† (ν, Ω,
Tr(R) a(ν, Ω, λ) ρ(R) (t) = 0 (6.45)
I I
& '
λ) a(ν , Ω
Tr(R) a(ν, Ω, , λ ) ρ(R) (t) = 0 (6.46a)
I
& '
Tr(R) a† (ν, Ω,
λ) a† (ν , Ω
, λ ) ρ(R) (t) = 0
I (6.46b)
& '
Tr(R) a† (ν, Ω,
λ) a(ν , Ω
, λ ) ρ(R) (t) = 0 unless ν = ν , Ω
I
.
=Ω (6.47)
Equation (6.45) means that the expectation value of any observable which is linear
either in the destruction or the creation operator of the radiation field is zero. If
we consider, for instance, the expectation value of the electric or magnetic field
(whose associated operators are given, in the Schrödinger picture, by Eqs. (4.33))
we have & 2π '
2π
r ) e−i h H0 t ρ(R) (t) = 0
r, t) = Tr(R) e i h H0 t E(
E( I
and, similarly
r, t) = 0 .
B(
We thus obtain the analogue of the classical result that the statistical average of
such quantities is zero as long as the electromagnetic field can be regarded as the
incoherent superposition of different wavetrains with random phases.
Equation (6.45) is, however, somewhat restrictive in that it prevents the consider-
ation of processes involving coherent states of the radiation field (see e.g. Glauber,
1964, for an introduction to these concepts). Nevertheless, this approximation is
quite reliable to describe the radiation field typical of a stellar atmosphere, where
optical-coherent phenomena are believed to play a totally insignificant role. The
same is true for a quite large variety of laboratory experiments.
The meaning of Eqs. (6.46) is similar; indeed, the left-hand side of Eq. (6.46a)
represents, up to a factor, the statistical average of the quantity ckλ (t) ck λ (t),
where ckλ (t) is the amplitude of the Fourier component of the electric field vector
associated with a particular mode (see Eq. (4.20)). Similarly, the left-hand side
of Eq. (6.46b) represents the statistical average of the quantity ckλ (t)∗ ck λ (t)∗ .
Again, the approximation of Eqs. (6.46) is fully justified for electromagnetic fields
that can be described as the incoherent superposition of different wavetrains with
random phases.
Finally, assumption (6.47) means that no correlations exist between different
modes of the radiation field, except for correlations between polarization states
within each mode.
Using approximations (6.44)-(6.46) we can considerably simplify Eq. (6.43) for
two important classes of operators ÔI (t), namely the operators acting only on the
252 CHAPTER 6
atomic system and the operators acting only on the radiation field and having the
form a† (ν, Ω,
λ) a(ν , Ω
, λ ). It can be easily shown that for such operators, owing
to the form of B(t) and B † (t) (Eqs. (6.27)-(6.28)), Eq. (6.43) reduces to
!
d d
O(t) = Tr Ô (t) ρI (t)
dt dt I
t !
4π 2
− 2 Tr ÔI (t), B(t) , B † (t ) ρI (t ) dt
h
0
t !
4π 2
− Tr ÔI (t), B † (t) , B(t ) ρI (t ) dt .
h2
0
Because of the Hermitian character of the density operator and of the cyclic prop-
erty of the trace, this equation can be rewritten as
!
d d
O(t) = Tr ÔI (t) ρI (t)
dt dt
t !
4π 2
− 2 Tr ÔI (t), B(t) , B † (t ) ρI (t ) dt
h
0
& '
+ c.c. ÔI (t) → ÔI† (t) , (6.48)
where the last symbol denotes the quantity that is obtained by taking the complex
conjugate of the previous term and by replacing the operator ÔI (t) with its adjoint
ÔI† (t).
This is the basic equation that will be used in the following to derive the time
evolution of the radiation field and of the material system resulting from their
mutual interaction.
A complete description of the atomic system can be obtained through the knowl-
edge of the instantaneous value of the matrix elements of the density operator
ρ(A) (t), & '
ρmm (t) ≡ m | ρ(A) (t) | m = Tr(A) | m m | ρ(A) (t) , (6.49)
where | m and | m are two arbitrary eigenvectors of the atomic Hamiltonian,
and ρ(A) (t) is the time-dependent density operator of the atomic system expressed
in the Schrödinger picture.
From Eq. (3.87) we have
& '
ρ(A) (t) = Tr(R) ρ(t) ,
MATTER-RADIATION INTERACTION (QUANTUM) 253
where ρ(t) is the density operator of the whole system, so that we can also write
& '
ρmm (t) = Tr | m m | ρ(t) .
i 2π HA t −i 2π HA t 2πi νm m t
ÔI (t) = e h | m m| e h = | m m| e . (6.51)
Thus the density-matrix element ρmm (t) represents the expectation value of an
operator given by Ô = | m m | in the Schrödinger picture, and by Eq. (6.51) in
the interaction picture.
To derive the time evolution of ρmm (t) we will now apply Eq. (6.48), taking for
B(t) and B † (t) their expressions in the electric-dipole approximation (Eqs. (6.35)).
The first term is easily evaluated; from Eqs. (6.51) and (6.50) we get
! & '
d 2πi νm m t
Tr ÔI (t) ρI (t) = 2π i νm m Tr | m m| e ρI (t)
dt
= 2π i νm m ρmm (t) . (6.52)
For the second term we have, on the contrary, a rather involved expression
ÔI (t), B(t) , B † (t ) =
= cν cν d · eλ (Ω)
)∗
d · eλ (Ω
nn rr
νΩ
ν Ωλ λ nn rr
2πi (νm m + νnn − ν) t + 2πi (νrr + ν ) t
× e C, (6.53)
C can be easily evaluated using the commutation rule of the creation and destruc-
tion operators (Eq. (4.32)), and the commutation rule of the projection operators
|a b|,|c d | = δbc | a d | − δad | c b| .
254 CHAPTER 6
After some algebra, rearranging the final result in such a way that a† -operators
precede a-operators, we get
, -
C = δmn δn r | m r | a† (ν , Ω
, λ ) a(ν, Ω,
λ) + δ δ δ
νν ΩΩ λλ
− δmn δm r | r n | a† (ν , Ω
, λ ) a(ν, Ω,
λ)
, -
− δm n δmr | n r | a† (ν , Ω , λ ) a(ν, Ω,
λ) + δ δ δ
νν ΩΩ λλ
+ δm n δnr | r m | a† (ν , Ω
, λ ) a(ν, Ω,
λ) . (6.54)
Substituting this expression into Eq. (6.53) and renaming the summation indices
in the first line of Eq. (6.54) according to
n = r → n , r → m ,
we obtain
ÔI (t), B(t) , B † (t ) =
= cν c ν d · eλ (Ω)
)∗
d · eλ (Ω
mn nm
νΩ
ν Ωλ λ nm
2πi (ν − ν) t + 2πi (ν + ν ) t
× e m n nm
| m m |
, -
× a† (ν Ω
λ ) a(ν, Ω,
λ) + δ δ δ
νν ΩΩ λλ
where ‘(other terms)’ denotes the contribution arising from the other lines of
Eq. (6.54).
Now we multiply Eq. (6.55) by ρI (t ) and evaluate the trace. Using the approxi-
mations of Eqs. (6.44) and (6.47), and observing that
we get
!
Tr ÔI (t), B(t) , B † (t ) ρI (t ) =
2πi (νm n − ν)(t−t )
= c2ν d · eλ (Ω)
∗
d · eλ (Ω) e
mn nm
ν Ωλλ nm
& 2πi νm m t
'
× Tr(A) | m m | e ρ(A)
I (t
)
& '
× Tr(R) a† (ν, Ω,
λ ) a(ν, Ω,
λ) + δ ρ(R) (t )
λλ I
The two separate traces appearing in this equation can be easily identified in terms
of physical observables. The trace over (A) is nothing but the density-matrix
element ρm m (t ) (see Eqs. (6.49), (6.50) and (6.51)), while from Eq. (4.37) we
have1
&, - ' 2
Tr(R) a† (ν, Ω, λ) + δ ρ(R) (t ) = c
λ ) a(ν, Ω, +δ ,
Iλ λ (ν, Ω)
λλ I 3 t λλ
hν
is the polarization tensor of the radiation of frequency ν prop-
where [Iλ λ (ν, Ω)]t
agating in the direction Ω at time t .
Now we have to perform the integral in dt , between 0 and t, of Eq. (6.56).
A possible way to evaluate the integral is to assume that the relaxation time tr
characterizing the evolution of the physical quantities of the coupled system is
much longer than the typical time-scale over which the oscillating factor
2πi (νm n − ν)(t−t )
e
varies. Under this assumption we can extract the quantities ρm m and Iλ λ from
the integral (and drop their explicit time dependence) to get
t !
4π 2
− 2 Tr ÔI (t), B(t) , B † (t ) ρI (t ) dt =
h
0
8π 3 c2
=−
d · eλ (Ω) ∗
d · eλ (Ω) ρm m
h2 ν 2 V mn nm
ν Ωλλ nm
3 t
+ hν δ
× Iλ λ (ν, Ω) e
2πi (νm n − ν)(t−t )
dt
c2 λλ
0
+ (other terms) ,
where the expression of c2ν has been used (see Eqs. (6.36)).
The last integral can be evaluated by standard methods; moreover, because of
the above assumption, it is natural to take its asymptotic value for t → ∞. We
have
t
2πi (νab − ν)(t−t ) 1 i 1
lim e dt = δ(νab − ν) + P ,
t→∞ 2 2π νab − ν
0
where δ(x) is the Dirac delta-function and P(x) means the principal part in the
sense of distribution theory.
If we define the complex profile Φ(ν0 − ν) to be
i 1
Φ(ν0 − ν) = φ(ν0 − ν) + i ψ(ν0 − ν) = δ(ν0 − ν) + P , (6.57)
π ν0 − ν
1
It follows from Eq. (6.22) that the operator a† (ν, Ω,
λ ) a(ν, Ω,
λ) has the same form in the
Schrödinger picture and in the interaction picture.
256 CHAPTER 6
and transform the summation over the various modes of the radiation field into a
double integral via Eq. (4.35), we obtain
t !
4π 2
− 2 Tr ÔI (t), B(t) , B † (t ) ρI (t ) dt =
h
0
5 ∞
16 π
4
dΩ
∗
=− 2 ρm m dν d · eλ (Ω)
mn
d · eλ (Ω) nm
h c
4π
λλ nm 0
+ hν 3
× Iλ λ (ν, Ω) δ Φ(νm n − ν)
c2 λλ
+ (other terms) . (6.58)
This procedure can however be criticized from several points of view. First of
all, by assuming ρm m (t ) to be independent of t , we give up the possibility of
describing those aspects of the phenomena which depend on the ‘history’ of the
density-matrix elements. This is the so-called secular approximation (cf. Cohen-
Tannoudji et al., 1988), a special case of the Markov approximation, which makes
impossible the treatment of correlation effects in successive interactions, e.g., fre-
quency redistribution effects in the absorption and re-emission process.
Furthermore, we should realize that the exact time evolution of the density-
matrix element ρmm is affected by several complicated phenomena that cannot be
accounted for by our approach, which is based on a lowest-order expansion in the
framework of Quantum Electrodynamics: the finite width of energy levels, their
energy shift due to interactions with real and virtual photons, and – when the
levels m and m are non-degenerate – the so-called quantum beats. The last phe-
nomenon consists in oscillations of ρmm whose frequencies depend on the spectral
characteristics of the incident radiation.
The first two phenomena (finite width and shift of the energy levels) can be
taken into account, in a phenomenological way, by replacing the complex profile of
Eq. (6.57) with the following
Φ(νab − ν) = φ(νab − ν) + i ψ(νab − ν)
1 Γab i νab + ∆ab − ν
= + , (6.59a)
2 + (ν
π Γab ab + ∆ab − ν)2 π Γ 2 + (ν
ab ab + ∆ab − ν)
2
where
γab γ + γb
Γab = = a , ∆ab = ∆a − ∆b , (6.59b)
4π 4π
with γa , γb the probabilities per unit time that the atom leaves level | a or | b ,
respectively, via spontaneous or stimulated transitions, and ∆a , ∆b the frequency
shifts of the two levels. Note that the real part of the profile satisfies the normal-
ization
∞
φ(νab − ν) dν = 1 . (6.59c)
−∞
MATTER-RADIATION INTERACTION (QUANTUM) 257
The existence of quantum beats affects Eq. (6.58) in such a way that, when off-
diagonal density-matrix elements connecting non-degenerate levels are present, the
central frequency νm n of the Φ profile is in fact an oscillating quantity. Therefore,
the equations obtained from our simple treatment remain valid when the precise
frequency position of the Φ profile is not crucial. This requires that the incident
radiation field should be flat, i.e. independent of frequency, across a spectral in-
terval ∆ν larger than the Bohr frequencies connecting the different m levels and,
moreover, larger than the inverse lifetime of each m level. Under this assumption
(flat-spectrum approximation), the integral over frequency of the first term in the
square bracket of Eq. (6.58) is seen to be proportional to
,
Iλ λ (νm∗ n , Ω)
∞
Φ(ν − ν) dν .
Iλ λ (ν, Ω) mn
0
In order to describe both the general case and these two special cases by a unique
equation, we write the result of the integration over frequency in the form
,
Iλ λ (νmn , Ω)
1
νmn = νmn + νm n + νm n . (6.60)
3
Obviously, the introduction of the ‘mean’ Bohr frequency is just a formal artifice.
In the general case (non-zero density-matrix elements connecting non-degenerate
levels), νmn is a ‘label’ which reminds that the radiation field at any of the Bohr
frequencies νmn , νm n , νm n should be considered; in the special cases i) and ii),
νmn reduces to the true Bohr frequency νmn .
Let’s now consider the second term in the square bracket of Eq. (6.58). This
term leads to an integral over frequency whose real part is easily evaluated. Taking
258 CHAPTER 6
into account that the real part of the Φ profile is practically a Dirac delta-function
centered at the frequency νmn , we obtain
⎧
⎨ h2 νmn
3
if νmn > 0
c
⎩
0 if νmn < 0 .
The imaginary part of the Φ profile leads, on the contrary, to a diverging integral.
Integrals of such kind are commonly encountered in Quantum Electrodynamics, and
describe the finite widths and the shifts of the energy levels due to the interaction of
the atom with virtual photons. The actual calculation of the shift of a given level
(Lamb shift) involves the removal of the divergence of the integral by standard
procedures based on the concept of mass renormalization. An example of such
calculation for the simple case of the hydrogen atom can be found in Bethe and
Salpeter (1957). In the following, these terms related to the Lamb shift will be
simply neglected in our equations. Obviously, they can be reintroduced a posteriori
in the theory by supposing that the eigenvalues En of the atomic Hamiltonian
represent the exact energies (including the Lamb shift) of the atomic levels.
The above reasoning allows Eq. (6.58) to be rewritten in the form
t !
4π 2
− 2 Tr ÔI (t), B(t) , B † (t ) ρI (t ) dt =
h
0
5
16 π 4 dΩ
∗
=− 2 ρm m
d · eλ (Ω) mn
d · eλ (Ω) nm
h c
4π
λλ nm
× Iλ λ (νmn , Ω)
16 π 4
− 3
ρm m 3
νmn Θ(νmn )
hc
λ m n
5
dΩ
∗
×
d · eλ (Ω) mn
d · eλ (Ω) nm
4π
+ (other terms) , (6.61)
where Θ(x) is the step-function, which is equal to 1 for positive values of the
argument and to 0 otherwise.
The ‘(other terms)’ appearing in this equation are those arising from the sec-
ond, third, and fourth line of Eq. (6.54). The evaluation of these terms requires
calculations quite similar to those developed above. Once these terms have been
evaluated (by introducing the relevant ‘mean’ Bohr frequencies as in Eq. (6.60)),
we substitute Eqs. (6.61) and (6.52) into Eq. (6.48) and we add the term result-
ing from the complex conjugate of Eq. (6.61) with the interchange m ← → m (this
†
interchange transforms the operator ÔI (t) defined in Eq. (6.51) into ÔI (t)).
After some algebra, which involves several operations of index renaming, we
obtain the evolution equations for the density-matrix elements ρmm (usually called
MATTER-RADIATION INTERACTION (QUANTUM) 259
d
ρ = −2π i νmm ρmm
dt mm
+ ρnn TA (m, m , n, n )
nn
+ ρpp TE (m, m , p, p )
pp
+ ρpp TS (m, m , p, p )
pp
− ρmm RA (m, m , m ) + ρm m RA (m , m , m)
m
− ρmm RE (m , m, m ) + ρm m RE (m, m , m )
m
− ρmm RS (m , m, m ) + ρm m RS (m, m , m ) , (6.62)
m
where
5
32π 4 dΩ
TA (m, m , n, n ) = 2
d · eλ (Ω) mn
∗
d · eλ (Ω) nm
h c
4π
λλ
× Iλ λ (νmn , Ω)
5
32π 4 dΩ
TE (m, m , p, p ) = 3 p m λ
∗
d · e (Ω)
d · eλ (Ω) mp
hc 4π
λ
× νpm
3
Θ(νpm )
5
32π 4 dΩ
TS (m, m , p, p ) = 2 p m λ
∗
d · e (Ω)
d · eλ (Ω) mp
h c
4π
λλ
× Iλ λ (νpm , Ω)
16π 4 5 dΩ
RA (m, m , m ) = 2
d · eλ (Ω)
pm d · λ
∗
e (Ω)
m p
p
h c
4π
λλ
× Iλ λ (νpm , Ω)
1
It should be recalled that from Eqs. (1.37), (3.80), and (6.37) one has the relations
∗ = I (ν, Ω)
Iλλ (ν, Ω) , ρ∗ab = ρba , ∗ = [ d · e (Ω)
[ d · eλ (Ω)] ∗] .
λ λ ab λ ba
260 CHAPTER 6
16π 4 5 dΩ
∗
RE (m, m , m ) = 3
d · eλ (Ω)
mn
d · eλ (Ω) nm
n
hc 4π
λ
× νmn
3
Θ(νmn )
16π 4 5 dΩ
RS (m, m , m ) = 2
d · eλ (Ω)
mn
∗
d · eλ (Ω) nm
n
h c
4π
λλ
,
× Iλ λ (νmn , Ω) (6.63)
with
1
νrs = ν + νrs + νr s + νr s
4 rs
1
νrs = νrs + νrs + νrs
3
1
νrs = νrs + νr s + νr s . (6.64)
3
The physical meaning of the various terms in Eq. (6.62) can be easily understood.
The first term in the right-hand side describes a relaxation of the coherence ρmm
due to the energy difference between levels | m and | m . This term,which is
obviously zero for the coherences between degenerate levels (and in particular for
the diagonal elements ρmm representing the populations), produces in general a
relaxation of the coherence ρmm which is the stronger, the larger the energy sepa-
ration between the corresponding levels. Roughly speaking, the coherence between
levels | m and | m turns out to be practically zero when the corresponding Bohr
frequency 2π νmm is much larger than the typical rates T and R appearing in the
right-hand side of Eq. (6.62). As these rates are usually of order 109 s−1 or smaller,
we obtain the following criterion for the disappearance of coherences between levels
| m and | m
| νmm | 1.6 × 108 Hz
or, in other words
| Em − Em | 7 × 10−7 eV . (6.65)
The other terms in the right-hand side of Eq. (6.62) represent the rates at which
the coherence ρmm either increases as a result of coherence-transfer from different
levels or decreases as a result of coherence-relaxation to different levels. Coherence-
transfer rates are denoted by the symbol T and all bear a positive sign. These terms
are due to absorption from lower levels (TA ), to spontaneous emission from upper
levels (TE ), and to stimulated emission from upper levels (TS ). Similarly, coherence-
relaxation rates – denoted by the symbol R – are due to absorption towards upper
levels (RA ), to spontaneous emission towards lower levels (RE ), and to stimulated
emission towards lower levels (RS ). All these rates bear a negative sign. The rates
RS and RE originate from the first line of Eq. (6.54); the rate TA from the second,
MATTER-RADIATION INTERACTION (QUANTUM) 261
Fig.6.1. The radiative processes contributing to the time evolution of the coherence ρmm are
schematized in this simplified Grotrian diagram. Straight lines refer to spontaneous emission
while wavy lines refer to absorption (arrows pointing upward) or stimulated emission (arrows
pointing downward). Coherence-transfer rates are represented by transitions ending in the couple
of levels (m, m ) while coherence-relaxation rates are represented by transitions originating from
these levels. Each process is labelled with the symbol of the corresponding rate.
the rates TS and TE from the third, and the rate RA from the fourth line. The
processes described by the different rates are schematically shown in Fig. 6.1.
From the explicit expressions of the various rates (Eqs. (6.63)) it is possible
to prove some important properties. With easy transformations, one obtains the
conjugation relations
TA (m, m , n, n )∗ = TA (m , m, n , n)
TE (m, m , p, p )∗ = TE (m , m, p , p)
TS (m, m , p, p )∗ = TS (m , m, p , p)
RA (m, m , m )∗ = RA (m, m , m )
RE (m, m , m )∗ = RE (m , m , m)
RS (m, m , m )∗ = RS (m , m , m) .
Another property concerns the time evolution of the trace of the density operator.
Setting m = m in Eq. (6.62) and summing over m one finds, after some algebra
which involves several operations of index renaming, that the term arising from TA
exactly cancels out with the terms arising from RA , and, similarly, TE cancels out
with RE , and TS cancels out with RS , so that
d
ρmm =0.
dt m
This property – which should be expected on the basis of Eq. (3.83) – implies that
the trace of the density operator of the atomic system can be fixed once and for
262 CHAPTER 6
all. In many cases the density operator is normalized through the equation
ρmm = 1 ,
m
but different normalizations can be safely employed. Equation (6.62), being lin-
ear in the density-matrix elements, is independent of the particular normalization
condition employed.
The expressions for the various rates given in Eqs. (6.63) contain matrix elements
e (Ω)]
of the form [d·
λ mn . The evaluation of these matrix elements is generally easier
using the spherical components of the dipole operator (rather than its Cartesian
components).1 From Eq. (2.83) we have
d · eλ (Ω)
=
(−1)q d−q eλ (Ω) q
,
q
whence
d · eλ (Ω)
mn
=
(−1)q (d−q )mn eλ (Ω) q
(6.66)
q
and, similarly
∗
∗
d · eλ (Ω) ∗ =
= d · eλ (Ω)
(−1)q (d−q )nm eλ (Ω) .
mn nm q
q
so that
∗
d · eλ (Ω) = ∗.
(−1)q (d−q )∗nm eλ (Ω) (6.67)
mn q
q
which up to
At this point we choose a specific form for the unit vectors eλ (Ω),
now were just the unit vectors – satisfying the relations (4.31) – used to expand
the vector potential. From now on we require that these vectors have the form of
Eq. (5.141). This choice provides the simplest expression for their spherical com-
ponents and, moreover, it allows us to use the spherical tensors already introduced
in Sect. 5.11.
As a first consequence, we can easily evaluate the angular integrals appearing
in the expressions for the rates TE and RE related to spontaneous emission (see
Eqs. (6.63)). From Eqs. (6.66), (6.67), (5.140) and (5.143) we have
1
This is basically related to the Wigner-Eckart theorem, see Sect. 2.8.
MATTER-RADIATION INTERACTION (QUANTUM) 263
5 dΩ
d · eλ (Ω)
∗
d · eλ (Ω) =
4π ab cd
λ
5 dΩ
=
(−1)q+q (d−q )ab (d−q )∗dc eα (Ω) ∗
eα (Ω)
4π q q
α=±1 qq
5
dΩ
= (−1)q+q (d−q )ab (d−q )∗dc
Eqq (α, α, Ω)
α=±1
4π
qq
2
= (d−q )ab (d−q )∗dc . (6.68)
3 q
Next, recalling the definition of the radiation field tensor Jqq (ν) (see Eqs. (5.150)
and (5.153)), we can rewrite the rates appearing in the statistical equilibrium equa-
tions in the more compact form
32π 4
TA (m, m , n, n ) = 2 (−1)q+q (d−q )mn (d−q )∗m n Jqq (νmn )
h c qq
64π 4
TE (m, m , p, p ) = (d−q )p m (d−q )∗pm νpm
3
Θ(νpm )
3hc3 q
32π 4
TS (m, m , p, p ) = (−1)q+q (d−q )p m (d−q )∗pm Jqq (νpm )
h2 c qq
16π 4
RA (m, m , m ) = 2c
(−1)q+q (d−q )pm (d−q )∗pm Jqq (νpm )
p
h qq
32π 4
RE (m, m , m ) = 3
(d−q )mn (d−q )∗m n νmn
3
Θ(νmn )
n
3hc q
16π 4
RS (m, m , m ) = 2
(−1)q+q (d−q )mn (d−q )∗m n Jqq (νmn ) , (6.69)
n
h c qq
where the various ‘mean’ Bohr frequencies are defined in Eqs. (6.64).
An important remark should be made about the limit of the statistical equilib-
rium equations under the hypothesis of neglecting polarization phenomena in the
atomic system and of neglecting polarization and anisotropy effects in the radiation
field. Let us assume that
ρab = ρa δab , = 1 I(ν) δ ,
Iλ λ (ν, Ω) λλ
2
where ρa is the population of level | a and I(ν) is the radiation field intensity.
From Eq. (6.62) we have
d
ρm = ρn TA (m, m, n, n) + ρp TE (m, m, p, p) + TS (m, m, p, p)
dt n p
From Eqs. (5.150), (5.153) and (5.143) the radiation field tensor is found to be
1
Jqq (ν) = I(ν) δqq .
3
According to Eqs. (6.69) the coherence-transfer rate TA becomes
32π 4 2
TA (m, m, n, n) = |d | I(νmn ) ,
3h2 c mn
where (see Eq. (2.82))
|dmn |2 = |dnm |2 = dmn · dnm = |(dq )mn |2 . (6.70)
q
Introducing the Einstein Bnm coefficient for the transition from the initial state
| n to the final state | m by the standard definition
32 π 4 2
Bnm = |d | , (6.71)
3 h2 c nm
one obtains
TA (m, m, n, n) = Bnm I(νmn ) .
Performing the same kind of calculations on the other rates one finally gets
d
ρm = ρn Bnm I(νmn )
dt n
+ ρp Apm + Bpm I(νpm )
p
− ρm Bmp I(νpm )
p
− ρm Amn + Bmn I(νmn ) , (6.72)
n
where
64π 4 3 2 3
2hνmn
Amn = νmn |dmn | = Bmn (6.73)
3hc3 c2
is the Einstein coefficient for spontaneous emission from the upper level | m to
the lower level | n .
Equations (6.72) are the standard statistical equilibrium equations whose deriva-
tion, based on heuristic arguments, can be found in various textbooks (see e.g.
Mihalas, 1978). They are just a special case of a more general set of equations
(Eqs. (6.62)) which have been derived here directly from the principles of Quan-
tum Electrodynamics. Equations (6.62) reduce to Eqs. (6.72) when polarization
and anisotropy phenomena are neglected.
MATTER-RADIATION INTERACTION (QUANTUM) 265
We derive here the evolution equations for the radiation field by essentially the same
method just employed for the atomic system. In particular, we are interested in the
related to a given frequency ν
time evolution of the polarization tensor Iαβ (ν, Ω)
whose associated quantum operator is given by Eq. (4.37).
and a given direction Ω,
As this operator remains unchanged in the interaction picture (see footnote on
p. 255), we have just to apply Eq. (6.48) with the operator ÔI (t) given by
hν 3 † β) .
ÔI (t) = a (ν, Ω, α) a(ν, Ω,
c2
Since dÔI (t)/dt = 0, and since the adjoint operator ÔI† (t) is obtained from ÔI (t)
by exchange of the indices α and β, we get
d =
I (ν, Ω)
dt αβ
t !
4π 2 ν 3 †
=− Tr a (ν, Ω, β) , B(t) , B † (t ) ρ (t ) dt
α) a(ν, Ω,
I
hc2
0
6 7
+ c.c. α →
←β . (6.74)
We start by evaluating the inner commutator, with B(t) given by Eq. (6.35). We
have
†
α) a(ν, Ω,
a (ν, Ω, β) , B(t) =
= −i )
cν d · eλ (Ω mn
νΩ
λ mn
× |m n | a† (ν, Ω, , λ ) e 2πi (νmn − ν ) t ,
β) , a(ν , Ω
α) a(ν, Ω,
and using the commutation properties of the operators a and a† (Eq. (4.32)) we
get
†
α) a(ν, Ω,
a (ν, Ω, β) , B(t) = i c
ν d · eα (Ω)
mn
mn
× |m β) e 2πi (νmn − ν) t .
n | a(ν, Ω,
With some easy algebra, and rearranging the final result in such a way that a† -
operators precede a-operators, we obtain
, -
C = δnr | m s | a† (ν , Ω
, λ ) a(ν, Ω,
β) + δ δ δ
νν ΩΩ λ β
− δsm | r n | a† (ν , Ω
, λ ) a(ν, Ω,
β) . (6.76)
Now we substitute Eq. (6.76) into Eq. (6.75), multiply by ρI (t ) and take the trace
of the resulting operator. With the help of Eqs. (6.44) and (6.47) we obtain
!
†
†
Tr
a (ν, Ω, α) a(ν, Ω, β) , B(t) , B (t ) ρI (t ) =
Similarly to the previous section, we identify the various traces in this equation
with the corresponding physical observables. Next we perform the integral in dt
between 0 and t (for t → ∞) and we substitute into Eq. (6.74) to obtain, after
some index renaming
d =
I (ν, Ω)
dt αβ
3
4π ν
= d · eα (Ω)
∗
d · eγ (Ω) ρm m
hV γ mn nm
mm n
× Φ(νmn − ν) Iγβ (ν, Ω)
4π 3 ν 4
∗
+
d · eα (Ω) d · eβ (Ω) nm
ρm m Φ(νmn − ν) +
c V
2
mn
mm n
MATTER-RADIATION INTERACTION (QUANTUM) 267
4π 3 ν
∗ ρ
−
d · eα (Ω) mn
d · eγ (Ω) n m nn
hV γ
mnn
!
× Φ(νmn − ν) Iγβ (ν, Ω)
6 7
→β ,
+ c.c. α ← (6.77)
; t + ∆t, x + Ω
Iαβ (ν, Ω c ∆t) − I (ν, Ω ; t, x )
αβ
lim =
∆t→0 ∆t
∂ ; t, x ) + c Ω
· grad I (ν, Ω
; t, x ) .
= I (ν, Ω
∂t αβ αβ
d ∂ d
→ +c .
dt ∂t ds
Next we observe that Eq. (6.77) has been obtained under the assumption that the
radiation field, defined in the normalization box of volume V, interacts with a single
atomic system. For an ensemble of uncorrelated atomic systems we have just to
multiply the right-hand side by the number of such systems, with the result that
the factor 1/V is replaced by the number density N . Equation (6.77) can then be
268 CHAPTER 6
4π 3 ν
gαβ = N d · eα (Ω)
∗ ρ Φ(ν − ν)
d · eβ (Ω)
ch mn n m nn mn
mnn
3
4π ν
hαβ = N d · eα (Ω)
∗
d · eβ (Ω) ρm m Φ(νmn − ν)
ch mn nm
mm n
3
2hν
fαβ = hαβ . (6.79)
c2
The physical interpretation of the terms in the right-hand side of Eq. (6.78) is
straightforward. The first term, which bears a negative sign and which results
from the third line in Eq. (6.77), describes the transfer effects produced by the
absorption of radiation. It is proportional to the coherence ρnn between two levels
| n and | n which are connected to the upper level | m by dipole transitions.
The imaginary part of the complex Φ profile describes the associated anomalous
dispersion effects.
In Chap. 5 we already found – using the formalism of classical electrodynamics –
a transfer equation for the polarization tensor (cf. Eq. (5.24)).1 Comparison with
Eq. (6.78) shows that the tensor g is just the quantum analogue of the propagation
tensor G defined in Eq. (5.22) (with G → g ∗ ). This analogy provides an interesting
relation between the refractive index and the matrix elements of the dipole operator
between eigenstates of the atomic system.
The second term in Eq. (6.78), which bears a positive sign and which results from
the first line in Eq. (6.77), describes the transfer effects produced by stimulated
emission of radiation. This term is proportional to the coherence ρm m between
two levels | m and | m which are connected to the lower level | n by dipole
transitions. We see the appearance, via the imaginary part of the complex Φ profile,
of anomalous dispersion effects associated with stimulated emission of radiation.
This term has no analogue in the classical equation, an obvious consequence of the
quantum nature of stimulated emission.
Finally, the third term in Eq. (6.78), which again bears a positive sign and which
results from the second line in Eq. (6.77), describes the transfer effects produced
1
Equation (5.24) involves the tensor J, while Eq. (6.78) involves the tensor I. We recall that
I and J are proportional (see Eq. (1.36)).
MATTER-RADIATION INTERACTION (QUANTUM) 269
by spontaneous emission of radiation. The classical analogue of this term has been
deduced for the special case of an atom in the presence of a magnetic field (see
Eq. (3.37)).
As in the preceding section, it is convenient to express the dipole matrix elements
appearing in Eqs. (6.79) in terms of spherical tensors, assuming at the same time
that the polarization unit vectors have the form of Eq. (5.141). Using Eqs. (6.66),
(6.67), and (5.140) we can easily rewrite the tensors g, h, f in the form
4π 3 ν
gαβ = N (−1)q+q (d−q )mn (d−q )∗mn
ch
mnn qq
ρ Φ(ν − ν)
× Eqq (α, β, Ω) nn mn
4π 3 ν
hαβ = N (−1)q+q (d−q )mn (d−q )∗m n
ch
mm n qq
ρ Φ(ν − ν)
× Eqq (α, β, Ω) mm mn
2hν 3
fαβ = hαβ . (6.80)
c2
An important remark on the radiative transfer equations concerns the special
case where polarization is neglected both in the radiation field and in the atomic
system. If we suppose that
= 1 δ ,
Iαβ (ν, Ω) I(ν, Ω) αβ ρnn = ρn δnn ,
2
we get
1 ∂ d = 1 ∂ + d
+ I(ν, Ω) I (ν, Ω)
c ∂t ds c ∂t ds α αα
1
∗
=− gαα + gαα I(ν, Ω)
α
2
1
+ hαα + h∗αα I(ν, Ω)
α
2
1
∗
+ fαα + fαα , (6.81)
α
2
where
4π 3 ν
gαα = N (−1)q+q (d−q )mn (d−q )∗mn
α
ch mn α qq
ρ Φ(ν − ν) ,
× Eqq (α, α, Ω) n mn
270 CHAPTER 6
etc. Now we ‘average over all the possible orientations of the atom’, which is
directions entering
equivalent to keeping the atom fixed and averaging over all the Ω
the tensor Eqq . Using Eq. (6.68) we obtain1
8π 3 ν
gαα = N |dmn |2 ρn Φ(νmn − ν) , (6.82)
α
3ch mn
where |dmn |2 is defined in Eq. (6.70). Performing similar calculations for the other
terms of Eq. (6.81) and bearing in mind the definition of the Einstein coefficients
(Eqs. (6.71) and (6.73)) we easily obtain
1 ∂ d +ε ,
= −k I(ν, Ω)
+ I(ν, Ω) ν ν
c ∂t ds
where2
hν
kν = B ρ − ρm φ(νmn − ν) N
4π nm nm n
hν
εν = A ρ φ(νmn − ν) N .
4π nm mn m
This is the usual radiative transfer equation whose derivation, generally based on
heuristic arguments, can be found in several textbooks (see for instance Mihalas,
1978). The quantities kν and εν are the line absorption coefficient (corrected for
stimulated emission) and the emission coefficient, respectively. The special case
now considered shows that the tensors g, h, and f defined in Eqs. (6.79) are just the
generalization of the absorption coefficient, the negative absorption coefficient due
to stimulated emission, and the emission coefficient, respectively. These quantities,
and the structure itself of the transfer equations, have been derived in this section
directly from the principles of Quantum Electrodynamics.
1
Equation (6.82) is correct only when the atomic system is evenly populated in all the
degenerate sublevels of a given energy level. Even in that case, it does not hold in the presence
of any kind of anisotropy (e.g., in the presence of a magnetic field).
2
The above expression for εν is obtained by noticing that, because of the presence of the
profile φ(νmn − ν), the Einstein coefficient Amn can also be written as
2hν 3
Amn = Bmn .
c2
MATTER-RADIATION INTERACTION (QUANTUM) 271
where σ̂i are the Pauli spin matrices of Eqs. (5.128). Substituting Eqs. (6.78), and
using the conjugation relation (5.147) and the cyclic property of the trace, one
obtains after some algebra
3 3
1 ∂ d =− A + S +ε ,
+ Si (ν, Ω) Kij Sj (ν, Ω) Kij Sj (ν, Ω) i (6.83)
c ∂t ds j=0 j=0
where2
A
Kij = Re Tr (σ̂j σ̂i g)
S
Kij = Re Tr (σ̂j σ̂i h)
εi = Re Tr (σ̂i f ) . (6.84)
where
ηIA = K00
A A
= K11 A
= K22 = K33A
= Re Tr (g) = Re g++ + g−−
A
ηQ A
= K01 A
= K10 = Re Tr (σ̂1 g) = −Re g+− + g−+
1
Note that a similar transformation on the transfer equations has been performed in Sect. 5.2,
although using a different basis of polarization unit vectors.
2
It is understood that the components of the tensors f, g, h are labelled in the same way as
the spin matrices σ̂i (see the remark following Eq. (5.128)).
272 CHAPTER 6
ηUA A
= K02 = K20A
= Re Tr (σ̂2 g) = Im g+− − g−+
ηVA = K03
A
= K30A
= Re Tr (σ̂3 g) = Re g++ − g−−
ρAQ = K23
A
= −K32 A
= Im Tr (σ̂1 g) = −Im g+− + g−+
ρAU = K31
A
= −K13 A
= Im Tr (σ̂2 g) = −Re g+− − g−+
ρAV = K12
A
= −K21 A
= Im Tr (σ̂3 g) = Im g++ − g−−
εI = ε0 = Re Tr (f ) = Re f++ + f−−
εQ = ε1 = Re Tr (σ̂1 f ) = −Re f+− + f−+
εU = ε2 = Re Tr (σ̂2 f ) = Im f+− − f−+
εV = ε3 = Re Tr (σ̂3 f ) = Re f++ − f−− , (6.86)
and where ηIS , . . . , ρSV have the same expressions as ηIA , . . . , ρAV with hαβ substi-
tuted for gαβ . All these quantities can be easily written in terms of the reducible
spherical tensor Tqq (i, Ω) introduced in Chap. 5 (Eq. (5.146)). Defining the formal
vectors
ηiA = (ηIA , ηQ
A A
, ηU , ηVA ) (i = 0, 1, 2, 3)
ηiS = (ηIS , ηQ
S S
, ηU , ηVS ) (i = 0, 1, 2, 3)
A A A A
ρi = (ρQ , ρU , ρV ) (i = 1, 2, 3)
ρSi = (ρSQ , ρSU , ρSV ) (i = 1, 2, 3) , (6.87)
ch
mnn qq
× Re (d−q )mn (d−q )∗mn Tqq (i, Ω)
ρ Φ(ν − ν)
nn mn
8π 3 ν
=
ηiS (ν, Ω) N (−1)q+q
ch
mm n qq
× Re (d−q )mn (d−q )∗m n Tqq (i, Ω)
ρ Φ(ν − ν)
mm mn
6 7
= η A (ν, Ω)
ρAi (ν, Ω) Re → Im
i
6 7
= η S (ν, Ω)
ρSi (ν, Ω) Re → Im
i
3
= 2hν η S (ν, Ω)
εi (ν, Ω) , (6.88)
i
c2
where the symbol {Re → Im} means that ρi is obtained from the corresponding ηi
by substitution of the imaginary part for the real part.
MATTER-RADIATION INTERACTION (QUANTUM) 273
The radiative transfer equations become much simpler in the special case of
Local Thermodynamic Equilibrium, since the density-matrix elements of the atomic
system reduce to the expression (Boltzmann law)
En
−
e kB T
ρnn = δnn ,
Z(T )
where kB is the Boltzmann constant, T the absolute temperature of the medium,
and Z(T ) the partition function. With the use of the conjugation relation (5.148),
we have from Eqs. (6.88)
3
= 8π ν N
ηiA (ν, Ω) (−1)q+q
ch mn qq En
−
kB T
e
× (d−q )mn (d−q )∗mn Tqq (i, Ω)
φ(νmn − ν)
Z(T )
= 8π 3 ν
ηiS (ν, Ω) N (−1)q+q
ch mn qq Em
−
kB T
× (d−q )mn (d−q )∗mn e
Tqq (i, Ω) φ(νmn − ν) ,
Z(T )
and since the profiles φ(νmn − ν) are practically Dirac delta-functions
− hν
= e
ηiS (ν, Ω) kB T
ηiA (ν, Ω)
3 hν
= 2hν e− kB T η A (ν, Ω)
εi (ν, Ω) = η A (ν, Ω)
− η S (ν, Ω)
B , (6.89)
2 i i i P
c
where
2hν 3 1
BP = hν
c2
e −1 kB T
is the Planck function. Thus the radiative transfer equations can be written in the
compact form
1 ∂ d
3
+ Si = − Kij Sj − BP Uj ,
c ∂t ds j=0
where1
A
Kij = Kij − Kij
S
1
Note that the relation between the quantities ρA S
i and ρi analogous to the first of Eqs. (6.89),
hν
= e− kB T ρA (ν, Ω)
ρSi (ν, Ω) ,
i
Uj = δj0
is the unity vector U = (1, 0, 0, 0)† . In matrix form the equation reads
⎛ ⎞ ⎛ ⎞⎛ ⎞
I ηI ηQ ηU ηV I − BP
1 ∂ d ⎜Q⎟ ⎜ ηQ ηI ρV −ρU ⎟⎜ Q ⎟
+ ⎝ ⎠ = −⎝ ⎠⎝ ⎠,
c ∂t ds U ηU −ρV ηI ρQ U
V ηV ρU −ρQ ηI V
with ηI = ηIA − ηIS , ηQ = ηQ
A
− ηQ
S
, etc.
The statistical equilibrium equations and the radiative transfer equations have been
derived in the preceding sections under the assumption of electric-dipole transitions,
but they can be easily rewritten for magnetic-dipole transitions. As shown at the
end of Sect. 6.3, we have just to change the interaction Hamiltonian according to
Eqs. (6.42).
It immediately follows that the basic equations (6.62) and (6.78) remain un-
changed, while the matrix elements in the expressions of the radiative rates (Eqs.
(6.63)) and in the expressions of the tensors f , g, h (Eqs. (6.79)) must be replaced
according to
d · eλ (Ω)
nm
→ × e (Ω)
µ·Ω λ
nm
.
When spherical tensors are introduced, it can be easily seen that the matrix ele-
ments of the form (dq )nm must be replaced by (µq )nm , and the tensor Eqq (α, β, Ω)
by
E (α, β, Ω) = Ω × e (Ω)
∗
× e (Ω)
Ω (α, β = ± 1) ,
qq m.d. α q β q
where the symbol in curly bracket means the substitution of the matrix elements
(µq )nm for (dq )nm . If we consider the matrix form (6.85) of the transfer equations,
we have therefore – besides the just mentioned substitution – a sign change of the
A A
quantities ηQ , ηU , ρAQ , ρAU , ηQ
S S
, ηU , ρSQ , ρSU , εQ , εU .
CHAPTER 7
In this chapter we will apply the basic equations derived in Chap. 6 (statistical
equilibrium and radiative transfer equations) to the special case where the material
system interacting with the radiation field is an atom – or, more properly, an
ensemble of uncorrelated identical atoms. We will also assume the atoms to be
embedded in an external, static magnetic field, on account of the great importance
of this topic in astrophysical plasmas. Obviously, the results that will be derived
in this chapter are subjected to the limitations outlined in the introduction to
Chap. 6.
The two sets of equations depend on the atomic system through the quantities:
i) ρnm = n | ρ | m , the matrix elements of the atomic density operator between
energy eigenstates, describing populations (n = m) and coherences (n = m);
ii) νnm = (En − Em )/h, the corresponding Bohr frequencies;
iii) n | d | m (or n | µ | m ), the matrix elements of the electric (or magnetic)
dipole operator between energy eigenstates. In these expressions | n and En are
the eigenvectors and eigenvalues of the Hamiltonian of the unperturbed atom, plus
the Hamiltonian describing the interaction with the external magnetic field.
The quantities | n and En depend on the spectral properties of the atom and
on the intensity of the magnetic field (L-S or different coupling schemes, possi-
ble presence of hyperfine structure, Zeeman or Paschen-Back effect regime, etc.).
Moreover, one should in principle consider the coherences between any couple of
levels | n , | m , which would make the problem extremely involved. However,
depending on the specific atomic structure and magnetic field value, certain coher-
ences can be disregarded. This is related to a general property of the statistical
equilibrium equations, according to which the coherence ρnm is the smaller, the
larger the energy difference between levels | n and | m .
We consider in the following three schematic situations, which cover most cases
of interest for practical applications:
i) atom with no hyperfine structure, Zeeman effect regime, coherences between
magnetic sublevels (M, M ) of each (αJ)-level (neglecting those between different
levels): Sects. 7.1-7.4;
ii) atom with no hyperfine structure, L-S coupling scheme, incomplete Paschen-
Back effect regime, coherences between magnetic sublevels (M, M ) of each (βLSJ)
level and between magnetic sublevels of different levels (βLSJ), (βLSJ ) belonging
to the same term (neglecting coherences between different terms): Sects. 7.5-7.8;
iii) atom with hyperfine structure, incomplete Paschen-Back effect regime (of hy-
perfine structure), coherences between magnetic sublevels (f, f ) of each hyperfine
level (αJIF ) and between magnetic sublevels of different hyperfine levels (αJIF ),
278 CHAPTER 7
(αJIF ) belonging to the same level (neglecting coherences between different lev-
els): Sect. 7.9.
The statistical equilibrium and the radiative transfer equations will be written
both in the standard representation of the atomic density operator (which is easier
to work with, starting from the formalism of Chap. 6) and in the spherical statis-
tical tensor representation, which leads to more compact and physically significant
expressions.
At the end of the chapter we will also study the effect of collisions on the statistical
equilibrium equations, under the assumption of an isotropic distribution of colliding
particles.
where all the symbols have the same meaning as in Sect. 3.1.
The formalism presented in Chap. 6 can now be applied to this particular system.
To begin with the simplest case, we will neglect here all coherences between mag-
netic sublevels that do not belong to the same (αJ)-level. In other words, we sup-
pose that all the atomic density-matrix elements of the form1 αJM | ρ |α J M
are zero except for (αJ) = (α J ),
This approximation is usually justified when the energy separation between two
different levels – say (αJ) and (α J ) – is sufficiently large that the corresponding
Bohr frequency
E − Eα J
ναJ, α J = αJ (7.2)
h
1
In this chapter we denote by ρ the atomic density operator ρ(A) .
EQUATIONS FOR ATOMIC SYSTEMS 279
is much larger (in absolute value) than the Einstein A coefficient for spontaneous
emission.1 Under such approximation all the different atomic levels are decoupled
and the atomic system is referred to as a multi-level atom. For this system, the
flat-spectrum approximation discussed in Sect. 6.5 requires that the radiation field
incident on the atom should be constant across frequency intervals wider than
the Zeeman separations and than the inverse lifetimes of the relevant magnetic
sublevels. Therefore, it is sufficient to specify the incident radiation field at the
Bohr frequencies (7.2) relating different levels.
We can apply the formalism of Chap. 6 to the multi-level atom in the presence
of a magnetic field by performing the following substitutions
i) for the energy eigenvectors:
| n → |αJM ;
En → EαJ + µ0 gαJ B M ,
whence
iii) for the Bohr frequencies:
En − Em
νnm = → ναJM, α J M = ναJ, α J + νL (gαJ M − gα J M ) , (7.3)
h
where ναJ,α J is defined in Eq. (7.2) and νL is the Larmor frequency (cf. Eq. (3.10));
iv) for the matrix elements of the spherical components of the dipole operator:
where we have used the assumption (7.1) and we have introduced the notation of
Eq. (3.91a).
1
We will see however in the following (see Sect. 10.17) that the coherences between
different J-levels are fundamental to explain certain polarization phenomena in multiplets even
when ναJ, α J A.
280 CHAPTER 7
m → αJM
n → α J M p → αu Ju Mu
m → αJM
n → α J M p → αu Ju Mu ,
m → αJM
where the indices (for ‘lower’) and u (for ‘upper’) denote any level of energy lower
(or higher, respectively) than EαJ . We obtain
d
ρ (M, M ) = −2πi νL gαJ (M − M ) ραJ (M, M )
dt αJ
+ ρα J (M , M ) TA (αJM M , α J M M )
α J M M
+ ρα J (Mu , Mu ) TE (αJM M , αu Ju Mu Mu )
u u
αu Ju Mu Mu
+ TS (αJM M , αu Ju Mu Mu )
&
− ραJ (M, M ) RA (αJM M ) + RE (αJM M )
M
+ RS (αJM M )
where the various rates (for which we have introduced shorthand notations) are
given by
TA (αJM M , α J M M ) =
32π 4
= 2 (−1)q+q (d−q )αJM, α J M (d−q )∗αJM , α J M Jqq (ναJ, α J )
h c
qq
TE (αJM M , αu Ju Mu Mu ) =
64π 4
= (d−q )α J M , αJM (d−q )∗α J M , αJM να3 J , αJ
3hc3 q u u u u u u u u
EQUATIONS FOR ATOMIC SYSTEMS 281
TS (αJM M , αu Ju Mu Mu ) =
32π 4
= 2 (−1)q+q (d−q )α J M , αJM (d−q )∗α J M , αJM Jqq (να J , αJ )
h c
u u u u u u u u
qq
RA (αJM M ) =
16π 4
= 2
(−1)q+q (d−q )α J M , αJM (d−q )∗α J M , αJM Jqq (να J , αJ )
h c
u u u u u u u u
αu Ju Mu qq
RE (αJM M ) =
32π 4
= (d−q )αJM, α J M (d−q )∗αJM , α J M ναJ,
3
α J
3hc3 q
α J M
RS (αJM M ) =
16π 4
= (−1)q+q (d−q )αJM, α J M (d−q )∗αJM , α J M Jqq (ναJ, α J ) .
h2 c
α J M qq
Because of Eq. (7.4), the expressions for the rates contain the square modulus of
the reduced matrix element of the dipole operator between the (αJ)-level and one
of the lower or upper levels,
| αJ d α J |2 or | αu Ju d αJ |2 .
These quantities can be easily expressed in terms of the Einstein coefficients for
transitions between different levels. Given two levels (α J ) and (αu Ju ), with
Eα J > Eα J , the line strength of the transition is defined by the symmetrical
u u
expression
where the last equality follows from Eq. (2.101). The Einstein coefficients for
spontaneous and stimulated emission and for absorption are connected with the
line strength by the relations1
64π 4 3
(2Ju + 1) A(αu Ju → α J ) = ν S(αu Ju , α J )
3hc3 αu Ju , α J
32π 4
(2Ju + 1) B(αu Ju → α J ) = S(αu Ju , α J )
3h2 c
32π 4
(2J + 1) B(α J → αu Ju ) = S(α J , αu Ju ) , (7.7)
3h2 c
1
Note that from Eqs. (6.71) and (6.73) the Einstein coefficients for the transition between
282 CHAPTER 7
with
2h 3
A(αu Ju → α J ) = ν B(αu Ju → α J )
c2 αu Ju , α J
(2J + 1) B(α J → αu Ju ) = (2Ju + 1) B(αu Ju → α J ) . (7.8)
Using these relations, we can rewrite the expressions of the rates in the form
64π 4 3
Aα J Mu , α J M = ν | αu Ju Mu | d |α J M |2
u u 3hc3 αu Ju Mu , α J M
32π 4
Bα J Mu , α J M = | αu Ju Mu | d |α J M |2
u u 3h2 c
32π 4
Bα J M , αu Ju Mu = | α J M | d |αu Ju Mu |2 .
3h2 c
1
RA (αJM M ) = (2J + 1) B(αJ → αu Ju )
2
αu Ju
M−M Ju J 1 Ju J 1
× 3 (−1)
−M u M −q −Mu M −q
qq Mu
1
RE (αJM M ) = (2J + 1) A(αJ → α J )
2
α J
J J 1 J J 1
× (−1)M−M
−M M −q −M M −q
qM
1
= δMM A(αJ → α J ) (7.9e)
2
α J
1
RS (αJM M ) = (2J + 1) B(αJ → α J )
2
α J
J J 1
J J 1
× 3
−M M −q −M M −q
qq M
m → αu Ju Mu n → α J M
m → αu Ju Mu n → α J M ,
we obtain1
hν
=
ηiA (ν, Ω) N (2J + 1) B(α J → αu Ju )
4π
α J αu Ju
M −M Ju J 1 Ju J 1
× (−1) 3
−M u M −q −M u M −q
M M Mu qq
ρ
× Re Tqq (i, Ω) α J (M , M ) Φ(να J Mu , α J M − ν) (7.10a)
u u
hν
=
ηiS (ν, Ω) N (2Ju + 1) B(αu Ju → α J )
4π
α J αu Ju
Ju J 1 Ju J 1
× 3
−M u M −q −M u M −q
Mu Mu M qq
ρ
× Re Tqq (i, Ω) α J (Mu , Mu ) Φ(να J Mu , α J M − ν) (7.10b)
u u u u
& '
= η A (ν, Ω)
ρAi (ν, Ω) Re → Im (7.10c)
i
& '
= η S (ν, Ω)
ρSi (ν, Ω) Re → Im (7.10d)
i
3
= 2hν η S (ν, Ω)
εi (ν, Ω)
i
c2
hν
= N (2Ju + 1) A(αu Ju → α J )
4π
α J αu Ju
Ju J 1 Ju J 1
× 3
−M u M −q −M u M −q
Mu Mu M qq
ρ
× Re Tqq (i, Ω) α J (Mu , Mu ) Φ(να J Mu , α J M − ν) . (7.10e)
u u u u
We will now convert the equations of the previous section into the spherical statis-
tical tensor representation.
and carrying out the summation over M and M we obtain, with the use of
Eqs. (3.97) and (3.99)
d K
ρ (αJ) = −2πi νL gαJ Q ρKQ (αJ)
dt Q
K
+ ρQ (α J ) TA (αJKQ, α J K Q ) +
α J K Q
EQUATIONS FOR ATOMIC SYSTEMS 285
K
+ ρQu (αu Ju ) TE (αJKQ, αu Ju Ku Qu )
u
αu Ju Ku Qu
+ TS (αJKQ, αu Ju Ku Qu )
− Q (αJ) RA (αJKQK Q ) + RE (αJKQK Q )
ρK
K Q
+ RS (αJKQK Q ) , (7.11)
where
TA (αJKQ, α J K Q ) =
(2K + 1)(2K + 1)
J−M+J −M J J K J J K
× (−1)
M −M −Q M −M −Q
MM M M
× TA (αJM M , α J M M )
TE (αJKQ, αu Ju Ku Qu ) =
(2K + 1)(2Ku + 1)
J J K Ju Ju Ku
× (−1)J−M+Ju −Mu
M −M −Q Mu −Mu −Qu
MM Mu Mu
× TE (αJM M , αu Ju Mu Mu )
TS (αJKQ, αu Ju Ku Qu ) =
(2K + 1)(2Ku + 1)
J J K Ju Ju Ku
× (−1)J−M+Ju −Mu
M −M −Q Mu −Mu −Qu
MM Mu Mu
× TS (αJM M , αu Ju Mu Mu )
RA (αJKQK Q ) = (2K + 1)(2K + 1)
J J K
×
M −M −Q
MM M
J J K
× RA (αJM M )
M −M −Q
!
Q −Q J J K
+ (−1) RA (αJM M )
M −M −Q
286 CHAPTER 7
RE (αJKQK Q ) = (2K + 1)(2K + 1)
J J K
×
M −M −Q
MM M
J J K
× RE (αJM M )
M −M −Q
!
J J K
+ (−1)Q −Q R (αJM M
)
M −M −Q E
RS (αJKQK Q ) = (2K + 1)(2K + 1)
J J K
×
M −M −Q
MM M
J J K
× RS (αJM M )
M −M −Q
!
J J K
+ (−1)Q −Q R (αJM M
) .
M −M −Q S
These expressions can be further developed by substitution of Eqs. (7.9). For the
rate TA we have
K
We can now introduce the irreducible radiation field tensor JQ (ν) using Eq. (5.156),
which gives
.
2Kr + 1 1 1 Kr K
1+q
Jqq (ν) = (−1) JQ r (ν) . (7.13)
3 q −q −Qr r
Kr Qr
1
To arrange the sign factor in this expression it should be kept in mind that – as apparent
from the various 3-j symbols – the indices K, Q, K , Q , Kr , Qr are integers.
EQUATIONS FOR ATOMIC SYSTEMS 287
The sum of the product of the five 3-j symbols can be evaluated via Eq. (2.52).
After some manipulations similar to those of App. 2 we obtain1
With analogous procedures involving the use of Eq. (2.34) for TE , Eq. (2.52) for
TS , Eq. (2.23a) for RE , and Eq. (2.42) – applied twice – for RA and RS , we get
!
Ju Ju K
× (−1)1+J+Ju +K (7.14b)
J J 1
1
Note that the sign factor appearing in this formula, (−1)K +Q , can be written in various
different ways. As Q is an integer, we can also write (−1)K −Q . As the 9-j symbol is zero unless
(K + K + Kr ) is an even integer (because of the symmetry properties following Eq. (2.48)), the
sign factor can also be written (−1)K+Kr +Q ; etc.
288 CHAPTER 7
RA (αJKQK Q ) = (2J + 1) B(αJ → αu Ju )
αu Ju
× 3(2K + 1)(2K + 1)(2Kr + 1)
Kr Qr
! !
K K Kr 1 1 Kr K K Kr
× (−1)1+Ju −J+Kr +Q
J J J J J Ju Q −Q Qr
K
× ζ+ JQ r (να J , αJ ) (7.14d)
r u u
RE (αJKQK Q ) = δKK δQQ A(αJ → α J ) (7.14e)
α J
RS (αJKQK Q ) = (2J + 1) B(αJ → α J )
α J
× 3(2K + 1)(2K + 1)(2Kr + 1)
Kr Qr
! !
K K Kr 1 1 Kr K K Kr
× (−1)1+J −J+Q
J J J J J J Q −Q Qr
K
× ζ+ JQ r (ναJ, α J ) , (7.14f)
r
where1
1
1 + (−1)K+K +Kr .ζ+ =
2
The equations above were first deduced by Bommier and Sahal-Bréchot (1978).
Bommier (1977) had formerly derived the corresponding equations in the standard
representation.
1
Note that the quantity ζ+ is 1 or 0 according as the integer (K + K + Kr ) is even or odd,
respectively.
EQUATIONS FOR ATOMIC SYSTEMS 289
Ju J 1 Ju J 1
× (−1)1+J −M +q
−Mu M −q −Mu M −q
M M Mu qq
1 1 K J J K
×
q −q −Q M −M −Q
× Re T Q
K ρK (α J ) Φ(ν
(i, Ω) − ν) (7.15a)
Q
α J Mu , α J M
u u
hν
=
ηiS (ν, Ω) N (2Ju + 1) B(αu Ju → α J )
4π
α J αu Ju
× 3(2K + 1)(2Ku + 1)
KQKu Qu
1+Ju −Mu +q Ju J 1 Ju J 1
× (−1)
−Mu M −q −Mu M −q
Mu Mu M qq
1 1 K Ju Ju Ku
×
q −q −Q Mu −Mu −Qu
× Re T Q
K ρKu (α J ) Φ(ν
(i, Ω) − ν) (7.15b)
Q u u u
α J Mu , α J M
u u
& '
= η A (ν, Ω)
ρAi (ν, Ω) Re → Im (7.15c)
i
& '
= η S (ν, Ω)
ρSi (ν, Ω) Re → Im (7.15d)
i
3
= 2hν η S (ν, Ω)
εi (ν, Ω) . (7.15e)
i
c2
Φ(να J Mu , α J M − ν) → Φ(να J , α J − ν) ,
u u u u
the summations over the magnetic quantum numbers can % be Kcarried out using
Q T Q (i, Ω) ρQ (αJ) is
K
Eq. (2.34). Taking also into account that the quantity
real – as apparent from the conjugation properties (3.102) and (5.158) – one gets
hν
=
ηiA (ν, Ω) N (2J + 1) B(α J → αu Ju ) ×
4π
α J αu Ju
290 CHAPTER 7
√ !
1 1 K ρK (α J )
× 3 (−1)1+J +Ju +K TQ
K
(i, Ω) Q
J J Ju
KQ
× φ(να J , α J − ν) (7.16a)
u u
= hν N
ηiS (ν, Ω) (2Ju + 1) B(αu Ju → α J )
4π
α J αu Ju
√ !
1 1 K ρK (α J )
× 3 (−1)1+J +Ju TQ K
(i, Ω) Q u u
Ju Ju J
KQ
× φ(να J , α J − ν) (7.16b)
u u
& '
= η A (ν, Ω)
ρAi (ν, Ω) φ(ν − ν) → ψ(να − ν) (7.16c)
i α J , α J
u u
J , α J
u u
& '
= η S (ν, Ω)
ρSi (ν, Ω) φ(ν − ν) → ψ(να − ν) (7.16d)
i α J , α J
u u
J , α J
u u
3
= 2hν η S (ν, Ω)
εi (ν, Ω) , (7.16e)
2 i
c
The physical meaning of the various rates defined in Sect. 7.1.a is strictly analogous
to that of the more general rates introduced in Chap. 6 (see the discussion follow-
ing Eq. (6.65)). For instance, the quantity TA (αJM M , α J M M ) represents the
transfer rate, due to absorption, from the ‘lower-level coherence’ ρα J (M , M )
to the coherence ραJ (M, M ). Similarly, the quantity RA (αJM M ) represents
the relaxation rate, due to absorption to upper levels, connecting the coherence
ραJ (M, M ) with the coherence ραJ (M, M ). The rates introduced in Sect. 7.2.a
have a similar physical interpretation. The quantity TA (αJKQ, α J K Q ), for in-
stance, represents the transfer rate, due to absorption, from the spherical statistical
K
tensor of the lower level ρQ (α J ) to the spherical statistical tensor ρK
Q (αJ).
We can easily find the conjugation properties of the different rates. Bearing in
mind Eq. (5.154), we have from Eqs. (7.9)
RA (αJM M )∗ = RA (αJM M )
RS (αJM M )∗ = RS (αJM M ) .
= TE (αJKQ, αu Ju Ku Qu )
The equations derived in Sects. 7.1 and 7.2 are rather involved, which is quite
natural as they describe a complex physical situation. It is interesting to consider
in detail the special case where atomic polarization can be disregarded; obviously,
the equations become considerably simpler in this case.
Let us assume that coherences between Zeeman sublevels can be neglected or,
in other words, that the atomic density matrix is diagonal. This case – which
will be referred to as no-coherence case – covers the union (in the sense of the
mathematical set theory) of the physical regimes IIIa, IVa, Va, IIIb outlined in
Sect. 5.16. The statistical equilibrium equations in the standard representation
might be deduced directly from Eq. (7.5). However, the resulting equations would
be of limited use because of the severe assumption on the spectrum of the incident
radiation. Since no coherences between magnetic sublevels are present, we can now
make – still consistently with the flat-spectrum approximation – the less restrictive
assumption that the incident radiation field is flat across frequency intervals wider
than the inverse lifetimes of the magnetic sublevels, allowing for its dependence on
the individual Bohr frequencies ναJM, α J M . Performing in Eqs. (6.62), (6.69) the
same substitutions as in Sect. 7.1.a and introducing shorthand notations for the
292 CHAPTER 7
rates, we have1
d
ραJ (M ) = ρα J (M ) tA (αJM, α J M )
dt
α J M
+ ρα J (Mu ) tE (αJM, αu Ju Mu )
u u
αu Ju Mu
+ tS (αJM, αu Ju Mu )
where
J J 1
2
tE (αJM, αu Ju Mu ) = (2Ju + 1) A(αu Ju → αJ) u (7.18b)
−Mu M −q
q
rE (αJM ) = A(αJ → α J ) (7.18e)
α J
rS (αJM ) = (2J + 1) B(αJ → α J )
α J
2
J J 1
× 3 Jqq (ναJM, α J M ) , (7.18f)
−M M −q
qM
For the radiative transfer coefficients we obtain from Eqs. (7.10), with the help
of Eq. (5.148)
= hν N
ηiA (ν, Ω) (2J + 1) B(α J → αu Ju )
4π
α J αu Ju
2
Ju J 1 ρ
× 3 Tqq (i, Ω) (M )
−Mu M −q α J
M Mu q
× φ(να J Mu , α J M − ν) (7.19a)
u u
= hν N
ηiS (ν, Ω) (2Ju + 1) B(αu Ju → α J )
4π
α J αu Ju
2
Ju J 1 ρ
× 3 Tqq (i, Ω) (Mu )
−Mu M −q α J
u u
M Mu q
× φ(να J M , α J M − ν) (7.19b)
u u u
& '
= η (ν, Ω) φ(ν
αu Ju Mu , α J M − ν) → ψ(ναu Ju Mu , α J M − ν)
A A
ρi (ν, Ω) i (7.19c)
& '
= η S (ν, Ω)
φ(ν
ρSi (ν, Ω) i α J M , α J M − ν) → ψ(να J M , α J M − ν)
u u u u u u
(7.19d)
3
= 2hν η S (ν, Ω)
εi (ν, Ω) . (7.19e)
2 i
c
Similar simplifications occur in the equations written in the spherical statistical
tensor representation. The no-coherence hypothesis results here in the fact that all
the components of the multipole moments ρK Q vanish except those having Q = 0,
as apparent from the definition (3.101). Multiplying both sides of Eq. (7.17) by
√ J J K
(−1)J−M 2K + 1 ,
M −M 0
carrying out the summation over M , and using Eqs. (3.99), (3.101) and (7.13), we
obtain the following simplified statistical equilibrium equations1
d K K
ρ0 (αJ) = ρ0 (α J ) tA (αJK, α J K )
dt
α J K
K
+ ρ0 u (αu Ju ) tE (αJK, αu Ju Ku ) + tS (αJK, αu Ju Ku )
αu Ju Ku
− ρK
0 (αJ) rA (αJKK ) + rE (αJKK ) + rS (αJKK ) ,
K
1
The expressions for the rates tE (αJK, αu Ju Ku ) and rE (αJKK ) are derived with the help
of Eqs. (2.34) and (2.23a), respectively.
294 CHAPTER 7
where
tA (αJK, α J K ) = (2J + 1) B(α J → αJ) 3(2K + 1)(2K + 1)(2Kr + 1)
Kr
J J K J J K
× (−1)1+J−J
M −M 0 M −M 0
MM q
2
J J 1 1 1 Kr K
× J0 r (ναJM, α J M ) (7.20a)
−M M −q q −q 0
tS (αJK, αu Ju Ku ) = (2Ju + 1) B(αu Ju → αJ) 3(2K + 1)(2Ku + 1)(2Kr + 1)
Kr
J J K Ju Ju Ku
× (−1) 1+J−Ju
M −M 0 Mu −Mu 0
MMu q
2
Ju J 1 1 1 Kr K
× J0 r (να J Mu , αJM ) (7.20c)
−Mu M −q q −q 0 u u
rA (αJKK ) = (2J + 1) B(αJ → αu Ju ) 3(2K + 1)(2K + 1)(2Kr + 1)
αu Ju Kr
J J K J J K
× (−1) 1+q
M −M 0 M −M 0
MMu q
2
Ju J 1 1 1 Kr K
× J0 r (να J Mu , αJM ) (7.20d)
−Mu M −q q −q 0 u u
rE (αJKK ) = δKK A(αJ → α J ) (7.20e)
α J
rS (αJKK ) = (2J + 1) B(αJ → α J ) 3(2K + 1)(2K + 1)(2Kr + 1)
α J Kr
J J K J J K
× (−1) 1+q
M −M 0 M −M 0
MM q
2
J J 1 1 1 Kr K
× J0 r (ναJM, α J M ) . (7.20f)
−M M −q q −q 0
EQUATIONS FOR ATOMIC SYSTEMS 295
For the radiative transfer coefficients we have from Eqs. (7.15), with the use of
the conjugation relations (3.102) and (5.158)
hν
=
ηiA (ν, Ω) N (2J + 1) B(α J → αu Ju ) 3(2K + 1)(2K + 1)
4π
α J αu Ju KK
2
Ju J 1 1 1 K J J K
× (−1)1+J −M +q
−Mu M −q q −q 0 M −M 0
M Mu q
ρK (α J ) φ(ν
× T 0K (i, Ω) − ν) (7.21a)
0 α J Mu , α J M
u u
= hν N
ηiS (ν, Ω) (2Ju + 1) B(αu Ju → α J ) 3(2K + 1)(2Ku + 1)
4π
α J αu Ju KKu
2
1+Ju −Mu +q Ju J 1 1 1 K Ju Ju Ku
× (−1)
−Mu M −q q −q 0 Mu −Mu 0
M Mu q
K
ρ u (α J ) φ(ν
× T 0K (i, Ω) − ν) (7.21b)
0 u u α J Mu , α J M
u u
& '
= η A (ν, Ω)
ρAi (ν, Ω) φ(ν − ν) → ψ(ν − ν) (7.21c)
i αu Ju Mu , α J M αu Ju Mu , α J M
& '
= η S (ν, Ω)
ρSi (ν, Ω) φ(ν − ν) → ψ(ν − ν) (7.21d)
i α J M ,α J M
u u u
α J M ,α J M u u u
3
= 2hν η S (ν, Ω)
εi (ν, Ω) . (7.21e)
2 i
c
We will now consider the interaction of an atomic system with a polarized radiation
field in more general terms, dropping some of the basic restrictions adopted so far.
First, we will allow for coherences between different J-levels, besides those between
magnetic sublevels of any individual J-level. Second, we will drop the limitation
on the magnetic field intensity,1 so that the validity of the new equations is not
restricted to the Zeeman effect regime. The only restrictions that will be retained
are the following:
a) the atom is devoid of hyperfine structure;
b) the atom is described by the L-S coupling scheme (cf. Sect. 3.1), so that
the different J-levels are grouped in terms, each term being characterized by the
quantum numbers β, L, and S;
c) coherences between J-levels pertaining to different terms are negligible.
1
The magnetic field must be however sufficiently weak for the diamagnetic term in the atomic
Hamiltonian to be negligible. This implies B 1010 G (see the discussion in Sect. 3.1).
296 CHAPTER 7
The model atom described by these approximations will be named in the following
the multi-term atom. For such system, the flat-spectrum approximation discussed
in Sect. 6.5 requires that the radiation field incident on the atom should be constant
across frequency intervals larger than the frequency separation between different
levels belonging to the same term. The radiation field can thus be characterized
by specifying its value at the Bohr frequencies
E(βLS) − E(β L S )
νβLS, β L S = , (7.22)
h
where E(βLS), E(β L S ) are the energies of the relevant terms (which disregard
both the spin-orbit and the magnetic Hamiltonian). This approximation is indeed
rather restrictive, but it can only be released within the framework of more general
theories able to encompass frequency redistribution effects (see the introductory
discussion to Chap. 6).
We now apply the equations derived in Chap. 6 to the multi-term atom embedded
in an arbitrary magnetic field. To this aim we must go back to the results obtained
in Sect. 3.4 for the Paschen-Back effect regime – which obviously describe, as a
particular case, also the Zeeman effect regime. Using the notations of Sect. 3.4
(which implies that the z-axis of the reference system is in the magnetic field
direction) we have for the eigenvectors and eigenvalues of the atomic Hamiltonian
HA |βLSjM = E(βLS) + λj (βLS, M ) |βLSjM ,
| n → |βLSjM ; (7.24)
En − Em
νnm = → νβLSjM, β L S j M =
h
λj (βLS, M ) − λj (β L S , M )
= νβLS, β L S + ; (7.26)
h
EQUATIONS FOR ATOMIC SYSTEMS 297
iv) for the matrix elements of the spherical components of the dipole operator:
where we have used the reality of the CJj coefficients (cf. Sect. 3.4), the Wigner-
Eckart theorem (Eq. (2.96)), and Eq. (2.108);
v) finally, for the atomic density-matrix elements:
ρnm = n | ρ | m → βLSjM | ρ |β L S j M =
= δββ δLL δSS βLSjM | ρ |βLSj M
= δββ δLL δSS ρβLS (jM, j M ) , (7.28)
m → βLSjM
n → β L Sj M p → βu Lu Sju Mu
m → βLSj M
n → β L Sj M p → βu Lu Sju Mu ,
m → βLSj M
where the quantum numbers (β L S) and (βu Lu S) refer to any term of energy lower
or higher, respectively, than E(βLS). We obtain
298 CHAPTER 7
d
ρ (jM, j M ) = −2πi νβLS (jM, j M ) ρβLS (jM, j M )
dt βLS
+ ρβ L S (j M , j M ) TA (βLSjM j M , β L Sj M j M )
β L j M j M
+ ρβ L S (ju Mu , ju Mu ) TE (βLSjM j M , βu Lu Sju Mu ju Mu )
u u
βu Lu ju Mu ju Mu
+ TS (βLSjM j M , βu Lu Sju Mu ju Mu )
&
− ρβLS (jM, j M ) RA (βLSj M j M )
j M
+ RE (βLSj M j M ) + RS (βLSj M j M )
+ ρβLS (j M , j M ) RA (βLSj M jM )
'
+ RE (βLSjM j M ) + RS (βLSjM j M ) , (7.29)
where
λj (βLS, M ) − λj (βLS, M )
νβLS (jM, j M ) = . (7.30)
h
Similarly to Sect. 7.1.a, we now define the line strength of the (electric-dipole)
transition between two terms (β L S) and (βu Lu S) – with E(βu Lu S) > E(β L S) –
by the symmetrical expression
The Einstein coefficients for spontaneous and stimulated emission and for absorp-
tion are connected with the line strength by the relations1
1
Starting from Eqs. (6.71) and (6.73), it can be proved that the Einstein coefficients for the
transition between the individual levels (β L Sj M ) and (βu Lu Sju Mu ) are connected with the
Einstein coefficients defined here by the relations
%
Aβ L Sju Mu , β L Sj M = A(βu Lu S → β L S)
%
j M u u
j M
Bβ L Sju Mu , β L Sj M = B(βu Lu S → β L S)
% u u
ju Mu
Bβ L Sj M , βu Lu Sju Mu = B(β L S → βu Lu S)
The proof can be carried out with the help of Eqs. (7.27), (3.62), (2.23a) and (2.39).
EQUATIONS FOR ATOMIC SYSTEMS 299
64π 4 3
(2Lu + 1) A(βu Lu S → β L S) = ν S(βu Lu S, β L S)
3hc3 βu Lu S, β L S
32π 4
(2Lu + 1) B(βu Lu S → β L S) = S(βu Lu S, β L S)
3h2 c
32π 4
(2L + 1) B(β L S → βu Lu S) = S(β L S, βu Lu S) , (7.32)
3h2 c
with
2h 3
A(βu Lu S → β L S) = ν B(βu Lu S → β L S)
c2 βu Lu S, β L S
(2L + 1) B(β L S → βu Lu S) = (2Lu + 1) B(βu Lu S → β L S) . (7.33)
j j
× CJ (β L S, M ) CJ (β L S, M ) (2J + 1)(2J + 1)(2J + 1)(2J + 1)
! !
J J 1 J J 1 L L 1 L L 1
×
−M M −q −M M −q J J S J J S
j j
× CJu (βu Lu S, Mu ) CJu (βu Lu S, Mu ) (2J + 1)(2J + 1)(2Ju + 1)(2Ju + 1)
u u
! !
Ju J 1 Ju J 1 Lu L 1 Lu L 1
× (7.34b)
−Mu M −q −Mu M −q J Ju S J Ju S
j j
× CJu (βu Lu S, Mu ) CJu (βu Lu S, Mu ) (2J + 1)(2J + 1)(2Ju + 1)(2Ju + 1)
u u
! !
Ju J 1 Ju J 1 Lu L 1 Lu L 1
×
−Mu M −q −Mu M −q J Ju S J Ju S
1
RA (βLSjM j M ) = (2L + 1) B(βLS → βu Lu S)
2
βu Lu
× 3 (−1)q+q CJj (βLS, M ) CJj (βLS, M )
JJ Ju qq Mu
Ju J 1 Ju J 1
× (2J + 1)(2J + 1) (2Ju + 1)
−Mu M −q −Mu M −q
! !
Lu L 1 Lu L 1
× Jqq (νβ L S, βLS ) (7.34d)
J Ju S J Ju S u u
1
RE (βLSjM j M ) = δjj δMM A(βLS → β L S) (7.34e)
2
β L
1
RS (βLSjM j M ) = (2L + 1) B(βLS → β L S)
2
β L
× 3 CJj (βLS, M ) CJj (βLS, M )
JJ J qq M
J J 1 J J 1
× (2J + 1)(2J + 1) (2J + 1)
−M M −q −M M −q
! !
L L 1 L L 1
× Jqq (νβLS, β L S ) . (7.34f)
J J S J J S
The expressions for RA and RS have been derived using Eq. (3.62b); the expression
for RE using Eqs. (3.62b), (2.23a), (2.39), and (3.62a).
m → βu Lu Sju Mu n → β L Sj M
m → βu Lu Sju Mu n → β L Sj M ,
= hν N
ηiA (ν, Ω) (2L + 1) B(β L S → βu Lu S)
4π
β L Sβu Lu
j j
× 3 (−1)q+q CJ (β L S, M ) CJ (β L S, M )
j j J J ju Ju Ju M M Mu qq
j j
× CJu (βu Lu S, Mu ) CJu (βu Lu S, Mu ) (2J + 1)(2J + 1)(2Ju + 1)(2Ju + 1)
u u
! !
Ju J 1 Ju J 1 Lu L 1 Lu L 1
×
−Mu M −q −Mu M −q J Ju S J Ju S
ρ
× Re Tqq (i, Ω) β L S (j M , j M ) Φ(νβ L Sju Mu , β L Sj M − ν) (7.35a)
u u
= hν N
ηiS (ν, Ω) (2Lu + 1) B(βu Lu S → β L S)
4π
βu Lu Sβ L
j j
× 3 CJ (β L S, M ) CJ (β L S, M )
ju ju Ju Ju j J J Mu Mu M qq
j j
× CJu (βu Lu S, Mu ) CJu (βu Lu S, Mu ) (2J + 1)(2J + 1)(2Ju + 1)(2Ju + 1)
u u
! !
Ju J 1 Ju J 1 Lu L 1 Lu L 1
×
−Mu M −q −Mu M −q J Ju S J Ju S
ρ
× Re Tqq (i, Ω) β L S (ju Mu , ju Mu ) Φ(νβ L Sju Mu , β L Sj M − ν) (7.35b)
u u u u
& '
= η A (ν, Ω)
ρAi (ν, Ω) Re → Im (7.35c)
i
& '
= η S (ν, Ω)
ρSi (ν, Ω) Re → Im (7.35d)
i
3
= 2hν η S (ν, Ω)
εi (ν, Ω) . (7.35e)
i
c2
It is worth pointing out an important difference between the case of the multi-level
atom considered in Sect. 7.1 and the case of the multi-term atom of Sect. 7.5. In
the former case, because of the Zeeman effect regime assumption, the energy eigen-
vectors coincide with the eigenvectors of angular momentum, so that there is no
302 CHAPTER 7
ρβLS (jM, j M ) = CJj (βLS, M ) CJj (βLS, M ) ρβLS (JM, J M )
JJ
ρβLS (jM, j M ) = CJj (βLS, M ) CJj (βLS, M )
JJ
√ J J K
× (−1) J−M
2K + 1 ρQ (J, J )
βLS K
(7.36)
M −M −Q
KQ
with the inverse formulae, that can be deduced using Eqs. (3.62b) and (3.97)
ρβLS (JM, J M ) = CJj (βLS, M ) CJj (βLS, M ) ρβLS (jM, j M )
jj
ρQ (J, J )
βLS K
= CJj (βLS, M ) CJj (βLS, M )
jMj M
√ J J K
× (−1)J−M 2K + 1 ρβLS (jM, j M ) . (7.37)
M −M −Q
In the following we derive the statistical equilibrium equations and the radiative
transfer coefficients directly in the spherical statistical tensor representation.
√ J J K
CJj (βLS, M ) CJj (βLS, M ) (−1)J−M 2K + 1
M −M −Q
d
βLS K
ρQ (J, J )
βLS K
= −2πi NβLS (KQJJ , K Q J J ) ρQ (J , J )
dt
K Q J J
β L S K
+ ρQ (J , J ) TA (βLSKQJJ , β L SK Q J J )
β L K Q J J
βu Lu S Ku
+ ρQ (Ju , Ju ) TE (βLSKQJJ , βu Lu SKu Qu Ju Ju )
u
βu Lu Ku Qu Ju Ju
+ TS (βLSKQJJ , βu Lu SKu Qu Ju Ju )
βLS K
− ρQ (J , J ) RA (βLSKQJJ K Q J J )
K Q J J
where
RA (βLSKQJJ K Q J J ) =
= CJj (βLS, M ) CJj (βLS, M )
jMj M j M
J−J
J J K
× (−1) (2K + 1)(2K + 1)
M −M −Q
× CJj (βLS, M ) CJj (βLS, M )
J J K
× RA (βLSj M j M )
M −M −Q
+ (−1)M −M CJj (βLS, M ) CJj (βLS, M )
J J K
× RA (βLSj M jM ) (7.40c)
M −M −Q
RE (βLSKQJJ K Q J J ) =
= CJj (βLS, M ) CJj (βLS, M )
jMj M j M
J J K
× (−1)J−J (2K + 1)(2K + 1)
M −M −Q
× CJj (βLS, M ) CJj (βLS, M )
J J K
× RE (βLSj M j M )
M −M −Q
+ (−1)M −M CJj (βLS, M ) CJj (βLS, M )
J J K
× RE (βLSjM j M ) (7.40d)
M −M −Q
NβLS (KQJJ , K Q J J ) = δQQ (−1)J−J (2K + 1)(2K + 1)
1 J J K
J J K
× δ
h JJ
M −M −Q M −M −Q
MM
where Hso is the spin-orbit Hamiltonian and HB the magnetic Hamiltonian. The
matrix elements in the right-hand side can be easily evaluated. Taking into account
that Hso is diagonal with respect to J, expressing the matrix elements of HB via
Eq. (3.57), and performing the summation over M and M by means of Eqs. (2.23a),
(3.42) and (2.42), we obtain after some algebra
where
ΓLS (J, J ) = (−1)J−J ΓLS (J , J)
= δJJ J(J + 1)(2J + 1)
!
J J 1
+ (−1) 1+L+S+J
(2J + 1)(2J + 1)S(S + 1)(2S + 1) (7.42)
S S L
and where
EβLS (J) − EβLS (J )
νβLSJ, βLSJ = , (7.43)
h
with EβLS (J) defined in Eq. (3.60). The quantity ΓLS (J, J ) is a sort of generalized
Landé factor. Use of Eq. (2.36d) shows that
ΓLS (J, J) = J(J + 1)(2J + 1) gLS (J) , (7.44)
where gLS (J) is the usual Landé factor introduced in Eq. (3.8).
The expressions for the different rates can be further developed by substituting
Eqs. (7.34) into Eqs. (7.40). Performing the summations over the indices j, j , j ,
j etc. with the use of Eqs. (3.62b), and substituting Eq. (7.13), the calculations
reduce to evaluating the sums over angular momentum components of products of
several 3-j symbols. This can be done using Eq. (2.52) for the rates TA and TS ,
306 CHAPTER 7
Eq. (2.34) for TE , Eq. (2.42) applied twice for RA and RS , and Eq. (2.23a), also
applied twice, for RE . The final results are the following
RA (βLSKQJJ K Q J J ) = (2L + 1) (2Ju + 1) B(βLS → βu Lu S)
βu Lu Ju
+Kr +Q K K Kr
× 3(2K + 1)(2K + 1)(2Kr + 1) (−1)1+Ju −J
Q −Q Qr
Kr Qr
1
× δJJ (2J + 1)(2J + 1)
2 ! !
J−J Lu L 1 Lu L 1
× (−1)
J Ju S J Ju S
! !
K K Kr 1 1 Kr
× +
J J J J J Ju
EQUATIONS FOR ATOMIC SYSTEMS 307
+ δJ J (2J + 1)(2J + 1)
! !
Lu L 1 Lu L 1
× (−1)K+K +Kr
J Ju S J Ju S
! !
K K Kr 1 1 Kr K
× JQ r (νβ L S, βLS ) (7.45d)
J J J J J Ju r u u
RS (βLSKQJJ K Q J J ) = (2L + 1) (2J + 1) B(βLS → β L S)
β L J
1+J −J +Q K K Kr
× 3(2K + 1)(2K + 1)(2Kr + 1) (−1)
Q −Q Qr
Kr Qr
1
× δJJ (2J + 1)(2J + 1)
2 ! !
J−J L L 1 L L 1
× (−1)
J J S J J S
! !
K K Kr 1 1 Kr
×
J J J J J J
+ δJ J (2J + 1)(2J + 1)
! !
L L 1 L L 1
× (−1)K+K +Kr
J J S J J S
! !
K K Kr 1 1 Kr K
× JQ r (νβLS, β L S ) . (7.45f)
J J J J J J r
The above expressions for the relaxation rates can be further simplified by carrying
out the summations on the J quantum numbers. Using Eq. (2.41) for RA and RS ,
and Eq. (2.39) for RE we obtain
RA (βLSKQJJ K Q J J ) = (2L + 1) B(βLS → βu Lu S)
βu Lu
× 3(2K + 1)(2K + 1)(2Kr + 1)
Kr Qr
!
1+Lu −S+J+Q L L Kr K K Kr
× (−1) ×
1 1 Lu Q −Q Qr
308 CHAPTER 7
1
× δJJ (2J + 1)(2J + 1)
2
! !
L L Kr K K Kr
×
J J S J J J
+ δJ J (2J + 1)(2J + 1) (−1)J −J +K+K +Kr
! !
L L Kr K K Kr K
× JQ r (νβ L S, βLS ) (7.46a)
J J S J J J r u u
RS (βLSKQJJ K Q J J ) = (2L + 1) B(βLS → β L S)
β L
× 3(2K + 1)(2K + 1)(2Kr + 1)
Kr Qr
!
1+L −S+J+Kr +Q L L Kr K K Kr
× (−1)
1 1 L Q −Q Qr
1
× δJJ (2J + 1)(2J + 1)
2
! !
L L Kr K K Kr
×
J J S J J J
+ δJ J (2J + 1)(2J + 1) (−1)J −J +K+K +Kr
! !
L L Kr K K Kr K
× JQ r (νβLS, β L S ) . (7.46c)
J J S J J J r
1
The expressions appearing in that paper contain indeed some additional terms, due to
K
the tensor FQ r , in the rates RA and RS . These terms are of very limited importance and their
r
presence is not justified in the flat-spectrum approximation.
EQUATIONS FOR ATOMIC SYSTEMS 309
= hν N
ηiA (ν, Ω) (2L + 1) B(β L S → βu Lu S)
4π
β L Sβu Lu
× 3(2K + 1)(2K + 1)
KQK Q
j j
× (−1)1+J −M +q CJ (β L S, M ) CJ (β L S, M )
j J J J ju Ju Ju M M Mu qq
j j
× CJu (βu Lu S, Mu ) CJu (βu Lu S, Mu ) (2J + 1)(2J + 1)(2Ju + 1)(2Ju + 1)
u u
Ju J 1 Ju J 1 1 1 K
×
−Mu M −q −Mu M −q q −q −Q
! !
J J K Lu L 1 Lu L 1
×
M −M −Q J Ju S J Ju S
β L S K
× Re T Q
K
(i, Ω) ρQ (J , J ) Φ(νβ L Sju Mu , β L Sj M − ν) (7.47a)
u u
= hν N
ηiS (ν, Ω) (2Lu + 1) B(βu Lu S → β L S)
4π
βu Lu Sβ L
× 3(2K + 1)(2Ku + 1)
KQKu Qu
j j
× (−1)1+Ju −Mu +q CJ (β L S, M ) CJ (β L S, M )
ju Ju Ju Ju j J J Mu Mu M qq
j j
× CJu (βu Lu S, Mu ) CJu (βu Lu S, Mu ) (2J + 1)(2J + 1)(2Ju + 1)(2Ju + 1)
u u
Ju J 1 Ju J 1 1 1 K
×
−Mu M −q −Mu M −q q −q −Q
! !
Ju Ju Ku Lu L 1 Lu L 1
×
Mu −Mu −Qu J Ju S J Ju S
βu Lu S Ku
× Re T Q
K
(i, Ω) ρQ (Ju , Ju ) Φ(νβ L Sju Mu , β L Sj M − ν) (7.47b)
u u u
& '
= η A (ν, Ω)
ρAi (ν, Ω) Re → Im (7.47c)
i
& '
= η S (ν, Ω)
ρSi (ν, Ω) Re → Im (7.47d)
i
= 2hν 3 S
εi (ν, Ω) ηi (ν, Ω) . (7.47e)
c2
310 CHAPTER 7
These expressions become much simpler when the frequency splittings – due to
fine structure and to the magnetic field – of the lines of each multiplet can be
neglected (see the analogous case discussed at the end of Sect. 7.2.b). Replacing
the profiles
Φ(νβ L Sj M , β L Sj M − ν)
u u u u
Φ(νβ L S, β L S − ν) ,
u u
and using Eqs. (3.62b), (2.34), (3.100) and (5.158), one obtains
= hν N
ηiA (ν, Ω) (2L + 1) B(β L S → βu Lu S)
4π
β L Sβu Lu
× (−1)1+Ju +J +K (2Ju + 1) 3(2J + 1)(2J + 1)
KQ J J Ju
! ! !
Lu L 1 Lu L 1 1 1 K
×
J Ju S J Ju S J J Ju
× TQ
K
(i, Ω) ρQ (J , J )
β L S K
φ(νβ L S, β L S − ν) (7.48a)
u u
hν
=
ηiA (ν, Ω) N (2L + 1) B(β L S → βu Lu S)
4π
β L Sβu Lu
× (−1)1−Lu +S+J 3(2J + 1)(2J + 1)
KQ J J
! !
L L K 1 1 K
×
J J S L L Lu
× TQ
K
(i, Ω) ρQ (J , J )
β L S K
φ(νβ L S, β L S − ν) . (7.48b)
u u
Similarly
= hν N
ηiS (ν, Ω) (2Lu + 1) B(βu Lu S → β L S)
4π
βu Lu Sβ L
× (−1)1+J +Ju (2J + 1) 3(2Ju + 1)(2Ju + 1)
KQ Ju Ju J
! ! !
Lu L 1 Lu L 1 1 1 K
×
J Ju S J Ju S Ju Ju J
× TQ
K
(i, Ω) ρQ (Ju , Ju )
βu Lu S K
φ(νβ L S, β L S − ν) (7.48c)
u u
EQUATIONS FOR ATOMIC SYSTEMS 311
× TQ
K
(i, Ω) ρQ (Ju , Ju )
βu Lu S K
φ(νβ L S, β L S − ν) . (7.48d)
u u
3
= 2hν η S (ν, Ω)
εi (ν, Ω) . (7.48g)
i
c2
The radiative rates appearing in the statistical equilibrium equations for the multi-
term atom have simple conjugation properties. In the energy-eigenvector repre-
sentation, we have from Eqs. (7.34), using Eq. (5.154) and the reality of the CJj
coefficients (see Sect. 3.4)
with analogous relations for RE and RS ; moreover, the rates TE and RE are real.
In the spherical statistical tensor representation, we have from Eqs. (7.45a-c) and
(7.46a-c), with the help of Eqs. (5.158)1
RA (βLSKQJJ K Q J J )∗ =
+Q+J −J +Q
= (−1)J−J RA (βLSK −QJ JK −Q J J ) ,
with analogous relations for RE and RS ; moreover, the rates TE and RE are real.
As apparent from Sects. 7.5 and 7.6, the description of the multi-term atom requires
much more complicated equations than the multi-level atom treated previously:
this is basically related to the fact that two types of coherences (between different
J-levels and between magnetic sublevels of each J-level) are considered in this
model. It can be easily realized that if we neglect coherences between J-levels,
retaining only those between magnetic sublevels of individual J-levels, the case of
the multi-term atom becomes very similar to the case of the multi-level atom. In
fact, after the formal identifications
the statistical equilibrium equations for the two cases must indeed coincide provided
the incident radiation field is such that
K
JQ (να J , α J ) ≡ JQ
K
(νβ L SJu , β L SJ )
K
= JQ (νβ L S, β L S ) (7.50)
u u u u u u
for each multiplet. The radiative transfer coefficients for the two cases must also
coincide provided the frequency splittings of the Φ profiles due to fine structure are
neglected,
ρQ (J, J )
βLS K
= δJJ βLS K
ρQ (J, J) = δJJ βLS K
ρQ (J) , (7.52)
and we get
EQUATIONS FOR ATOMIC SYSTEMS 313
d
βLS K
βLS K
ρQ (J) = −2πi NβLS (KQJJ, K Q J J ) ρQ (J )
dt
K Q J
β L S K
+ ρQ (J ) TA (βLSKQJJ, β L SK Q J J )
β L K Q J
βu Lu S Ku
+ ρQ (Ju ) TE (βLSKQJJ, βu Lu SKu Qu Ju Ju )
u
βu Lu Ku Qu Ju
+ TS (βLSKQJJ, βu Lu SKu Qu Ju Ju )
βLS K
− ρQ (J ) RA (βLSKQJJK Q J J ) + RE (βLSKQJJK Q J J )
K Q J
+ RS (βLSKQJJK Q J J ) . (7.53)
From Eqs. (7.41) and (7.44) we have
NβLS (KQJJ, K Q J J ) =
K K 1
= δQQ νL (−1) 2J−Q
(2K + 1)(2K + 1)
−Q Q 0
!
K−K K K 1
× δJJ J(J + 1)(2J + 1) 1 + (−1) gLS (J) ,
J J J
which can be transformed using some Racah algebra. Because of the triangular
condition (K, K , 1) we can write
1 + (−1)K−K = 2 δKK ,
and using Eqs. (2.26d) and (2.36d) we obtain
NβLS (KQJJ, K Q J J ) = δKK δQQ δJJ νL Q gLS (J) , (7.54)
which makes the first term in the right-hand side of Eq. (7.53) equal to the corre-
sponding term in Eq. (7.11).1
As far as the other terms are concerned, it is readily seen that Eq. (7.53) – with
the identifications (7.49) – has the same form as Eq. (7.11), thus we have just to
compare the expressions for the rates. From Eq. (7.45a) we have
TA (βLSKQJJ, βL SK Q J J ) = (2L + 1) B(β L S → βLS)
× (2J + 1) (2J + 1) 3(2K + 1)(2K + 1)(2Kr + 1)
Kr Qr ⎧ ⎫
⎨ J J 1 ⎬ !2
L L 1 K K Kr
× (−1)K +Q J J 1
⎩ ⎭ J J S −Q Q −Qr
K K Kr
K
× JQ r (νβLS, β L S ) , (7.55)
r
1
Of course, Eq. (7.54) contains the Landé factor evaluated in the L-S coupling scheme, while
Eq. (7.11) contains the more general Landé factor defined in Eq. (3.7).
314 CHAPTER 7
which should be compared with Eq. (7.14a). To this end we need the relation be-
tween the Einstein coefficient for the transition between two terms and the Einstein
coefficients for the transition between two individual J-levels.
Equation (7.6) with substitutions (7.49) gives
and using Eqs. (2.108) and (7.31) we obtain the relation connecting the strengths1
!2
Lu L 1
S(β L SJ , βu Lu SJu ) = (2J + 1) (2Ju + 1)
J Ju S
× S(β L S, βu Lu S) . (7.56)
Substitution into Eq. (7.55) shows that the expressions in the right-hand side of
Eqs. (7.55) and (7.14a) are identical when approximation (7.50) is satisfied.
Quite similar calculations can be performed for the other rates. Using the further
relations between Einstein coefficients2
!2
Lu L 1
B(βu Lu SJu → β L SJ ) = (2Lu + 1) (2J + 1)
J Ju S
× B(βu Lu S → β L S) (7.57b)
1
From Eq. (7.56) we have, with the help of Eq. (2.39)
%
J Ju
S(β L SJ , βu Lu SJu ) = (2S + 1) S(β L S, βu Lu S) ,
therefore the relative strengths of the multiplet, normalized to unity, are given by
!2
(2J + 1)(2Ju + 1) Lu L 1
S J ,Ju = ,
2S + 1 J Ju S
which is just Eq. (3.65).
2
Note that Eq. (7.57c) implies the approximation
νβ3 L SJu , β L SJ / νβ3 L S, β L S =1.
u u u u
Note also that the Einstein coefficients in Eqs. (7.57) are related by the equations
%
J
A(βu Lu SJu → β L SJ ) = A(βu Lu S → β L S)
%
B(βu Lu SJu → β L SJ ) = B(βu Lu S → β L S)
%
J
Ju
B(β L SJ → βu Lu SJu ) = B(β L S → βu Lu S) ,
one finds that all the rates in Eq. (7.53) coincide with the corresponding rates of
Eqs. (7.14), which proves that the statistical equilibrium equations for the multi-
term atom reduce to the equations for the multi-level atom under the assumptions
mentioned above.
The same check can be repeated for the radiative transfer coefficients. Substitu-
tion of Eq. (7.52) into Eqs. (7.48) and use of approximation (7.51) yields exactly
the same expressions derived for the multi-level atom (Eqs. (7.16)).
In the preceding sections of this chapter we have developed the basic equations for
a standard atom devoid of hyperfine structure. Now we are going to generalize the
description of the atomic system by including this further degree of freedom.
If I is the nuclear spin of a given atomic species, the energy eigenvectors – in the
absence of magnetic fields – have the form |αJIF f (cf. Sect. 3.5 for the meaning
of symbols). In principle, one could develop the statistical equilibrium equations
and the radiative transfer equations for the general case implying coherences of the
form
αJIF f | ρ |αJ IF f .
However, with the exception of the hydrogen atom (which presents, moreover, an
additional degeneracy of the eigenvalues with respect to the azimuthal quantum
number l, and therefore requires a more involved formalism), it is in most cases
sufficient to restrict the description of the atom with hyperfine structure to the
J-diagonal density-matrix elements
The model atom described by this approximation will be called in the following
the multi-level atom with hyperfine structure. For such system, the flat-spectrum
approximation (cf. Sect. 6.5) implies that the radiation field incident on the atom
should be constant across a frequency interval wider than the frequency shifts and
inverse lifetimes of the hyperfine-structure sublevels involved. This means that the
radiation field can be characterized by specifying its value at the Bohr frequencies
EαJ − Eα J
ναJ, α J = ,
h
where EαJ , Eα J are the energies of the relevant levels (which disregard both the
hyperfine-structure and the magnetic Hamiltonian). This approximation is usually
well-satisfied in astrophysical plasmas.
316 CHAPTER 7
| n → |αJIif ; (7.59)
En − Em
νnm = → ναJIif, α J Ii f =
h
λ (αJI, f ) − λi (α J I, f )
= ναJ, α J + i ; (7.61)
h
iv) for the matrix elements of the spherical components of the dipole operator (see
the analogous derivation of Eq. (7.27)):
If we now compare Eqs. (7.59)-(7.63) with Eqs. (7.24)-(7.28), we see that the
equations for the multi-level atom with hyperfine structure can be deduced from
the corresponding equations of the multi-term atom by means of the formal sub-
stitutions
Thus the statistical equilibrium equations and the radiative transfer coefficients for
the multi-level atom with hyperfine structure, expressed in the energy-eigenvector
representation, are simply given by Eqs. (7.29),1 (7.34) and (7.35) with the formal
substitutions (7.64).
The conversion to the spherical statistical tensor representation is also straight-
forward. Using Eqs. (7.58) and (3.103) we obtain the relation
ραJI (if, i f ) = CFi (αJI, f ) CFi (αJI, f ) ραJI (F f, F f )
FF
= CFi (αJI, f ) CFi (αJI, f )
FF
F −f
√ F F K
× (−1) 2K + 1 ρQ (F, F )
αJI K
,
f −f −Q
KQ
which is just the same as Eq. (7.36) with substitutions (7.64). It follows that the
equations for the multi-level atom with hyperfine structure, expressed in the spher-
ical statistical tensor representation, can be easily deduced from the corresponding
equations for the multi-term atom by performing the same substitutions.
For the statistical equilibrium equations we have (cf. Eq. (7.38))
d
ρQ (F, F )
αJI K
= −2πi ÑαJI (KQF F , K Q F F ) αJIρK
Q (F , F )
dt
K Q F F
α J I K
+ ρQ (F , F ) TA (αJIKQF F , α J IK Q F F )
α J K Q F F
αu Ju I Ku
+ ρQ (Fu , Fu ) TE (αJIKQF F , αu Ju IKu Qu Fu Fu )
u
αu Ju Ku Qu Fu Fu
+ TS (αJIKQF F , αu Ju IKu Qu Fu Fu )
αJI K
− ρQ (F , F ) RA (αJIKQF F K Q F F )
K Q F F
1
The term in Eq. (7.30) is discussed later in greater detail.
318 CHAPTER 7
The only term which cannot be deduced by direct substitution of Eqs. (7.64) is
the first term in the right-hand side, as expected because of the intrinsic difference
between the fine-structure and the hyperfine-structure Hamiltonians. To evaluate
this term we must go back to Eqs. (7.39) and (7.30). Applying substitutions (7.64)
and using Eqs. (3.76b,c) we get
ÑαJI (KQF F , K Q F F ) = δQQ (−1)F −F (2K + 1)(2K + 1)
1 F F K
F F K
× δF F
h
f −f −Q f −f −Q
ff
× αJIF f |Hhfs
(1) (2)
+ Hhfs + HB |αJIF f
F F K
F F K
− δF F
f −f −Q f −f −Q
ff
× αJIF f |Hhfs
(1) (2)
+ Hhfs + HB |αJIF f ,
(1) (2)
with the Hamiltonians Hhfs and Hhfs given by Eqs. (3.70) and (3.71). The matrix
elements in the right-hand side can be evaluated using Eqs. (3.70) and (3.72).
Carrying out the summations over f and f via Eqs. (2.23a) and (2.42) one obtains
where
EαJI (F ) − EαJI (F )
ναJIF, αJIF =
h
1
= αJIF f |Hhfs
(1) (2)
+ Hhfs |αJIF f − αJIF f |Hhfs
(1) (2)
+ Hhfs |αJIF f (7.67)
h
and where we have introduced a generalized Landé factor for hyperfine structure
through the position1
1
A slightly different generalized Landé factor has been introduced by Landolfi and Landi
Degl’Innocenti (1985). Note the similarity of Eq. (7.68) with Eq. (7.42). Note also that
ΓJ I (F, F ) = F (F + 1)(2F + 1) ghfs (F ) ,
where ghfs (F ) is given in Eq. (3.73).
EQUATIONS FOR ATOMIC SYSTEMS 319
ΓJI (F, F ) = (−1)F −F ΓJI (F , F )
!
F F 1
= (−1)1+J+I+F J(J + 1)(2J + 1)(2F + 1)(2F + 1) . (7.68)
J J I
All the other terms in Eq. (7.65) can be derived by carrying out the substitu-
tions (7.64). We write down here, for future reference, the expressions for the
radiative rates (cf. Eqs. (7.45a-c) and (7.46a-c))
RA (αJIKQF F K Q F F ) = (2J + 1) B(αJ → αu Ju )
αu Ju
× 3(2K + 1)(2K + 1)(2Kr + 1)
Kr Qr
!
1+Ju −I+F +Q J J Kr K K Kr
× (−1) ×
1 1 Ju Q −Q Qr
320 CHAPTER 7
1
× δF F (2F + 1)(2F + 1)
2
! !
J J Kr K K Kr
×
F F I F F F
+ δF F (2F + 1)(2F + 1) (−1)F −F +K+K +Kr
! !
J J Kr K K Kr K
× JQ r (να J , αJ ) (7.69d)
F F I F F F r u u
RS (αJIKQF F K Q F F ) = (2J + 1) B(αJ → α J )
α J
× 3(2K + 1)(2K + 1)(2Kr + 1)
Kr Qr
!
1+J −I+F +Kr +Q J J Kr K K Kr
× (−1)
1 1 J Q −Q Qr
1
× δF F (2F + 1)(2F + 1)
2
! !
J J Kr K K Kr
×
F F I F F F
+ δF F (2F + 1)(2F + 1) (−1)F −F +K+K +Kr
! !
J J Kr K K Kr K
× JQ r (ναJ, α ). (7.69f)
F F I F F F r
J
Finally, the radiative transfer coefficients can be obtained by applying the same
substitutions to Eqs. (7.47). In the special case where all the profiles of the various
hyperfine components can be considered coincident, we have from Eqs. (7.48)
hν
=
ηiA (ν, Ω) N (2J + 1) B(α J → αu Ju )
4π
α J αu Ju
× (−1)1−Ju +I+F 3(2F + 1)(2F + 1) ×
KQ F F
EQUATIONS FOR ATOMIC SYSTEMS 321
! !
J J K 1 1 K
×
F F I J J Ju
× TQ
K
(i, Ω) ρQ (F , F )
α J I K
φ(να J , α J − ν) (7.70a)
u u
= hν N
ηiS (ν, Ω) (2Ju + 1) B(αu Ju → α J )
4π
αu Ju α J
× (−1)1−J +I+Fu +K 3(2Fu + 1)(2Fu + 1)
KQ Fu Fu
! !
Ju Ju K 1 1 K
×
Fu Fu I Ju Ju J
× TQ
K
(i, Ω) ρQ (Fu , Fu )
αu Ju I K
φ(να J , α J − ν) (7.70b)
u u
& '
= η A (ν, Ω)
ρAi (ν, Ω) φ(ν − ν) → ψ(να − ν) (7.70c)
i α J , α J
u u
J , α J
u u
& '
= η S (ν, Ω)
ρSi (ν, Ω) φ(ν − ν) → ψ(να − ν) (7.70d)
i α J , α J
u u
J , α J
u u
3
= 2hν η S (ν, Ω)
εi (ν, Ω) . (7.70e)
i
c2
The equations obtained in the previous section for the multi-level atom with hy-
perfine structure must be compatible with the corresponding equations derived in
Sect. 7.2 for the multi-level atom without hyperfine structure. By this we mean
that when hyperfine structure has a negligible effect (strictly speaking, when the
two constants A and B in Eqs. (3.70) are zero), it should be possible to get back
the equations of Sect. 7.2 from those of Sect. 7.9. This is what we are going to
prove here, thus obtaining – in a particular case – a confirmation of the principle
of spectroscopic stability. Among the several possible formulations of this princi-
ple, the most satisfactory is the following: ‘If two different descriptions are used
to characterize a quantum system – a detailed description which takes an inner
quantum number into account and a simplified description which disregards it –
the predicted results must be the same in all physical experiments where the struc-
ture described by the inner quantum number is unimportant’. In the particular
case that we are considering here, the inner quantum numbers are the nuclear spin
quantum numbers (I, MI ).
We will now derive from Eq. (7.65) the corresponding equation for the multipole
moments ρK Q (αJ) describing the atom irrespective of its hyperfine structure.
Equation (3.104) shows that the time evolution of ρK Q (αJ) can be obtained by
322 CHAPTER 7
Now we perform the summation over F and F for the various terms in the right-
hand side of Eq. (7.65). For the term containing TA we obtain from Eq. (7.69a)
the following expression
!
F F K
(−1)J+I+F +K (2F + 1)(2F + 1)
J J I
FF
α J I K
× ρQ (F , F ) (2J + 1) B(α J → αJ)
α J K Q F F
× 3(2F + 1)(2F + 1)(2F + 1)(2F + 1)(2K + 1)(2K + 1)(2Kr + 1)
Kr Qr
⎧ ⎫
⎨ F F 1 ⎬ ! !
K +Q +F −F J J 1 J J 1
× (−1) F F 1
⎩ ⎭ F F I F F I
K K Kr
K K Kr K
× JQ r (ναJ, α J ) .
−Q Q −Qr r
The summation over F and F can be performed with the help of Eq. (2.56), and
the expression above reduces to
!
F F K
(−1)J +I+F +K (2F + 1)(2F + 1)
J J I
α J K Q F F
α J I K
× ρQ (F , F ) (2J + 1) B(α J → αJ)
⎧ ⎫
⎨J J 1 ⎬
× 3(2K + 1)(2K + 1)(2Kr + 1) (−1)K +Q J J 1
⎩ ⎭
Kr Qr K K Kr
K K Kr K
× JQ r (ναJ, α J )
−Q Q −Qr r
which, using again Eq. (3.104), can be cast into the form
K
ρQ (α J ) TA (αJKQ, α J K Q ) ,
α J K Q
EQUATIONS FOR ATOMIC SYSTEMS 323
where all the rates are exactly the same as those of Eqs. (7.14).
Finally, we evaluate the first term in the right-hand side using the expression
for Ñ given in Eq. (7.66). With the help of Eqs. (2.41), (2.26d) and (2.36d) we get
!
d K F F K
ρQ (αJ) = −2πi (−1)J+I+F +K (2F + 1)(2F + 1)
dt
J J I
FF
× ναJIF, αJIF ρQ (F, F )
αJI K
− 2πi νL gαJ Q ρK
Q (αJ)
K
+ ρQ (α J ) TA (αJKQ, α J K Q )
α J K Q
K
+ ρQu (αu Ju ) TE (αJKQ, αu Ju Ku Qu ) + TS (αJKQ, αu Ju Ku Qu )
u
αu Ju Ku Qu
− Q (αJ) RA (αJKQK Q ) + RE (αJKQK Q )
ρK
K Q
+ RS (αJKQK Q ) . (7.72)
When hyperfine structure is negligible, the Bohr frequencies ναJIF, αJIF are zero
(cf. Eq. (7.67)), and Eqs. (7.72) reduce to the statistical equilibrium equations
for the multi-level atom without hyperfine structure (Eq. (7.11)) – which confirms
the principle of spectroscopic stability. Equation (7.72) also shows that the ‘in-
ner’ description of the atomic system involving hyperfine structure can be avoided
whenever the frequency shifts among the F -sublevels of each (αJ)-level are negli-
gible in comparison with the relaxation rates of the same level (roughly speaking,
324 CHAPTER 7
when they are much smaller than the inverse lifetime of the level). This provides
a quantitative criterion to ascertain whether hyperfine structure can be neglected
in a real physical problem.
As far as the radiative transfer coefficients are concerned, a direct application of
Eq. (3.104) shows that Eqs. (7.70), which have just been obtained by neglecting
the frequency shifts between hyperfine components, are nothing but Eqs. (7.16)
in
written in a different form. Referring for instance to the expression for ηiA (ν, Ω)
Eq. (7.70a), and recalling that, according to Eq. (3.104)
ρK
Q (α J ) =
!
J +I+F +K F F K
= (−1) (2F + 1)(2F + 1) ρQ (F , F )
α J I K
,
J J I
F F
we obtain
hν
=
ηiA (ν, Ω) N (2J + 1) B(α J → αu Ju )
4π
α J αu Ju
√ !
1−J −Ju −K 1 1 K ρK (α J )
× 3 (−1) TQ
K
(i, Ω) Q
J J Ju
KQ
× φ(να J , α J − ν) ,
u u
which coincides with the corresponding expression in Eq. (7.16a). This is a further
aspect of the principle of spectroscopic stability.
The same reasoning followed about hyperfine structure can be repeated for fine
structure, which is – in terms of the principle of spectroscopic stability – the ‘inner’
structure of the atom due to the electronic spin. According to this principle, the
equations describing the multi-term atom should exactly reduce to the equations
for the ‘spinless atom’ when the effects due to the presence of spin are negligible.
To check this property we start from Eq. (7.38), with N given by Eqs. (7.41)-
(7.43) and the rates by Eqs. (7.45a-c) and (7.46a-c), and – bearing in mind Eq.
(3.105) – we multiply both sides by the factor
!
L+S+J +K
J J K
(−1)
(2J + 1)(2J + 1)
L L S
and sum over J and J . Neglecting the fine-structure shifts νβLSJ, βLSJ in the
final result, we obtain the equation1
1
The first term in the right-hand side can be deduced using the Racah-algebra relation
! !
a b 1 a b 1
a−b
c (c + 1)(2c + 1) + (−1) d (d + 1)(2d + 1) =
c c d d d c
= δab (−1)a+c+d+1 a (a + 1) 2a + 1 ,
d K
ρ (βL) = −2πi νL Q ρK Q (βL)
dt Q
K
+ ρQ (β L ) TA (βLKQ, β L K Q )
β L K Q
K
+ ρQu (βu Lu ) TE (βLKQ, βu Lu Ku Qu ) + TS (βLKQ, βu Lu Ku Qu )
u
βu Lu Ku Qu
− Q (βL) RA (βLKQK Q ) + RE (βLKQK Q )
ρK
K Q
+ RS (βLKQK Q ) , (7.73)
where all the rates are the same as those in Eqs. (7.14) with the formal substitutions
α→β α → β αu → βu
J →L J → L Ju → Lu . (7.74)
All the expressions derived in the preceding sections for the radiative rates and for
the radiative transfer coefficients involve some 3-j, or 6-j, or 9-j symbols which
are responsible for the appearance of several selection rules. These rules originate
from the electric-dipole approximation and can be considered as an obvious gener-
alization of the selection rules that are usually met in conventional spectroscopy.
In the following we list the selection rules for the radiative rates written in the
spherical statistical tensor representation. The list is divided into three parts,
which refer to the different atomic models considered previously. We recall that
– as apparent from Eqs. (5.153) and (5.155) – the only non-zero components of the
K
radiation field tensor JQ r are those with Kr = 0, 1, 2.
r
326 CHAPTER 7
∆J = J − J = 0, ±1; 00
⎧
⎨0 for Kr = 0
∆K = K − K = 0, ±1; 00 for Kr = 1
⎩
0, ±2; 00 for Kr = 2
∆J = Ju − J = 0, ±1; 00
∆K = Ku − K = 0
∆J = Ju − J = 0, ±1; 00
⎧
⎨0 for Kr = 0
∆K = Ku − K = 0, ±1; 00 for Kr = 1
⎩
0, ±2; 00 for Kr = 2
∆K = K − K = 0
∆L = L − L = 0, ±1; 00
∆J = J − J = 0, ±1; 00
∆J = J − J = 0, ±1; 00
⎧
⎨0 for Kr = 0
∆K = K − K = 0, ±1; 00 for Kr = 1
⎩
0, ±1, ±2; 00, 01, 10 for Kr = 2
∆L = Lu − L = 0, ±1; 00
∆J = Ju − J = 0, ±1; 00
∆J = Ju − J = 0, ±1; 00
∆K = Ku − K = 0
∆L = Lu − L = 0, ±1; 00
∆J = Ju − J = 0, ±1; 00
∆J = Ju − J = 0, ±1; 00
⎧
⎨0 for Kr = 0
∆K = Ku − K = 0, ±1; 00 for Kr = 1
⎩
0, ±1, ±2; 00, 01, 10 for Kr = 2
⎧
⎨0 for Kr = 0
∆K = K − K = 0, ±1; 00 for Kr = 1
⎩
0, ±1, ±2; 00, 01, 10 for Kr = 2
∆J = J − J = 0
∆J = J − J = 0
∆K = K − K = 0
⎧
⎨0 for Kr = 0
∆K = K − K = 0, ±1; 00 for Kr = 1
⎩
0, ±1, ±2; 00, 01, 10 for Kr = 2
All the equations of the former sections have been derived in a reference system
having its z-axis (i.e. the quantization axis for angular momentum) in the magnetic
field direction – obviously, the z-axis direction is arbitrary for zero magnetic field.
It is now important to discuss how these equations are modified when a different
reference system is chosen.
EQUATIONS FOR ATOMIC SYSTEMS 329
with the inverse transformation (that can be easily deduced using Eqs. (2.72))
ρK
Q (αJ) = ρK
Q (αJ) DQQ
K
(R) . (7.77)
old new
Q
Note that these transformations imply that the S matrix defined above is block-
diagonal, with its non-zero elements connecting only statistical tensors of the same
level and of the same rank.
330 CHAPTER 7
With the help of some algebra it is easily shown that the statistical equilibrium
equations in the new reference system are the following
d K
ρ (αJ) = −2πi νL gαJ KQQ
K
ρK
Q (αJ)
dt Q new new
Q
K
+ ρQ (α J ) TA (αJKQ, α J K Q )
new new
α J K Q
K
&
+ ρQu (αu Ju ) TE (αJKQ, αu Ju Ku Qu )
u new new
αu Ju Ku Qu
'
+ TS (αJKQ, αu Ju Ku Qu )
new
&
− ρK
Q (αJ) RA (αJKQK Q ) + RE (αJKQK Q )
new new new
K Q
'
+ RS (αJKQK Q ) , (7.78)
new
∗
KQQ
K
= DQ
K
Q (R) Q DQ
K
Q (R) (7.79)
Q
and where
TA (αJKQ, α J K Q ) =
new
K
∗
= DQK
Q (R) TA (αJKQ , α J K Q ) DQQ (R)
old
Q Q
TE,S (αJKQ, αu Ju Ku Qu ) =
new
K
∗
= DQ K
Q (R) TE,S (αJKQ , αu Ju Ku Qu ) DQu Q (R)
old u u
Q Qu
RA,E,S(αJKQK Q ) =
new
∗
= DQ
K
Q (R) RA,E,S(αJKQ K Q ) DQ
K
Q (R) . (7.80)
old
Q Q
It can be proved that the expressions for the radiative rates are formally invariant.
Consider for instance Eq. (7.14a); from Eqs. (7.80) we have
EQUATIONS FOR ATOMIC SYSTEMS 331
TA (αJKQ, α J K Q ) =
new
K
∗
= DQ
K
Q (R) D
Q Q (R) (2J + 1) B(α J → αJ)
Q Q
× 3(2K + 1)(2K + 1)(2Kr + 1)
Kr Qr
⎧ ⎫
⎨J J 1 ⎬
K +Q K K Kr K
×(−1) J J 1 JQ r (ναJ, α ) ,
⎩ ⎭ −Q Q −Qr r
J
old
K K Kr
K
where JQ r (ναJ, α J ) old is the radiation field tensor evaluated in the old reference
r
system. After some algebra, which involves the use of Eqs. (2.77), (2.23a) and
(2.73), one obtains the expression
we see that the rate TA has the same expression in the old and new reference
systems.
It can be easily proved that this property holds for the other rates as well. Thus
we can state that all the radiative rates entering the statistical equilibrium equations
are invariant under rotation of the reference system.1
This result – which has been deduced here for the multi-level atom – is also valid
for the other model atoms considered in this chapter, namely the multi-term and
the multi-level atom with hyperfine structure. This means that the radiative rates
of Eqs. (7.45a-c), (7.46a-c) and (7.69) are formally invariant under rotation of the
reference system.
1
This is of course a formal invariance. The actual value of the rates TA , TS , RA , RS for an
assigned incident radiation field does depend on the reference system because the radiation field
K changes according to Eq. (7.81).
tensor JQ
332 CHAPTER 7
Moreover, this result is valid irrespective of the representation used for the den-
sity operator. We have proved it here for the equations written in the spherical
statistical tensor representation, but it is also valid in the standard representation.
The proof is of course more involved, since the transformation law of the density-
matrix elements under rotation of the reference system is more complicated than
the transformation law of multipole moments (cf. Eqs. (3.96) and (3.98)).1
It should be remarked that, in spite of the formal invariance of the radiative rates,
the statistical equilibrium equations in a reference system other than the ‘magnetic’
system are in general more involved because of the presence of the magnetic term
(Eq. (7.79)).2 This term has its simplest expression in the reference system having
the z-axis in the magnetic field direction.
The radiative transfer coefficients, like the radiative rates, have always been
expressed in the ‘magnetic’ reference system in the former sections; thus we are
again faced with the problem of finding their expressions in a different system.
Referring, for instance, to the emission coefficient εi (ν, Ω), its expression in the
‘magnetic’ reference system can be formally written as the scalar product
= C† · ρ ,
ε (ν, Ω)
i i
where Ci is a vector having dimension N . In the new reference system, one has
from Eq. (7.75)
= C † (new) · ρ(new) ,
εi (ν, Ω) i
where
† (new) † (old)
Ci = Ci S −1 .
This is the general transformation law for the vector Ci† .
To give an example, let us consider the expression of εi (ν, Ω) for the multi-level
atom in the approximation of neglecting the dependence of the Φ profiles on the
magnetic quantum numbers (Eq. (7.16e)). Recalling Eqs. (7.8) and (7.77) we can
write
= hν N
εi (ν, Ω) (2Ju + 1) A(αu Ju → α J )
4π
α J αu Ju
√ !
1 1 K
× 3 (−1) 1+J +Ju
TQ K
(i, Ω)
Ju Ju J old
KQ
× ρK
Q (αu Ju ) DQQ
K
(R) φ(να J , α J − ν) ,
new u u
Q
1
The proof is however straightforward. Considering for instance the multi-level atom, one
should start from Eq. (7.5), multiply both sides by the factor DM ∗
N (R) DM N (R) and sum over
J J
M and M to get, using Eq. (3.96) and its inverse, the statistical equilibrium equations in the
new reference system. The expressions for the rates in the old system are given by Eqs. (7.9).
The corresponding expressions in the new system can then be derived via several Racah-algebra
calculations.
2
An exception occurs when the radiation field is cylindrically symmetrical about an axis.
In that case it may be simpler to write the statistical equilibrium equations in a reference system
having the z-axis directed along this symmetry axis (see e.g. Sect. 10.8).
EQUATIONS FOR ATOMIC SYSTEMS 333
where R is the rotation which carries the ‘magnetic’ reference system into the new
one. Since from Eq. (2.78)
TQ
K
(i, Ω) DQQ
K
(R) = TQ
K
(i, Ω) ,
old new
Q
has the same expression in the old and new reference systems.
we see that εi (ν, Ω)
As this property holds for the other radiative transfer coefficients too, we conclude
that when the dependence of the Φ profiles on the magnetic quantum numbers can
be neglected, all the radiative transfer coefficients are invariant under rotation of
the reference system.1
Similar invariance properties can be proved for the multi-term atom and for the
multi-level atom with hyperfine structure (Eqs. (7.48) and (7.70)).
In the previous sections we have derived the statistical equilibrium equations for the
atomic density-matrix elements by considering only the interaction of the atomic
system with the radiation field. In real plasmas atoms are also subjected, in general,
to a wide variety of collisions both with particles of several species (like electrons,
ions, and different atoms) and – in the case of laboratory experiments – with the
walls of the cell containing the plasma.
A detailed theory of collisions is beyond the aim of this book. Here we want just
to deduce some general results about the effect of collisions on the physical state
of the atomic system under the assumption of an isotropic distribution of colliding
particles.
In Chap. 5 (see Sects. 5.14 and 5.15) we presented a thorough discussion, based on
classical physics, of the role of collisions in polarization phenomena. The distinction
there introduced between ‘exciting’ and ‘perturbing’ collisions can be transferred
to the quantum description of collisional interactions.
Let us consider a multi-level atom with energy eigenvectors of the form |αJM ,
and let us suppose, for the time being, that no magnetic field (nor any other
anisotropic agent) is present in the physical environment. The energy eigenvectors
will then be degenerate with respect to the magnetic quantum number M . We can
divide collisions into inelastic and elastic. Inelastic collisions induce transitions
between the level |αJM and an upper level |αu Ju Mu with a corresponding energy
loss of the colliding particle. Such collisions, which are the quantum analogue
1
Contrary to the radiative rates, this is a real – not just a formal – invariance. It is clear
that once the multipole moments in an assigned reference system are given, the radiative transfer
coefficients are univocally determined. If the reference system is rotated, both the multipole
moments and the components of the tensor T Q are changed, but the quantity
K (i, Ω)
%
Q
TQ ρK (αJ)
K (i, Ω)
Q
is invariant.
334 CHAPTER 7
of the (classical) ‘exciting’ collisions, have their inverse process in the so-called
superelastic collisions, which induce transitions between the level |αJM and a
lower level |α J M with a corresponding energy increase of the colliding particle.
Elastic collisions induce transitions between the level |αJM and levels of the form
|αJM or, in other words, between magnetic sublevels belonging to the same
(αJ)-level. In these collisions, which are the quantum analogue of the (classical)
‘perturbing’ collisions, the energy of the colliding particle does not change during
the interaction.
The effect of all these collisions on the physical state of the atomic system can
be described by suitable rates (which will be divided into transfer and relaxation
collisional rates, with the same meaning as the radiative rates) appearing in the
statistical equilibrium equations for the density-matrix elements of the atomic sys-
tem. We will now derive some properties of these rates which follow from general
physical principles under the hypothesis of an isotropic distribution of colliding
particles.
d
ρ (M, M ) = CI (αJM M , α J M M ) ρα (M , M ) , (7.82)
dt αJ J
α J M M
where the quantum numbers (α J ) denote any atomic level having energy lower
than the level (αJ).
We now consider a new reference system obtained from the old one by the rota-
tion R. Using Eq. (3.96) and its inverse, we have
d
ρ (M, M ) =
dt αJ new
J J
∗ ∗
= DN
J
M (R) DN M (R) DN M (R) DN M (R)
J
α J M M N N N N
!
× CI (αJN N , α J N N ) ρα J (M , M ) . (7.83)
new
On the other hand, the assumption of isotropic collisions implies that all quantiza-
tion directions are equivalent, therefore Eqs. (7.82) and (7.83) have to be identical.
It follows that the collisional rates must satisfy the relation
J J
CI (αJM M , α J M M ) = DN
J ∗
M (R) DN M (R) DN M (R) DN M (R)
J ∗
N N N N
∗
DN
J
M (R) DN M (R) =
J
J J K J J K
= (−1)N −M (2K + 1) DPKQ (R)∗ ,
N −N P M −M Q
K
J J
DN M (R) DN M (R)∗ =
N −M J J K J J K
= (−1) (2K + 1) DPK Q (R)∗ ,
N −N P M −M Q
K
DPKQ (R)∗ DPK Q (R)∗ =
K K K K K K
= (2K + 1) DPK Q (R) , (7.85)
P P P Q Q Q
K
one gets
CI (αJM M , α J M M ) = CI (αJN N , α J N N )
N N N N
× (2K + 1)(2K + 1)(2K + 1) (−1)N −M+N −M
KK K
J J K J J K J J K
×
N −N P M −M Q N −N P
J J K K K K K K K
× DPK Q (R) .
M −M Q P P P Q Q Q
As the right-hand side must be independent of the rotation R, the index K can
only take the value K = 0. This implies K = K , P = −P , Q = −Q . We thus
obtain, with the help of Eq. (2.26a)1
.
J−M +J −M 2J + 1
CI (αJM M , α J M M ) = (−1)
2J + 1
J J K J J K
× (2K + 1) CI(K) (αJ, α J ) , (7.86)
M −M Q M −M Q
K
1
The factor
2J + 1 2J + 1
is introduced in Eq. (7.86) to get simpler relations between these rates and the usual collisional
rates connecting atomic populations (see below).
336 CHAPTER 7
where we have defined the multipole components of the inelastic collisional rates
by the equation
2J + 1
CI(K) (αJ, α J ) =
2J + 1
+J −N J J K J J K
× (−1)J−N
N −N P N −N P
N N N N P
and can be written, under the assumption of isotropic collisions, in the form
.
J−M +Ju −Mu 2Ju + 1
CS (αJM M , αu Ju Mu Mu ) = (−1)
2J + 1
J J K Ju Ju K
× (2K + 1) CS(K) (αJ, αu Ju ) ,
M −M Q Mu −Mu Q
K
EQUATIONS FOR ATOMIC SYSTEMS 337
where
2J + 1
CS(K) (αJ, αu Ju ) =
2Ju + 1
J−N +Ju −Nu J J K Ju Ju K
× (−1)
N −N P Nu −Nu P
N N Nu Nu P
are the multipole components of the superelastic collisional rates. These quantities
are real and CS(0) (αJ, αu Ju ) is non-negative.
Let’s now discuss the relaxation rates due to inelastic and superelastic collisions.
In a given reference system, these rates contribute to the time evolution of a par-
ticular density-matrix element via an equation of the form
d
ραJ (M, M ) = − f (αJM M M ) ραJ (M, M )
dt M
+ g(αJM M M ) ραJ (M , M ) .
The conjugation properties of the density-matrix elements (Eq. (3.80)) require that
g(αJM M M ) = f (αJM M M )∗ ,
d 1
ραJ (M, M ) = − S(αJM M M ) ραJ (M, M )
dt
2
M
1
+ S(αJM M M )∗ ραJ (M , M ) . (7.90)
2
d
ρ (M, M ) =
dt αJ new
1
∗
=− DN
J
M (R) DN M (R) S(αJN N N )
J
2
M M NN N
∗
× DN
J
M (R) DN M (R) ραJ (M , M )
J
new
1
∗ ∗
+ DN
J
M (R) DN M (R) S(αJN N N )
J
2
N N N !
∗
× DN
J
M (R) DN
J
M (R) ραJ (M , M ) . (7.91)
new
338 CHAPTER 7
Owing to the isotropy of collisions, Eqs. (7.90) and (7.91) must be identical, there-
fore
S(αJM M M ) δMM =
∗ ∗
= DN J
M (R) DN M (R) S(αJN N N ) DN M (R) DN M (R)
J J J
(7.92)
N N N
N N
Since the right-hand side must be independent of the rotation R, the index K can
only take the value K = 0. This implies M = M , N = N , and using Eq. (2.26a)
one gets
1
S(αJM M ) = δM M S(αJN N ) .
2J + 1
N
d
ρ (M, M ) = − S0 (αJ) ραJ (M, M ) ,
dt αJ
where the relaxation collisional rate
1
S0 (αJ) = Re S(αJM M )
2J + 1
M
The conservation of the total number of atoms implies that the quantities CI(K) ,
(K)
CS and S0 are not independent. Denoting by nαJ the population of the level
(αJ), we require that (cf. Eq. (3.89))
d d
nαJ = ραJ (M, M ) = 0 . (7.94)
dt dt
αJ αJ M
Using Eq. (7.93) and the relation (see Eqs. (2.23a) and (2.26a))
J−M J J K
(−1) =
M −M Q
M
√ J J K
J J 0
√
= 2J + 1 = 2J + 1 δK0 δQ0 , (7.95)
M −M Q M −M 0
M
we obtain
d (0) (0)
nαJ = CI (αJ, α J ) nα J + CS (αJ, αu Ju ) nα J
dt u u
α J αu Ju
In the first term we replace (αJ) by (αu Ju ) and (α J ) by (αJ), and in the second
term we replace (αJ) by (α J ) and (αu Ju ) by (αJ). This yields
S0 (αJ) = CI(0) (αu Ju , αJ) + CS(0) (α J , αJ) , (7.97)
αu Ju α J
340 CHAPTER 7
d (0) (0)
nαJ = CI (αJ, α J ) nα J + CS (αJ, αu Ju ) nα J
dt u u
α J αu Ju
− CI(0) (αu Ju , αJ) + CS(0) (α J , αJ) nαJ .
αu Ju α J
This equation can be compared with the standard statistical equilibrium equation
that is found in many classical textbooks, like for instance in Mihalas (1978). One
finds an exact correspondence between the two formulations, which allows a direct
comparison of the different rates. In the notation used by Mihalas, Cu denotes
the inelastic collisional rate for the transition from the lower level to the upper
level u, while Cu denotes the superelastic collisional rate for the transition from
the level u to the level . The relation with our quantities is the following
Because of this simple relation, we can directly apply the results derived in Mi-
halas (1978) to the zero-rank multipole components of the collisional rates. In
particular, if the colliders are electrons with a velocity distribution f (v), such that
∞
f (v) dv = 1 ,
0
we have
∞
(0)
CI (αu Ju , α J ) = Ne σα J , αu Ju (v) f (v) v dv ,
v0
with m the electron mass and Eα J , Eα J the energies of the two levels.
u u
If the colliding particles have a Maxwellian distribution of velocities, a simple
relation can be established between the zero-rank multipole components CI(0) and
CS(0) . From detailed-balance arguments one gets the following Einstein-Milne rela-
tion
Eα − Eα
u Ju J
(0) 2J + 1
CS (α J , αu Ju ) = e kB Tc
CI(0) (αu Ju , α J ) , (7.98)
2Ju + 1
EQUATIONS FOR ATOMIC SYSTEMS 341
d
ραJ (M, M ) = CE (αJM M , αJM M ) ραJ (M , M )
dt
M M
1 1
− S(αJM M M ) ραJ (M, M ) + S(αJM M M )∗ ραJ (M , M ) .
2 2
M
If collisions are isotropic one finds, similarly to the preceding subsection, that this
equation can be cast into the form
d J J K
ραJ (M, M ) = (−1)M −M (2K + 1)
dt M −M Q
M M K
J J K
× CE(K) (αJ) ραJ (M , M )
M −M Q
where
(K) N −N J J K J J K
CE (αJ) = (−1)
N −N P N −N P
N N N N P
are the multipole components of the elastic collisional rates, which satisfy the prop-
erties
CE(K) (αJ)∗ = CE(K) (αJ) , CE(0) (αJ) ≥ 0 .
342 CHAPTER 7
From Eq. (7.99) one can prove, using Eq. (7.95), that
d d
nαJ = ραJ (M, M ) = 0 ,
dt dt
M
.
2Ju + 1
J−M +Ju −Mu
+ (−1)
2J + 1
αu Ju Mu Mu K
J J K Ju Ju K
× (2K + 1)
M −M Q Mu −Mu Q
−M
+ (−1)M
M M K
J J K J J K
× (2K + 1)
M −M Q M −M Q
− CI(0) (αu Ju , αJ) + CS(0) (α J , αJ) + CE(0) (αJ)
αu Ju α J
× ραJ (M, M ) .
.
d K 2J + 1 (K)
ρQ (αJ) = C (αJ, α J ) ρK
Q (α J )
dt 2J + 1 I
α J
.
2Ju + 1 (K)
+ C (αJ, αu Ju ) ρK Q (αu Ju )
2J + 1 S
αu Ju
− CI(0) (αu Ju , αJ) + CS(0) (α J , αJ)
αu Ju α J
+ D(K) (αJ) ρK
Q (αJ) , (7.101)
where we have introduced the depolarizing rates due to elastic collisions defined
by1
D(K) (αJ) = CE(0) (αJ) − CE(K) (αJ) . (7.102)
Equation (7.101) shows that, under the assumption of isotropic collisions, only
spherical statistical tensors with the same K and Q value are coupled by collisional
transitions.
Depolarizing rates have received little attention both from the experimental and
from the theoretical point of view.2 This is obviously due to the fact that the actual
values of depolarizing rates are needed only for the interpretation of polarization
phenomena. A detailed discussion of depolarizing rates can be found in Lamb and
ter Haar (1971). According to these authors, the main contribution to depolar-
izing rates in astrophysical plasmas arises from long-range collisions with neutral
perturbers (generally hydrogen atoms). In such collisions, the interaction between
atom and perturber is described by a Van der Waals potential, which depends on
the atom-perturber distance R as
where h is the Planck constant and C6 (αJ) is the Van der Waals constant for the
level in question.
Let us consider a particular collision having impact parameter b. If v̄r is the
average relative velocity between atom and collider, the interaction energy varies
with time according to
1
VVdW (t) = h C6 (αJ) ,
(b2 + v̄r2 t2 )3
1
It follows that D (0) (αJ) = 0, which shows at once that the population of the (αJ)-level is
unaffected by elastic collisions (see Eq. (3.107)).
2
Some theoretical results on depolarizing rates by isotropic collisions with hydrogen atoms have
recently appeared in the literature. Applications to NaI and other atoms of the same isoelectronic
sequence can be found in Kerkeni (2002) and Kerkeni et al. (2003). Applications to p states of
neutral atoms have been presented by Derouich et al. (2003).
344 CHAPTER 7
∞
2π 3π 2 C6 (αJ)
φ= VVdW (t) dt = . (7.103)
h 4 b5 v̄r
−∞
The cumulative effect of these phase shifts is responsible for the depolarizing rates
via a mechanism similar to that illustrated in Sect. 5.14 for classical oscillators.
We can give a rough estimate of the depolarizing rate by writing D(K) in the form
D(K) ≈ f = σ v̄r nH ,
where f is the collision frequency, nH the number density of hydrogen atoms, and
σ – the depolarizing cross-section – is given by
σ = π b2c ,
bc being the critical impact parameter which produces a phase shift equal to some
large value φ0 1. This approximation is equivalent to assuming that all collisions
having b < bc are totally effective in depolarizing the atom, while the others can
be neglected. Taking for φ0 the value 0.61,1 one gets from Eq. (7.103)
0.2
C6 (αJ)
bc = 1.65 , (7.104)
v̄r
whence
0.4 0.6
f = 8.5 C6 (αJ) v̄r nH . (7.105)
To get a rough estimate of C6 (αJ) we can use the expression given by Aller
(1963)
e2
C6 (αJ) = 0 p r2 (αJ) ,
h
where e0 is the electron
charge,
h the Planck constant, p the polarizability of the
hydrogen atom, and r2 (αJ) the mean square radius of the electronic cloud in the
level (αJ). Since p 6.7 × 10−25 cm3 (Allen, 1973), one gets, expressing r2 (αJ)
in atomic units
C6 (αJ) 6.5 × 10−34 r2 (αJ) cm6 s−1 . (7.106)
1
This value is discussed in some detail by Aller (1963) in connection with the frequency of
collisions responsible for the broadening of spectral lines.
EQUATIONS FOR ATOMIC SYSTEMS 345
Values for r2 (αJ) can be deduced by means of scaled Fermi-Thomas potentials
(see e.g. Warner, 1969); otherwise one can refer to an approximate expression
which, for neutral atoms, reduces to (Unsöld, 1955)
2
2
5 13.6
r (αJ) = ,
2 I − EαJ
where I is the ionization potential of the atom and EαJ is the excitation energy of
the (αJ)-level, both expressed in eV.
On the other hand, assuming a Maxwellian distribution of velocities at temper-
ature T , the average relative velocity is
8kB T 1
v̄r = 1+ , (7.107)
π mH µ
where kB is the Boltzmann constant, mH is the mass of the hydrogen atom, and
µ is the atomic weight of the atom.1 Substitution of Eqs. (7.106) and (7.107) into
Eq. (7.105) gives
−10
2 0.4 1
0.3
f 1.4 × 10 r (αJ) T 1+ nH s−1 , (7.108)
µ
D(K) (αJ) =
2J ! !
J J 0 J J K
= (2J + 1) (−1)2J+K − (−1)K ΓE(K ) (αJ) ,
J J K J J K
K =1
which shows that depolarizing rates are unaffected by a zero-rank tensor operator.
If the interaction is described by a single tensor operator of rank 1 (or by a tensor
operator of rank 1 plus a tensor operator of rank 0) we have, using Eq. (2.36d)
D(1) (αJ) 1 1
= J
= 0, ,
D(2) (αJ) 3 2
1
It is worth noticing that Eqs. (7.104), (7.106) and (7.107) give for the collision time
0.2 , - −0.6
1
tc = bc / v̄r 3.8 × 10−12 r 2 (αJ) T 1+ s,
µ
thus tc 10−13 - 10−14 s for a typical stellar atmosphere.
346 CHAPTER 7
whereas for a single tensor operator of rank 2 (or a tensor operator of rank 2 plus
a tensor operator of rank 0) we have, using Eq. (2.36h)
D(1) (αJ) 1 4J 2 + 4J − 3 1
= J
= 0, . (7.109)
D(2) (αJ) 3 4J 2 + 4J − 7 2
In particular, the Van der Waals interaction depends on the atomic variables
through the quadrupolar momentum of the atomic cloud, which is a symmetric
Cartesian tensor that can be decomposed into the sum of a spherical tensor of
rank 0 plus a spherical tensor of rank 2, so that the ratio D(1) (αJ)/D(2) (αJ) is
given by Eq. (7.109). We can compare this expression with the results obtained in
Sect. 5.14, where the atom was schematized as a collection of classical oscillators.
The corresponding ratio (δ1 /δ2 ) was found to be (see Eq. (5.195))
δ1 5
= ,
δ2 3
which is just the value given by Eq. (7.109) for J = 1 (the only quantum-mechanical
atomic level for which the classical analogy works).
h νL kB T
(with νL the Larmor frequency and kB the Boltzmann constant), or, numerically
where B is in G and T in K. This inequality is well-satisfied both for the sun and
for magnetic stars.
As far as depolarizing rates are concerned, the situation is more involved. Besides
the condition of Eq. (7.110) (which still must be satisfied), there is the further
constraint that the typical collision time tc should be much smaller than the Larmor
time (2πνL )−1 . Thus
B 1.1 × 10−7 t−1c ,
EQUATIONS FOR ATOMIC SYSTEMS 347
where B is in G and tc in s. This condition is also well-satisfied both for the sun
and for magnetic stars (see footnote on p. 345).
d
ρ(JM, J M ) =
βLS
CI (βLSJM J M , β L S J M J M )
dt
β L S J M J M
× β L S
ρ(J M , J M ) .
Assuming that collisions are isotropic and that no magnetic field is present (or
that the magnetic field is sufficiently weak to satisfy the constraints discussed in
Sect. 7.13.d), one could derive, along the same lines of the former subsections, the
general properties of the rates CI (βLSJM J M , β L S J M J M ). However, this
more complicated type of collisional rates will not be studied in this book.
A major point concerning collisional rates is that, under a wide variety of astro-
physical and laboratory conditions, they can be simply added to the corresponding
radiative rates in the statistical equilibrium equations; symbolically
d
Density-Matrix Element =
dt
= Radiative Rates + Collisional Rates × Density-Matrix Element .
Although this equation may seem physically evident, it is indeed correct only when
the so-called impact approximation is valid (see Lamb and ter Haar, 1971, for a
thorough discussion of this subject). The impact approximation – that we have
already encountered in Sect. 5.14, see Eq. (5.176) – requires that the collision time
is much smaller than the relaxation time due to radiative rates, a condition that is
generally well-satisfied both in laboratory plasmas and in stellar atmospheres.
CHAPTER 8
In previous sections of this book we have derived the radiative transfer equations for
polarized radiation from general principles. In Sect. 5.2 the derivation was based
on a classical approach involving the familiar concept of index of refraction, while
in Sect. 6.7 it was based on a more powerful quantum-electrodynamical approach.
As already remarked, the structure of the resulting equations is obviously the same.
In Sect. 5.5 we have indeed outlined how this structure is intimately connected with
very general physical principles.
This chapter is devoted to the solution of the radiative transfer equations for
polarized radiation in their most general formulation. We will also consider some
particular cases where the solution can be expressed in purely analytical form.
Most practical applications of radiative transfer for polarized radiation concern
the problem of line formation in a magnetic field, and the consequent analysis of
polarimetric observations of solar active regions or of magnetic stars. These special
aspects of the general problem of radiative transfer for polarized radiation will be
discussed in the following chapters.
8.1. Generalities
The radiative transfer equations for polarized radiation have been derived from the
principles of Quantum Electrodynamics in Chap. 6. For a stationary medium (i.e.
a medium whose properties do not vary over time) we have from Eq. (6.83)
d 3 3
=−
Si (ν, Ω) A
Kij +
Sj (ν, Ω) S
Kij +ε
Sj (ν, Ω) (i = 0, 1, 2, 3) , (8.1)
i
ds j=0 j=0
and stressing the explicit dependence of the various quantities on the coordinate s ,
we have
d
I(s) = −K(s) I(s) + ε(s) , (8.2)
ds
350 CHAPTER 8
where1 ⎛ ⎞ ⎛ ⎞
S0 (s) I(s)
⎜ S (s) ⎟ ⎜ Q(s) ⎟
⎜ ⎟ ⎜ ⎟
I(s) = ⎜ 1 ⎟ = ⎜ ⎟ .
⎝ S2 (s) ⎠ ⎝ U (s) ⎠
S3 (s) V (s)
The matrix K, as deduced both from classical and from quantum physics, has the
form (see Eqs. (5.25) and (6.85))
⎛ ⎞
ηI ηQ ηU ηV
⎜η −ρU ⎟
⎜ ηI ρV ⎟
K=⎜ Q ⎟ , (8.3)
⎝ ηU −ρV ηI ρQ ⎠
ηV ρU −ρQ ηI
d
I(s) = −k(s) I(s) + ε(s) , (8.4)
ds
where I is the intensity of the beam and where k and ε are the absorption and
emission coefficients, respectively. It is seen that the two equations have a similar
structure, with the obvious difference that the scalar quantities I and ε of Eq. (8.4)
are replaced, in the ‘polarized’ case, by the four-component vectors I and ε, while
the scalar absorption coefficient k is replaced by the 4 × 4 propagation matrix K.
The peculiarity of the transfer of polarized radiation originates just from the matrix
character of Eq. (8.2), which produces a typical effect of ordering that will be
explained in the following section.
O(s, s) = 1 , (8.7)
Derivation of Eq. (8.6) with respect to s and use of Eq. (8.5) show that the
evolution operator satisfies the differential equation
d
O(s, s ) = −K(s) O(s, s ) . (8.9)
ds
In a similar way we get
d
O(s, s ) = O(s, s ) K(s ) . (8.10)
ds
Equation (8.9) can be formally integrated to give
s
O(s, s ) = 1 − K(s1 ) O(s1 , s ) ds1 . (8.11)
s
The operator O(s1 , s ) in the right-hand side can in turn be expressed via Eq. (8.11),
and iterating the procedure we obtain
∞
s s1 sn−1
O(s, s ) = 1 + (−1)n
ds1 ds2 · · · dsn K(s1 ) K(s2 ) · · · K(sn ) .
n=1 s s s
O(s, s ) =
∞ s s s & '
(−1)n
=1+ ds1 ds2 · · · dsn P K(s1 ) K(s2 ) · · · K(sn ) , (8.12)
n=1
n!
s s s
1
The concept of evolution operator expressed as a 4 × 4 real matrix was introduced by Landi
Degl’Innocenti and Landi Degl’Innocenti (1985). A similar concept in terms of 2 × 2 complex
matrices is due to Van Ballegooijen (1985, 1987) and is developed in Sects. 8.8 and 8.9. The
presentation given here follows rather closely the derivation by Landi Degl’Innocenti (1987). It
appears that the general concept of evolution operator has been known for a long time in matrix
theory (see e.g. Gantmacher, 1966, where it is referred to as matricant).
352 CHAPTER 8
where we have introduced the Dyson chronological product of the n matrices K(s1 ),
K(s2 ), . . . , K(sn ), defined by
& '
P K(s1 ) K(s2 ) · · · K(sn ) = K(si ) K(sj ) · · · K(sk ) , (8.13)
si ≥ sj ≥ . . . ≥ s k .
In other words, the chronological product orders the matrices in the bracket in such
a way that larger values of the argument s stand to the left of smaller values.
The formal solution of the non-homogeneous transfer equation (Eq. (8.2)) can
be easily expressed in terms of the evolution operator O(s, s ). Bearing in mind
Eqs. (8.7) and (8.9), it is seen that the expression
s
I(s) = O(s, s ) ε(s ) ds + O(s, s ) I(s ) (8.14)
s
s
I(s) = O(s, s ) ε(s ) ds . (8.15)
−∞
The operator O(s, s ) introduced in this section is nothing but the generalization
to the radiative transfer problem for polarized radiation of the usual attenuation
operator : s
− k(s ) ds
o(s, s ) = e s (8.16)
RADIATIVE TRANSFER FOR POLARIZED RADIATION 353
that solves the transfer equation for ‘unpolarized’ radiation (Eq. (8.4)). It can be
easily proved that the homogeneous equation
d
I(s) = −k(s) I(s)
ds
has the solution
I(s) = o(s, s ) I(s ) , (8.17)
and that the non-homogeneous equation has the solution
s
I(s) = o(s, s ) ε(s ) ds + o(s, s ) I(s ) . (8.18)
s
Comparison of Eqs. (8.17)-(8.18) with Eqs. (8.6)-(8.14) shows the strict correspon-
dence between the operators O(s, s ) and o(s, s ). However, Eq. (8.12) – contrary
to Eq. (8.16) – contains a chronological product. This peculiarity is intimately
connected with the physical fact that two slabs, A and B, acting differently on the
polarization properties of a radiation beam, do not ‘commute’: in other words, the
emerging polarization is in general different if slab A is located in front of slab B
or vice versa. On the contrary, the ordering of a train of absorbing slabs does not
matter for ‘unpolarized’ radiation.
Owing to the presence of a chronological product in Eq. (8.12), the expression for
the evolution operator O(s, s ) cannot be reduced, in general, to simpler forms.
An important exception occurs when the propagation matrix K(s) at a particular
point s1 commutes with the same matrix at any other point s2 ,
K(s1 ), K(s2 ) = K(s1 ) K(s2 ) − K(s2 ) K(s1 ) = 0. (8.19)
∞ s n
(−1)n
O(s, s ) = 1 + K(s ) ds . (8.20)
n=1
n!
s
∞ ∞
1 dn f n
f (z) = n
z = fn z n ,
n=0
n! dz z=0 n=0
354 CHAPTER 8
with A0 ≡ 1 . In particular
∞
−A (−1)n n
e = A , (8.21)
n=0
n!
There are three remarkable cases where Eq. (8.19) holds and, therefore, Eq. (8.22)
can be applied. These are the following:
a) the matrix K is independent of s ;
b) the matrix K can be written in the form
with H independent of s ;1
c) the matrix K has an arbitrary dependence on s but it has a particular structure,
with several elements identically zero.
This expression can be transformed, with some algebra, in the linear combination
of four matrices, each multiplied by a suitable exponential factor. The relevant
calculations are carried out in App. 5 and allow the evolution operator to be written
in the form
−(s−s ) η
O(s, s ) = e I
1
× cosh (s − s ) Λ+ ( ) + cos (s − s ) Λ− (η , ρ
η, ρ ) M1 (η , ρ
)
2
− sin (s − s ) Λ− ( ) M2 (
η, ρ η, ρ
)
− sinh (s − s ) Λ+ ( ) M3 (
η, ρ η, ρ)
!
1
+ cosh (s − s ) Λ+ ( η, ρ ) − cos (s − s ) Λ− (η , ρ
) M4 (η , ρ
) (8.23)
2
1
Obviously a) is a special case of b). The two cases will be however treated separately, as
both of them have important applications.
RADIATIVE TRANSFER FOR POLARIZED RADIATION 355
or, alternatively
−(s−s ) [ ηI + Λ+ ( ρ)]
η , −(s−s ) [ ηI − Λ+ ( ρ)]
η ,
O(s, s ) = e N1 (
η, ρ
) + e N2 (η , ρ
)
−(s−s ) [ ηI + i Λ− ( ρ)]
η , −(s−s ) [ ηI − i Λ− ( ρ)]
η ,
+e N3 (
η, ρ
) + e N4 (η , ρ
) ,
Λ+ (
η, ρ
) = Λ− (
η, ρ
) = 0
−(s−s ) ηI 1
O(s, s ) = e 1 − (s − s ) G(
η, ρ
) + (s − s )2 G(η , ρ
)2 ,
2
with G given by Eq. (A5.25).
8.3.b Propagation matrix of the form [c1 (s) 1 + c2 (s) H ] , with H independent of s
Equation (8.22) gives, with the help of Eq. (A5.6)
where
s s
C1 (s, s ) = c1 (s ) ds , C2 (s, s ) = c2 (s ) ds .
s s
Obviously the matrix H has the same structure as the matrix K. Writing
⎛ ⎞
hI hQ hU hV
⎜h −rU ⎟
⎜ hI rV ⎟
H =⎜ Q ⎟ ,
⎝ hU −rV hI rQ ⎠
hV rU −rQ hI
Using the matrices A3 and B3 defined in App. 5 (Eq. (A5.1c)) we can write
where
1 1
a3 (s) = η (s) + i ρV (s) , b3 (s) = a3 (s)∗ = η (s) − i ρV (s) .
2 V 2 V
As the three matrices in the right-hand side of Eq. (8.25) are commuting matrices
(see Eq. (A5.3)), the matrix K(s) satisfies Eq. (8.19), thus the evolution operator
is given by Eq. (8.22). Using Eq. (A5.6) we get
−HI 1 −1
2 (HV + i RV ) A3 −1
2 (HV − i RV ) B3
O(s, s ) = e e e ,
where
s s s
HI = ηI (s ) ds , HV = ηV (s ) ds , RV = ρV (s ) ds . (8.26)
s s s
RADIATIVE TRANSFER FOR POLARIZED RADIATION 357
s s
HQ = ηQ (s ) ds , RQ = ρQ (s ) ds .
s s
Finally, if ⎛ ⎞
ηI (s) 0 ηU (s) 0
⎜ 0 −ρU (s) ⎟
⎜ η (s) 0 ⎟
K(s) = ⎜ I
⎟ (8.29)
⎝ ηU (s) 0 ηI (s) 0 ⎠
0 ρU (s) 0 ηI (s)
one gets
⎛ ⎞
cosh HU 0 − sinh HU 0
⎜ sin RU ⎟
−HI ⎜ 0 cos RU 0 ⎟
O(s, s ) = e ⎜ ⎟ ,
⎝ − sinh HU 0 cosh HU 0 ⎠
0 − sin RU 0 cos RU
where
s s
HU = ηU (s ) ds , RU = ρU (s ) ds .
s s
358 CHAPTER 8
K u(i) = λi u(i) (i = 1, 2, 3, 4) .
det (K − λ 1 ) = 0. (8.30)
η = (ηQ , ηU , ηV ) ,
ρ
= (ρQ , ρU , ρV ) , (8.31)
whose solution is
λ1 = ηI + Λ+ (
η, ρ
)
λ2 = ηI − Λ+ (
η, ρ
)
λ3 = ηI + i Λ− (
η, ρ
)
λ4 = ηI − i Λ− (
η, ρ
) , (8.33)
where Λ+ (η, ρ
) and Λ− (
η, ρ
) are the quantities – which now acquire a more precise
significance – defined in Eqs. (A5.18).
If both Λ+ (η, ρ
) and Λ− (η, ρ
) are non-zero (so that the four eigenvalues λi are
distinct), we can consider the matrix X such that
X −1 K X = K , (8.34)
1
If Λ+ (η , ρ ) = 0 and/or Λ− (η , ρ ) = 0, the matrix K cannot be reduced, in general, to
diagonal form. In such cases the method described in this section cannot be applied. A similar
method, based on a different reduction for K, could indeed be employed (see e.g. Heading, 1958,
p.63), but it will not be considered here. Note that the case ρQ = ρU = ρV = 0, which implies
Λ− (η , ρ ) = 0, is a special one: K becomes a symmetric matrix and can be diagonalized.
RADIATIVE TRANSFER FOR POLARIZED RADIATION 359
Multiplying Eq. (8.2) by X −1 from the left, and introducing the identity XX −1 = 1
between K(s) and I(s), we get
d
X −1 I(s) = −K (s) X −1 I(s) + X −1 ε(s) . (8.35)
ds
Since K depends in general on s, the matrix X depends also on s, thus it does
not commute with the operator d/ds. In the special case where X is independent
of s, Eq. (8.35) becomes
d
I (s) = −K (s) I (s) + ε (s) , (8.36)
ds
where
As the matrix K (s) is diagonal, Eq. (8.36) represents a set of four decoupled
differential equations (in other words, the linear combinations of Stokes parameters
given in Eq. (8.37a) are the eigenvectors of the radiative transfer equations). Thus
they have the solution (cf. Eqs. (8.4), (8.18) and (8.16))
s
I (s) = O (s, s ) ε (s ) ds + O (s, s ) I (s ) ,
s
with :s
− λi (s ) ds
ei (s, s ) = e s (i = 1, 2, 3, 4) .
Inversion of Eq. (8.37a) yields the solution of the radiative transfer equations in
the form
s
I(s) = X I (s) = X O (s, s ) ε (s ) ds + X O (s, s ) I (s ) . (8.39)
s
It is easily seen that this expression, although different from the solution based on
the evolution operator (Eq. (8.14)), is however strictly connected with it. Compar-
ison of Eqs. (8.39) and (8.14) shows, with the help of Eqs. (8.37), that the evolution
360 CHAPTER 8
operator O(s, s ) and the matrix O (s, s ) defined in Eq. (8.38) are connected by
the relation
O(s, s ) = X O (s, s ) X −1 . (8.40)
It should be emphasized that the assumption X independent of s, which is at the
basis of the diagonalization method, implies that condition (8.19) is satisfied. This
is easily seen by inversion of Eq. (8.34), which gives
K(s1 ), K(s2 ) = X K (s1 ) X −1 X K (s2 ) X −1 − X K (s2 ) X −1 X K (s1 ) X −1
= X K (s1 ), K (s2 ) X −1 = 0 ,
because the matrix K is diagonal. It follows that the solution of the radiative
transfer equations based on the diagonalization of the propagation matrix is equiv-
alent to the solution based on the evolution operator with the evolution operator
given by Eq. (8.22).
The actual calculation of the matrices X and X −1 can be performed by a stan-
dard method that is presented in App. 6. We give here the final results.
Defining the formal vectors
ζ = η × ρ
≡ (ηU ρV − ηV ρU , ηV ρQ − ηQ ρV , ηQ ρU − ηU ρQ ) (8.41)
η − Λ+ ζ + (
f (1) = Λ2+ η · ρ ) ρ
f (2) = Λ2+
η + Λ+ ζ + (
η · ρ ) ρ
f (3) = −Λ2− η − i Λ− ζ + (
η·ρ
) ρ
f (4) = −Λ2− η + i Λ− ζ + (
η·ρ
) ρ
Substitution of Eqs. (8.42) into Eq. (8.37a) yields explicit expressions for the eigen-
vectors of the radiative transfer equations. These are the linear combinations (two
real and two complex)
I1 = l+ I + f1(1) Q + f2(1) U + f3(1) V with eigenvalue ηI + Λ+
I2 = −l+ I + f1(2) Q + f2(2) U + f3(2) V with eigenvalue ηI − Λ+
I3 = i l− I + f1(3) Q + f2(3) U + f3(3) V with eigenvalue ηI + i Λ−
I4 = −i l− I + f1(4) Q + f2(4) U + f3(4) V with eigenvalue ηI − i Λ− .
If the propagation matrix is constant, so that the elements of the matrix O (s, s )
defined in Eq. (8.38) reduce to
−(s−s ) λi
ei (s, s ) = e
(with λi independent of s), the expression for the evolution operator obtained by
substitution of Eqs. (8.42) into Eq. (8.40) must coincide with Eq. (8.23). The
formal proof is however rather involved; moreover, it should be kept in mind that
Eqs. (8.42) are ill-defined in several particular cases (see App. 6).
Finally, we want to remark that the diagonalization method is particularly suit-
able to solve the radiative transfer equations in the special cases that have been
collected in Sect. 8.3.c. Let us suppose, for instance, that the propagation matrix
has the form of Eq. (8.24). It is easily shown that the radiative transfer equations
for the linear combinations of Stokes parameters
1 1
I1 = I +V I3 = Q + iU
2 2
1 1
I2 = I −V I4 = Q − iU (8.43)
2 2
are
d 1
I = − ηI + ηV I1 + εI + εV
ds 1 2
d 1
I = − ηI − ηV I2 + εI − εV
ds 2 2
d 1
I = − ηI − i ρV I3 + ε + i εU
ds 3 2 Q
d 1
I = − ηI + i ρV I4 + εQ − i εU , (8.44)
ds 4 2
362 CHAPTER 8
which is a set of four decoupled equations. The solution to these equations is trivial
(cf. Eqs. (8.4), (8.18) and (8.16)), and the solution for the Stokes parameters is
readily obtained by inversion of Eqs. (8.43).1 This is of course the same solution
that is obtained from Eq. (8.14) with the evolution operator of Eq. (8.27). It is
important to stress that the eigenvectors of the radiative transfer equations have in
this case a simple physical meaning: the combinations I1 and I2 in Eq. (8.43) are
just the amount of right and left circular polarization contained in the radiation
beam.
Analogous considerations can be repeated for the propagation matrices described
by Eqs. (8.28) and (8.29). In these cases the eigenvectors of the transfer equations
represent the amount of linear polarization along different directions.
The analytical expression for the evolution operator derived in Sect. 8.3.a takes a
simpler form when the medium where the radiation is propagated is either purely
dichroic (ρQ = ρU = ρV = 0) or purely dispersive (ηI = ηQ = ηU = ηV = 0).
The purely dichroic case is seldom encountered in practice. Its analysis is however
important because, in some cases, it can be useful to tackle a transfer problem
neglecting – as a first-order approximation – the influence of anomalous dispersion
(this leads in general to considerable simplifications). The opposite case of a purely
dispersive medium is met in some astrophysical applications (the propagation of
radio waves in a magnetized plasma is a typical example). Further examples of
purely dichroic or purely dispersive media can be found in laboratory optics: an
ideal (dichroic) polarizer is composed of a purely dichroic medium, while an ideal
retarder is composed of a purely dispersive medium.
Λ− (
η , 0) = 0 ,
and the evolution operator can be written, using Eqs. (8.23) and (A5.21), in the
form
1
Note that in this case, being
η2 ρ2 = ηV ρV = (η · ρ )2 ,
2 2
the condition of Eq. (A6.14) is not satisfied, thus the matrices X, X−1 in Eqs. (8.42) are ill-
defined (actually the denominator of the quantity g− vanishes).
RADIATIVE TRANSFER FOR POLARIZED RADIATION 363
⎧ ⎛ ⎞
⎪ 0 0 0 0
⎪
⎪
⎨1 ⎜0 2
ηU + ηV2 −ηQ ηU −ηV ηQ ⎟
−x ηI ⎜ ⎟
O(s, s ) = e ⎜ ⎟
⎪
⎪ η2 ⎝0 −ηQ ηU ηV2 + ηQ
2
−ηU ηV ⎠
⎪
⎩
0 −ηV ηQ −ηU ηV 2
ηQ 2
+ ηU
⎛ ⎞
η2 0 0 0
1 ⎜ ηV ηQ ⎟
2
⎜ 0 ηQ ηQ ηU ⎟
+ cosh(xη) 2 ⎜ ⎟
η ⎝ 0 ηQ ηU 2
ηU ηU ηV ⎠
0 ηV ηQ ηU ηV ηV2
⎛ ⎞⎫
0 ηQ ηU ηV ⎪
⎪
⎜
1 ⎜η ⎟⎪⎬
0 0 0 ⎟
− sinh(xη) ⎜ Q ⎟ , (8.45)
η ⎝ ηU 0 0 0 ⎠⎪⎪
⎪
⎭
ηV 0 0 0
where x = (s − s ).
The concept of Mueller matrix is a very useful tool that is often used to characterize,
from a mathematical point of view, the properties of an optical device with respect
to the polarization of the light that is crossing the device itself. If I is the Stokes
vector of the light beam that is entering the device, and I the Stokes vector at the
exit of the device, the Mueller matrix is defined by
I = M I . (8.47)
I = MN · · · MB MA I . (8.48)
Comparison of Eq. (8.47) or (8.48) with Eq. (8.6) shows that the Mueller matrix of
an optical device is nothing but the evolution operator describing the propagation of
radiation inside the material medium of which the device is composed.1 Therefore,
the expression of the Mueller matrix can be deduced once the geometrical and
optical properties of the medium are specified.
As an example, let us consider the case of a dichroic linear polarizer of geometrical
thickness l. The medium composing the polarizer can be characterized by a triplet
of real, orthogonal unit vectors (ut , ua , up ) – parallel to the transmission axis,
the ‘absorption’ axis and the propagation direction, respectively – such that the
corresponding indices of refraction are
nt = 1 + i kt , na = 1 + i ka
(the index np is unimportant). Using the notations of Sect. 5.1, the unit vectors
uα are given by2
uα = uα (α = t, a, p) .
Aligning the reference direction with the unit vector ut , we obtain for the propa-
gation tensor (see Eq. (5.22))
ν ν
G11 = −2π i n , G22 = −2π i n , G12 = G21 = 0 ,
c t c a
1
Actually, the concept of Mueller matrix is broader, because it can also be used to characterize
optical devices where light is partially reflected or deviated laterally. Of course the identification
of Mueller matrix and evolution operator does not hold in these cases.
2
Note that the medium composing a dichroic polarizer is a very special case of an absorbing
anisotropic medium.
RADIATIVE TRANSFER FOR POLARIZED RADIATION 365
The Mueller matrix of the polarizer, MP , can be deduced directly from Eq. (8.45)
by substituting l for x . We obtain
⎧⎛ ⎞ ⎛ ⎞
⎪ 0 0 0 0 1 0 0 0
⎨
−l ηI ⎜0 0 0 0⎟ ⎜0 1 0 0⎟
MP = e ⎝ ⎠ + cosh(l ηQ ) ⎝ ⎠
⎪
⎩ 0 0 1 0 0 0 0 0
0 0 0 1 0 0 0 0
⎛ ⎞⎫
0 1 0 0 ⎪
⎬
⎜1 0 0 0⎟
− sinh(l ηQ ) ⎝ ⎠
0 0 0 0 ⎪ ⎭
0 0 0 0
⎛ ⎞
Kt + Ka Kt − Ka 0 0
⎜
1 ⎜ t − Ka
K Kt + Ka 0 0 ⎟
⎟
= ⎜ ⎟,
2⎝ 0 0 2 Kt Ka 0 ⎠
0 0 0 2 Kt Ka
where
−4π ν
c kt l −4π ν
c ka l
Kt = e , Ka = e .
For an ideal polarizer, one has Kt = 1 (no absorption along the transmission axis)
and Ka = 0 (complete absorption along the orthogonal axis), thus
⎛ ⎞
1 1 0 0
1 ⎜1 1 0 0⎟
MP = ⎝ ⎠,
2 0 0 0 0
0 0 0 0
which is the correct Mueller matrix for an ideal polarizer with its transmission axis
parallel to the reference direction (see e.g. Clarke and Grainger, 1971).
As a further example, let us consider the Mueller matrix of a retarder. In this case
the medium is characterized by a triplet of real, orthogonal unit vectors (uf , us , up )
– parallel to the fast axis, the slow axis and the propagation direction – such that
the corresponding indices of refraction are given by
n f = 1 + δf , ns = 1 + δ s
with δs > δf (again the index np is unimportant). Aligning the reference direction
with the unit vector uf , we have from Eq. (5.22)
ν ν
G11 = −2π i n , G22 = −2π i n , G12 = G21 = 0 ,
c f c s
366 CHAPTER 8
The Mueller matrix of the retarder, MR , is derived from Eq. (8.46). After some
easy algebra we get
⎛ ⎞
1 0 0 0
⎜0 1 0 0 ⎟
MR = ⎝ ⎠,
0 0 cos δ sin δ
0 0 − sin δ cos δ
ν l
δ = −ρQ l = 2π δs − δf l = 2π ns − nf
c λ
(l is the thickness of the retarder).
Using the method outlined above, it is possible to obtain the Mueller matrix of
any optical device once the geometrical and optical properties of its constituents
are known.
d
I(s) = − K0 (s) + δK(s) I(s) + ε0 (s) + δε(s) . (8.49)
ds
We look for a solution to this equation of the form
d
I (s) = −K0 (s) I0 (s) + ε0 (s) (8.50)
ds 0
RADIATIVE TRANSFER FOR POLARIZED RADIATION 367
and where δI(s) satisfies the equation (resulting from Eqs. (8.49) and (8.50))
d
δI(s) = − K0 (s) δI(s) − δK(s) I0 (s) − δK(s) δI(s) + δε(s) . (8.51)
ds
δI(s) I0 (s) ,
so that the third term in the right-hand side of Eq. (8.51), being a ‘second-order’
term, can be neglected. We have therefore
d
δI(s) = − K0 (s) δI(s) + δε(s) − δK(s) I0 (s) . (8.52)
ds
Equations (8.50) and (8.52) can be solved using the evolution operator O0 (s, s ) as-
sociated with the unperturbed propagation matrix K0 (s). For a medium extending
in s from −∞ to s , one gets from Eq. (8.15)
s
I0 (s) = O0 (s, s ) ε0 (s ) ds (8.53)
−∞
s
δI(s) = O0 (s, s ) δε(s ) − δK(s ) I0 (s ) ds
−∞
s
= O0 (s, s ) δε(s ) ds
−∞
s s
− ds O0 (s, s ) δK(s ) ds O0 (s , s ) ε0 (s ) . (8.54)
−∞ −∞
Equations (8.53) and (8.54) solve, in a perturbative way, the transfer problem.
This method of solution is especially useful when the operator O0 can be written
in analytical form. Moreover, through an adaptation of this same method, it is
possible to introduce the concept of response function, which will be done in the
following chapter (see Sect. 9.16) in connection with the problem of line formation
in a magnetic field.
368 CHAPTER 8
The transfer equation (8.2) can be cast into a different form by introducing, in the
place of the Stokes vector I , a 2 × 2 matrix which is strictly related to the polar-
ization tensor. Indeed, both the classical derivation of Sect. 5.2 and the quantum-
mechanical derivation of Sect. 6.6 lead quite naturally to a transfer equation for the
polarization tensor (cf. Eqs. (5.24) and (6.78)). These equations were later con-
verted into a transfer equation for the Stokes vector, which in both cases resulted
in being of the form of Eq. (8.2).
In this section we will proceed in the opposite direction, showing in full generality
how it is possible to transform Eq. (8.2) into a transfer equation having the form
of Eq. (5.24) or (6.78).
From the Stokes vector I we define the tensor Ĩ via the equation (cf. Eqs. (1.40)
and (5.129))
1
3
I˜αβ = (τ ) I (α, β = 1, 2) , (8.55)
2 i=0 i αβ i
where the matrices τi are any representation of the Pauli spin matrices. In particu-
lar, we can take τi = σi or τi = σ̂i (where σi and σ̂i are given by Eqs. (1.17) and
(5.128), respectively) or, more generally, τi = x σi x−1 , where x is any unitary
2 × 2 matrix (x−1 = x† ).1
Whatever representation is chosen, the matrices τi are Hermitian,
† †
τi† = x σi x−1 = x−1 σi† x† = x σi x−1 = τi ,
1 1
3 3
∗
I˜αβ = (τi )∗αβ Ii = (τ ) I = I˜βα .
2 i=0 2 i=0 i βα i
1
The arbitrariness of the matrices τi is strictly connected with the arbitrariness of the choice
of the unit vectors defining the polarization tensor (cf. Sect. 1.8).
RADIATIVE TRANSFER FOR POLARIZED RADIATION 369
Deriving Eq. (8.55) with respect to s , and using Eqs. (8.2) and (8.58), one obtains
the transfer equation for I˜αβ
1
3 3
d ˜
Iαβ (s) = − (τi )αβ (τj )γδ Kij (s) I˜δγ (s) + eαβ (s) , (8.59)
ds 2 i=0 j=0
γδ
where
1
3
eαβ (s) = (τ ) ε (s) .
2 i=0 i αβ i
We now observe that, because of its symmetry (and reality) properties, the ma-
trix K can be written in the following form, suggested by Eq. (6.84)
Kij = Re Tr (τj τi q) , (8.60)
one gets
3
Kij = Re ak Tr (τj τi τk ) .
k=0
The trace is easily evaluated from Eqs. (8.56) and (8.57), which give
(if one of the indices is zero, the trace is given directly from Eq. (8.57)). We obtain
or1
1 1
a0 = η a2 = ηU + i ρU
2 I 2
1 1
a1 = ηQ + i ρQ a3 = ηV + i ρV . (8.63)
2 2
Substituting Eq. (8.60) into Eq. (8.59), and using the remarkable property of the
Pauli spin matrices
3
(τi )αβ (τi )µν = 2 δαν δβµ , (8.64)
i=0
d ˜
Iαβ (s) = − qαγ (s) I˜γβ (s) + qβγ (s)∗ I˜αγ (s) + eαβ (s) ,
ds γ
d
Ĩ(s) = − q(s) Ĩ(s) − Ĩ(s) q † (s) + e(s) , (8.65)
ds
†
where q † , the transpose conjugate of q ( qαβ ∗
= qβα ), is
3
†
q = a∗k τk .
k=0
The above derivation shows that the radiative transfer equation (8.65) is fully
equivalent to Eq. (8.2).
with
E(s, s) = 1 (8.68)
1 being the 2 × 2 identity matrix) and
(1
d d d †
Ĩ(s) = E(s, s ) Ĩ(s ) E † (s, s ) + E(s, s ) Ĩ(s ) E (s, s ) ,
ds ds ds
while substitution of Eq. (8.67) into the right-hand side of Eq. (8.66) gives
d
Ĩ(s) = − q(s) E(s, s ) Ĩ(s ) E † (s, s ) − E(s, s ) Ĩ(s ) E † (s, s ) q † (s) .
ds
Comparison of these two equations yields
d
E(s, s ) = − q(s) E(s, s ) . (8.69)
ds
In a similar way one finds
d
E(s, s ) = E(s, s ) q(s ) . (8.70)
ds
Because of the strict analogy between Eqs. (8.69)-(8.70) and Eqs. (8.9)-(8.10),
we can expand the operator E(s, s ) in the form (cf. Eq. (8.12))
∞ s s s & '
(−1)n
E(s, s ) = 1 + ds1 ds2 · · · dsn P q(s1 ) q(s2 ) · · · q(sn ) ,
n=1
n!
s s s
where P is the Dyson chronological product defined in Eq. (8.13). Moreover, it can
be easily proved, with the help of Eqs. (8.68) and (8.69), that the formal solution
of the non-homogeneous equation (Eq. (8.65)) is
s
Ĩ(s) = E(s, s ) e(s ) E † (s, s ) ds + E(s, s ) Ĩ(s ) E † (s, s ) .
s
372 CHAPTER 8
Analytical forms of the reduced evolution operator E(s, s ) can be found when-
ever
q(s1 ), q(s2 ) = 0 (8.71)
for any couple of points s1 , s2 in the interval (s , s). In this case one gets (cf.
Eq. (8.22)) : s
− q(s ) ds
E(s, s ) = e s . (8.72)
Similarly to Sect. 8.3, one can distinguish three different cases where Eq. (8.71)
is satisfied. As an example, let us consider the case q = const., where Eq. (8.72)
reduces to
−(s−s ) q
E(s, s ) = e .
Use of Eqs. (8.61) and (A5.6) gives
−(s−s ) a0 −(s−s )
a·
τ
E(s, s ) = e e , (8.73)
where
a = (a1 , a2 , a3 ) , τ = (τ1 , τ2 , τ3 ) ,
with ai given by Eqs. (8.63). The exponential in the right-hand side of Eq. (8.73)
can be expanded by a method similar to that of App. 5. The result is1
−1
2 x ηI
sinh(xa)
E(s, s ) = e cosh(xa) 1 − a · τ , (8.74)
a
where
x = (s − s )
√ 1
a = a2 = Λ (
η, ρ
) + i σ Λ− (η , ρ
)
2 +
with Λ± (
η, ρ
) and σ defined in Eqs. (A5.18) and (A5.19), respectively.
The reduced evolution operator E(s, s ) is of course related to the evolution
operator O(s, s ) defined in Sect. 8.2. To find this relation we start from Eq. (8.58),
written at point s , and substitute for Ĩ(s) its expression in terms of the reduced
evolution operator (Eq. (8.67)). Next we use Eq. (8.55), written at point s , and
compare with the definition (8.6) to obtain
1
Oij (s, s ) = Tr τi E(s, s ) τj E † (s, s ) (i, j = 0, 1, 2, 3) . (8.75)
2
This formula shows that the 16 real elements Oij are bilinear combinations of the
4 complex elements Eαβ . The explicit expressions are rather cumbersome and
depend on the representation used for the matrices τi .
1
If (η2 − ρ2 ) = η ·ρ = 0 (so that Λ+ (η , ρ ) = Λ− (η , ρ ) = 0), Eq. (8.74) should be replaced by
1
E(s, s ) = e − 2 x ηI
1 − x a ·
τ .
RADIATIVE TRANSFER FOR POLARIZED RADIATION 373
On the other hand, E(s, s ) depends also on the representation because of its
dependence on the matrix q (see Eqs. (8.69) and (8.61)). It is however possible
to obtain an ‘intrinsic’ relation (i.e. a relation independent of the representation)
between the operators O(s, s ) and E(s, s ) by expanding the latter on the ba-
sis {τi },1
3
E(s, s ) = E (k) (s, s ) τk . (8.76)
k=0
1 (k)
3 3
Oij (s, s ) = E (s, s ) E (l) (s, s )∗ Tr τi τk τj τl , (8.77)
2
k=0 l =0
1
E (k) (s, s ) = Tr τk E(s, s ) .
2
Derivation with respect to s leads, with the help of Eqs. (8.69), (8.61) and (8.76),
to the set of differential equations (independent of the representation)
1
3 3
d (k)
E (s, s ) = − a E (j) (s, s ) Tr τi τj τk . (8.78)
ds 2 i=0 j=0 i
It follows that the general problem of finding the evolution operator is equivalent
to solving a system of 8 linear, homogeneous, real differential equations in the
8 unknowns
Re E (k) (s, s ) , Im E (k) (s, s ) (k = 0, 1, 2, 3) ,
Once the system is solved, the reduced evolution operator E(s, s ) can be computed
from Eq. (8.76), and the evolution operator O(s, s ) from Eq. (8.77).
Equations (8.78) and (8.77) are developed in detail in App. 7. Similar equations
have been obtained, with a different formalism, by Sánchez Almeida (1992).
1
In the case q = const. considered above, E(s, s ) is already expressed in this form (see
Eq. (8.74)).
CHAPTER 9
The problem of line formation in a magnetic field is a classical one in the theory of
stellar atmospheres.1 In its most general formulation it consists in the following.
Consider a wavelength interval of the electromagnetic spectrum where one or sev-
eral spectral lines are present. The radiation contained in the interval flows through
a stellar atmosphere permeated by a magnetic field and interacts with the atoms
(responsible for those spectral lines) via the usual processes of absorption, emis-
sion (spontaneous and stimulated) and scattering. The result is, quite generally,
the formation of absorption (or emission) lines with typical polarization features.
The problem is to relate the polarization features of the emerging radiation, de-
scribed by its Stokes parameters, to the physical properties of the atmosphere and
to the magnetic field vector.
The overall problem is extremely complicated and ultimately requires a full
non-equilibrium (or non-LTE) approach, whose basic physics will be outlined in
Chap. 14. However, in most astrophysical applications only the deepest layers of
stellar atmospheres are involved. In that case, depolarizing collisions can be as-
sumed to be sufficiently strong to destroy any atomic polarization that might be
induced by the (anisotropic and polarized) radiation field. As we will see in the
following, this assumption greatly simplifies the general problem and, in particular,
allows the emission term in the radiative transfer equation to be described by a
scalar source function (which reduces to the Planck function in LTE). The present
chapter is devoted to the subject of line formation in a magnetic field under the
assumption of complete atomic depolarization.
1
In this chapter we use the term ‘stellar atmosphere’ in a rather broad sense. Many of
the results that will be derived in the following sections can as well be applied to laboratory or
astrophysical plasmas of arbitrary geometrical shape, like slabs, cylinders, etc. In many cases,
these can indeed be regarded as ‘pieces’ of a stellar atmosphere.
376 CHAPTER 9
θ
Ω
is the propagation
Fig.9.1. The magnetic field direction is specified by the angles θ and χ. Ω
direction of the radiation while ea is the reference unit vector.
given by Eqs. (7.19), where (because of the isolated spectral line assumption) the
summations over the atomic levels are limited to one lower level (α J ) and one
upper level (αu Ju ), and where (because of the no-atomic-polarization assumption)
we have to substitute
N (2J + 1) ρα J (M ) → N
N (2Ju + 1) ρα J (Mu ) → Nu ,
u u
N and Nu being the number of atoms per unit volume in the lower and upper
level, respectively. Equation (7.19a) reduces to
= hν N B(α J → α J )
ηiA (ν, Ω)
4π u u
J J 1
2
× φ(ν
Tqq (i, Ω) αu Ju Mu , α J M − ν) , (9.2)
3 u
−Mu M −q
M Mu q
where γ is an arbitrary angle (specifying the orientation of the x and y axes) which
These expressions can be deduced from
does not enter the expressions of Tqq (i, Ω).
Table 5.3 which, however, refers to the rotation R ≡ (−γ, −θ, −χ). Performing the
substitutions (γ → −χ , θ → −θ, χ → −γ), we obtain
= 1 1 + cos2 θ
= T (0, Ω)
T−1−1 (0, Ω) 11
4
= − 1 sin2 θ cos 2χ
= T (1, Ω)
T−1−1 (1, Ω) 11
4
= T (2, Ω) 1
= − sin2 θ sin 2χ
T−1−1 (2, Ω) 11
4
= −T (3, Ω)
T−1−1 (3, Ω) = − 1 cos θ
11
2
1
= sin2 θ
T00 (0, Ω)
2
= 1 sin2 θ cos 2χ
T00 (1, Ω)
2
= 1 sin2 θ sin 2χ
T00 (2, Ω)
2
T (3, Ω) =0.
00
Equation (9.2) can then be cast into the form (see also Eqs. (6.87))
where
hν
kLA = N B(α J → αu Ju ) (9.5)
4π
is the frequency-integrated absorption coefficient in the line1 and where
2
Ju J 1
φq = 3 φ(να − ν) (q = −1, 0, +1) . (9.6)
−Mu M −q J Mu , α J M
u u
M Mu
1
Since the φq profiles are normalized to unity in frequency (cf. Eqs. (6.59c) and (2.23a)), we
have
ηIA (ν, Ω)
dν = k A .
L
line
LINE FORMATION IN A MAGNETIC FIELD 379
where
2
Ju J 1
ψq = 3 ψ(να − ν) (q = −1, 0, +1). (9.8)
−Mu M −q u u
J Mu , α J M
M Mu
= k A φ (ν, Ω)
ηIA (ν, Ω)
L I
A
ηQ = k A φ (ν, Ω)
(ν, Ω) = k A ψ (ν, Ω)
ρAQ (ν, Ω)
L Q L Q
A
ηU = k A φ (ν, Ω)
(ν, Ω) = k A ψ (ν, Ω)
ρAU (ν, Ω)
L U L U
= k A φ (ν, Ω)
ηVA (ν, Ω) = k A ψ (ν, Ω)
ρAV (ν, Ω) ,
L V L V
with
Using these notations, one gets for the stimulated emission coefficients (Eqs. (7.19b)
and (7.19d))
380 CHAPTER 9
= η S (ν, Ω)
η0S (ν, Ω) = k S φ (ν, Ω)
I L I
= η S (ν, Ω)
η1S (ν, Ω) = k S φ (ν, Ω)
Q L Q
= η S (ν, Ω)
η2S (ν, Ω) = k S φ (ν, Ω)
U L U
= η S (ν, Ω)
η3S (ν, Ω) = k S φ (ν, Ω)
V L V
= ρS (ν, Ω)
ρS1 (ν, Ω) = k S ψ (ν, Ω)
Q L Q
= ρS (ν, Ω)
ρS2 (ν, Ω) = k S ψ (ν, Ω)
U L U
= ρS (ν, Ω)
ρS3 (ν, Ω) = k S ψ (ν, Ω)
,
V L V
where
hν
kLS = N B(αu Ju → α J ) ,
4π u
and for the spontaneous emission coefficients (Eq. (7.19e); see also Eqs. (7.8))
= ε (ν, Ω)
ε0 (ν, Ω) = ε φ (ν, Ω)
I L I
= ε (ν, Ω)
ε1 (ν, Ω) = ε φ (ν, Ω)
Q L Q
= ε (ν, Ω)
ε2 (ν, Ω) = ε φ (ν, Ω)
U L U
= ε (ν, Ω)
ε3 (ν, Ω) = ε φ (ν, Ω)
, (9.10)
V L V
where
hν
εL = N A(αu Ju → α J ) . (9.11)
4π u
Collecting all the various terms, the contribution of the line to the radiative transfer
equation for polarized radiation takes the form
⎛ ⎞ ⎛ ⎞⎛ ⎞
I φI φQ φU φV I − SL
d ⎜⎜Q⎟
⎟ ⎜φ
⎜ Q φI ψV −ψU ⎟ ⎜
⎟⎜ Q ⎟
⎟
⎜ ⎟ = − kL ⎜ ⎟⎜ ⎟, (9.12)
ds ⎝ U ⎠ ⎝ φU −ψV φI ψQ ⎠⎝ U ⎠
V φV ψU −ψQ φI V
where
kL = kLA − kLS
is the (frequency-integrated) line absorption coefficient corrected for stimulated
emission, and where
ε
SL = A L S (9.13)
kL − kL
is the line source function. Using the well-known relations among the Einstein
coefficients (see Eqs. (7.8)), these two quantities can be rewritten as
−1
Nu g̃ 2hν03 N g̃u
kL = kL 1 −
A
, SL = 2 −1 , (9.14)
N g̃u c Nu g̃
LINE FORMATION IN A MAGNETIC FIELD 381
where g̃ = (2J + 1) and g̃u = (2Ju + 1) are the degeneracies of the lower and upper
level, respectively. In the particular case of LTE, one simply gets
hν0 hν0 −1
− 2hν03
kL = kLA 1 − e kB T , SL = e kB T − 1 = BP (ν0 ) .
c2
Adding the contributions of continuum processes and line processes, one obtains
from Eqs. (9.1) and (9.12)
⎛ ⎞ ⎛ ⎞⎛ ⎞
I 1 0 0 0 I − Sc
⎜
d ⎜ ⎟ ⎟ ⎜ 0⎟ ⎜ ⎟
Q ⎜0 1 0 ⎟⎜ Q ⎟
⎜ ⎟ = − kc ⎜ ⎟⎜ ⎟
ds ⎝ U ⎠ ⎝0 0 1 0 ⎠⎝ U ⎠
V 0 0 0 1 V
⎛ ⎞⎛ ⎞
φI φQ φU φV I − SL
⎜φ −ψU ⎟ ⎜ ⎟
⎜ φI ψV ⎟⎜ Q ⎟
− kL ⎜ Q ⎟⎜ ⎟. (9.15)
⎝ φU −ψV φI ψQ ⎠⎝ U ⎠
φV ψU −ψQ φI V
It should be noticed that the equations derived in this section prove some prop-
erties of the Zeeman patterns that were anticipated in Sect. 3.1. First of all, the
φq and ψq profiles in Eqs. (9.6) and (9.8) can be written in the form
J J
φq = Sq u (M , Mu ) φ(να J Mu , α J M − ν)
u u
M Mu
J J
ψq = Sq u (M , Mu ) ψ(να J Mu , α J M − ν) , (9.16)
u u
M Mu
J J
where Eq. (3.16) has been used. Thus the quantity Sq u (M , Mu ) is indeed the
relative strength of the Zeeman component (σb , π or σr according as q = −1, 0,
+1, respectively) connecting the sublevels (J M ) and (Ju Mu ). The strengths are
normalized as in Eq. (3.17) and satisfy the relations (3.18)-(3.19). Furthermore,
the polarization features of the radiation emitted by each type of component can
be deduced from Eq. (9.10), and it is easily seen that they are just as illustrated
in Sect. 3.1 (cf. Figs. 9.1 and 3.1).
We are now in a position to compare the line transfer equation derived from Quan-
tum Mechanics with the corresponding equation derived from classical physics in
Chap. 5. Direct comparison of Eqs. (9.12) and (5.39) shows that the structure
of the two equations is exactly the same and allows us to relate the line source
382 CHAPTER 9
function SL , as given by Eq. (9.14), with the ‘classical’ source function defined
in Eq. (5.40). Equating the two expressions (and neglecting in SL the contribu-
tion due to stimulated emission, a phenomenon that has no classical analogue)
one finds a relationship between the average energy per degree of freedom of the
classical oscillator, Ē, and the number density of atoms in the lower and upper
level
N g̃
Ē = hν0 u u .
N g̃
This is just the expression that is obtained for a two-level atom having a proba-
bility Nu /g̃u of being found in the upper level (of energy hν0 ) and a much larger
probability N /g̃ of being found in the lower level (of zero energy).
Comparison of Eqs. (9.12) and (5.39) also shows that the elements of the propa-
gation matrix coincide provided the following transformations are performed (see
Eqs. (9.9) and (5.36), and Figs. 9.1 and 5.1)
πe20
N → kL (9.17)
mc
and
φb → φ−1 ψb → ψ−1
φp → φ0 ψp → ψ0
φr → φ1 ψr → ψ1 . (9.18)
πe20
kL → kLA = N f (α J → αu Ju ) , (9.19)
mc
where f (α J → αu Ju ) is the oscillator strength of the transition. The dimension-
less parameter f can be regarded as an efficiency parameter for the transition:
multiplication of the number of atoms in the lower level by f yields the equivalent
number of classical oscillators. Comparison of Eqs. (9.19) and (9.5) shows that the
oscillator strength is given by
hν mc
f (α J → αu Ju ) = B(α J → αu Ju ) ,
4π 2 e20
8π 2 mν
f (α J → αu Ju ) = | α J d αu Ju |2
3 h e20
1 mc3 2Ju + 1
= 2 A(αu Ju → α J ) . (9.20)
8π e20 ν 2 2J + 1
LINE FORMATION IN A MAGNETIC FIELD 383
The oscillator strength is generally a quite small number which can attain a value
of order unity only for the strongest spectral lines.
The correspondence between classical oscillators and atomic transitions also ap-
pears from transformation (9.18). Some qualitative aspects of this correspondence
have been anticipated in Sect. 3.2. More quantitatively, recalling Eqs. (9.3) and
(6.59a), and neglecting the frequency shifts, we have from Eqs. (9.16)
J J 1 Γ
φq = Sq u (M , Mu )
π Γ 2 + [ ν0 + νL (gu Mu − g M ) − ν ]2
M Mu
J J 1 ν0 + νL (gu Mu − g M ) − ν
ψq = Sq u (M , Mu ) , (9.21)
π Γ 2 + [ ν0 + νL (gu Mu − g M ) − ν ]2
M Mu
The absorption and anomalous dispersion profiles given by Eqs. (9.21) are still
inadequate to describe the physical situation of a stellar atmosphere. This is be-
cause the quantum theory presented in Chap. 6 is based on the assumption that
384 CHAPTER 9
the material system (the atom in this case) interacting with the radiation field is
isolated and at rest. The obvious consequence is that the absorption and anoma-
lous dispersion profiles are broadened by a single mechanism, namely natural (or
radiation) damping.
Collisional broadening can be carried into our formalism in a rather straightfor-
ward way. The effect of collisions is simply described by replacing Γ , the natural
line width appearing in Eqs. (9.21), with the sum Γ of the natural and collisional
width, defined by
Γ = Γ + 2Γc . (9.22)
This result has been proved in Sect. 5.14 within the framework of the classical
theory (see Eq. (5.180)). It still holds, however, when more refined, quantum-
mechanical models of the atom-radiation interaction are considered (see for instance
Loudon, 1983).
To get an estimate of the collisional width, we recall that Γc is proportional to
the frequency f of perturbing collisions (Γc = f /4π, see Sect. 5.14), and that f
has already been estimated in Sect. 7.13.c in connection with our discussion of
depolarizing rates.1 If we assume that perturbing collisions are mainly due to
long-range interactions with neutral perturbers (hydrogen atoms),2 we get from
Eqs. (5.181) and (7.105)3
0.4 0.6
Γc = 0.68 C6 (αu Ju ) v̄r nH ,
where C6 (αu Ju ) is the Van der Waals constant for the upper level involved in the
transition, v̄r the average relative velocity atom-perturber, and nH the number
density of hydrogen atoms. Using Eq. (7.108) we can also write
−11
2 0.4 1
0.3
Γc 1.1 × 10 r (αu Ju ) T 1+ nH s−1 ,
µ
where r2 (αu Ju ) is the mean square radius of the electronic cloud in the level
(αu Ju ) expressed in atomic units, T is the temperature in K, µ is the atomic weight
of the atom, and nH is expressed in cm−3 .
When collisions are taken into account, the profiles φq and ψq in Eqs. (9.21)
become
1
It should be remarked that both the phenomena of collisional broadening and atomic-level
depolarization are basically due to the same physical mechanism, as shown in Sect. 5.14.
2
By so doing, we neglect collisions with electrons and ions. In typical stellar atmospheres,
their contribution to collisional line broadening is quite modest (with the exception of hydrogen
and helium lines). An excellent discussion of collisional broadening due to charged particles can
be found in Mihalas (1978) or in Griem (1974).
3
This result agrees with the often quoted expression of Unsöld (1955)
2/5
ΓU 17 C6 v̄r3/5 nH .
J J 1 Γ
φq = Sq u (M , Mu )
π Γ + [ ν0 + νL (gu Mu − g M ) − ν ]2
2
M Mu
J J 1 ν0 + νL (gu Mu − g M ) − ν
ψq = Sq u (M , Mu )
,
π Γ + [ ν0 + νL (gu Mu − g M ) − ν ]2
2
M Mu
where
Γ = Γ + 2Γc .
As far as the motion of the atomic system is concerned, we already noticed (see
the discussion following Eq. (6.31)) that the quantum theory presented in Chap. 6
cannot account for the Doppler effect. However, this effect can be reintroduced in
our formalism by the same arguments outlined in Sect. 5.4. By so doing we obtain
J J 1 1
φq = Sq u (M , Mu ) √ H v − vA + vB (gu Mu − g M ), a
π ∆νD
M Mu
J J 1 1
ψq = Sq u (M , Mu ) √ L v − vA + vB (gu Mu − g M ), a , (9.23)
π ∆νD
M Mu
where the functions H(v, a) and L(v, a) are defined in Eqs. (5.45), and where
wT Γ
∆νD = ν0 , a=
c ∆νD
νL wA ν w ν0 − ν
vB = , vA = = 0 A , v= . (9.24)
∆νD wT c ∆νD ∆νD
Note that these equations are the same as Eqs. (5.43) except for the definition of the
damping constant a, where Γ is substituted for Γ in order to allow for collisional
broadening. We recall that wA and wT represent the line-of-sight component of
the bulk velocity of the ambient medium and the thermal velocity, respectively.
As mentioned in Sect. 5.4, the various quantities entering the definitions of the
dimensionless parameters v, vA , vB , a can be expressed in wavelength units rather
than in frequency units. Introducing the Doppler width in wavelength units
λ20 w
∆λD = ∆νD = λ0 T , (9.25)
c c
we can write (cf. Eqs. (5.47))
λ − λ0 λ20 Γ λ0 wA
v= , a= , vA =
∆λD c ∆λD c ∆λD
λ20 νL λ20 e0 B ∆λB
vB = = 2
= . (9.26)
c ∆λD 4πmc ∆λD ∆λD
386 CHAPTER 9
For practical applications, and especially for those concerning the diagnostics of
magnetic fields in the solar and stellar atmospheres, it may be convenient to write
the transfer equation (9.15) in other forms, using different definitions for the ab-
sorption and anomalous dispersion profiles and/or substituting optical depth for
geometrical depth.
Consider first the propagation matrix multiplying kL in the right-hand side of
Eq. (9.15). This matrix contains the seven independent quantities φI , φQ , φU , φV ,
ψQ , ψU , ψV defined in Eqs. (9.9), which are linear combinations of the profiles φq
or ψq defined in Eqs. (9.23). It can be easily shown via Eqs. (3.17) and (5.60) that
the profiles φq are normalized to unity in frequency,
∞
φq dν = 1 .
−∞
ηq = ∆νD φq
J J 1
= Sq u (M , Mu ) √ H v − vA + vB (gu Mu − g M ), a (9.27a)
π
M Mu
and, by analogy
ρq = ∆νD ψq
J J 1
= Sq u (M , Mu ) √ L v − vA + vB (gu Mu − g M ), a , (9.27b)
π
M Mu
we obviously have
∞
ηq dv = 1 , (9.28)
−∞
ηb = η−1 ρb = ρ−1
ηp = η0 ρp = ρ0
ηr = η1 ρr = ρ1 , (9.29)
LINE FORMATION IN A MAGNETIC FIELD 387
the transfer equation for polarized radiation can be written in the form
⎛ ⎞ ⎧⎛ ⎞⎛ ⎞
I ⎪
⎪ 1 0 0 0 I − Sc
⎪ ⎜
⎜
d ⎜Q⎟ ⎟ ⎨
⎜0 1 0 0⎟ ⎜
⎟⎜ Q ⎟
⎟
⎜ ⎟ = − kc ⎜ ⎟⎜ ⎟
ds ⎝ U ⎠ ⎪
⎪ ⎝0 0 1 0 ⎠⎝ U ⎠
⎪
⎩
V 0 0 0 1 V
⎛ ⎞⎛ ⎞⎫
hI hQ hU hV I − SL ⎪
⎪
⎜h −rU ⎟ ⎜ ⎟⎪
⎬
⎜ hI rV ⎟⎜ Q ⎟
+ κL ⎜ Q ⎟⎜ ⎟ , (9.30)
⎝ hU −rV hI rQ ⎠⎝ U ⎠⎪ ⎪
⎪
⎭
hV rU −rQ hI V
and where
1 ηb + ηr
hI = ηp sin2 θ + 1 + cos2 θ
2 2
1 ηb + ηr
hQ = ηp − sin2 θ cos 2χ
2 2
1 η + ηr
hU = ηp − b sin2 θ sin 2χ
2 2
1
hV = ηr − ηb cos θ
2
1 ρb + ρr
rQ = ρp − sin2 θ cos 2χ
2 2
1 ρ + ρr
rU = ρp − b sin2 θ sin 2χ
2 2
1
rV = ρr − ρb cos θ . (9.32)
2
With the present definitions,1 the ratio between the line absorption coefficient at
line center and the continuum absorption coefficient is given, for zero magnetic field
1
Note that κL , as well as hI , hQ , hU , hV , rQ , rU , rV , are dimensionless quantities. For
the sake of brevity, the parameter κL will be sometimes referred to in the following as the ‘ratio
between line and continuum absorption coefficients’.
388 CHAPTER 9
ray path
top
α
stellar
atmosphere
bottom
Fig.9.2. The coordinates s and z generally used for radiative transfer in stellar atmospheres.
√
and no damping, by κL / π (see Eqs. (5.55), (9.27), (3.17) and (9.32)). Different
definitions can be found in the√literature, where the symbol √ η0 is often used to
represent the ratio kL /(kc ∆νD π ) (thus resulting a factor π smaller than κL ),
and where the profiles ηb , ηp , ηr , ρb , ρp , ρr are defined as in Eqs. (9.29) and
√ √
(9.27) without the factor 1 / π (thus resulting a factor π larger than ours).
When dealing with radiative transfer in astrophysical plasmas, it is generally
convenient to refer to the familiar concept of optical depth. The coordinate s mea-
sured along the ray path can be replaced by the optical depth τc in the continuum
adjacent to the line, or by the optical depth τr in the continuum at a given reference
frequency νr , or by the optical depth τL in the line. These quantities are defined
by
kL
dτc = −kc (ν0 ) ds , dτr = −kc (νr ) ds , dτL = − ds , (9.33)
∆νD
where the minus sign means that the increasing direction for optical depth is op-
posite to the propagation direction. For plane-parallel atmospheres, optical depths
can be referred to the coordinate z measured outward along the vertical to the
atmosphere. If we denote by t such optical depths and if α is the angle defined in
Fig. 9.2, we obviously have
where
1 k (ν ) ∆νD
κc = = c 0 . (9.37)
κL kL
Particularly important is the case of LTE, where both the source functions Sc and
SL reduce to the Planck function BP ; in this case Eq. (9.35) can be written in the
form
⎛ ⎞ ⎛ ⎞⎛ ⎞
I 1 + kI kQ kU kV I − BP
d ⎜ ⎟ ⎜
⎜ Q ⎟ ⎜ kQ 1 + kI fV −fU ⎟ ⎜
⎟⎜ Q ⎟
⎟
⎜ ⎟=⎜ ⎟⎜ ⎟ , (9.38)
dτc ⎝ U ⎠ ⎝ kU −fV 1 + kI fQ ⎠⎝ U ⎠
V kV fU −fQ 1 + kI V
where
kI = κL hI
kQ = κL hQ fQ = κL rQ
kU = κL hU fU = κL rU
kV = κL hV fV = κL rV , (9.39)
Let us consider the transfer equation in the form (9.35). For any wavelength λ,
Eq. (9.35) is a system of four linear, first-order, ordinary differential equations in
390 CHAPTER 9
the Stokes parameters I(τc ), Q(τc ), U (τc ), V (τc ) characterizing a radiation beam
flowing along a given direction Ω. The solution to the system requires a set of
boundary conditions, which depend of course on the specific physical situation.
Here we restrict attention to the case of a semi-infinite stellar atmosphere.
If the atmosphere is not illuminated from outside (obviously this is not the case
for a star in a binary system), the boundary conditions at the outermost layer are
simply
I(τc = 0) = Q(τc = 0) = U (τc = 0) = V (τc = 0) = 0 (9.40)
for any λ and any inward propagation direction ( Ω · k < 0, with k the outward-
directed unit vector perpendicular to the stellar surface). The boundary conditions
in the deep layers are more delicate and require a previous discussion of the source
functions Sc and SL . It is reasonable to assume that the atmospheric plasma at
large optical depths is in thermodynamic equilibrium, so that
lim Sc (τc ) = lim SL (τc ) = BP T (τc ) . (9.41)
τc →∞ τc →∞
for any λ and any Ω. Equations (9.40)-(9.42) still hold if the optical depth τ is
c
replaced by any of the optical depths considered in Sect. 9.4.
Once the propagation direction and the boundary conditions are specified, the
Stokes parameters at any assigned optical depth depend on the coefficients of
Eq. (9.35), which, in general, are themselves functions of optical depth. Inspection
of Eqs. (9.35), (9.32), (9.29), (9.27) and (9.26) shows that the value of the Stokes
parameters for a specific spectral line at a given distance (λ − λ0 ) from line center
is determined by the following parameters:
– the continuum source function Sc , which is generally well-approximated by the
local Planck function BP ;
– the line source function SL , which reduces to the Planck function only when the
LTE approximation is valid;
– the ratio κL between line and continuum absorption coefficient;
– the magnetic field intensity B, entering the definition of vB ;
– the inclination angle θ of the magnetic field vector defined in Fig. 9.1;
– the azimuth angle χ of the magnetic field vector, also defined in Fig. 9.1;
– the Doppler width of the line ∆λD , entering the definitions of the reduced wave-
length v, the damping constant a, and the quantities vA and vB ;
– the natural plus collisional width Γ , defined in Eq. (9.22), entering the damping
constant a ;
– the line-of-sight component of the macroscopic velocity of the ambient medium
wA , entering the definition of vA .
LINE FORMATION IN A MAGNETIC FIELD 391
In addition, the Stokes parameters depend on the Zeeman pattern of the line and
on the wavelength λ0 which affects the parameters vB , vA and a.
In the special case where the parameter wA is independent of optical depth, its
effect on the Stokes parameters profiles is just a global wavelength shift of the
spectrum. It may then be convenient to neglect the term vA in Eqs. (9.27) and use
a different definition for the reduced wavelength,
λ − λ0
v ≡ v − vA = , (9.43)
∆λD
where , w -
λ0 = λ0 1 + A . (9.44)
c
This case will be referred to in the following as the constant velocity case.1 The
opposite case, where wA is variable with optical depth, will be referred to as the
velocity gradient case, although, more properly, one should speak of ‘the case where
the line-of-sight component of the bulk velocity of the ambient medium is variable
with optical depth’.
The wavelength dependence of the Stokes parameters emerging from a stellar
atmosphere reflects, broadly speaking, the wavelength dependence of the quantities
hI , hQ , hU , hV , rQ , rU , rV defined in Eqs. (9.32). As an illustrative example,
Fig. 9.3 shows this dependence for different Zeeman patterns and for assigned values
of the relevant parameters. The quantities hU and rU , not shown in the figure,
vanish identically since the azimuth angle χ has been set to zero.
The existence of wavelength symmetries about line center appears clearly from
Fig. 9.3, and a general proof of this fact can be easily obtained. From Eqs. (9.27)
we get, with the help of Eqs. (3.18), (5.59), and (9.26)
where , w -
∆λ = λ − λ0 1 + A (9.45)
c
is the wavelength distance from (local) line center. Thus from Eqs. (9.29)
hI (−∆λ) = hI (∆λ)
hQ (−∆λ) = hQ (∆λ) rQ (−∆λ) = − rQ (∆λ)
hU (−∆λ) = hU (∆λ) rU (−∆λ) = − rU (∆λ)
hV (−∆λ) = −hV (∆λ) rV (−∆λ) = rV (∆λ) , (9.47)
1
Obviously, it includes the more restrictive case of a static atmosphere (wA = 0).
392 CHAPTER 9
Fig.9.3. The elements of the propagation matrix are plotted as functions of the reduced wavelength
for the following transitions (cf. Fig.3.2 on p. 83): (a) 1 S0 − 1 P1 (normal Zeeman triplet); (b)
5 D − 7 D ; (c) 5 P − 5 F ; (d) 5 D − 5 F ; (e) 5 P − 5 D ; (f) 5 D − 5 G . The relevant parameters
2 3 1 1 3 3 3 4 1 2
are: vA = 0, vB = 1.5, θ = 60◦ , χ = 0◦ , a = 0.05. Note that cases (c) and (f) show characteristic
reversals in hQ , hV , rQ , rV relative to the other cases. Transition (c) corresponds to ḡ > 0 and
Ḡ < 0 (see Sect. 9.6), while transition (f) corresponds to ḡ < 0 and Ḡ < 0.
1
A different proof of the properties expressed by Eqs. (9.54) and (9.56), based on the analogy
between the transfer equation and the motion of the representative point in the Poincaré sphere,
has been presented by Landi Degl’Innocenti and Landi Degl’Innocenti (1981).
LINE FORMATION IN A MAGNETIC FIELD 393
Consider again Eq. (9.35), which we rewrite in the compact matrix form
dI
= I − Sc U + κ L H I − SL U , (9.48)
dτc
where
I = (I, Q, U, V )†
and ⎛ ⎞
hI hQ hU hV
⎜h −rU ⎟
⎜ hI rV ⎟
H =⎜ Q ⎟. (9.50)
⎝ hU −rV hI rQ ⎠
hV rU −rQ hI
Equations (9.47) show that wavelength inversion about (local) line center corre-
sponds to a unitary similarity transformation of the matrix H
X I (b) (−∆λ) = I (b) (∆λ), we can conclude that the two vectors coincide, or, in
other words
I(−∆λ) = I(∆λ)
Q(−∆λ) = Q(∆λ)
U (−∆λ) = U (∆λ)
V (−∆λ) = −V (∆λ) (9.54)
at any optical depth.
Another general property of the solutions to the transfer equation can be proved
under certain assumptions. It concerns the local inversion of the magnetic field
direction at each point along the ray path,
)→B
B(τ (τ ) = −B(τ
).
c c c
χ
θ
−Ω Ω
Fig.9.4. The angles θ and χ specifying the magnetic field direction are referred to the right-handed
The unit vector e is parallel to e .
orthogonal system (ea ,eb , Ω). a a
provided the boundary values I (b) , I (b) are themselves connected by these rela-
tions. In the preferred reference frame (Sect. 5.5) we simply have a sign switch of
the Stokes parameters U and V . It should be remarked that, while the magnetic
field strength and inclination (as well as the other parameters affecting the trans-
fer equation) are allowed to vary with optical depth, the azimuth angle must be
constant in order for Eqs. (9.56) to hold.
A further symmetry property of the solutions to the transfer equation concerns
the 180◦ rotation of the magnetic field vector about the propagation direction, at
each point along the ray path. This transformation, corresponding to
leaves the propagation matrix unchanged, so that the Stokes parameters are also
unchanged. This is an obvious consequence of the invariance of the Stokes param-
eters under a 180◦ rotation of the reference frame about the propagation direction.
Finally, we point out a symmetry property relating the transfer equations of two
radiation beams flowing in opposite directions. Let (ea , eb ) and (ea , eb ) denote the
unit vectors defining the Stokes parameters for the directions Ω and −Ω, respec-
tively. We choose the reference directions ea and ea to be the same. If the magnetic
field vector at a given point is specified in the reference system (ea , eb , Ω) by the
angles θ and χ (see Fig. 9.4), the corresponding angles in the system (ea , eb , −Ω)
are given by
θ = π − θ , χ = −χ . (9.58)
If, in addition, a macroscopic velocity field is present, having a component wA
its component along the opposite direction is w = −w .
along the direction Ω, A A
The relation between the propagation matrices H and H (referring to −Ω and Ω,
respectively) can be easily deduced. From Eqs. (9.27) we obtain, with the help of
Eqs. (5.59) and (3.18)
where
∆λ = λ − λ0 (9.59)
396 CHAPTER 9
is the wavelength distance from the rest line center. Bearing in mind Eqs. (9.29)
and (9.58), we have from Eqs. (9.32)
It can be shown with some matrix algebra that this transformation can be written
as a unitary similarity transformation
where
⎛ ⎞
1 0 0 0
⎜0 1 0 0⎟
Z = Z −1 =⎝ ⎠ .
0 0 −1 0
0 0 0 1
Denoting by I and I the Stokes vectors of the radiation beams flowing in the
and Ω
directions −Ω respectively, we have from Eq. (9.48)
d
I (∆λ) = I (∆λ) − Sc U + κL H (∆λ) I (∆λ) − SL U ,
dτc
d
Z I (∆λ) = Z I (∆λ) − Sc U + κL H(−∆λ) Z I (∆λ) − SL U ,
dτc
Ĩ = Z I = (I , Q , −U , V )† ,
d & '
Ĩ (∆λ) = − Ĩ (∆λ) − Sc U + κL H(−∆λ) Ĩ (∆λ) − SL U . (9.61)
dτc
It follows that the transfer equations for Ĩ (∆λ) and I(−∆λ) are the same except
for a sign switch.
LINE FORMATION IN A MAGNETIC FIELD 397
When the magnetic field is weak or, more precisely, when the Zeeman splitting is
much smaller than the typical width of the profiles ηq and ρq defined in Eqs. (9.27),
it is possible to deduce some properties of the solutions to the transfer equation
without actually solving it. The above condition means
∆λB
ḡ vB = ḡ 1,
∆λD
where ḡ is the effective Landé factor. This sets an upper bound to the magnetic
field strength, which depends (see Eq. (5.49)) both on the spectral line (through
its wavelength and the atomic weight of the element) and on the ambient medium
(through its kinetic temperature and microturbulent velocity). For instance, for
an iron line of optical wavelength formed in the solar atmosphere, we have from
Eq. (5.49)
ḡ B 2500 G .
The following derivation is based on an original, perturbative scheme presented
by Landi Degl’Innocenti and Landi Degl’Innocenti (1973) and later extended by
Jefferies et al. (1989).
Let us consider a radiation beam flowing along a given direction in a stellar
) the magnetic field vector at optical depth τ
atmosphere, and let us denote by B(τ c c
along the ray path. The transfer equation is given by Eq. (9.35). Now suppose this
reference (or ‘true’) atmosphere is replaced by a ‘fictitious’ atmosphere identical to
the former except for the substitution
) → α B(τ
B(τ ), (9.62)
c c
which entails
vB (τc ) → α vB (τc ) ,
where α is a dimensionless, real parameter which at the end of the calculation will
be set to 1. All the quantities in Eq. (9.35) – except the source functions and
the ratio κL between line and continuum absorption coefficient – become functions
of α, hence they can be expanded in power series of α. For the Stokes parameters
we can write1
I = I0 + I1 α + I2 α2 + I3 α3 + · · ·
Q = Q0 + Q1 α + Q2 α2 + Q3 α3 + · · · (9.63)
with similar equations for U and V . To obtain the series expansions of the elements
of the propagation matrix, we write the ηq and ρq profiles in the form
∞ ∞
∂ n ηq αn ∂ n ρq αn
ηq = , ρq = . (9.64)
n=0
∂αn α=0 n! n=0
∂αn α=0 n!
1
In this section we denote by I0 , I1 , I2 . . . the coefficients of the power series expansion
of the intensity I (first Stokes parameter). These should not be confused with the symbols
(I0 , I1 , I2 , I3 ) used in other sections to denote the full Stokes vector (I, Q, U, V ).
398 CHAPTER 9
The zero-order coefficients are given by (see Eqs. (9.27) and (3.17))
1
ηq α=0 = √ H(v − vA , a) ≡ η
π
1
ρq α=0 = √ L(v − vA , a) ≡ ρ , (9.65)
π
thus they are simply the absorption profile and the associated dispersion profile
for zero magnetic field. The higher-order coefficients can be expressed in terms of
derivatives of η and ρ with respect to wavelength. From Eqs. (9.27) and (9.26) we
have
∂ n ηq J J ∂nη
= Sq u (M , Mu ) (gu Mu − g M )n ∆λnB
∂αn α=0 ∂λn
M Mu
n
∂ ρq J J ∂nρ
= Sq u (M , Mu ) (gu Mu − g M )n ∆λnB , (9.66)
∂αn α=0 ∂λn
M Mu
where
∂nη 1 ∂n
= n
√ H(v − vA , a)
∂λn ∆λD π ∂v n
∂nρ 1 ∂n
n
= n
√ L(v − vA , a) . (9.67)
∂λ ∆λD π ∂v n
The sum over M and Mu appearing in Eqs. (9.66) has already been calculated in
Sect. 3.3. Using Eqs. (3.53), (3.20) and (3.54) we get
∂ n ηq ∂ nη
= (−1)n G(n) n
q (J , Ju ) ∆λB
∂αn α=0 ∂λn
n
∂ ρq ∂nρ
= (−1)n G(n) n
q (J , Ju ) ∆λB , (9.68)
∂αn α=0 ∂λn
(n)
where the symbols Gq , defined in Eq. (3.55), are expressed analytically (up to
n = 4) in Table 3.4. Substituting into Eqs. (9.64) and bearing in mind Eqs. (9.29),
we obtain from Eqs. (9.32) the following expansions up to third-order terms in α
1 2
hI = η + α ∆λ2B η G(2)
0 sin θ + G1 (1 + cos θ) + · · ·
2 (2) 2
4
1 2
hQ = α ∆λ2B η G(2)
0 − G1
(2)
sin2 θ cos 2χ + · · ·
4
1
hU = α2 ∆λ2B η G(2)0 − G1
(2)
sin2 θ sin 2χ + · · ·
4
1 3
hV = − α ∆λB η G(1)
1 cos θ − α ∆λ3B η G(3)
1 cos θ + · · ·
6
LINE FORMATION IN A MAGNETIC FIELD 399
1 2
rQ = α ∆λ2B ρ G(2)
0 − G1
(2)
sin2 θ cos 2χ + · · ·
4
1
rU = α2 ∆λ2B ρ G(2)0 − G1
(2)
sin2 θ sin 2χ + · · ·
4
1 3
rV = − α ∆λB ρ G(1)
1 cos θ − α ∆λ3B ρ G(3)1 cos θ + · · · , (9.69)
6
where
∂η ∂ρ ∂2η
η = , ρ = , η = , etc. (9.70)
∂λ ∂λ ∂λ2
Substitution of the series expansions (9.69) and (9.63) into the transfer equation
(9.35) leads to four equations each representing an equality between two power
series in α. Since α is arbitrary, the coefficients of each power must separately be
equal. To the lowest perturbative order we have
dI0 dU0
= I0 − Sc + κL η (I0 − SL ) = (1 + κL η) U0
dτc dτc
dQ0 dV0
= (1 + κL η) Q0 = (1 + κL η) V0 . (9.71)
dτc dτc
... = 0 . (9.72)
any direction, the intensity that would be present in the same atmosphere without
magnetic field. This was to be expected since, according to Eqs. (9.63) and (9.62),
the zero-order terms correspond to a non-magnetic atmosphere.
Proceeding along the same lines, we obtain a system of differential equations for
each perturbative order. The results up to the third order are the following:
– Zero order
I = I¯
0
Q0 = U0 = V0 = 0 .
– First order
dV1
= (1 + κL η) V1 − ∆λB G(1)
1 cos θ κL η (I0 − SL )
dτc
I1 = Q1 = U1 = 0 .
– Second order
dI2 1
0 sin θ + G1 (1 + cos θ) κL η (I0 − SL )
= (1 + κL η) I2 + ∆λ2B G(2) 2 (2) 2
dτc 4
− ∆λB G(1)
1 cos θ κL η V1
dQ2 1
0 − G1
= (1 + κL η) Q2 + ∆λ2B G(2) (2)
sin2 θ cos 2χ κL η (I0 − SL )
dτc 4
dU2 1
0 − G1
= (1 + κL η) U2 + ∆λ2B G(2) (2)
sin2 θ sin 2χ κL η (I0 − SL )
dτc 4
V2 = 0 .
– Third order
I3 = 0
dQ3
= (1 + κL η) Q3 − ∆λB G(1)
1 cos θ κL ρ U2
dτc
1
− ∆λ2B G(2) 0 − G1
(2)
sin2 θ sin 2χ κL ρ V1
4
dU3
= (1 + κL η) U3 + ∆λB G(1)
1 cos θ κL ρ Q2
dτc
1
+ ∆λ2B G(2) 0 − G1
(2)
sin2 θ cos 2χ κL ρ V1
4
dV3 1
= (1 + κL η) V3 − ∆λ3B G(3)
1 cos θ κL η (I0 − SL )
dτc 6
− ∆λB G(1)
1 cos θ κL η I2
1
+ ∆λ2B G(2) 2 (2) 2
0 sin θ + G1 (1 + cos θ) κL η V1 .
4
LINE FORMATION IN A MAGNETIC FIELD 401
I = I0 , Q = Q2 , U = U2 , V = V1 . (9.73)
Thus the transfer equation for the Stokes parameters reduces, under the limit of
weak magnetic field, to the following system
dI
= (1 + κL η) I − (Sc + κL η SL ) (9.74a)
dτc
dV
= (1 + κL η) V − ∆λB ḡ cos θ κL η (I − SL ) (9.74b)
dτc
dQ 1
= (1 + κL η) Q − ∆λ2B Ḡ sin2 θ cos 2χ κL η (I − SL ) (9.74c)
dτc 4
dU 1
= (1 + κL η) U − ∆λ2B Ḡ sin2 θ sin 2χ κL η (I − SL ) , (9.74d)
dτc 4
G(1)
1 = ḡ , (9.75)
with ḡ the effective Landé factor of the line defined in Eq. (3.44), and where
(2)
1 − G0 .
Ḡ = G(2) (9.76)
Ḡ = ḡ 2 − δ , (9.77)
where the notation of Eq. (9.70) has been used. Multiplying both sides by the
factor −∆λB ḡ cos θ, and assuming this factor (i.e. the line-of-sight component of
the magnetic field) to be independent of optical depth, we see that the resulting
equation is formally identical to Eq. (9.74b), which means that the functions
∂I
V and − ∆λB ḡ cos θ
∂λ
obey the same differential equation. On the other hand, since from Eqs. (9.73)
and (9.72)
∂I ∂
lim V (τc ) = 0 , lim = B T (τc ) = 0 ,
τc →∞ τc →∞ ∂λ ∂λ P
they also satisfy the same boundary conditions. We conclude that, at any optical
depth
∂I
V (λ) = − ∆λB ḡ cos θ . (9.80)
∂λ
ii) Next consider Eqs. (9.74c) and (9.74d). Multiplying the former by sin 2χ , the
latter by cos 2χ , subtracting the resulting equations and assuming χ to be inde-
pendent of optical depth, we get
d
sin 2χ Q − cos 2χ U = (1 + κL η) sin 2χ Q − cos 2χ U ,
dτc
sin 2χ Q − cos 2χ U = 0 ,
or, for Q = 0
U (λ)
= tan 2χ . (9.81)
Q(λ)
LINE FORMATION IN A MAGNETIC FIELD 403
iii) If we retain the assumption χ = const., we can choose to measure the Stokes
parameters in the preferred frame (so that χ = 0; see Sect. 5.5). Denoting by Q̃
and Ũ the Stokes parameters in this frame, we have from Eqs. (9.74c) and (9.81)
dQ̃ 1
= (1 + κL η) Q̃ − ∆λ2B Ḡ sin2 θ κL η (I − SL )
dτc 4
Ũ = 0 . (9.82)
On the other hand, derivation of Eq. (9.79) with respect to wavelength yields
2
d ∂ I ∂2I ∂I
= 1 + κ L η + κL η (I − SL ) + 2 κL η .
dτc ∂λ2 ∂λ2 ∂λ
In the constant velocity case, the last term vanishes at the wavelength λ0 cor-
responding to line center (see Eqs. (9.44) and (9.67)). Multiplying the resulting
equation by the factor −∆λ2B Ḡ sin2 θ/4 and assuming this factor (hence the trans-
verse component of the magnetic field) to be independent of optical depth, we
obtain by comparison with the first of Eqs. (9.82) that the two functions
1 ∂2I
Q̃(λ0 ) and − ∆λ2B Ḡ sin2 θ
4 ∂λ2 λ=λ0
satisfy the same differential equation. As the boundary conditions are also the
same, we deduce that, at any optical depth
1 ∂2I
Q̃(λ0 ) = − ∆λ2B Ḡ sin2 θ . (9.83)
4 ∂λ2 λ=λ0
iv) Another expression for Q̃, valid for any λ , can be obtained under a set of more
restrictive assumptions. Consider Eq. (9.79) and multiply both sides by the factor
1 η
− ∆λ2B Ḡ sin2 θ .
4 η
If this factor is independent of optical depth, which requires the τc -independence
of:
a) the transverse component of the magnetic field;
b) the line-of-sight velocity wA ;
c) the parameters ∆λD and Γ controlling the shape of the η profile,
then we see by comparison with the first of Eqs. (9.82) that, at any optical depth1
1 η ∂I
Q̃(λ) = − ∆λ2B Ḡ sin2 θ . (9.84)
4 η ∂λ
1
Note that Eq. (9.84) is meaningless at line center, since Eqs. (9.67) and (9.54) yield
∂η ∂I
= = 0.
∂λ λ=λ ∂λ λ=λ
0 0
The value of Q̃(λ0 ) is however given by the (more general) equation (9.83).
404 CHAPTER 9
TABLE 9.1
Expressions for the Stokes parameters in the weak field limit. The expressions are valid provided
the physical parameters marked by a star are independent of optical depth. B and B⊥ stand
for B cos θ and B sin θ, respectively.
U (λ)
Q(λ)
= tan 2χ (9.81) any λ
∂2I
Q̃(λ0 ) = − 14 ∆λ2B Ḡ sin2 θ ∂λ2 λ
(9.83) line center
0
η
∂I
Q̃(λ) = − 14 ∆λ2B Ḡ sin2 θ η ∂λ
(9.84) any λ
3 1
∂I
Q̃(λw ) = 4
∆λ2B Ḡ sin2 θ λw −λ0 ∂λ λ
(9.85) line wings
w
the profile η can be written in the form (see Eqs. (9.65), (9.43), and (5.57))
1 a ∆λ2D
η= ,
π (λw − λ0 )2
so that
η 3
=− .
η λw − λ0
This ratio is independent of ∆λD and Γ , therefore assumption c) can be dropped
and Eq. (9.84) becomes
3 1 ∂I
Q̃(λw ) = ∆λ2B Ḡ sin2 θ . (9.85)
4 λw − λ0 ∂λ λ=λw
The results just derived are summarized in Table 9.1, together with the assump-
tions required – in addition to the weak field hypothesis – for the various expressions
to hold. In particular, it should be emphasized that Eq. (9.80) is rather general
and that it can also be applied in the ‘velocity gradient case’ (see Sect. 9.5).
Some general features of the Stokes parameters profiles arise from the analytical
expressions in Table 9.1. First we consider Eq. (9.80), and we refer to a typical
stellar atmosphere where both line and continuum source functions are monotoni-
cally increasing functions of optical depth. Such atmosphere produces absorption
LINE FORMATION IN A MAGNETIC FIELD 405
lines and, for any line, the quantity ∂I/∂λ is negative in the blue wing and positive
in the red wing. For any propagation direction making an angle θ < π/2 with the
magnetic field direction, it follows that the V Stokes parameter is positive in the
blue wing and negative in the red wing, while the opposite holds for θ > π/2.1 It
also follows that there is a particular wavelength (more precisely, an odd number
of wavelengths) where the V profile vanishes.
Another consequence of Eq. (9.80) concerns the dependence of circular polar-
ization on the characteristics of spectral lines. Let us suppose that the intensity
profile emerging from a static stellar atmosphere can be written in the form2
η(λ)
I(λ) = Ic 1 − dc , (9.86)
η(λ0 )
where Ic is the intensity of the continuum adjacent to the line and where
Ic − I(λ0 )
dc =
Ic
V ∆λB
≈ cos θ ḡ dc . (9.87)
Ic ∆λD
This formula shows that V /Ic is inversely proportional to ∆λD , which means that
circular polarization in narrow lines is larger than in broad lines. Furthermore, it
is larger in lines of heavy elements compared with light elements, because ∆λD
decreases with increasing atomic weight (see Eqs. (9.25) and (5.48)). For a given
element, the largest circular polarization is produced by lines having the largest
effective Landé factor ḡ , the largest central depression dc , and the largest wave-
length λ0 . This last effect stems from the fact that ∆λB scales as λ20 (Eqs. (9.26))
while ∆λD scales as λ0 (Eq. (9.25)).
The above properties suggest the introduction of a dimensionless parameter, the
circular polarization sensitivity index of a spectral line, having the form
λ0
sV = ḡ dc , (9.88)
λref
where λref is a reference wavelength, like for instance 5000 Å. A search for the
most sensitive lines in the FeI solar spectrum (restricted to the list of about 400
1
We consider here the most frequent case of ḡ > 0. For the very few lines having ḡ < 0 the
situation is reversed.
2
This is generally an acceptable approximation for unsaturated absorption lines. The following
remarks also apply, as a rule, to the more general case where η(λ) in Eq. (9.86) is replaced by a
bell-shaped profile having a typical width ∆λp . For saturated lines one usually has ∆λp > ∆λD .
406 CHAPTER 9
TABLE 9.2
The lines of the FeI solar spectrum are listed in order of decreasing circular polarization sensitivity
index (computed from Eq.(9.88) with λref = 5000 Å). The value of ḡ is calculated according to
L-S coupling. Zeeman triplets are identified by a star in the last column.
unblended lines in the interval 4365-6859 Å given by Stenflo and Lindegren, 1977)
yields the list in Table 9.2.
Let us now turn to the discussion of linear polarization, described by Eqs. (9.83),
(9.84), and (9.85). For an absorption line the quantity [∂ 2 I/∂λ2 ]λ=λ is positive,
0
so that, according to Eq. (9.83), Q̃(λ0 ) is negative when Ḡ is positive and vice
versa. Equation (9.85) shows that the sign of Q̃ in the wings is opposite to the sign
at line center. It follows that the Q̃ profile vanishes at two particular wavelengths
(more precisely, at an even number of wavelengths). These wavelengths can be
easily found only when Eq. (9.84) holds (which implies however more restrictive
assumptions – see Table 9.1). In that case they are the solutions to the equation
η = 0, which, using Eqs. (9.65) and (5.58), can be written in the form1
2a
√ − (1 + 2a2 − 2v 2 ) H(v, a) − 4av L(v, a) = 0 .
π
√
For zero damping (a = 0) this equation has the two solutions v0 = ± 1/ 2. For
a = 0 one can find an analytical, approximate solution if a 1. The calculation,
that is left as an exercise to the reader, leads to the following result
1
v0 = ± √ + α a ,
2
where, denoting by D(x) the Dawson function defined in Eq. (5.56),
.
e √ 1
α = 2 2 D √ − 1 0.2957 .
2π 2
1
We assume here, to avoid formal complications, that the ambient medium velocity wA is
zero. In any case, wA must be constant for Eq. (9.84) to hold.
LINE FORMATION IN A MAGNETIC FIELD 407
Another consequence of Eq. (9.84) concerns the special case where the intensity
profile I(λ) has the form of Eq. (9.86). Since in that case
η ∂ 2 I ; ∂I
= ,
η ∂λ2 ∂λ
Eq. (9.84) becomes
1 ∂2I
Q̃(λ) = − ∆λ2B Ḡ sin2 θ ,
4 ∂λ2
which ‘extends’ the validity of expression (9.83) to all wavelengths (of course this
holds only for weak, unsaturated lines). Use of Eq. (9.67) yields, as an order of
magnitude (cf. Eq. (9.87))
Q̃ ∆λ2B
≈ sin2 θ Ḡ dc ,
Ic ∆λ2D
∆J = 0, ±1 , ∆S = 0, ±1 , ∆L = 0, ±1, ±2 .
It appears from Table 9.4 that both ḡ and Ḡ are zero in some cases. It can be easily
shown that this property, combined with the selection rule ∆J = 0, ±1, implies
1
The analysis has been performed using L-S Landé factors; the average Ḡ was obtained
by weighting the Ḡ factors of individual lines by their central depression.
408 CHAPTER 9
TABLE 9.3
The lines of the FeI solar spectrum are listed in order of decreasing linear polarization sensitivity
index (computed from Eq.(9.89) with λref = 5000 Å ; upper panel). The lower panel lists the
most sensitive lines with Ḡ < 0. Ḡ is calculated in L-S coupling. A star in the last column
identifies Zeeman triplets.
5250.21 1 5D
0 − 7D
1 9.000 7.085
6302.50 816 5P
1 − 5 D0 6.250 6.485
6173.34 62 5P
1 − 5 D0 6.250 5.936
4690.14 820 5P
1 − 7 D1 7.375 4.296
6232.64 816 5P
2 − 5 D1 3.983 3.974
6842.69 1197 5P
1 − 5 P1 6.250 3.921
4704.95 821 5P
1 − 5D
0 6.250 3.885
5506.78 15 5F
2 − 5 D3 3.900 3.884
6136.99 62 5P
2 − 5 D1 3.983 3.673
6336.83 816 5P
1 − 5 D1 3.250 3.644
5197.93 1091 5F − 5P −3.125 −1.351
1 1
4598.12 554 5D − 5F −1.125 −0.740
1 1
5705.46 1087 5F − 5D −1.125 −0.583
1 1
5432.95 1143 5G − 5F −0.600 −0.453
2 2
5916.25 170 3H − 3F −0.559 −0.422
4 4
TABLE 9.4
A star identifies the transitions having ḡ = 0 or Ḡ = 0. The Landé factors are calculated in L-S
coupling. The list includes all the transitions obeying the constraints specified in the text.
that the Landé factors of both the upper and lower level are zero. Therefore, these
transitions produce no polarization whatever the magnetic field strength.
As for the transitions with ḡ = 0 and Ḡ = 0, it can be proved that the Landé
factors of both levels are non-zero. According to Eq. (9.80), such transitions pro-
duce no circular polarization under the weak field limit. They can produce circular
polarization if the magnetic field is not weak.
According to Eq. (9.84), the transitions with Ḡ = 0 and ḡ = 0 produce no linear
polarization in the weak field limit. They can in principle give rise to linear polar-
ization when the magnetic field is strong. However, as shown by Sánchez Almeida
and Vela Villahoz (1993), all the transitions in Table 9.4 involving the terms 4 D1/2
or 6 G3/2 are characterized by peculiar Zeeman patterns where a compensation oc-
curs between σ and π components, which eventually causes the elements hQ , hU ,
rQ , rU of the propagation matrix to be identically zero. It follows (cf. Eq. (9.35))
that the transfer equations for the Stokes parameters Q and U are decoupled from
those for I and V ; if the boundary values for Q and U are also zero (Eqs. (9.42)), the
linear polarization for such transitions is zero whatever the magnetic field strength.
As a concluding remark, we emphasize that most of the results contained in
this section, and especially those related to the shape of the Stokes profiles and
to the dependence of polarization on the characteristics of spectral lines, remain
qualitatively valid even when the weak field assumption is dropped, that is when
∆λB ≈ ∆λD .
Consider again the transfer equation in the form (9.35). A formal solution to this
equation is easily obtained by adjusting the formalism of the evolution operator
developed in Sect. 8.2. For this purpose it is convenient to cast Eq. (9.35) in the
matrix form1
d
I(τc ) = C(τc ) I(τc ) − j(τc ) , (9.90)
dτc
where
⎛
⎞
I
⎜Q⎟
I=⎝ ⎠
U
V
⎛ ⎞
1 + kI kQ kU kV
⎜ k −fU ⎟
⎜ 1 + kI fV ⎟
C = 1 + κL H = ⎜ Q ⎟ (9.91)
⎝ kU −fV 1 + kI fQ ⎠
kV fU −fQ 1 + kI
1
The notation used here is rather uncommon: the matrix C appearing in this equation is
often denoted in the literature with the symbol K. We use this notation to avoid confusion with
the matrix K introduced in Chap. 8 (see Eq. (8.2)).
410 CHAPTER 9
⎞ ⎛
Sc + kI SL
⎜ k S ⎟
⎜ Q L ⎟
j = Sc 1 + κ L SL H U = ⎜ ⎟ , (9.92)
⎝ kU SL ⎠
kV SL
with U and H given by Eqs. (9.49) and (9.50), and the coefficients kI , kQ , kU ,
kV , fQ , fU , fV by Eqs. (9.39).
Equation (9.90) is formally identical to Eq. (8.2), except for a sign switch due to
the use of the optical depth (which increases in the direction opposite to propaga-
tion) as independent variable. Thus we can define an evolution operator O(τc , τc )
via the equation (cf. Eq. (8.6))
O(τc , τc ) = 1 (9.94)
O(τc , τc ) = O(τc , τc ) O(τc , τc ) (τc ≤ τc ≤ τc ) (9.95)
d
O(τc , τc ) = C(τc ) O(τc , τc ) (9.96)
dτc
d
O(τc , τc ) = − O(τc , τc ) C(τc ) , (9.97)
dτc
τi ≤ τj ≤ . . . ≤ τk . (9.99)
∞
I(τc ) = O(τc , τc ) j(τc ) dτc (9.101)
τc
provided
lim O(τc , τc ) j(τc ) = 0 . (9.102)
τc →∞
It can be easily proved, by the same arguments used in Sect. 8.3, that the evolution
operator corresponding to a C matrix independent of optical depth is
−(τc − τc ) C
O(τc , τc ) = e , (9.103)
The formal solution derived in the previous section allows us to express the Stokes
parameters of the radiation propagating along any direction at any specified point
of a magnetized stellar atmosphere in terms of the physical characteristics of the
atmosphere itself. To specify such characteristics means to give a model of the at-
mosphere or, in other words, a set of mathematical functions which fix, at any point
in the atmosphere, the value of all the physical quantities affecting the propagation
matrix and the emission vector.
Probably the simplest atmospheric model1 is the so-called Milne-Eddington model
which results from the following set of assumptions:
a) the atmosphere is plane-parallel, semi-infinite, and in LTE (Sc = SL = BP );
b) all the quantities affecting the propagation matrix C, namely κL , B, θ, χ ,
∆λD , Γ , wA are depth-independent, so that, for any propagation direction, C is
constant;
c) the Planck function is linear in tc , the continuum optical depth measured along
the vertical
BP = B0 + B1 tc = B0 (1 + β tc ) . (9.105)
Under these assumptions we can easily find, using the formulae of the previous
section, an analytical solution to the transfer equation.
1
The even simpler reversing-layer model that leads to the Seares formulae (see Sect. 9.13)
should be considered, more properly, a slab model.
412 CHAPTER 9
Owing to assumptions a) and c), Eq. (9.92) reduces to (see also Eq. (9.91))
j(tc ) = B0 (1 + β tc ) C U ,
and using assumption b) we obtain from Eqs. (9.103), (9.101) and (9.100)1
∞ tc − tc
− C dtc
I(tc , µ) = B0 e µ
(1 + β tc ) C U (µ > 0)
µ
tc
tc tc − tc
− C dtc
I(tc , µ) = B0 e |µ| (1 + β tc ) C U (µ < 0) ,
|µ|
0
with µ = cos α and α defined in Fig. 9.2. Both integrals can be performed by
means of the substitution (tc − tc )/µ = x. Bearing in mind the definition of the
exponential of a matrix (Eq. (8.21)), it can be easily proved that
b
−x C −a C −b C
e dx = e − e C −1 (9.106)
a
b
−x C −a C −b C
e x dx = a 1 + C −1 e − b 1 + C −1 e C −1 , (9.107)
a
C C −1 = C −1 C = 1 .
I(tc , µ) = B0 (1 + β tc ) 1 + βµ C −1 U (µ > 0)
I(tc , µ) = B0 (1 + β tc ) 1 − β |µ| C −1
− tc C
− 1 − β |µ| C −1 e |µ| U (µ < 0) . (9.108)
In particular, the radiation emerging from the atmosphere (tc = 0 , µ > 0) is given
by
I(0, µ) = B0 1 + βµ C −1 U . (9.109)
1
It is understood that the boundary condition at tc = 0 is I (µ < 0) = 0; using Eqs. (A5.23)
and (9.111) it can be shown that condition (9.102) is also satisfied.
LINE FORMATION IN A MAGNETIC FIELD 413
TABLE 9.5
where
2 + f 2 + f 2 − k 2 − k 2 − k 2 ) − (k f + k f + k f )2
∆ = (1 + kI )4 + (1 + kI )2 (fQ U V Q U V Q Q U U V V
The matrix C −1 can be calculated analytically from the expression of the ma-
trix C given in Eq. (9.91). A straightforward calculation yields the expressions
contained in Table 9.5, whence one gets the so-called Unno-Rachkovsky solutions 1
& '
I(0, µ) = B0 1 + βµ ∆−1 (1 + kI ) (1 + kI )2 + fQ2
+ fU2 + fV2
&
Q(0, µ) = −B0 βµ ∆−1 (1 + kI )2 kQ − (1 + kI ) (kU fV − kV fU )
'
+ fQ (kQ fQ + kU fU + kV fV )
1
Note that the quantity (kQ fU − kU fQ ) appearing in the expression of V (0, µ) – and of
the matrix elements (C −1 )03 and (C −1 )30 – has been written down just to point out the ‘cyclic’
character of the formulae for Q , U , V ; however, as obvious from Eqs. (9.39) and (9.32), it is
identically zero.
414 CHAPTER 9
&
U (0, µ) = −B0 βµ ∆−1 (1 + kI )2 kU − (1 + kI ) (kV fQ − kQ fV )
'
+ fU (kQ fQ + kU fU + kV fV )
&
V (0, µ) = −B0 βµ ∆−1 (1 + kI )2 kV − (1 + kI ) (kQ fU − kU fQ )
'
+ fV (kQ fQ + kU fU + kV fV ) . (9.110)
(kQ fQ + kU fU + kV fV )2 ≤ (kQ
2 2
+ kU + kV2 ) (fQ
2
+ fU2 + fV2 ) (9.111)
implies
∆ ≥ (1 + kI )2 + (fQ
2
+ fU2 + fV2 ) (1 + kI )2 − (kQ
2 2
+ kU + kV2 ) .
On the other hand, it can be shown via Eqs. (9.39) and (9.32) that the term in
the second bracket is positive. It follows that, whatever the values of the various
parameters, one always has ∆ > 0.
Equations (9.110) are particularly important because they provide simple ana-
lytical expressions for the Stokes parameters profiles of the line radiation emerging
from a magnetized stellar atmosphere. Although the Milne-Eddington model (sum-
marized by approximations a), b) and c) at the beginning of this section) is indeed
the simplest schematization of a real stellar atmosphere, the Unno-Rachkovsky
formulae contain the basic physics of radiative transfer for polarized radiation
and have been widely used in astrophysical applications. They were first de-
rived by Unno (1956) who neglected, however, magneto-optical effects. By setting
fQ = fU = fV = 0 in Eqs. (9.110), one obtains the simplified Unno solutions
I(0, µ) = B0 1 + βµ ∆−1
0 (1 + kI )
where
∆0 = (1 + kI )2 − kQ
2
− kU
2
− kV2 .
The equations acquired their standard form with the introduction of magneto-
optical effects by Rachkovsky (1962b).
The Stokes parameters profiles given by the Unno-Rachkovsky solutions depend
on 9 different parameters: the seven quoted under point b) above plus the coeffi-
cients B0 and β representing the surface value and the slope of the Planck function.
Obviously there is a further dependence on the Zeeman pattern and on the rest
LINE FORMATION IN A MAGNETIC FIELD 415
Ic (0, µ) − I(0, µ)
RI (0, µ) =
Ic (0, µ)
βµ & '
= 1 − ∆−1 (1 + kI ) (1 + kI )2 + fQ
2
+ fU2 + fV2
1 + βµ
Q(0, µ) βµ
RQ (0, µ) = =− ∆−1
Ic (0, µ) 1 + βµ
& '
× (1 + kI )2 kQ − (1 + kI )(kU fV − kV fU ) + fQ (kQ fQ + kU fU + kV fV )
U (0, µ) βµ
RU (0, µ) = =− ∆−1
Ic (0, µ) 1 + βµ
& '
× (1 + kI )2 kU − (1 + kI )(kV fQ − kQ fV ) + fU (kQ fQ + kU fU + kV fV )
V (0, µ)
RV (0, µ) =
Ic (0, µ)
βµ & '
=− ∆−1 (1 + kI )2 kV + fV (kQ fQ + kU fU + kV fV ) . (9.112)
1 + βµ
1
An atlas of Stokes profiles, based on the Unno-Rachkovsky formulae, has been presented by
Arena and Landi Degl’Innocenti (1982).
416 CHAPTER 9
Fig.9.5. The Stokes parameters profiles, as given by the Unno-Rachkovsky formulae (9.110), are
plotted as functions of the reduced wavelength for the same transitions considered in Fig.9.3.
The profiles are normalized to B0 , and the values of the relevant parameters are the following
(‘standard case’): βµ = 5, κL = 20, vA = 0, vB = 1.5, θ = 60◦ , χ = 0◦ , a = 0.05 (the last
5 values are the same as in Fig.9.3).
Eqs. (1.45))
Q(χ + ∆χ) = cos(2 ∆χ) Q(χ) − sin(2 ∆χ) U (χ)
U (χ + ∆χ) = sin(2 ∆χ) Q(χ) + cos(2 ∆χ) U (χ) .
It should be remarked that Eqs. (9.110) provide analytical expressions for the
LINE FORMATION IN A MAGNETIC FIELD 417
Fig.9.6. Influence of the different parameters, varied one at a time, on the Stokes profiles. The
profiles refer to a Zeeman triplet and are calculated as in Fig.9.5; the full line corresponds to the
‘standard case’. (a) κL = 5 (dotted), κL = 40 (dashed); (b) vB = 0.5 (dotted), vB = 3 (dashed);
(c) θ = 30◦ (dotted), θ = 80◦ (dashed); (d) χ = 30◦ (dotted), χ = 60◦ (dashed); (e) a = 0.0
(dotted), a = 0.2 (dashed).
emerging Stokes parameters profiles (tc = 0, µ > 0). To obtain analytical expres-
sions valid for any optical depth and direction we must resort to Eqs. (9.108). The
result for µ > 0 is trivial
I(tc , µ) = I(0, µ) + B0 β tc
Q(tc , µ) = Q(0, µ)
U (tc , µ) = U (0, µ)
V (tc , µ) = V (0, µ) ,
while for µ < 0 one needs the expressions for the matrix elements of C −1 contained
418 CHAPTER 9
in Table 9.5 and those for the exponential of a matrix deduced in App. 5. The
resulting formulae are rather involved and will not be given here.
C(tc ) = 1 + κL (tc ) H ,
with H = const. According to Eqs. (9.104), (9.101), (9.102) and (9.92), the emerg-
ing Stokes parameters are given by
∞ t
− µc −
KL (tc )
H dt
I(0, µ) = e e µ
Sc (tc ) 1 + κL (tc ) SL (tc ) H U c , (9.113)
µ
0
where
tc
KL (tc ) = κL (tc ) dtc ,
0
provided
t
− µc −
KL (tc )
H
lim e e µ
Sc (tc ) 1 + κL (tc ) SL (tc ) H U = 0 . (9.114)
tc →∞
Using Eq. (9.113), explicit expressions for I(0, µ) can be obtained once the functions
Sc (tc ), SL (tc ), and κL (tc ) are specified.
LINE FORMATION IN A MAGNETIC FIELD 419
κL = const.
−α1 tc
Sc (tc ) = B0 (1 + β tc ) + A1 e
−α1 tc −α2 tc
SL (tc ) = B0 (1 + β tc ) + A1 e − A2 e (9.115)
with α1 and α2 positive, so that Eq. (9.114) is satisfied, one has the possibility of
adjusting the various parameters so as to simulate a variety of different behaviors
for the continuum and line source functions. In particular, the last term in SL (tc )
allows for a drop of the line source function below the continuum source function
at optical depths tc ≤ 1/α2 .
The integral resulting from the substitution of Eqs. (9.115) into Eq. (9.113) can
be carried out with the help of Eqs. (9.106) and (9.107). After some algebra we
obtain
&
I(0, µ) = B0 1 + βµ C −1 + A1 1 − α1 µ (α1 µ 1 + C)−1
'
− A2 1 − (1 + α2 µ) (α2 µ 1 + C)−1 U , (9.116)
A1
Ic (0, µ) = B0 (1 + βµ) + .
1 + α1 µ
A remarkable expression for the emerging Stokes parameters can be obtained if,
in addition to assumptions a ) and b ) specified above, we adopt the LTE hypoth-
esis. To derive this expression we use the line optical depth tL (instead of tc ) as
independent variable.
Using Eqs. (9.36) and (9.34), we can write the transfer equation in the form
dI
µ = (κc 1 + H) I − (κc Sc 1 + SL H ) U ,
dtL
where κc is given by Eq. (9.37) and H by Eq. (9.50). According to Eqs. (9.104),
(9.101) and (9.102), the emerging Stokes parameters are given by
∞
−
Kc (tL ) t
− µL H dt
I(0, µ) = e µ
e κc (tL ) Sc (tL ) 1 + SL (tL ) H U L , (9.117)
µ
0
420 CHAPTER 9
Fig.9.7. Stokes parameters profiles as functions of reduced wavelength, computed from Eq.(9.116)
for a normal Zeeman triplet and the source functions Sc and SL plotted in the lower part of the
figure. The profiles are normalized to the continuum intensity, the source functions to B0 . Full
line: A1 = A2 = 0 (model (a), i.e. a Milne-Eddington atmosphere); dotted line: A1 /B0 = 4,
α1 = 5, A2 = 0 (model (b)); dashed line: A1 /B0 = A2 /B0 = 4, α1 = 5, α2 = 10 (model (c)).
The parameters β and µ are set to 5 and 1 respectively. The remaining parameters have the same
values as in Fig.9.5.
with
tL
Kc (tL ) = κc (tL ) dtL ,
0
provided
−
Kc (tL ) t
− µL H
lim e µ
e κc (tL ) Sc (tL ) 1 + SL (tL ) H U = 0 .
tL →∞
Equation (9.117), which is basically equivalent to Eq. (9.113), can be further de-
veloped in the LTE case, where
Since
K (t ) !
−
Kc (tL ) t
− µL H d − cµ L
t
− L H
e µ
e κc (tL ) 1 + H = − µ e e µ ,
dtL
where BP (0) = BP (tL = 0). Using Eq. (A5.23), the matrix exp(−tL H/µ) can be
written in the form
t
− µL H
4 t
− µL λi ( r)
h,
e = e Ni ( h, r ) , (9.119)
i=1
where the matrices Ni are given by Eqs. (A5.24) and λi are the eigenvalues of the
matrix H (cf. Sect. 8.4)
with Λ± ( h, r ) given by Eqs. (A5.18). Equation (9.119) enables the emerging Stokes
parameters to be expressed in terms of Laplace transforms. We recall that the
Laplace transform of the function f (x ; µ) is defined, in the half plane Re p > 0, by
∞
−px
L(p ; µ) = e f (x ; µ) dx .
0
As outlined in Sect. 8.3.c, there are some special cases where, because certain
elements of the propagation matrix are zero, the transfer equation can be solved
in closed form without any additional assumption. Two such cases occur when, at
each point along the ray path, the magnetic field vector is parallel or perpendicular
to the propagation direction.
Let us consider the former case (θ = 0◦ or 180◦ in Fig. 9.1). From Eqs. (9.32) we
have
1
hI = ηb + ηr
2
hQ = hU = 0 rQ = rU = 0
1 1
hV = ± ηr − ηb rV = ± ρr − ρb , (9.121)
2 2
where the plus sign in the expressions of hV and rV refers to θ = 0◦ and the minus
sign to θ = 180◦ . Substitution into the transfer equation (9.35) yields
dI
= (1 + κL hI ) I + κL hV V − (Sc + κL hI SL )
dτc
dQ
= (1 + κL hI ) Q + κL rV U
dτc
dU
= (1 + κL hI ) U − κL rV Q
dτc
dV
= (1 + κL hI ) V + κL hV I − κL hV SL . (9.122)
dτc
These equations can be solved either by the evolution operator technique (cf.
Eq. (8.27) which refers to a similar case) or by diagonalization. We apply here
the second method, which is very simple in this case (cf. Eqs. (8.43) and (8.44)).
As apparent from Eqs. (9.122), the Stokes parameters Q and U are only coupled
with each other: since the equations for Q and U do not contain source terms, we
have, at any optical depth1
Q=U =0
provided the boundary values of Q and U are also zero. As far as I and V are
concerned, summation and subtraction of the first and fourth of Eqs. (9.122) gives,
1
Note that Eqs. (8.44) do not imply the solution Q = U = 0, although the propagation matrix
has the same form. This is because all atomic coherences are neglected in the present chapter, so
that the condition hQ = hU = 0 implies that the Q and U components of the emission vector are
zero (see Eq. (9.92)).
LINE FORMATION IN A MAGNETIC FIELD 423
in the θ = 0◦ case1
d
(I + V ) = (1 + κL ηr ) (I + V ) − (Sc + κL ηr SL )
dτc
d
(I − V ) = (1 + κL ηb ) (I − V ) − (Sc + κL ηb SL ) , (9.123)
dτc
where Eqs. (9.121) have been used. These equations are uncoupled and have the
formal solutions (cf. Eqs. (8.4) and (8.18))
∞ : τc
− 1 + κL (τc ) ηr (τc ) dτc
(I + V )τ =0 = e 0
c
0
∞ : τc
− 1 + κL (τc ) ηb (τc ) dτc
(I − V )τ =0 = e 0
c
0
which allow the emerging I and V Stokes parameters to be written in the form
1
I(0) = (I + V )τ =0 + (I − V )τ =0
2 c c
1
V (0) = (I + V )τ =0 − (I − V )τ =0 . (9.125)
2 c c
Equations (9.123) show that magneto-optical effects do not influence the transfer
of radiation: this is indeed one of the few cases where these effects can a priori
be neglected (cf. Sect. 9.22). It also appears that the quantity (I + V ) – which
represents twice the amount of right circular polarization contained in the radiation
beam, see Sect. 1.6) – obeys the same transfer equation as the intensity of an
(unpolarized) beam propagating through a non-magnetic atmosphere, with the
only difference that the profile ηr would be replaced, in the latter case, be the
profile η defined in Eq. (9.65). The same holds for (I − V ), with ηb playing the role
of ηr . If we restrict attention to a Zeeman triplet (normal or anomalous), the ηr
and ηb profiles are nothing but the η profile shifted in wavelength by the amount
g ∆λB to the red or to the blue, respectively2 (see Eqs. (9.26), (9.27) and (9.29)).
Therefore, if the amplitude (besides the direction) of the magnetic field is constant
along the ray path, and if suitable boundary conditions (like those in Eqs. (9.42))
1
The case θ = 180◦ leads to the same equations with the interchange of ηr and ηb .
2
Obviously, the opposite shifts occur if the Landé factor is negative.
424 CHAPTER 9
1 ¯ ¯ + g ∆λ )
I(λ) = I(λ − g ∆λB ) + I(λ B
2
1 ¯ ¯ + g ∆λ ) ,
V (λ) = I(λ − g ∆λB ) − I(λ B (9.126)
2
¯
where I(λ) is the intensity that would be observed if the magnetic field were
switched off leaving unchanged all the other physical parameters (including the
source functions).
Let’s now turn to the case where the magnetic field vector is perpendicular to
the propagation direction. For θ = 90◦ , Eqs. (9.32) reduce to
1 η + ηr
hI = η + b
2 p 2
1 η + ηr 1 ρ + ρr
hQ = η − b cos 2χ rQ = ρ − b cos 2χ
2 p 2 2 p 2
1 η + ηr 1 ρ + ρr
hU = η − b sin 2χ rU = ρ − b sin 2χ
2 p 2 2 p 2
hV = 0 rV = 0 ,
dI
= (1 + κL hI ) I + κL hQ Q + κL hU U − (Sc + κL hI SL )
dτc
dQ
= (1 + κL hI ) Q + κL hQ I − κL rU V − κL hQ SL
dτc
dU
= (1 + κL hI ) U + κL hU I + κL rQ V − κL hU SL
dτc
dV
= (1 + κL hI ) V + κL rU Q − κL rQ U .
dτc
Q̃ = Q cos 2χ + U sin 2χ
Ũ = U cos 2χ − Q sin 2χ , (9.127)
1
The case θ = 180◦ yields a sign switch in the expression of V (λ).
2
These combinations are nothing but the Stokes parameters defined in the preferred frame
(see Sect. 5.5).
LINE FORMATION IN A MAGNETIC FIELD 425
and setting
1 η + ηr
hW = hQ (θ = 90◦ , χ = 0◦ ) = η − b
2 p 2
1 ρ + ρr
rW = rQ (θ = 90◦ , χ = 0◦ ) = ρ − b ,
2 p 2
one gets
dI
= (1 + κL hI ) I + κL hW Q̃ − (Sc + κL hI SL )
dτc
dQ̃
= (1 + κL hI ) Q̃ + κL hW I − κL hW SL
dτc
dŨ
= (1 + κL hI ) Ũ + κL rW V
dτc
dV
= (1 + κL hI ) V − κL rW Ũ .
dτc
These are strictly similar to Eqs. (9.122) and can be solved by the same method.
Since Ũ and V are only coupled with each other and no source term is present in
their equations, one has
Ũ = V = 0 .
On the other hand, summation and subtraction of the first two equations yields
the decoupled equations
d , - , -
(I + Q̃) = 1 + κL ηp (I + Q̃) − Sc + κL ηp SL
dτc
d , η + ηr - , η + ηr -
(I − Q̃) = 1 + κL b (I − Q̃) − Sc + κL b SL ,
dτc 2 2
∞ : τc ηb (τc ) + ηr (τc )
− 1 + κL (τc ) dτc
(I − Q̃)τ =0 = e 0 2
c
0
ηb (τc ) + ηr (τc )
× Sc (τc ) + κL (τc ) SL (τc ) dτc . (9.128)
2
426 CHAPTER 9
Finally, inversion of Eqs. (9.127) leads to the following expressions for the emerging
Stokes parameters
1
I(0) = (I + Q̃)τ =0 + (I − Q̃)τ =0
2 c c
1
Q(0) = (I + Q̃)τ =0 − (I − Q̃)τ =0 cos 2χ
2 c c
1
U (0) = (I + Q̃)τ =0 − (I − Q̃)τ =0 sin 2χ
2 c c
V (0) = 0 .
The above equations show that magneto-optical effects do not affect the Stokes
parameters in this case either.
dI
= I − Sc + κL hI (I − SL ) + hQ Q + hU U + hV V
dτc
dQ
= Q + κL hQ (I − SL ) + hI Q
dτc
dU
= U + κL hU (I − SL ) + hI U
dτc
dV
= V + κL hV (I − SL ) + hI V . (9.129)
dτc
hQ = a cos 2χ , hU = a sin 2χ , hV = b ,
with
1 η + ηr 1
a= η − b sin2 θ , b= η − ηb cos θ . (9.130)
2 p 2 2 r
Setting
a b
√ = cos δ , √ = sin δ , (9.131)
a + b2
2 a + b2
2
LINE FORMATION IN A MAGNETIC FIELD 427
we have
The Stepanov solution derives from the assumption that both χ and δ are inde-
pendent of τc : this assumption enables Eqs. (9.129) to be cast into diagonal form.
Contrary to the angle χ (the azimuth of the magnetic field vector defined in
Fig. 9.1), the angle δ, which depends on wavelength, has no precise physical mean-
ing. However, as apparent from Eqs. (9.130) and (9.131), the assumption δ = const.
implies that the parameters B, θ, ∆λD , Γ , wA should be independent of optical
depth. Thus the approximations underlying the Stepanov solution are equivalent
to approximation b ) considered in Sect. 9.9 plus the additional approximation of
neglecting magneto-optical effects.
Let us substitute Eqs. (9.132) into Eqs. (9.129), and let us replace the last
three equations by the linear combinations obtained by summation of the sec-
ond, third, and fourth equation multiplied, respectively, by the factors cos δ cos 2χ ,
cos δ sin 2χ , sin δ ; then by the factors sin δ cos 2χ , sin δ sin 2χ , − cos δ ; and finally
by the factors sin 2χ , − cos 2χ , 0. The following set of equations is obtained
d
I = I − Sc + κL hI (I − SL ) + h2Q + h2U + h2V PA
dτc
d
P = PA + κL h2Q + h2U + h2V (I − SL ) + hI PA
dτc A
d
P = (1 + κL hI ) PB
dτc B
d
P = (1 + κL hI ) PC , (9.133)
dτc C
where
The last two equations in the set (9.133) show that, provided Q , U , and V are
zero at the boundary,
PB = PC = 0 . (9.135)
The solution to the algebraic system (9.134) with the condition (9.135) is
Finally, by summing and subtracting the first two equations in the set (9.133) one
gets the decoupled equations
dI (+)
= 1 + κL h(+) I (+) − Sc + κL h(+) SL
dτc
dI (−)
= 1 + κL h(−) I (−) − Sc + κL h(−) SL , (9.136)
dτc
where
I (+) = I + PA h(+) = hI + h2Q + h2U + h2V
Equations (9.136) are the so-called Stepanov equations (Stepanov, 1958), and can
be easily integrated (cf. the analogous solutions (9.124) or (9.128)). The Stepanov
equations are seldom used in modern research. They have been illustrated mainly
because of their historical importance in the problem of line formation in a magnetic
field.
In the limiting case of very strong magnetic field, the various Zeeman components
spread out considerably in wavelength and the spectral line consists of a num-
ber of independent components, each associated with the corresponding Zeeman
component.1
This physical situation, which will be referred to as ‘intense field’ and which is
characterized by the inequality (see Eqs. (9.26))
∆λB
vB = 1,
∆λD
any assigned wavelength, only one Zeeman component contributes to the elements
of the propagation matrix. In other words, we assume that the expressions of the
profiles ηq and ρq in a wavelength interval about the Zeeman component connecting
two specific sublevels (J M ) and (Ju Mu ) are (cf. Eqs. (9.27))
J J 1
ηq = Sq u (M , Mu ) √ H v − vA + vB (gu Mu − g M ), a
π
J J 1
ρq = Sq u (M , Mu ) √ L v − vA + vB (gu Mu − g M ), a , (9.137)
π
λ0
δ(∆λB ) ∆λB , δ(wA ) ∆λB . (9.138)
c
Let us consider the transfer equation in the form of Eq. (9.35), and let us restrict
attention to a wavelength interval about the Zeeman component connecting two
particular sublevels (J M ) and (Ju Mu ). This component – σb , π or σr according to
the value of (M − Mu ) – will be referred to as the i-component . Using assumption
(9.137), the elements of the propagation matrix can be written in the form (see
Eqs. (9.29) and (9.32))
(i)
hI = αI si η (i)
(i) (i)
hQ = αQ si η (i) rQ = αQ si ρ(i)
(i) (i)
hU = αU si η (i) rU = αU si ρ(i)
(i) (i)
hV = αV si η (i) rV = αV si ρ(i) , (9.139)
430 CHAPTER 9
TABLE 9.6
σb 1
4
(1 + cos2 θ) − 14 sin2 θ cos 2χ − 14 sin2 θ sin 2χ − 12 cos θ
1 1 1
π 2
sin2 θ 2
sin2 θ cos 2χ 2
sin2 θ sin 2χ 0
σr 1
4
(1 + cos2 θ) − 14 sin2 θ cos 2χ − 14 sin2 θ sin 2χ 1
2
cos θ
where
J J
si = Sq u (M , Mu )
1
η (i) = √ H v − vA + vB (gu Mu − g M ), a
π
1
ρ(i) = √ L v − vA + vB (gu Mu − g M ), a , (9.140)
π
(i) (i) (i) (i)
and where the angular factors αI , αQ , αU , αV are defined in Table 9.6. We will
now show that the transfer equation can be cast into diagonal form if the ratios
(i) (i) (i)
αQ αU αV
(i)
, (i)
, (i)
αI αI αI
are assumed to be independent of optical depth. As apparent from Table 9.6, this
means that the direction of the magnetic field vector is assumed to be constant
along the ray path.
Setting
(i) (i) (i)
αQ α αV
Q = (i) P , U= U (i)
P , V = (i)
P , (9.141)
αI αI αI
and noticing that, both for σ and π components
(i) 2 (i) 2 (i) 2 (i) 2
αQ + αU + αV = αI ,
we obtain from Eqs. (9.35)
dI (i)
= I − Sc + κL αI si η (i) (I + P − SL )
dτc
dP (i)
= P + κL αI si η (i) (I + P − SL ) .
dτc
Summation and subtraction of these two equations yields the decoupled equations
d (i) (i)
(I + P ) = 1 + 2κL αI si η (i) (I + P ) − Sc + 2 κL αI si η (i) SL
dτc
d
(I − P ) = (I − P ) − Sc . (9.142)
dτc
LINE FORMATION IN A MAGNETIC FIELD 431
It is interesting to note that the latter of Eqs. (9.142) has the same form as the
transfer equation for the continuum intensity Ic , while the former is just the trans-
(i) (i)
fer equation for the intensity If of a fictitious, non-magnetic line having a ratio κf
between line and continuum absorption coefficient given by
(i) (i)
κf = 2 κL αI si (9.143)
(i)
I − P = Ic , (I + P ) = If ,
and using Eqs. (9.141), we can write for the Stokes parameters in a wavelength
interval about the i-component
1 (i)
I (i) (τc ) = I (τ ) + If (τc )
2 c c
(i)
1 αQ (i)
Q(i) (τc ) = − I (τ ) − If (τc )
2 α(i) c c
I
(i)
1 αU (i)
U (i) (τc ) = − I (τ ) − If (τc )
2 α(i) c c
I
(i)
1 αV (i)
V (i) (τc ) = − I (τ ) − If (τc ) , (9.144)
2 α(i) c c
I
where
∞
−(τc − τc )
Ic (τc ) = Sc (τc ) e dτc
τc
∞ : τc
−
(i)
1 + κf η (i) dτc
dτc .
(i) (i) (i) τc τc
If (τc ) = Sc + κ f η SL τ e (9.145)
c
τc
The equations above show that magneto-optical effects do not affect the Stokes
parameters. This is a direct consequence of the two basic assumptions of this
section, namely that the magnetic field is very strong and has a fixed direction.
Bearing in mind the geometrical interpretation of the transfer equation presented in
Sect. 5.6, we see from Eqs. (9.139) that the formal vectors η and ρ
point in the same
(or the opposite) direction in the Poincaré space (Q/I, U/I, V /I); furthermore,
since the angles θ and χ are assumed to be constant, this direction is fixed. It
follows that the formal vector p is also parallel to the same direction, so that the
term ρ × p in Eq. (5.74b) – the only term due to magneto-optical effects – is zero.
432 CHAPTER 9
In the special case of the Milne-Eddington model atmosphere, the expressions for
the emerging Stokes parameters in a wavelength interval about the i-component
become very simple. We write them down because they will be needed in subse-
quent applications (see Sects. 9.19, 9.20 and 9.21). According to assumptions a),
b) and c) of Sect. 9.8, we have
(i)
Sc = SL = B0 (1 + β tc ) , κf = const. , η (i) = const. ,
Ic (0, µ) = B0 (1 + βµ)
(i) 1
If (0, µ) = B0 1 + βµ (i)
.
1 + κf η (i)
Substituting into Eqs. (9.144) and adopting the normalization of Eqs. (9.112), we
obtain
(i)
(i) Ic (0, µ) − I(0, µ) 1 βµ κf η (i)
RI (0, µ) = =
Ic (0, µ) 2 1 + βµ 1 + κ(i) η (i)
f
(i)
(i)
(i) Q(0, µ) 1 βµ αQ κf η (i)
RQ (0, µ) = =−
Ic (0, µ) 2 1 + βµ α(i) 1 + κ(i) η (i)
I f
(i) (i)
(i) U (0, µ) 1 βµ αU κf η (i)
RU (0, µ) = =−
Ic (0, µ) 2 1 + βµ α(i) 1 + κ(i) η (i)
I f
(i) (i)
(i) V (0, µ) 1 βµ αV κf η (i)
RV (0, µ) = =− (i) (i)
. (9.146)
Ic (0, µ) 2 1 + βµ α 1 + κf η (i)
I
These formulae can also be obtained by direct substitution into Eqs. (9.112) of
the following expressions (resulting from Eqs. (9.39), (9.139) and (9.143)) for the
elements of the propagation matrix
1 (i) (i)
kI = κ η
2 f
(i) (i)
1 αQ (i) (i) 1 αQ (i) (i)
kQ = κ η fQ = κ ρ
2 α(i) f 2 α(i) f
I I
(i) (i)
1 αU (i) (i) 1 αU (i) (i)
kU = κ η fU = κ ρ
2 α(i) f 2 α(i) f
I I
(i) (i)
1 αV (i) (i) 1 αV (i) (i)
kV = κ η fV = κ ρ . (9.147)
2 α(i) f 2 α(i) f
I I
LINE FORMATION IN A MAGNETIC FIELD 433
The solution to the radiative transfer equation is particularly simple when dealing
with optically thin, homogeneous slabs rather than semi-infinite atmospheres. To
be definite, let us consider a cold, absorbing slab superposed on a deeper layer which
emits continuum, unpolarized radiation. The slab is assumed to absorb only in the
line (not in the continuum), and to have zero emission and small optical thickness.
This schematic situation (the reversing layer model) was used in the early decades
of this century to represent a stellar atmosphere. Nowadays, it should be regarded
as a slab model rather than a real model atmosphere.
In the reversing layer – that we assume magnetized – the transfer of radiation can
be described using Eq. (9.36) with κc = 0 (no continuum absorption) and SL = 0
(b)
(no emission). Let ∆τL and Ic denote, respectively, the optical thickness of the
slab and the intensity of the continuum radiation (at the line wavelength) entering
the slab from the lower boundary. Since ∆τL 1, we can replace the Stokes
vector in the right-hand side of Eq. (9.36) by the Stokes vector at the boundary
(Ic , 0, 0, 0)† . Hence we obtain
(b)
(b)
∆I Ic − I ∆U U
≡ = hI Ic(b) ≡− = hU Ic(b)
∆τL ∆τL ∆τL ∆τL
∆Q Q ∆V V
≡− = hQ Ic(b) ≡− = hV Ic(b) ,
∆τL ∆τL ∆τL ∆τL
where I, Q, U , V are the emerging Stokes parameters. Use of Eqs. (9.32) yields
!
1 η + ηr
I = Ic(b) 1 − ∆τL ηp sin2 θ + b 1 + cos2 θ
2 2
1 η + ηr
Q = −Ic(b) ∆τL ηp − b sin2 θ cos 2χ
2 2
1 η + ηr
U = −Ic(b) ∆τL η − b sin2 θ sin 2χ
2 p 2
1
V = −Ic(b) ∆τL η − ηb cos θ . (9.148)
2 r
These are the so-called Seares formulae (Seares, 1913). They give Stokes parame-
ters profiles which match exactly the corresponding dichroic profiles.
It should be remarked that the same result is predicted by the Unno-Rachkovsky
solutions under the limit of weak spectral lines (κL 1). Recalling Eqs. (9.39),
we obtain by series expansion of Eqs. (9.110) up to first order in κL
I(0, µ) = B0 (1 + βµ) − βµ κL hI
Q(0, µ) = −B0 βµ κL hQ
U (0, µ) = −B0 βµ κL hU
V (0, µ) = −B0 βµ κL hV , (9.149)
434 CHAPTER 9
∞
−(τc − τc )
δI(τc ) = e κL (τc ) H(τc ) BP (τc ) − Ic (τc ) U dτc ,
τc
where Ic is the continuum intensity at the line wavelength. The emerging Stokes
parameters are therefore given by
I(0) = I0 (0) + δI(0)
1
We restrict attention to the LTE transfer equation rather than the more general equation (9.35).
This is justified by the fact that LTE is generally a very good approximation for weak lines.
LINE FORMATION IN A MAGNETIC FIELD 435
with
∞
−τc
I0 (0) = e BP (τc ) U dτc = Ic (0) U
0
∞
−τc
δI(0) = e κL (τc ) H(τc )
0
∞
−(τc − τc )
× BP (τc ) − e BP (τc ) dτc U dτc . (9.151)
τc
The problem of line formation in a magnetic field whose azimuth is variable with
optical depth can be reduced to the corresponding problem in a magnetic field with
constant azimuth via a simple transformation on the propagation matrix.
To illustrate this statement, we consider a radiation beam flowing in a stellar
atmosphere along the direction Ω, and take the transfer equation in the form (9.35).
We denote by I the Stokes vector in a fixed reference system – say, the system
of Fig. 9.1 – and by I the Stokes vector in the depth-dependent system
(ea , eb , Ω)
whose reference unit vector ea points in the direction of the transverse component
of the local magnetic field vector. It follows that the azimuth angle of the magnetic
field is χ(τc ) in the fixed system and χ = 0◦ in the depth-dependent system.
The Stokes vectors I and I are related by the transformation (see Eqs. (1.45))
I = R I , (9.152)
where ⎛ ⎞
1 0 0 0
⎜0
⎜ cos 2χ(τc ) sin 2χ(τc ) 0⎟⎟
R=⎜ ⎟ .
⎝0 − sin 2χ(τc ) cos 2χ(τc ) 0⎠
0 0 0 1
Writing Eq. (9.35) in the matrix form
dI
= 1 + κ L H I − Sc 1 + κ L SL H U (9.153)
dτc
436 CHAPTER 9
(with H and U given by Eqs. (9.50) and (9.49)), and deriving Eq. (9.152) with
respect to τc , we obtain
dI dR dI dR
= I +R = I + R 1 + κ L H I − R Sc 1 + κ L SL H U .
dτc dτc dτc dτc
The Stokes vector I in the right-hand side can be expressed in terms of I via the
inverse transformation of Eq. (9.152)
I = R−1 I , (9.154)
which leads to
dI dR −1
= R + 1 + κL R H R−1 I − Sc 1 + κL SL R H R−1 R U . (9.155)
dτc dτc
A direct calculation shows that
⎛ ⎞
1 0 0 0
⎜ 0 cos2χ(τ )
− sin 2χ(τc )
0⎟
⎜ ⎟
R−1 = ⎜
c
⎟ ,
⎝ 0 sin 2χ(τc ) cos 2χ(τc ) 0⎠
0 0 0 1
whence we easily obtain
dR −1 dχ
R =2 Z,
dτc dτc
with ⎛ ⎞
0 0 0 0
⎜0 0 1 0⎟
Z=⎝ ⎠ ,
0 −1 0 0
0 0 0 0
and, using Eqs. (9.32)
⎛ ⎞
hI hQ̃ 0 hV
⎜h 0 ⎟
⎜ hI rV ⎟
R H R−1 = ⎜ Q̃ ⎟ , (9.156)
⎝ 0 −rV hI rQ̃ ⎠
hV 0 −rQ̃ hI
where1
1 η + ηr
hQ̃ = hQ (χ = 0◦ ) = η − b sin2 θ
2 p 2
1 ρ + ρr
rQ̃ = rQ (χ = 0◦ ) = ρ − b sin2 θ . (9.157)
2 p 2
1
Note that the matrix in Eq. (9.156) is nothing but the propagation matrix for a magnetic
field having χ = 0◦ .
LINE FORMATION IN A MAGNETIC FIELD 437
Since, moreover
RU = U , ZU =0,
Eq. (9.155) can be rewritten in the form
dI
= 1 + κ L H I − Sc 1 + κ L SL H U , (9.158)
dτc
where
⎛ ⎞
hI hQ̃ 0 hV
⎜ rV 0 ⎟
2 dχ ⎜h hI ⎟
H = R H R−1 + Z = ⎜ Q̃ ⎟ , (9.159)
κL dτc ⎝ 0 −rV hI rQ̃ ⎠
hV 0 −rQ̃ hI
with
2 dχ dχ
rV = rV + = rV + 2 (9.160)
κL dτc dτL
(Eqs. (9.31) and (9.33) have been used to derive the last expression).
Comparison of Eqs. (9.158) and (9.153) shows that the transfer equation for I
is obtained from the transfer equation for I by substituting the matrix H for the
matrix H. As apparent from Eqs. (9.159) and (9.160), this means that the depth
dependence of the azimuth of the magnetic field can be taken into account by setting
χ = 0◦ at any optical depth and by adding the (wavelength-independent) term
2 (dχ /dτL ) to the Faraday rotation coefficient rV . This result was first derived
by Staude (1969). Obviously, once Eq. (9.158) is solved, the Stokes vector I is
obtained by applying the transformation (9.154).
The influence of the depth dependence of the azimuth angle on the Stokes pa-
rameters profiles is illustrated in Fig. 9.8, in the form of Unno-Rachkovsky solution
to the radiative transfer equation. Of course, such solution can only be used under
the assumption that (dχ /dτL ) is independent of optical depth (in addition to the
assumptions listed at the beginning of Sect. 9.8). As expected, the main effect of
azimuth variability is a decrease of linear polarization.
In the preceding sections of this chapter we have considered several cases where
the transfer equation can be solved in closed form thanks to certain assumptions
about the structure of the medium where propagation occurs. In a real stellar
atmosphere, or in detailed model atmospheres (like for instance the solar models
given by Gingerich et al., 1971, or Vernazza et al., 1981), these assumptions are
generally not satisfied, so that the approximate solutions presented so far cannot
be used reliably. This does not mean, however, that such solutions are pointless,
since it should be borne in mind that the sophisticated models which are nowa-
days available are themselves based on specific assumptions about the physics and
438 CHAPTER 9
Fig.9.8. Stokes parameters profiles for the normal Zeeman triplet 1 S0 − 1 P1 emerging from
an atmosphere with depth-dependent magnetic field’s azimuth. The profiles are computed using
Eqs.(9.110) with fV = κL rV and r given by Eq.(9.160). They are normalized to B and plotted
V 0
against the reduced wavelength. The reference direction is parallel to the transverse component
of the magnetic field at τc = 0. The values of the relevant parameters are those of the ‘standard
case’ (see Fig.9.5). The full line, drawn for reference, corresponds to dχ /dτL = 0. The dotted
line corresponds to the following values of dχ /dτL : −0.1 (a); +0.1 (b); −0.5 (c); +0.5 (d).
kc (ν0 ) kL
κr = , κR = , (9.162)
kc (νr ) kc (νr ) ∆νD
A = κr 1 + κR H
b = κ r Sc 1 + κ R SL H U , (9.164)
mentioned above. In the case of LTE, this is also true for the source functions
(Sc = SL = BP ) and for the quantity κR . For non-LTE situations, we assume here
that Sc , SL and κR (or the ratio Nu /N , see Eqs. (9.14)) are known, i.e. given
together with the model. (The problem of non-LTE will be considered in more
generality in Chap. 14).
Equation (9.163) can be formally solved by the evolution operator method. Sim-
ilarly to Eq. (9.100), we have
τr
I(τr ) = O(τr , τr ) b(τr ) dτr + O(τr , τr ) I(τr ) , (9.165)
τr
provided
lim O(0, τr ) b(τr ) = 0 .
τr →∞
The evolution operator O(τr , τr ) satisfies the analogues of Eqs. (9.94)-(9.97) and
can be written in the form
O(τr , τr ) =
the Stokes parameters at the optical depth τb are then chosen.1 The integration
starts at τb and proceeds outwards, step by step, the Stokes parameters at the k-th
grid-point being related to those at the (k + 1)-th point by the equations
1
I(τk ) = I(τk+1 ) − ∆I (1) + 2 ∆I (2) + 2 ∆I (3) + ∆I (4) , (9.168)
6
where
& '
∆I (1) = (τk+1 − τk ) A(τk+1 ) I(τk+1 ) − b(τk+1 )
& 1 '
∆I (2) = (τk+1 − τk ) A(τ̃k ) I(τk+1 ) − ∆I (1) − b(τ̃k )
2
& 1 '
∆I (3) = (τk+1 − τk ) A(τ̃k ) I(τk+1 ) − ∆I (2) − b(τ̃k )
2
& '
∆I (4) = (τk+1 − τk ) A(τk ) I(τk+1 ) − ∆I (3) − b(τk ) , (9.169)
τk + τk+1
τ̃k = .
2
Ik = pk + Πk Ik+1 , (9.170)
where
Ik = I(τk ) , Ik+1 = I(τk+1 ) .
1
One generally takes the Unno-Rachkovsky solutions resulting from the assumption that, for
τr > τb , the propagation matrix is constant and the source functions (coincident with the Planck
function at large optical depth) are linear in τr .
2
Note that this is a very qualitative statement. To the authors’ knowledge, a systematic
analysis of the influence of the τr -variation of A and b on the numerical accuracy of the Runge-
Kutta integration has never been carried out.
442 CHAPTER 9
We obtain
1
pk = δ b(τk+1 ) + 4 b(τ̃k ) + b(τk )
6 k
1
− δk2 A(τ̃k ) b(τk+1 ) + A(τ̃k ) b(τ̃k ) + A(τk ) b(τ̃k )
6
1 3
+ δ A2 (τ̃k ) b(τk+1 ) + A(τk ) A(τ̃k ) b(τ̃k )
12 k
1 4
− δ A(τk ) A2 (τ̃k ) b(τk+1 ) ,
24 k
1
Πk = 1 − δ A(τk+1 ) + 4 A(τ̃k ) + A(τk )
6 k
1 2
+ δ A(τ̃k ) A(τk+1 ) + A2 (τ̃k ) + A(τk ) A(τ̃k )
6 k
1 3
− δ A2 (τ̃k ) A(τk+1 ) + A(τk ) A2 (τ̃k )
12 k
1 4
+ δ A(τk ) A2 (τ̃k ) A(τk+1 ) , (9.171)
24 k
where δk is the length of the (k + 1)-th interval
δk = τk+1 − τk .
Comparison of Eqs. (9.170) and (9.165) shows that the matrix Πk provides a
numerical representation of the evolution operator associated with the interval
(τk , τk+1 ),
Πk ≈ O(τk , τk+1 ) .
To test the accuracy of the Runge-Kutta integration against the step length, we
can thus adopt the following criterion: assume the propagation matrix A to be
constant in the interval (τk , τk+1 ) and compare the exact analytical expression of
the evolution operator with its numerical representation Πk . Equations (9.103)
and (9.171) yield
−δk A
O(τk , τk+1 ) = e (9.172)
1 2 2 1 3 3 1 4 4
Π k = 1 − δk A + δ A − δ A + δ A . (9.173)
2! k 3! k 4! k
It follows that the 4-th order Runge-Kutta method gives a Taylor expansion of the
analytical result which is correct up to fourth-order terms in the quantity δk A .
Consider now the eigenvalues of the matrix A. From Eqs. (8.33) and (9.164) we
have
λ1 = κr + κR hI + Λ+ (h, r )
λ2 = κr + κR hI − Λ+ (h, r )
λ3 = κr + κR hI + i Λ− (h, r )
λ4 = κr + κR hI − i Λ− (h, r ) , (9.174)
LINE FORMATION IN A MAGNETIC FIELD 443
where Λ+ (h, r ) and Λ− (h, r ) are given by Eqs. (A5.18). Since each of the quantities
κr , κR , hI , Λ+ and Λ− is non-negative, the modulus of the eigenvalue having the
largest modulus (that will be simply referred to as maximum eigenvalue) is
λmax = max λ1 , |λ3 | = max λ1 , |λ4 |
& 2 2 '
= max κr + κR hI + Λ+ (h, r ) , κr + κR hI + κR Λ− (h, r ) . (9.175)
ln
≥ − 1/5
. (9.179)
(120 )
1
The quantity λmin is always positive (see footnote on p.421).
444 CHAPTER 9
and it is easily seen that this number may be very large at certain wavelengths,
especially for strong lines ( κR κr ). This is the main disadvantage of the Runge-
Kutta method.
Beside being used to solve the radiative transfer equation, this method can also
be used to compute numerically the evolution operator. In particular, Grossmann-
Doerth et al. (1988a) and Sánchez Almeida (1992) have applied the 4-th order
Runge-Kutta technique to evaluate the reduced evolution operator (see Sect. 8.9
and App. 7).
Another numerical technique that has been used to solve Eq. (9.163) is the so-
called Feautrier method. This is the direct generalization to the polarized case of
the method proposed by Feautrier (1964) to solve the ‘scalar’ transfer equation
for the intensity. It has been developed by Auer et al. (1977a) in the simplified
case where magneto-optical effects are neglected and later generalized by Rees
and Murphy (1987) to include these effects. The approach consists in writing the
transfer equation as a second-order differential equation with boundary conditions
at both ends of the integration interval.
Let us suppose the model atmosphere is given, and let us recall the property of
the radiative transfer equation considered at the end of Sect. 9.5 and summarized
by Eq. (9.61). Denoting by Ω an arbitrary outward direction, and by −Ω the
opposite (inward ) direction, we have from Eq. (9.163)
d
I(τr , ∆λ) = A(τr , ∆λ) I(τr , ∆λ) − b(τr , ∆λ)
dτr
d
− Ĩ (τr , −∆λ) = A(τr , ∆λ) Ĩ (τr , −∆λ) − b(τr , ∆λ) , (9.180)
dτr
1
F (τr , ∆λ) = I(τr , ∆λ) − Ĩ (τr , −∆λ)
2
1
G(τr , ∆λ) = I(τr , ∆λ) + Ĩ (τr , −∆λ) , (9.181)
2
and omitting the arguments of the various functions for conciseness, we have using
Eqs. (9.180)
dF dG
= AG − b , = AF , (9.182)
dτr dτr
LINE FORMATION IN A MAGNETIC FIELD 445
d dG
A−1 = AG − b . (9.183)
dτr dτr
so that Eqs. (9.181) yield F = G . Using the second of Eqs. (9.182) we get
dG
A−1 = G (τr = 0) . (9.185)
dτr
(6 )1/3 ln
δk ≤ , N≥ − ,
λmax (6 )1/3
advantage of the fact that the four diagonal elements of the propagation matrix A
are equal. This enables Eq. (9.163) to be cast into the form
d & '
I(τr ) = aI (τr ) 1 + A (τr ) I(τr ) − b (τr ) , (9.187)
dτr
where
aI = κ r + κ R h I
A(τr ) κ [ H − hI 1 ]
A (τr ) = −1 = R
aI (τr ) κr + κR hI
b(τr ) [ κ r Sc 1 + κ R SL H ] U
b (τr ) = = . (9.188)
aI (τr ) κr + κR hI
d
I(τr ) = aI (τr ) I(τr ) − s(τr ) , (9.189)
dτr
where
s(τr ) = b (τr ) − A (τr ) I(τr ) (9.190)
is a ‘modified source vector’ which is a function of the Stokes vector itself.
The formal solution to Eq. (9.189)
∞ : τr
− aI (τr ) dτr
I(τr ) = e τr aI (τr ) s(τr ) dτr (9.191)
τr
is an integral Volterra equation for I(τr ) . It can be solved via an iterative proce-
dure, called lambda-iteration,1 as done by Staude (1969). The numerical method
proposed by Rees and Murphy (1987), whose details are developed in App. 9, has
the advantage of avoiding the slow convergence of the lambda-iteration.
The results contained in App. 9 show that the integration grid must be locally
controlled also for the DELO method. The criterion to choose the step length
1
The concepts of lambda-operator and lambda-iteration are basic tools of the theory of
(scalar) radiative transfer (see e.g. Mihalas, 1978). Recalling Eq. (9.190), Eq. (9.191) can be
written in the form
I(τr ) = Λ( I ; τr ) + Λ (b ; τr ) ,
where Λ and Λ are integral operators acting, respectively, on the vectors I (unknown) and b
(known). The usual lambda-iteration consists in starting with the zero-order solution I (0) (τr ) = 0
and evaluating repeatedly the left-hand side of the equation by substituting in the right-hand side
the value I (k) (τr ) obtained in the previous iteration. The procedure is stopped when convergence
is reached. Lambda-iterations generally converge very slowly.
LINE FORMATION IN A MAGNETIC FIELD 447
Defining
λmax = max κR Λ+ (h, r ) , κR Λ− (h, r ) , (9.192)
the error introduced at each step by the DELO integration is of order
2
δk3 3 a
λmax 1+ I .
12 λmax
(12 )1/3
δk ≤ 2/3 . (9.193)
a
λmax 1 + I
λmax
The number of steps necessary for the integration is therefore (cf. the derivation
of Eq. (9.179))
ln
N ≥ − ,
(12 )1/3
where 2/3
λ aI
= max 1+ ,
λmin λmax
with λmin given by Eq. (9.176).
The performance of the DELO method against the Runge-Kutta method can
be examined by comparing the corresponding step lengths. Taking for instance
= 10−3 , we have from Eqs. (9.177) and (9.193)
1/3 2/3
(δk )R.K. λmax aI + λmax
≈ 2.9 ,
(δk )DELO λmax
with λmax given by Eq. (9.175). If the propagation matrix A is ‘almost diagonal’,
we have from Eqs. (9.174), (9.188) and (9.192)
λmax aI , λmax ≈ aI ,
thus the DELO method is more efficient. When λmax and aI are comparable (which
is often the case in most practical applications), it is difficult to establish which
method is more efficient.
448 CHAPTER 9
The last integration method that we discuss here is based on the direct application
of the concept of evolution operator. The method, already presented in a slightly
different form in Landi Degl’Innocenti (1987), will be referred to as the piecemeal
evolution operator (or PEO ) method . It consists in the following.
We introduce a grid of N points (τ1 , τ2 , . . . , τN ) not necessarily evenly spaced.
In each interval we assume that the matrix H is constant and equal to the average
of the values at the end points of the interval,
1
Hk = H(τk ) + H(τk+1 ) . (9.194)
2
According to Eqs. (9.164) and (9.104), the evolution operator associated with the
interval (τk , τk+1 ) is
where
τ τ
Kr (τ, τ ) = κr (τr ) dτr , KR (τ, τ ) = κR (τr ) dτr , (9.195)
τ τ
and where the exponential of the matrix can be conveniently expressed via Eqs.
(A5.20) or (A5.23).
Iterative application of Eq. (9.165) enables the emerging Stokes vector to be
written in the form
I(τ1 ) = W1 + O1 W2 + O2 W3 + O3 . . . WN −1 + ON −1 WN . . .
or
N
I(τ1 ) = W1 + O1 O2 · · · Ok−1 Wk ,
k=2
τk+1
−Kr (τk ,τr ) −KR (τk ,τr ) H k
= e e
τk
× κr (τr ) Sc (τr ) 1 + κR (τr ) SL (τr ) H k U dτr (9.196)
for k = 1, 2, . . . , N − 1 , and
WN = I(τN ) = I (b) ,
LINE FORMATION IN A MAGNETIC FIELD 449
where Eq. (9.194) has been used. Calculations similar to those in Sect. 8.7 show
that the correction to the zero-order solution can be written in the form
∞
δI(τ1 ) = O0 (τ1 , τr ) κR (τr ) δH(τr ) SL (τr ) U − I0 (τr ) dτr ,
τ1
where O0 and I0 are the evolution operator and the solution to the zero-order
equation, respectively. The integral in the right-hand side is rather involved, but
it can be easily performed under suitable assumptions. Since δI(τ1 ) is in general
a small correction to I0 (τ1 ) , even a crude approximation is justified. An adequate
approximation is to consider the ‘effective source vector’ [SL (τr )U − I0 (τr )] as a
linear function of τr in each interval (τk , τk+1 ) . This allows the integral to be
evaluated by standard numerical techniques.
It should be mentioned that a numerical method quite similar to the PEO method
has been presented by Van Ballegooijen (1987) to solve the transfer equation in the
form of Eq. (8.65).
with BP (τr ) the Planck function, and the transfer equation becomes
d
I(τr ) = A(τr ) I(τr ) − BP (τr ) U , (9.197)
dτr
where A(τr ) is given by Eqs. (9.164). For a semi-infinite atmosphere, the formal
solution to this equation is (cf. Eq. (9.165))
∞
I(τr ) = O(τr , τr ) A(τr ) BP (τr ) U dτr , (9.198)
τr
If the ‘increments’ δζi (τr ) are small, we can express the propagation matrix and
the Planck function by a first-order expansion
p
∂A
A (τr ) = A(τr ) + δA(τr ) = A(τr ) + δζ (τ )
i=1
∂ζi τ i r
r
p
∂BP
BP (τr ) = BP (τr ) + δBP (τr ) = BP (τr ) + δζi (τr ) . (9.199)
i=1
∂ζi τr
Some remarks are necessary here. A variation of the physical parameters entails, in
general, a variation of the continuum absorption coefficient. Therefore, the optical-
depth scales in the unperturbed and perturbed atmosphere are different. In the
following, the optical-depth scale of the unperturbed atmosphere will always be
used. This implies that the partial derivatives in Eqs. (9.199) should be performed
by keeping constant the value of the absorption coefficient kc (νr ) which enters
the definitions of κr and κR (see Eqs. (9.162)). The second remark concerns the
expression of δBP in Eqs. (9.199). Since BP depends only on temperature, δBP is
identically zero if the temperature is not contained in the set {ζi } .
Substituting Eqs. (9.199) into Eq. (9.197), writing the Stokes parameters for the
perturbed atmosphere in the form
I (τr ) = I(τr ) + δI(τr ) , (9.200)
and considering only the first-order terms in the resulting equation, we get for δI
the transfer equation
p
d ∂BP
δI(τr ) = A(τr ) δI(τr ) − A(τr ) δζ (τ ) U
dτr i=1
∂ζi τ i r
r
p
!
∂A
− δζi (τr ) I(τr ) − BP (τr ) U .
i=1
∂ζi τr
The correction to the Stokes parameters emerging from the perturbed atmosphere
can thus be written in the form
p ∞
∂BP
δI(0) = O(0, τr ) A(τr ) U
i=1
∂ζi τ
0 r
!
∂A
− I(τr ) − BP (τr ) U δζi (τr ) dτr , (9.201)
∂ζi τr
The physical meaning of response functions follows directly from Eq. (9.203). If
∆τr is a small optical-depth interval centered at τr , and δζi is a small variation of
the physical parameter ζi in that interval, one gets
δI(0)
RF (ζi ; τr ) = ,
δζi ∆τr
which shows that response functions can be regarded as a sort of partial derivatives
of the emerging Stokes parameters with respect to the physical parameters (Ruiz
Cobo and del Toro Iniesta, 1994). Equation (9.203) also gives an ‘operational def-
inition’ of response functions: in principle, they could be measured by changing
the value of a physical parameter in a small region of a stellar atmosphere and
by observing the corresponding variation of the emerging Stokes parameters. Al-
though this is obviously a Gedankenexperiment , it shows that response functions
are correctly defined from the physical point of view.
Response functions satisfy the important integral property
∞
∂I(0)
RF (ζi ; τr ) dτr = ,
∂ζi
0
which follows directly from Eq. (9.203) by considering an increment δζi indepen-
dent of optical depth.
Equation (9.202) shows that response functions can be expressed in analytical
form whenever the same property holds for the evolution operator. As an exam-
ple we can consider the case of the Milne-Eddington atmosphere that has been
developed in Sect. 9.8.
First of all, it is convenient to rewrite Eq. (9.202) in terms of the optical depth τc
rather than τr , so that the expressions derived in Sect. 9.8 can be applied. Starting
at Eq. (9.203) we obtain, with the help of Eqs. (9.33), (9.162), (9.164) and (9.31)1
!
∂BP ∂C
RF (ζi ; τc ) = O(0, τc ) C(τc ) U − I(τc ) − BP (τc ) U ,
∂ζi τ ∂ζi τ
c c
I(tc , µ), the Stokes parameters of the radiation flowing along the µ-direction at
optical depth tc = µ τc , by Eqs. (9.108). The response function for any physical
parameter ζi except temperature1 is therefore
t
− µc C ∂C
RF (ζi ; tc , µ) = −B0 βµ e C −1 U ,
∂ζi
and the correction to the emerging Stokes parameters due to the ‘perturbations’
δζi (tc ) is
p ∞
dt
δI(0, µ) = RF (ζi ; tc , µ) δζi (tc ) c .
i=1
µ
0
Explicit expressions for δI(0, µ) can be derived for different forms of the func-
tions δζi (tc ). Assuming, for instance, a linear behavior of the perturbation (Lan-
dolfi, 1987)
δζi (tc ) = ai + bi tc ,
we obtain with the help of Eqs. (9.106) and (9.107)
p
∂C −1 2 ∂C −1
δI(0, µ) = −B0 βµ ai C −1 C + µ bi C −1 C U,
i=1
∂ζi ∂ζi
p
∂
δI(0, µ) = ai 1 + µ bi C −1 I(0, µ) .
i=1
∂ζi
we get
p
−1 ∂C −1
δI(0, µ) = −B0 βµ Ai C + µ αi 1 C U,
i=1
∂ζi
1
In a Milne-Eddington atmosphere the partial derivative ∂BP / ∂T is meaningless because
of the parameterization adopted for BP .
454 CHAPTER 9
p
−1 ∂
= Ai 1 − µ αi C + µ αi 1 I(0, µ) .
i=1
∂ζi
∂ ∂
A(−∆λ) = −X A(∆λ) X −1 , (9.208)
∂ζb τr ∂ζb τr
where ζa is any physical parameter of group (a), and ζb = wA . Using Eqs. (9.206)-
(9.208), and observing that
∂BP
= 0, X −1 U = U ,
∂ζb τ
r
According to Eq. (9.203), the corrections δI(0, ∆λ) and δI(0, −∆λ) satisfy sym-
metry relations of the same form as Eqs. (9.209). It follows that only a variation
of the physical parameter wA with optical depth can alter the symmetry char-
acteristics of the Stokes parameters expressed by Eqs. (9.54).1 The (first-order)
corrections produced by such variation have the opposite symmetry of the corre-
sponding Stokes parameters,
Finally, we want to remark that the concept of response function can be easily
generalized to any observable quantity obtained by a mathematical transformation
on the emerging Stokes parameters I(0, λ). As an example, let us consider the
line-integrated Stokes parameters defined by2
I = (I, Q , U, V)† = I(0, λ) − Ic (0) U dλ , (9.210)
line
where Ic (0) is the emerging intensity in the continuum adjacent to the line ( Ic (0) U
= I(0, λc ) , with λc a wavelength sufficiently far in the line wing), and the integral
is over the line profile. Following our perturbative scheme, we have from Eq. (9.203)
p ∞ !
δI = RF (ζi ; τr , λ) − RF (ζi ; τr , λc ) δζi (τr ) dτr dλ .
line i=1 0
Inverting the order of the integrals, this equation can be cast into the form
p ∞
δI = RF (ζi ; τr ) δζi (τr ) dτr ,
i=1 0
del Toro Iniesta (1994). In this last paper the concept is precisely stated from the
mathematical point of view.
∞
I(0) = CF (τr ) dτr , (9.211)
0
provides the possibility of defining a contribution function for the Stokes parameters
in the form (Van Ballegooijen, 1985)
CF (τr ) = O(0, τr ) κr Sc 1 + κR SL H U .
This contribution function has a very intuitive physical meaning, but has the draw-
back of mixing the contributions of the continuum and of the spectral line. To over-
come this difficulty one can define the contribution function for the line depression
[Ic (0) U − I(0)], or the contribution function for the normalized line depression 1
defined by
I (0) U − I(0) I(0)
D(0) = c =U − . (9.212)
Ic (0) Ic (0)
1
The term ‘line depression’ is applied here to the four Stokes parameters, although, more
properly, it should only be applied to the intensity.
LINE FORMATION IN A MAGNETIC FIELD 457
dIc
= κr Ic − Sc (9.213)
dτr
one obtains, with the use of Eqs. (9.164), the transfer equation
dD S S
= κr c 1 + κR H D − κR 1 − L H U .
dτr Ic Ic
with A given by Eqs. (9.164), the value of AD at any optical depth can be evaluated
using the solution to Eq. (9.213). We can then calculate the evolution operator OD
corresponding to the propagation matrix AD , and finally write
∞
D(0) = CF (τr ) dτr , (9.214)
0
where CF (τr ) , the contribution function to the normalized line depression, is defined
by (Grossmann-Doerth et al., 1988a; Rees et al., 1989)
S
CF (τr ) = OD (0, τr ) κR 1 − L H U .
Ic
∞
g(τr ) dτr = 0 .
0
458 CHAPTER 9
The choice of the gauge is more or less arbitrary, although in some cases it can
be suggested by physical considerations. We believe that no real progress can be
achieved in this subject until a clear, operational definition of contribution function
is given.
A possible way out could be to replace the concept of contribution function by
an appropriate response function, like the response function to the logarithm of
the line oscillator strength, defined by (see Eq. (9.203))1
∞
δI(0) = RF ( lnf ; τr ) δ lnf (τr ) dτr .
0
According to Eq. (9.212), the correction to the normalized line depression produced
by the ‘perturbation’ δ lnf (τr ) is given by
δI(0)
δD(0) = − ,
Ic (0)
∞
δD(0) = RF ( lnf ; τr ) δ lnf (τr ) dτr ,
0
RF ( lnf ; τr )
RF ( lnf ; τr ) = − . (9.215)
Ic (0)
Since
∂BP
= 0
∂(lnf ) τ
r
1 & '
RF ( lnf ; τr ) = κR (τr ) O(0, τr ) H(τr ) I(τr ) − BP (τr ) U .
Ic (0)
1
We restrict ourselves to the LTE case since our definition of response function does not
apply to non-LTE lines – see the discussion in the previous section.
LINE FORMATION IN A MAGNETIC FIELD 459
Finally, it should be noticed that all the contribution functions defined in this
section, as well as the response functions defined in the former section, are referred
to the optical-depth scale τr . When a different depth indicator x is used, the new
contribution function is obtained through the obvious transformation
dτr
CF (x) = CF (τr ) ,
dx
with a similar relation for RF (x) . Assuming for instance x = Log τr , one has
The scale Log τr is generally convenient for plotting contribution and response
functions against optical depth.
9.18. Blends
Blends are a common feature of stellar spectra. They simply result from the fact
that, in a given section of the spectrum, two or more lines, generally belonging to
different elements, may overlap to give rise to a complicated spectral feature. The
most common case is the presence of a weak line in the blue or in the red wing of
a stronger line.
When the individual lines are independent , which means that they belong either
to different elements or to different multiplets of the same element, their cumulative
effect can be simply described by adding in the transfer equation the contributions
of each line. When the blending lines are the fine-structure or hyperfine-structure
components of a single ‘parent’ line, the situation is more complex and requires a
different approach (see Sect. 9.23).
(i)
For an ensemble of N independent lines, each characterized by its own κL (the
ratio between line and continuum absorption coefficient defined in Eq. (9.31)) and
(i)
SL (the line source function defined in Eq. (9.14)), Eq. (9.35) generalizes into the
following
⎛ ⎞ ⎛ ⎞⎛ ⎞
I 1 0 0 0 I − Sc
⎜ ⎟ ⎜
d ⎜Q⎟ ⎜0 1 0⎟ ⎜ ⎟
0 ⎟⎜ Q ⎟
⎜ ⎟=⎜ ⎟⎜ ⎟
dτc ⎝ U ⎠ ⎝ 0 0 1 0 ⎠⎝ U ⎠
V 0 0 0 1 V
⎛ (i) ⎞⎛ (i) ⎞
hI hQ
(i) (i)
hU hV
(i)
I − SL
⎜ (i) ⎟⎜ ⎟
N ⎜ (i) (i)
−rU ⎟
(i) ⎜ Q ⎟
(i) ⎜ hQ hI rV ⎟⎜ ⎟
+ κL ⎜ ⎟ ⎜ ⎟ , (9.216)
⎜ h(i) −rV
(i) (i)
hI rQ ⎟
(i) ⎜ U ⎟
i=1 ⎝ U ⎠⎝ ⎠
(i) (i) (i) (i)
hV rU −rQ hI V
460 CHAPTER 9
dI N
(i)
N
(i) (i)
= 1 + κL H (i) I − Sc 1 + κL SL H (i) U , (9.217)
dτc i=1 i=1
N
(i)
C=1 + κL H (i) .
i=1
Figure 9.9 shows the emerging Stokes parameters, calculated according to Eq.
(9.109), for a blend of two lines. The line on the left is the normal Zeeman triplet
of Fig. 9.5 (a), the other one is weaker but has a larger sensitivity to the magnetic
field (normal triplet with g = 2). Panels (a) to (d) are obtained by reducing
the wavelength separation of the lines. The Stokes parameters are normalized to
B0 and are plotted against the reduced wavelength v = (λ − λ̄0 ) / ∆λD , where
λ̄0 = (λ(1) (2)
0 + λ0 ) / 2 and ∆λD is the (common) Doppler width of the two lines.
The figure shows quite clearly that blends break the symmetry characteristics of
Stokes profiles about line center. The ‘distortion’ is particularly evident in the Q
profile, panels (a) and (b).
In the quite common case of a weak line lying on the wing of a stronger line,
the emerging Stokes parameters can be obtained by a perturbative solution of the
transfer equation (9.217). If the index 1 refers to the stronger line and the index 2
to the weaker, one can first solve the equation
dI0
L H
= 1 + κ(1) I0 − L SL H
Sc 1 + κ(1) U
(1) (1) (1)
dτc
to obtain the ‘zero-order’ Stokes vector I0 (τc ) . The correction δI is then found by
solving the approximate equation
d
L H
δI = 1 + κ(1) δI − κ(2)
L H SL(2) U − I0 .
(1) (2)
dτc
L Hij κL Hij .
Obviously, such perturbative solution is justified only if κ(2) (2) (1) (1)
Let us consider a spectral line formed in the radiation flowing along a particular
direction in a given stellar atmosphere. If no magnetic field is present in the
LINE FORMATION IN A MAGNETIC FIELD 461
Fig.9.9. Stokes parameters profiles for a blend of two lines, computed for a Milne-Eddington
atmosphere. The profiles are normalized to B0 and are plotted against the reduced wavelength
defined in the text. The lines have different strength (κ(1) L
= 20, κ(2)
L
= 10) and magnetic
(1) (2)
sensitivity (normal Zeeman triplets with g = 1, g = 2 ). The values of the remaining
parameters are those of the ‘standard case’ (see Fig.9.5). The four panels correspond to different
wavelength separation of the lines, parameterized through ∆v = (λ(2) (1)
0 − λ0 ) /∆λD with ∆λD
the (common) Doppler width. Panel (a) ∆v = 3 ; (b) ∆v = 1.5 ; (c) ∆v = 0.5 ; (d) ∆v = 0.
where Ic (0) is the emerging intensity in the nearby continuum and I(0) ¯ is the
zero-field, wavelength-dependent emerging intensity.
In a sort of Gedankenexperiment , let us now suppose a constant magnetic field
is switched on throughout the whole atmosphere, all the other physical parameters
remaining unchanged. The new atmosphere will be characterized by the same
continuum intensity1 but a different line intensity, and this will result in a different
1
See footnote on p.376.
462 CHAPTER 9
equivalent width for the line considered. Defining the new equivalent width as
1
W = Ic (0) − I(0) dλ , (9.219)
Ic (0)
line
where I(0) is the intensity emerging from the magnetic atmosphere, the influence of
the magnetic field on the equivalent width can be characterized by the dimensionless
parameter
W
M= , (9.220)
W0
that we will call the magnetic intensification parameter . In most stellar atmo-
spheres and for most spectral lines, this parameter is larger than 1 : in other
words, the equivalent width in a magnetic atmosphere is usually larger than the
equivalent width in the corresponding (in the sense specified above) non-magnetic
atmosphere.
This fact has a very simple physical explanation, which can be understood with
the help of the classical theory of the equivalent width developed, for instance, in
Aller (1963) or in Mihalas (1978). Consider a line in the flat part of the curve
of growth, where saturation phenomena are important. In the presence of a mag-
netic field the line splits into its Zeeman components and, as soon as the Zeeman
splitting becomes of the order of the Doppler width (∆λB ≈ ∆λD ), it broadens
– thus increasing its equivalent width – and desaturates. This desaturation process
increases with increasing magnetic field strength and reaches its maximum when
all the Zeeman components are well-separated (∆λB ∆λD ). In this process the
line intensifies, which justifies the name of magnetic intensification given to this
physical mechanism.
Ten Bruggencate and von Klüber (1939) were the first to suggest that such mech-
anism may be effective in sunspots. Further contributions to this subject were given
by Babcock (1949) and Boyarchuk et al. (1961).
Even in the simple case of an atmosphere with constant magnetic field, the mag-
netic intensification parameter defined in Eq. (9.220) depends on many quantities.
Besides the magnetic field intensity and direction, it depends on the spectral line
(the structure of the Zeeman pattern being of crucial importance), on the full
set of physical quantities specifying the properties of the atmosphere, and on the
direction of the emerging radiation.
We can obtain an approximate expression for M by assuming the atmosphere to
be described by the Milne-Eddington model. Although this may seem a very rough
approximation, it should be remarked that the equivalent width of a spectral line
has a rather weak dependence on the detailed structure of the atmosphere, and
this holds especially for the magnetic intensification parameter which is the ratio
of two equivalent widths. Recalling Eqs. (9.218)-(9.220) and Eq. (9.112), we obtain
∞&
'
1 − ∆−1 (1 + kI ) (1 + kI )2 + fQ 2
+ fU2 + fV2 dλ
M = −∞ ∞ & ' , (9.221)
−1
1 − (1 + κL η) dλ
−∞
LINE FORMATION IN A MAGNETIC FIELD 463
where ∆ is defined in Table 9.5, η is the normalized absorption profile for zero
magnetic field
1 λ − λ0
η = √ H ,a ,
π ∆λD
and the remaining quantities are given by Eqs. (9.31) and (9.39).
Despite the substantial simplification introduced by the Milne-Eddington model,
M still remains a complicated function of the Zeeman pattern and of the four
parameters1 vB , θ, κL , a . Only a few properties of M can be deduced analytically
from Eq. (9.221). These are the following:
a) Under the limit of weak, unsaturated lines, the magnetic intensification param-
eter is equal to 1, ∞
κL hI dλ
−∞
lim M = ∞ = 1.
κL →0
κL η dλ
−∞
In other words, the line-broadening mechanism due to the magnetic field is ineffec-
tive in this particular case. The proof is easily obtained by series expansion of the
integrands in Eq. (9.221) and use of the normalization (9.28).
b) Under the limit of weak magnetic field (∆λB ∆λD ), the numerator of
Eq. (9.221) can be expanded in power series of ∆λB with the help of Eqs. (9.69).
After some algebra we obtain, up to second-order terms
2
0 sin θ + G1 (1 + cos θ) I2 (κL , a) − G1
G(2) cos2 θ I3 (κL , a)
1 2 (2) 2 (1)
4
M = 1 + vB
2
,
I1 (κL , a)
∞
κL η
I1 (κL , a) = dv
1 + κL η
−∞
∞
κL η̈
I2 (κL , a) = dv
(1 + κL η)2
−∞
∞
κ2L η̇ 2
I3 (κL , a) = dv , (9.222)
(1 + κL η)3
−∞
1
The dependence on χ , the azimuth angle of the magnetic field vector, disappears for obvious
symmetry reasons.
464 CHAPTER 9
∂η ∂η ∂2η ∂2η
η̇ = = ∆λD , η̈ = 2
= ∆λ2D . (9.223)
∂v ∂λ ∂v ∂λ2
Integration by parts of I2 (κL , a) shows that
hence
I3 (κL , a)
M = 1 + vB
2
Γ (θ) , (9.225)
I1 (κL , a)
where
1 2
0 sin θ + G1 (1 + cos θ) − G1
G(2) 2 (2) 2 (1)
Γ (θ) = cos2 θ .
2
The function Γ (θ) can be transformed by substituting the expressions for the quan-
(n)
tities Gq contained in Table 3.4. With some easy algebra one gets
1 2
Γ (θ) = ḡ + δ sin2 θ + δ , (9.226)
2
where ḡ is the effective Landé factor defined in Eq. (3.44), δ is defined in Eq. (9.78)
and δ is given by
1 2
δ = g (8s − d2 − 12) . (9.227)
80 d
For electric-dipole transitions, the quantity δ is – like δ, see Sect. 9.6 – a non-
negative quantity that vanishes only for Zeeman triplets (and for transitions J =
1/2 → J = 1/2).
Since both integrals I1 and I3 are positive, substitution of Eq. (9.226) into
Eq. (9.225) shows that the following conclusions can be drawn about the mag-
netic intensification parameter in the weak field limit: i) it increases quadrati-
cally with the magnetic field strength; ii) it increases linearly with sin2 θ , reach-
ing its maximum value for transverse fields (θ = 90◦ ); iii) in a longitudinal field
(θ = 0◦ or 180◦ ), it is always equal to 1 for Zeeman triplets and for transitions
J = 1/2 → J = 1/2;1 iv) its dependence on κL and a is neither affected by the
magnetic field nor by the Zeeman pattern of the spectral line, and is fully described
by the function I3 / I1 . The functions I1 , I3 and the ratio I3 / I1 are plotted in
Fig. 9.10. Note that I1 is proportional to the equivalent width of a non-magnetic
line expressed in the Milne-Eddington approximation. From Eqs. (9.218), (9.112)
and (9.222) we have in fact
βµ
W0 (µ) = ∆λD I (κ , a) . (9.228)
1 + βµ 1 L
1
This property holds even if the magnetic field is not weak, as it can be easily realized from
the discussion presented in Sect. 9.10 (cf. Eqs. (9.126)).
LINE FORMATION IN A MAGNETIC FIELD 465
Fig.9.10. The functions I1 (κL , a) and I3 (κL , a) are plotted against κL on a Log-Log scale, for
different values of the damping constant: a = 10−3 (full line), a = 10−2 (short dash), a = 10−1
(long dash). Note that I1 (κL , a) represents the standard curve of growth for non-magnetic lines.
According to Eq.(9.225), the ratio I3 / I1 describes the dependence on κL and a of the magnetic
intensification parameter M.
c) Under the limit of very large magnetic field (∆λB ∆λD ), when the profiles
of the different Zeeman components are completely separated and do not overlap,
each component can be treated independently. Its contribution is easily obtained
with the help of the results derived in Sect. 9.12. Equations (9.220), (9.219), (9.228)
466 CHAPTER 9
N N ∞ (i) (i)
1 (i) 1 κf η
M= RI (0, µ) dλ = dv ,
W0 (µ) i=1 2 I1 (κL , a) i=1 (i) (i)
1 + κf η
line −∞
(i)
where κf is given by Eq. (9.143), η (i) by Eqs. (9.140), and N is the total number
of Zeeman components. Recalling the definition of I1 (Eqs. (9.222)) and dividing
the Zeeman components into π and σ, we obtain, with the use of the expressions
in Table 9.6
Nπ
, - Nσ
, 2 -
I1 κL si sin2 θ, a + I1 κL si 1 + cos
2
θ ,a
1 i=1 i=1
M= , (9.229)
2 I1 (κL , a)
where si is the strength of the i-component defined in Eqs. (9.140) and the two
summations refer to π and σ components, respectively (Nπ + Nσ = N ). It follows
from Eq. (9.229) that, when κL is increased, the magnetic intensification parameter
tends to a maximum asymptotic value given by1 (Babcock, 1949)
Nπ + Nσ N
M= = ,
2 2
which clearly shows the importance of the Zeeman pattern in determining the value
of the magnetic intensification parameter.
All the properties discussed above are confirmed by detailed numerical calcula-
tions. Sample results are shown in Fig. 9.11.
Saturation effects in magnetic spectral lines not only increase their equivalent width
(as shown in the former section) but also produce a net amount of linear polariza-
tion in the line.
It is easily seen that weak, unsaturated magnetic lines cannot produce net linear
polarization. According to the generalized Seares formulae (9.151), which are valid
for weak lines in an arbitrary atmosphere, the emerging Q Stokes parameter is
given by
∞ ∞
−τc −(τ − τ )
Q(0) = e κL (τc ) hQ (τc ) BP (τc ) − e c c BP (τc ) dτc dτc ,
0 τc
1
If θ is close to 0◦ or 180◦ , the ‘fictitious opacity’ in the π components (κL si sin2 θ) is small.
For such inclination angles the asymptotic value for M is Nσ / 2 instead of N / 2 . This shows
again that M = 1 for Zeeman triplets when the magnetic field is parallel or antiparallel to the
propagation direction (the magnetic intensification mechanism is ineffective).
LINE FORMATION IN A MAGNETIC FIELD 467
Fig.9.11. Numerical results for the magnetic intensification parameter M computed according to
Eq.(9.221). The solid line refers to a normal Zeeman triplet ( 1 S0 − 1 P1 ), while the dashed line
refers to a more complicated Zeeman pattern ( 5 P3 − 5 D4 , see Fig.3.2). Panel (a): M versus vB
for θ = 60◦ , κL = 20, a = 0.05. Panel (b): M versus θ for vB = 1.5, κL = 20, a = 0.05. Panel (c):
M versus Log κL for vB = 1.5, θ = 60◦ , a = 0.05.
with a similar expression for U (0). Since from Eqs. (9.32), (9.29) and (9.28)
hQ dλ = hU dλ = 0 ,
line line
468 CHAPTER 9
where I is defined in Eq. (9.210) and W is the equivalent width given by Eq. (9.219).
As in the case of the magnetic intensification parameter M, a good approximation
for q and u is obtained by assuming the Milne-Eddington model atmosphere. It is
also convenient to express the net linear polarization parameters in the preferred
reference frame (see Sect. 5.5). Setting
q̃ = q (χ = 0◦ ) , ũ = u (χ = 0◦ ) ,
where
∆ = (1 + kI )4 + (1 + kI )2 (fQ̃
2
+ fV2 − kQ̃
2
− kV2 ) − (kQ̃ fQ̃ + kV fV )2
LINE FORMATION IN A MAGNETIC FIELD 469
q̃ = Π cos 2Φ , ũ = Π sin 2Φ
with Φ ranging from 0 to π, are sometimes used in the place of q̃ and ũ . Inversion
of these relations gives (cf. Eqs. (1.8))
Π= q̃2 + ũ2
and, for q̃ = 0
1 ũ
Φ= arctan + Φ0
2 q̃
with ⎧
⎨0 if q̃ > 0 and ũ ≥ 0
Φ0 = π if q̃ > 0 and ũ < 0
⎩
π/2 if q̃ < 0 ,
while for q̃ = 0 1
4π if ũ > 0
Φ= 3
4π if ũ < 0 .
The relation between the net linear polarization parameters expressed in an arbi-
trary frame (rather than the preferred frame) and the set (Π, Φ) is easily obtained
from the transformation law of the Stokes parameters under rotation of the refer-
ence direction. Equations (1.45) yield
b) Under the limit of weak, unsaturated lines (κL 1), expansion of Eqs. (9.231)
into power series of κL yields, with the help of Eqs. (9.32), (9.29) and (9.28)
∞ ∞
q̃ = 2 κL hQ̃ hI dv , ũ = κL hV rQ̃ − hQ̃ rV dv ,
−∞ −∞
where v is the reduced wavelength defined in Eqs. (9.26). These equations show
that Π grows linearly with κL and that Φ attains a finite value for κL → 0.
c) Under the limit of weak magnetic field (∆λB ∆λD ) we can expand the
integrands in Eqs. (9.231) by using Eqs. (9.69). To the lowest order in vB we
obtain, with the help of Eqs. (9.76), (9.224) and Table 3.4
1 2 I (κ , a)
q̃ = vB Ḡ sin2 θ 3 L
2 I1 (κL , a)
1 3 I (κ , a)
ũ = vB ḡ Ḡ sin2 θ cos θ 4 L , (9.233)
4 I1 (κL , a)
∞
κ2L (η̇ ρ̈ − η̈ ρ̇)
I4 (κL , a) = dv ,
(1 + κL η)3
−∞
∂ρ ∂ρ ∂2ρ ∂2ρ
ρ̇ = = ∆λD , ρ̈ = 2
= ∆λ2D .
∂v ∂λ ∂v ∂λ2
Equations (9.233) show that, for weak fields, the net linear polarization grows
quadratically with the magnetic field strength and linearly with sin2 θ. The direc-
tion of maximum polarization is determined by the sign of q̃ , hence (since both I1
and I3 are positive) by the sign of Ḡ. Most spectral lines have Ḡ > 0, which im-
plies that the net linear polarization is parallel to the transverse component of the
magnetic field. The opposite holds for the lines with Ḡ < 0 (net linear polarization
perpendicular to the transverse component of the magnetic field). Comparison of
the expression for q̃ given by Eqs. (9.233) with Eqs. (9.83) and (9.85) shows that
the sign of q̃ is the same as the sign of the Q̃ profile in the wings and is opposite
to the sign of Q̃ in the line core. This means that, for weak fields, σ components
are always less saturated than π components.
d) Under the limit of very large magnetic field (∆λB ∆λD ), we can use the results
developed in Sect. 9.12. The following expressions are obtained (see Eq. (9.229)
LINE FORMATION IN A MAGNETIC FIELD 471
Nσ
, 1 + cos2 θ -
Nπ
, -
sin2 θ I1 κL si ,a − I1 κL si sin2 θ, a
1 + cos2 θ 2
i=1 i=1
q̃ = Nσ
, 1 + cos2 θ -
Nπ
, -
I1 κL si ,a + I1 κL si sin2 θ, a
i=1
2 i=1
ũ = 0 ,
sin2 θ Nσ N
q̃ = − π , (9.234)
1 + cos2 θ N N
where θV is the Van Vleck angle defined in Eqs. (5.100)-(5.101). Equation (9.234)
also shows that for θ = 90◦ (transverse field), q̃ reaches its maximum value
Nσ − Nπ
q̃ max = ,
N
or, for most Zeeman patterns
1
q̃ max = .
3
1
For electric-dipole transitions, the relation Nσ = 2Nπ is always valid except when the lower
and upper level have the same half-integral J quantum number. In that case Nπ = 2J + 1 and
Nσ = 4J, hence
.
2J − 1
θz = arccos .
6J + 1
472 CHAPTER 9
Fig.9.12. Numerical results for the differential saturation mechanism. The net linear polarization
parameters q̃ and ũ are calculated from Eqs.(9.231) for the normal Zeeman triplet 1 S0 − 1 P1 .
The big dot corresponds to the ‘standard case’ values vB = 1.5, θ = 60◦ , κL = 20, a = 0.05. The
four curves are obtained by varying one parameter at a time, namely: vB (a); θ (b); κL (c);
a (d). Values of the varying parameter are marked by small dots.
The analytical results presented above are only valid within a limited region of
the hyperspace of the four parameters vB , θ, κL , a. In general, the calculation
of the net linear polarization requires (under the Milne-Eddington assumptions)
a numerical evaluation of the integrals in Eqs. (9.231). Sample results are shown
in Fig. 9.12. It is seen that, except in the limiting cases of very weak and very
strong magnetic field, the parameter ũ makes an important contribution to the net
linear polarization. Since this parameter vanishes when magneto-optical effects are
neglected (see Eqs. (9.231)), it follows that these effects play a major role in the
mechanism of differential saturation. Numerical results on the net linear polar-
ization have been presented by Calamai et al. (1975), Landi Degl’Innocenti and
Calamai (1982), Calamai and Landi Degl’Innocenti (1983), and Bagnulo (1993). In
these works the line strength is parameterized
√ through the quantity η0 which is con-
nected with κL by the relation η0 = κL / π (see the remarks following Eqs. (9.32)).
Finally, it should be pointed out that differential saturation of π and σ compo-
LINE FORMATION IN A MAGNETIC FIELD 473
nents is the only mechanism that can introduce net linear polarization in spectral
lines when atomic polarization effects are neglected. We will see in the following
(Sect. 10.5) that if the alignment components of the atomic density matrix are
non-zero, there is an extra-contribution to net linear polarization. The definition
of alignment and its physical meaning are discussed in Sect. 3.7.
In the two previous sections we have seen that saturation effects in magnetic lines
produce an increase of equivalent width and the appearance of net linear polariza-
tion. For circular polarization the situation is quite different, because saturation
effects are necessary but not sufficient for the appearance of a net contribution
from a given spectral line.
The proof of this statement is straightforward. For the first part (saturation
effects are necessary) it is enough to consider the generalized Seares formulae
(Eqs. (9.151)) that we rewrite here for the emerging V Stokes parameter
∞ ∞
−τc −(τc − τc )
V (0) = e κL (τc ) hV (τc ) BP (τc ) − e BP (τc ) dτc dτc .
0 τc
The second part of the statement (saturation effects are not sufficient) is directly
proved by the wavelength symmetry properties of the Stokes parameters outlined
in Sect. 9.5 and summarized in Eqs. (9.54): for a static (or constant-velocity)
atmosphere we find again V = 0.
These arguments suggest that, in order to obtain net circular polarization in
magnetic spectral lines, it is necessary to have saturation effects combined with a
velocity gradient.
Similarly to Eqs. (9.230), we define the (dimensionless) net circular polarization
parameter to be
V V
v =− = , (9.235)
I Ic (0) W
with I and W given by Eqs. (9.210) and (9.219) respectively, and Ic (0) the emerg-
ing continuum intensity. This parameter has a very complicated dependence on
474 CHAPTER 9
the physical quantities specifying the stellar atmosphere (including the velocity
gradient) and on the Zeeman pattern of the line. This makes extremely difficult
to derive its general properties except in some special cases based on drastic ap-
proximations. Obviously, it is impossible to use the Milne-Eddington solution to
the transfer equation (as done in the former section for the net linear polarization)
because the Milne-Eddington model atmosphere is static.
In the following we consider three special cases where analytical expressions for
the parameter v can be derived:
i) a zero-order Milne-Eddington atmosphere is ‘perturbed’ by a (small) velocity
gradient;
ii) an arbitrary atmosphere with vertical magnetic field presents a velocity gradient
associated with a gradient of the magnetic field intensity;
iii) the atmosphere is composed of two superposed Milne-Eddington atmospheres
with a discontinuity of all the physical parameters (including velocity) at the tran-
sition layer.
In case i) the velocity gradient is small and continuous and it is not associated
with other gradients. In case ii) the velocity gradient is continuous and not nec-
essarily small, and it is associated with a gradient of the magnetic field intensity.
Finally, in case iii) the velocity gradient is discontinuous (being mathematically de-
scribed by a Dirac delta-function) and it is associated with discontinuous gradients
of the other physical quantities.
Before we analyze these special cases, some general remarks about the line-of-
sight velocity gradient are necessary. Given a stationary stellar atmosphere where
a macroscopic velocity field w A is present, we introduce the terms1
The reason for these terms is the following. Denoting by ρ the mass density of the
medium, its total (Eulerian) time derivative is given by
dρ
A · grad ρ ,
= w
dt
since the atmosphere is stationary (∂ρ/∂t = 0). Combining this relation with the
mass conservation equation
we get
dρ
= − ρ div w
A ,
dt
1 A has the same sign throughout the whole atmosphere.
For simplicity, we assume that div w
Note that a similar terminology was first introduced by Skumanich and Lites (1987).
LINE FORMATION IN A MAGNETIC FIELD 475
so that dρ/dt is negative for an ‘expanding’ atmosphere and positive for a ‘com-
pressing’ atmosphere. It follows that each plasma element goes across regions of
decreasing density in the former case and of increasing density in the latter.
In the particular case of a plane-parallel atmosphere with a vertical (upward or
A , depend only on
downward) velocity field, all the physical quantities, including w
the geometrical depth z defined in Fig. 9.2. For a radiation beam travelling along
the line-of-sight component of the velocity w
the direction Ω, A – which we keep on
denoting by wA – is, according to the sign astrophysical convention (see Sect. 5.4)
·w
wA = − Ω A .
If s is the coordinate measured along the ray path and µ = cos α, with α defined
in Fig. 9.2, we get from the two equations above
dwA
· dw A = − µ Ω
= −Ω · ∂w A = − µ2 div w
A .
ds ds ∂z
In terms of the optical depths τc or tc defined in Eqs. (9.33) and (9.34) we have
dwA µ2 dwA µ
= A ,
div w = A .
div w
dτc kc dtc kc
where tc is the optical depth measured along the vertical (Eqs. (9.34)).
We now make the further assumption that the perturbation is everywhere small,
that is (cf. Eqs. (9.26))
wA (tc ) λ0
vA (tc ) = 1
c ∆λD
or
(0)
wA (1)
, wA wT , (9.236)
where wT is the thermal velocity defined in Eqs. (5.42) and (5.46). This assump-
tion enables the first-order correction to the emerging V Stokes parameter to be
evaluated using the appropriate response function. Similarly to Eq. (9.200), we can
write
V (0, µ) = V (0, µ) + δV (0, µ) , (9.237)
where the zero-order term V (0, µ), corresponding to the static atmosphere, makes
no contribution to the net circular polarization parameter, which thus reduces to
(see Eqs. (9.235) and (9.210))
∞
1
v = δV (0, µ) dλ . (9.238)
Ic (0) W
−∞
On the other hand, Eqs. (9.39), (9.32), (9.29), (9.27) and (9.26) show that
∂ C −1 λ0 ∂ C −1 λ0 ∂ C −1 λ ∂ C −1
= =− =− 0 ,
∂wA c ∆λD ∂vA c ∆λD ∂v c ∂λ
thus we can apply Eq. (9.205) to obtain
λ0
δV (0, µ) = −B0 βµ
c
∂ −1 3
−1 ∂ −1
× wA
(0)
C 30
+ µ wA
(1)
C 3i
C i0
. (9.239)
∂λ i=0
∂λ
we have1
∞
∂ −1
C 30
dλ = 0 . (9.240)
∂λ
−∞
1 (0)
Equation (9.240) shows that the term wA does not affect the net circular polarization. The
(0)
condition wA wT in Eq. (9.236) is necessary only if we want to regard V (0, µ) in Eq. (9.237) as
the solution corresponding to the static atmosphere. However, V (0, µ) could as well be regarded
(0)
as the solution corresponding to a constant-velocity atmosphere with line-of-sight velocity wA ,
(0)
which again proves that the condition wA wT is indeed unnecessary for the following results
(in particular Eq. (9.241)) to hold.
LINE FORMATION IN A MAGNETIC FIELD 477
whence
∞ ∞
3
−1 ∂ −1 1 ∂ C −1
C 3i ∂λ
C i0
dλ = − , C −1 dλ ,
i=0
2 ∂λ 30
−∞ −∞
where the integrand in the right-hand side is the element (3, 0) of the commutator
of the two matrices. The denominator of Eq. (9.238) is easily evaluated using
Eqs. (9.219) and (9.109). To the lowest order we obtain
∞
Ic (0) W = B0 βµ 1 − C −1 00 dλ .
−∞
where ∆λD is the Doppler width, v the reduced wavelength defined in Eqs. (9.26),
and
&
A = (1 + kI )3 (kQ̃ kV + fQ̃ fV )
2 2 ' ∂kQ̃
+ (1 + kI ) −kQ̃ kV fQ̃ − fV2 + fQ̃ fV kQ̃ − kV2
∂v
∂k
− (1 + kI )3 kQ̃2 2
+ fQ̃ V
∂v
&
+ (1 + kI )3 (kV fQ̃ − kQ̃ fV )
2 ' ∂fQ̃
+ (1 + kI ) kQ̃ fV kV2 + fQ̃ 2
+ kV fQ̃ kQ̃ + fV2
∂v
2
2 ∂fV
− (1 + kI ) (kQ̃ fQ̃ + kV fV ) kQ̃ + fQ̃ ; (9.243)
∂v
478 CHAPTER 9
the quantities kQ̃ and fQ̃ are given by (cf. Eqs. (9.232))
Equations (9.242)-(9.243) show that the net circular polarization parameter de-
pends on the four quantities vB , θ, κL , a and on the structure of the Zeeman
(1)
pattern. Moreover, it is proportional to the line-of-sight velocity gradient µ wA ,
so that its sign changes if the velocity gradient is inverted (wA → − wA ). Be-
(1) (1)
cause of the involved structure of the expression, only a few results can be derived
analytically. These are the following:
a) It is easily seen, with the help of Eqs. (9.32), that v = 0 for θ = 0◦ and θ = 180◦ ,
and that
v (180◦ − θ) = − v (θ) ,
which implies in particular v (θ = 90◦ ) = 0. In other words, under the assumption
of weak velocity gradient considered here, the net circular polarization vanishes for
longitudinal and transverse magnetic fields.
b) Under the limit of weak magnetic field (∆λB ∆λD ), the expansions in
Eqs. (9.69) can be used. Recalling Eqs. (9.75) and (9.76) one obtains, to the
lowest order in vB
1 w(1) λ0 I (κ , a)
v =− µ A ḡ Ḡ2 vB
5
sin4 θ cos θ 5 L , (9.244)
16 c ∆λD I1 (κL , a)
where I1 (κL , a) is defined in Eqs. (9.222) and
∞ ... ...
κ5L (η̈ η̇ + ρ̈ ρ̇) η − (η̈ 2 + ρ̈ 2 ) η̈ + (η̇ ρ̈ − η̈ ρ̇) ρ
I5 (κL , a) = dv ,
(1 + κL η)5
−∞
with η and ρ given by Eqs. (9.65) and with the dots denoting derivatives with
respect to the reduced wavelength (cf. Eqs. (9.223)).
A numerical analysis of the integral I5 shows that, excluding weak lines (κL
less than about 1.5), it is a positive quantity. This implies that the net circular
polarization parameter has the same sign as the red wing of the V Stokes profile1
(1)
for ‘expanding’ atmospheres (wA > 0), while it has the sign of the blue wing
(1)
for ‘compressing’ atmospheres (wA < 0). It must be pointed out that magneto-
optical terms play a crucial role in determining the value of v : neglect of these
terms results in a sign switch of the integral I5 for practically all values of the
parameters κL and a .
Another important consequence of Eq. (9.244) is the explicit dependence of v on
the inclination angle θ. The net circular polarization is maximum for
1
θ1 = arccos √ 63◦. 4 , θ2 = 180◦ − θ1 116◦. 6 .
5
1
According to Eq. (9.80), the sign of the red wing of the V profile is the sign of the quantity
(− ḡ cos θ).
LINE FORMATION IN A MAGNETIC FIELD 479
Finally, the proportionality to the factor ḡ Ḡ2 shows that, for electric-dipole tran-
sitions of assigned effective Landé factor, the net circular polarization is maximum
for Zeeman triplets (see Sect. 9.6).
c) Under the limit of very large magnetic field (∆λB ∆λD ), direct substitution
of Eqs. (9.147) into Eq. (9.242) shows that v = 0.1
When the magnetic field is neither weak nor strong, one must resort to a numer-
ical evaluation of Eq. (9.242). Some results are shown in Fig. 9.13 for the quantity
v/v0 , where v0 = µ wA (1)
λ0 /(c ∆λD ) is the perturbative parameter (v0 1 for
Eq. (9.242) to hold).
Case ii) Given an arbitrary stellar atmosphere, let us consider the radiation flowing
along a given outward direction, and let us suppose that the magnetic field vector
at each point along the ray path is parallel to the propagation direction (θ = 0◦ )
or antiparallel (θ = 180◦). The results obtained in Sect. 9.10 and summarized by
Eqs. (9.124) and (9.125) allow the net circular polarization parameter to be written
in the form (see Eqs. (9.235), (9.210) and (9.219))
Wb − Wr
v = σ , (9.245)
Wb + Wr
where Wb and Wr are the equivalent widths of two ‘fictitious’ lines formed in the
same atmosphere, and having the same κL as the real line, and absorption profiles
ηb and ηr , respectively; σ is a sign factor given by
+1 for θ = 0◦
σ= (9.246)
−1 for θ = 180◦ .
1
ηb = √ H(v − vA + g vB , a)
π
1
ηr = √ H(v − vA − g vB , a) ,
π
which show that ηb and ηr are identical profiles centered, respectively, at the wave-
lengths λb and λr given by (see Eqs. (9.26))
λb = λ0 + ∆λA − g ∆λB
λr = λ0 + ∆λA + g ∆λB ,
1
Note that the two conditions in Eqs. (9.138) are satisfied since, by assumption, the magnetic
field is constant and large and the velocity gradient is small.
480 CHAPTER 9
Fig.9.13. The net circular polarization computed from Eq.(9.242) is plotted versus different
parameters. Panels (a), (b) and (c) refer to the normal Zeeman triplet 1 S0 − 1 P1 . Panel (d)
refers to three different transitions having ḡ = 1 : 1 S0 − 1 P1 (Ḡ = 1, full line), 3 P1 − 3 D1
(Ḡ = 0.250, short dash), 3 D2 − 3 F3 (Ḡ = 0.997, long dash).
where ∆λA = λ0 wA /c .
Let us now suppose that both the magnetic field intensity and the line-of-sight
component of the macroscopic velocity of the medium vary with optical depth. It
follows that the wavelengths λb and λr are themselves functions of optical depth,
LINE FORMATION IN A MAGNETIC FIELD 481
η η
τ λ λ λ λ λ
Fig.9.14. The combined effect of a velocity gradient and a magnetic field gradient on the absorp-
tion profiles ηb and ηr . For an ‘expanding’ atmosphere (∆λ(1)A
> 0), at each τc the profiles would
be centered at the wavelengths λb and λr (dashed lines) if the magnetic field were constant. If,
on the contrary, the intensity of the magnetic field increases with τc (∆λ(1) B
> 0), the profiles
are centered at the wavelengths λb and λr (solid lines). The ηr profiles at different τc are less
overlapped than the ηb profiles, thus Wr > Wb .
and this affects the equivalent widths of the two ‘fictitious’ lines. The effect will in
general be different for Wb and Wr , which eventually results in a net amount of
circular polarization.
General properties of v cannot be established without a knowledge of the depen-
dence of ∆λA and ∆λB on optical depth. Here we restrict attention to the simplest
case of linear variations1
∆λA (τc ) = ∆λ(0) (1)
A + ∆λA τc
⎧
⎪ Wr > Wb if | ∆λ(1)
A + g ∆λB | > | ∆λA − g ∆λB |
(1) (1) (1)
⎪
⎨
Wr = Wb if | ∆λ(1)
A + g ∆λB | = | ∆λA − g ∆λB |
(1) (1) (1)
(9.249)
⎪
⎪
⎩
Wr < Wb if | ∆λ(1)
A + g ∆λB | < | ∆λA − g ∆λB | .
(1) (1) (1)
(1) (1)
have the same sign; (c) Wr < Wb if ∆λA and g ∆λB have opposite signs.
Equations (9.249) can be rewritten in the form1
sign Wr − Wb = sign g ∆λ(1) (1)
A ∆λB ,
or, in terms of v
sign (v ) = sign − σ g ∆λ(1) (1)
A ∆λB , (9.250)
much smaller than the Doppler width ∆λD ), so that the transfer equation can
be solved by a perturbative expansion. By further assuming a Milne-Eddington
model atmosphere, one obtains a closed expression for the equivalent width. These
calculations are carried out in App. 10 and show that, up to the second order, the
equivalent widths Wr and Wb can be written in the form (see Eqs. (A10.8), (9.248)
and (A10.7))
(1) 2
β ∆λA ± g ∆λ(1)
Wr,b = ∆λD I1 (κL , a) + B
I6 (κL , a) ,
1+β ∆λD
where the plus and minus signs refer to the red and blue component respectively,
I1 (κL , a) is given by Eqs. (9.222) and I6 (κL , a) by Eq. (A10.9). Substitution into
Eq. (9.245) yields, to the lowest order
∆λ(1)
A ∆λB
(1)
I6 (κL , a)
v = −2 σg ,
∆λ2D I1 (κL , a)
which confirms Eq. (9.250) since both I1 (κL , a) and I6 (κL , a) are positive. Note
that this expression is valid irrespective of the magnetic field strength: only the
gradients (∆λ(1) (1)
A and ∆λB ) are required to be small. A similar expression has
been given by Pavlov and Shibanov (1978).
Case iii) Let us consider the following discontinuous model atmosphere. A thin
slab with a vertical velocity field is superposed on an arbitrary, static atmosphere;
1
A similar expression has been derived by Solanki and Pahlke (1988).
LINE FORMATION IN A MAGNETIC FIELD 483
both the slab and the underlying atmosphere are magnetized, the magnetic field
vectors being in general different.1 We restrict attention to the radiation travelling
in the vertical, outward direction, and denote by ts the continuum optical thickness
(s)
of the slab (ts 1), by wA the line-of-sight component of the velocity field (de-
≡ (B , θ , χ )
fined, as usual, according to the astrophysical convention), and by B s s s s
and B ≡ (B, θ, χ), respectively, the magnetic field vectors in the slab and in the
underlying atmosphere.
The transfer of radiation across the slab is described by Eq. (9.90), where the
propagation matrix C (s) and the emission vector j (s) can be considered constant
because ts 1. The emerging Stokes parameters can be easily calculated by
integration of Eq. (9.90)
I(0) = 1 − ts C (s) I (b) + ts j (s) , (9.251)
where I (b) is the boundary Stokes vector resulting from the lower atmosphere.
As far as the V Stokes parameter is concerned, the first and third term in
Eq. (9.251) give rise to antisymmetrical profiles that make no contribution to the
line-integrated circular polarization, which reduces to
∞
3
(s) (b)
V = − ts C3i Ii dλ .
−∞ i=0
To the lowest order in ts , the net circular polarization parameter is therefore (see
Eqs. (9.235), (9.219), (9.91) and (9.39))
∞
(s) (s) (s) (s)
hV I (b) + rU Q(b) − rQ U (b) + hI V (b) dλ
(s) −∞
v = − ts κ L ∞ , (9.252)
Ic(b) − I (b) dλ
−∞
(b)
where Ic is the continuum intensity emerging from the underlying atmosphere.
Note that the Stokes profiles I (b) , Q(b) , U (b) , V (b) are centered at the rest wave-
(s) (s) (s) (s)
length λ0 , while the profiles hV , rU , rQ , hI are centered at the wavelength
(s)
(s) wA
λ0 + ∆λA = λ0 1 + .
c
(s)
The symmetry properties (9.47) and (9.54) show that v is zero both when ∆λA is
(s) (s)
zero and when it is so large that the profiles I (b) , . . . , V (b) and hV , . . . , hI do
1
This model was proposed by Illing et al. (1975) to explain the broad-band circular polarization
observed in sunspots, and it is often referred to as the Illing (or Illing, Landman and Mickey)
model. Some quantitative estimates of the net circular polarization produced by the model have
been given by Auer and Heasley (1978).
484 CHAPTER 9
(s) ∞
t κ (s) (s) (s) (s)
v = s L hV hI + rU hQ − rQ hU + hI hV dλ
∆λD
−∞
where the quantities hI , . . . , hV and the Doppler width ∆λD refer to the lower
atmosphere. Use of Eqs. (9.32) yields, with evident notations
(s)
1 ts κ L
v =
4 ∆λD
∞
1 (s)
× cos θs (1 + cos2 θ) + cos θ (1 + cos2 θs ) ηr(s) ηr − ηb ηb dλ
2
−∞
∞ ∞
(s)
+ cos θs sin θ 2
ηr(s) − ηb ηp dλ + cos θ sin θs ηp(s) ηr − ηb dλ
2
−∞ −∞
∞
1 (s)
+ cos θs (1 + cos2 θ) − cos θ (1 + cos2 θs ) ηr(s) ηb − ηb ηr dλ
2
−∞
∞ (s) (s) !
ρb + ρr ηb + ηr
− sin θs sin θ sin 2(χ − χs )
2 2
ρ(s)
p − ηp − dλ . (9.253)
2 2
−∞
Two general properties of v follow from this expression. The first is that magneto-
optical effects are important only if the azimuth angles of the magnetic field in the
slab and in the underlying atmosphere are different. The second, which can be
easily deduced with the help of Eqs. (9.46), is that v is zero if the magnetic field
gradient is zero (B =B ), even in the presence of a velocity gradient (w(s) = 0).
s A
This could seem inconsistent with the results of Case i) considered above, but is
really quite logical because saturation effects are neglected in the present model:
the slab is by assumption optically thin and the Seares formulae describe a weak,
unsaturated spectral line.
In the following we analyze separately the effects of the velocity gradient com-
bined with a discontinuity at the boundary layer of: (a) the modulus of the
LINE FORMATION IN A MAGNETIC FIELD 485
magnetic field (∆B-effect ); (b) the inclination angle (∆θ-effect ); (c) the azimuth
angle (∆χ-effect ).
In case (a), being θs = θ and χs = χ , only the first two lines of Eq. (9.253) are
non-zero. Thus we obtain
(s) ∞
1 ts κ L (s)
v = cos θ 2
(1 + cos θ) ηr(s) ηr − ηb ηb dλ
4 ∆λD
−∞
∞ !
(s)
+ sin2 θ ηr(s) ηp + ηp(s) ηr − ηb ηp − ηp(s) ηb dλ . (9.254)
−∞
For simplicity, we restrict attention to Zeeman triplets and we assume that the
Doppler width and the damping constant in the slab and in the lower atmosphere
are the same. The integrals in Eq. (9.254) can be performed analytically via
(s)
Eqs. (5.64). Using Eqs. (9.29) and (9.27), and denoting by ∆λB and ∆λB the
Zeeman splittings in the slab and in the atmosphere, respectively, we get
1 (s)
v = √ ts κL cos θ
4 2π
&
× (1 + cos2 θ) H(v̂rr , â) − H(v̂bb , â)
'
+ sin2 θ H(v̂rp , â) + H(v̂pr , â) − H(v̂bp , â) − H(v̂pb , â) , (9.255)
1 (s) (s)
v̂rr = √ ∆λA + g ∆λB − ∆λB
2 ∆λD
1 (s) (s)
v̂bb =√ ∆λA − g ∆λB − ∆λB
2 ∆λD
1 (s) (s)
v̂rp = √ ∆λA + g ∆λB
2 ∆λD
1 (s) (s)
v̂bp = √ ∆λA − g ∆λB
2 ∆λD
1 (s)
v̂pr = √ ∆λA − g ∆λB
2 ∆λD
1 (s)
v̂pb = √ ∆λA + g ∆λB
2 ∆λD
1 (s) (s)
v̂rb =√ ∆λA + g ∆λB + ∆λB
2 ∆λD
486 CHAPTER 9
1 (s) (s)
v̂br = √ ∆λA − g ∆λB + ∆λB
2 ∆λD
1 (s)
v̂pp = √ ∆λA
2 ∆λD
√
â = 2 a .
If we separate the contributions of the first and second line of Eq. (9.255) by
writing
v = (1 + cos2 θ) v(1) + sin2 θ v(2) , (9.256)
the sign of v(1) can be easily deduced. Since H(v, a) is a monotonically decreasing
function of |v| , we have
(s) (s)
sign v(1) = sign cos θ g ∆λA ∆λB − ∆λB , (9.257)
which agrees with the result obtained previously for continuous gradients and
vertical magnetic field (cf. Eq. (9.250)).1 For slightly inclined magnetic field
(sin2 θ 1), Eq. (9.257) gives the sign of the net circular polarization. By contrast,
(s)
no general rule can be established for the sign of v(2) . When either ∆λB = 0 (non-
magnetic slab superposed on a magnetic atmosphere) or ∆λB = 0 (magnetic slab
superposed on a non-magnetic atmosphere), it can be easily shown that v(2) = v(1) .
However, there are cases where the signs of v(2) and v(1) are opposite.
Let us now consider the ∆θ-effect. Being now Bs = B and χs = χ , only the
second and third line of Eq. (9.253) are non-zero. Under the same assumptions as
before (Zeeman triplet, same Doppler width and damping constant in the slab and
in the atmosphere), one obtains
1 (s)
v = √ ts κ L
4 2π
&
× cos θs sin2 θ − cos θ sin2 θs H(v̂rp , â) − H(v̂bp , â)
1 '
+ cos θs (1 + cos2 θ) − cos θ (1 + cos2 θs ) H(v̂rb , â) − H(v̂br , â) .
2
It can be easily shown that both the factors containing θs and θ are positive when
θ > θs and negative when θ < θs , and that both the factors containing the Voigt
(s) (s)
functions are positive when g ∆λA < 0 and negative when g ∆λA > 0. Therefore,
we have the sign rule
(s)
sign (v ) = sign − g ∆λA θ − θs . (9.258)
1
Note that positive values of ∆λ(1) and ∆λ(1)
(s)
B A
in Eq. (9.250) are equivalent to ∆λB > ∆λB
(s)
and ∆λA < 0 in Eq. (9.257).
LINE FORMATION IN A MAGNETIC FIELD 487
Fig.9.15. Contour plot of the function in square brackets of Eq.(9.259) in the plane (vA , vB ) =
(s)
(∆λA /∆λD , ∆λB /∆λD ). The plus and minus signs denote the regions where the function is
positive or negative. The value of the damping constant is a = 0 (the contour plot depends very
slightly on a).
It is interesting to compare this relation with Eq. (9.257). It appears that a mag-
netic field of constant orientation whose modulus increases with optical depth, and
a magnetic field of constant modulus whose longitudinal component increases (in
absolute value) with optical depth, produce the same effect on the net circular
polarization parameter. This result has been deduced by Solanki and Montavon
(1993) with numerical computations. Sánchez Almeida and Lites (1992), who pro-
posed for the θ-gradient effect the name ∆θ-mechanism, pointed out that it is more
effective than the ∆B-effect when the magnetic field is highly inclined, while the
opposite holds when the field is slightly inclined.
Finally, we consider the ∆χ-effect.1 In this case, being Bs = B and θs = θ, only
the fourth line of Eq. (9.253) is non-zero. Under the same assumptions as before,
and using Eqs. (5.64) to evaluate the convolutions of Voigt functions and dispersion
profiles, we obtain the expression
1 (s)
v = √ ts κL sin4 θ sin 2(χ − χs )
4 2π
3 1 1
× L(v̂pp , â) − L(v̂rp , â) − L(v̂bp , â) + L(v̂rb , â) + L(v̂br , â) . (9.259)
2 4 4
It is important to remark that the presence of the factor sin4 θ in this equation
makes the ∆χ-effect particularly sensitive to transverse magnetic fields.
A numerical analysis of the term in square brackets of Eq. (9.259) is presented
in Fig. 9.15, which shows that no simple rule can be established for the sign of this
1
The possibility that this mechanism might be effective in sunspots was first suggested by
Makita (1986).
488 CHAPTER 9
(s)
term. However, if we exclude values of |∆λA | larger than approximately 1.5 ∆λD ,
we get the rule
(s)
sign (v ) = sign ∆λA χ − χs . (9.260)
Contrary to Eqs. (9.257) and (9.258), the sign rule (9.260) is independent of the
Landé factor. This property enables in principle the ∆χ-effect to be disentangled
from the two previous effects by using a couple of spectral lines having a positive
and a negative Landé factor respectively. According to Eq. (9.260), the sign of v
depends on the sign of the velocity gradient and on the twisting direction of the
magnetic field. Thus we have the following rule for the ∆χ-effect: an expanding
(s)
atmosphere (∆λA < 0) and a magnetic field winding as the thread of a right-
handed screw (χ < χs ) give rise to positive net circular polarization.
The results presented in this section show that the phenomenon of net circular
polarization in spectral lines is extremely complex. Furthermore, it should be
stressed that the net circular polarization discussed above is due to velocity and/or
magnetic field gradients only, while blends and atomic orientation have not been
considered.
Blends between independent lines, or between different components of a multi-
plet, can indeed produce net circular polarization (cf. Figs. 9.9 and 9.18). Further
remarks on this topic are presented in App. 11. The effect of atomic orientation
will be discussed in Sect. 10.5. We just point out that atomic orientation can
be set up both in isolated atomic levels (through a mechanism which involves
the simultaneous presence of magnetic fields and velocity gradients; see Landi
Degl’Innocenti, 1985b) and in fine-structured or hyperfine-structured atomic levels
(through a mechanism of ‘alignment-to-orientation conversion’ due to anisotropic
illumination; see Sect. 10.20). Net circular polarization in spectral lines can also
be produced by velocity-magnetic field correlations in stochastic media. This phe-
nomenon will be discussed in Sect. 9.24.
Fig.9.16. The Stokes parameters profiles, as given by the Unno-Rachkovsky solutions (9.110),
are plotted against reduced wavelength for the normal Zeeman triplet 1 S0 − 1 P1 . The profiles
are normalized to B0 and the relevant parameters have the same values as in the ‘standard case’
(see Fig.9.5) except for vB = 2. Full line: with magneto-optical effects; dashed line: without
magneto-optical effects (U is identically zero).
reduce – for χ = const. – to a straight line through the origin, partially covered
twice, having angular coefficient tan 2χ. These diagrams, introduced by Kawakami
(1983), clearly show that the importance of magneto-optical effects decreases in
the wings.
iii) Finally, magneto-optical effects have also a (small) influence on the intensity
profile. By scrambling the polarization among Q, U , and V , they reduce the
importance of saturation, which eventually leads to a deeper profile and to a slight
increase of the equivalent width.
The influence of magneto-optical effects is even more pronounced on the net (line-
integrated) polarization. As evident from our thorough discussion in Sect. 9.20, net
linear polarization is dramatically affected by magneto-optical effects. Referring
for instance to Fig. 9.12, all the representative points would lie on the q̃-axis if
magneto-optical effects were neglected. Similar conclusions can be drawn for the
net circular polarization. In the absence of magneto-optical effects, the net circular
LINE FORMATION IN A MAGNETIC FIELD 491
Fig.9.17. Polarization diagram ( U (λ) vs. Q(λ) ) for the normal Zeeman triplet 1 S0 − 1 P1 . The
Stokes parameters are computed from the Unno-Rachkovsky solutions (9.110) with the ‘standard’
values of the relevant parameters (see Fig.9.5). The dots indicate distance from line center in
Doppler width units.
polarization obtained for small velocity gradient and weak magnetic field would
have the opposite sign (see the comments following Eq. (9.244)), and the ∆χ-effect
discussed at the end of Sect. 9.21 would simply disappear.
The transfer equation that we have considered throughout this chapter refers to
an isolated spectral line originating in the transition between two atomic levels
whose magnetic splitting is described by the Zeeman effect regime. For a line
multiplet composed of different fine-structure or hyperfine-structure components,
the equation must be modified. The modifications do not concern, of course, the
form of the transfer equation – which is still given by Eq. (6.85) – but only its
coefficients: in other words, we have to find the new expressions for the elements
of the propagation matrix and of the emission vector.
Let us consider the multi-term model atom discussed in Sect. 7.5 embedded
in an arbitrary magnetic field. The radiative transfer coefficients, expressed in
the energy-eigenvector representation of the atomic density operator, are given by
Eqs. (7.35). Similarly to Sect. 9.1, we assume that only two atomic terms contribute
to absorption and emission at frequency ν (the analogue of the isolated spectral
line hypothesis), and that no atomic polarization is present in either term. The
former assumption implies that the summation over the terms in Eqs. (7.35) is
limited to one lower term (β L S) and one upper term (βu Lu S); the latter implies
492 CHAPTER 9
the substitutions
N
N (2L + 1) ρβ L S (j M , j M ) → δ δ
2S + 1 j j M M
Nu
N (2Lu + 1) ρβ L S (ju Mu , ju Mu ) → δ δ ,
u u 2S + 1 ju ju Mu Mu
where N and Nu are the overall number of atoms per unit volume in the lower and
upper term, respectively.
Using these assumptions, and recalling Eq. (5.148), Eq. (7.35a) reduces to the
following
= hν N
ηiA (ν, Ω) B(β L S → βu Lu S)
4π 2S + 1
j j
× 3 CJ (β L S, M ) CJ (β L S, M )
j J J ju Ju Ju M Mu q
j j
× CJu (βu Lu S, Mu ) CJu (βu Lu S, Mu ) (2J + 1)(2J + 1)(2Ju + 1)(2Ju + 1)
u u
! !
Ju J 1 Ju J 1 Lu L 1 Lu L 1
×
−Mu M −q −Mu M −q J Ju S J Ju S
φ(ν
× Tqq (i, Ω) − ν) ,
β L Sju Mu , β L Sj M
u u
which can be cast into a simpler form by introducing the normalized strengths of
the multiplet components defined in Eq. (3.64). Taking into account the symmetry
properties of the 3-j and 6-j coefficients, we have
= hν
ηiA (ν, Ω) N B(β L S → βu Lu S)
4π
j M ,j M
× φ(ν
Sq u u Tqq (i, Ω) − ν) .
β L Sju Mu , β L Sj M
u u
j ju M Mu q
Comparison with Eq. (9.2) – see also Eq. (3.16) – shows a remarkable similarity.
Since the same similarities are also found for the other radiative transfer coefficients,
we conclude that the transfer equation for polarized radiation in a fine-structure
multiplet can be obtained from the corresponding equation for an isolated spectral
line by performing the following formal substitutions
i) for the absorption coefficient:
kL , the frequency-integrated line absorption coefficient defined in Eqs. (9.14) and
(9.5), must be replaced by kM , the frequency-integrated multiplet absorption co-
efficient defined by
hν hν
A
kM = kM − kM
S
= N B(β L S → βu Lu S) − N B(βu Lu S → β L S)
4π 4π u
hν Nu ω̃
= N B(β L S → βu Lu S) 1 − ,
4π N ω̃u
LINE FORMATION IN A MAGNETIC FIELD 493
where ω̃ = (2L + 1)(2S + 1) is the degeneracy of the lower term and ω̃u =
(2Lu + 1)(2S + 1) the degeneracy of the upper term.
ii) for the source function:
SL , the line source function defined in Eqs. (9.14), must be replaced by SM , the
multiplet source function defined by
−1
2hν03 N ω̃u
SM = −1 ,
c2 Nu ω̃
N
N (2J + 1) ρα J I (i f , i f ) → δ δ
2I + 1 i i f f
Nu
N (2Ju + 1) ρα J I (iu fu , iu fu ) → δ δ , (9.261)
u u 2I + 1 iu iu fu fu
where N and Nu are the overall number densities of atoms in the lower and upper
level, respectively, and where the density-matrix elements are defined in Eq. (7.63).
Using Eqs. (9.261), one easily finds that the transfer equation for polarized radi-
ation in a hyperfine-structure multiplet can be obtained from the corresponding
494 CHAPTER 9
Fig.9.18. The Stokes parameters profiles of the line λ 5420.362 of MnI are plotted against wave-
length distance from line center in m Å (solid line). They are computed according to Eqs.(9.110)
and are normalized to B0 . The relevant parameters are the following: ∆λD = 40 mÅ, B = 3000 G
(whence vB = 1.029), θ = 60◦ , χ = 0◦ , κL = 7, a = 0.05, βµ = 5. The ‘equivalent Zee-
man pattern’ is computed using the values of the hyperfine-structure constants given by Brix and
Kopfermann (1952). The dashed line refers to the Stokes parameters profiles computed neglecting
hyperfine structure.
Figure 9.18 shows the Stokes parameters profiles, as deduced from the Unno-
Rachkovsky solution, for a typical hyperfine-structure multiplet. Note the charac-
teristic broadening of the profiles and the ‘asymmetries’ introduced by hyperfine
structure.
In this chapter we have treated several aspects of the problem of line formation in a
magnetic field. In all cases, it was understood that the physical parameters affect-
ing the transfer of radiation through the ambient medium were deterministic: in
other words, they were assumed to take well-defined values in any volume element
496 CHAPTER 9
of the medium. Here we assume, on the contrary, that the medium is stochastic.
By this we mean that the physical parameters are random variables having, locally,
a certain probability distribution (with possible correlations between different pa-
rameters) and, non-locally, a typical autocorrelation length.
The idea of a non-deterministic ambient medium is indeed realistic. For instance,
the analysis of spectral lines in stellar atmospheres has shown for a long time
the existence of turbulent velocity fields: this suggests quite naturally that the
magnetic field (and other physical variables) may also be present in a turbulent
form. Furthermore, one can easily imagine the existence of correlation effects in
this turbulent medium. As an example, one might consider the possibility that
the magnetic field is larger-than-average when the turbulent velocity is directed
outward and smaller-than-average when it is directed inward (or vice versa).
Writing the transfer equation in its general form (9.163)
dI
= A(τr ) I − b(τr ) , (9.264)
dτr
one is faced with the problem of solving a linear stochastic equation, that is a linear
equation with random coefficients.1 The general problem is extremely complex;
however, it is possible to obtain a quite simple analytical solution under a set of
restrictive assumptions. Following Auvergne et al. (1973), who solved the simpler
problem of the transfer of unpolarized radiation in a turbulent velocity field, we
make the following hypotheses:
a) We assume the line is formed in LTE, with a Planck function linear in τr . This
allows us to write the emission coefficient b(τr ), defined in Eqs. (9.164), in the
form2
b(τr ) = BP (τr ) A(τr ) U = B0 (1 + β τr ) A(τr ) U . (9.265)
b) We assume the physical parameters {ζi } (with i = 1, . . . , p), on which the prop-
agation matrix A(τr ) depends – see the discussion at the beginning of Sect. 9.16 –
to have a stochastic distribution described by the function f (ζ1 , ζ2 , . . . , ζp ). The
probability dP of finding the first parameter in the interval (ζ1 , ζ1 +dζ1 ), the second
in the interval (ζ2 , ζ2 + dζ2 ), etc., is given by
with
dζ1 dζ2 · · · dζp f (ζ1 , ζ2 , . . . , ζp ) = 1 .
1
The optical depth τr in Eq. (9.264) should be considered a deterministic variable.
For a stochastic medium where the continuum absorption coefficient at the reference frequency,
kc (νr ), is also a stochastic variable, the definition of τr given in Eqs. (9.33) must be replaced by
dτr = − kc (νr ) ds, where the meaning of the symbol · · · is explained below (see Eq. (9.271)).
2
The slope of the Planck function is here denoted by β to distinguish it from the
quantity β defined in Eq. (9.105). For a plane-parallel atmosphere with a constant ratio of the
absorption coefficients kc (νr ) and kc (ν0 ), the two quantities are related by the simple equation
β = βµ kc (ν0 ) / kc (νr ).
LINE FORMATION IN A MAGNETIC FIELD 497
c) We assume that each given ray path crosses a sequence of independent ‘eddies’.
The n-th eddy covers the optical depth interval (τr = τn−1 , τr = τn ), with τ0 = 0.
In each interval the physical parameters are constant and their values are drawn
at random from the probability distribution f (ζ1 , ζ2 , . . . , ζp ). Accordingly, the
propagation matrix in the n-th interval is constant and will be denoted by An .
Obviously there is no correlation between the matrix Ai in the i-th interval and
the matrix Aj in the j-th interval.
d) We finally assume the ‘jumping points’ τ1 , τ2 , . . . , τn , . . . to be distributed
according to a Poisson law characterized by the frequency 1/τe or, in other words,
by the mean length of the eddies τe . The probability dQ that the first jumping
point is found in the interval (τ1 , τ1 + dτ1 ), the second in the interval (τ2 , τ2 + dτ2 ),
etc., is given by
τ1 − τ0 τ2 − τ1 τn − τn−1
− − − dτ1 dτ2 dτ
dQ = e τe
e τe
··· e τe
··· ··· n ··· , (9.267)
τe τe τe
with
∞ ∞ ∞ τ −τ τ −τ τ −τ
dτ1 dτ2 dτn − 1τ 0 − 2τ 1 − n τ n−1
dQ = ··· ··· e e e e ··· e e ··· = 1 .
τe τe τe
τ0 τ1 τn−1
∞
I(0) = O(0, τr ) A(τr ) BP (τr ) U dτr . (9.268)
0
Using the analogues of Eqs. (9.97)-(9.94) for the evolution operator O(τr , τr ) and
integrating by parts, we get
∞
dBP (τr )
I(0) = BP (0) 1 + O(0, τr ) dτr U,
dτr
0
∞
I(0) = B0 1 + β O(0, τr ) dτr U. (9.269)
0
498 CHAPTER 9
∞
τn
−(τ1 − τ0 ) A1 −(τ2 − τ1 ) A2 −(τn−1 − τn−2 ) An−1 −(τr − τn−1 ) An
= e e ··· e e dτr .
n=1 τn−1
The integral in the right-hand side can be evaluated via Eq. (9.106). Substitution
into Eq. (9.269) yields
∞
−(τ − τ ) A −(τ − τ ) A −(τ −τ )A
I(0) = B0 1 + β e 1 0 1 e 2 1 2 · · · e n−1 n−2 n−1
n=1
!
−(τn − τn−1 ) An
× 1 − e A−1
n U. (9.270)
We now multiply both sides by the probability dQ defined in Eq. (9.267) and inte-
grate over all possible realizations of the jumping points. Using again Eq. (9.106),
we obtain
∞
I(0) dQ = B0 1 + β 1 + τe A1 )−1 (1
(1 1 + τe A2 )−1 · · · (1
1 + τe An−1 )−1
n=1
!
1 + τe An )−1 A−1
× 1 − (1 n U.
Finally, we average over the distribution of the physical parameters {ζi }. Let
us denote by g the mean value of any given function g (ζ1 , ζ2 , . . . , ζp ) over the
distribution function f (ζ1 , ζ2 , . . . , ζp ) defined in Eq. (9.266),
g = dζ1 dζ2 · · · dζp g (ζ1 , ζ2 , . . . , ζp ) f (ζ1 , ζ2 , . . . , ζp ) . (9.271)
Since by assumption c) the values of the physical parameters in different eddies are
uncorrelated, we obtain for the statistical average of the emerging Stokes parame-
ters1
∞
n−1
I(0) av = B0 1 + β 1 + τe A)−1
(1
n=1
!
× A−1 − (1
1 + τe A)−1 A−1 U. (9.272)
1
Note that the term corresponding to n = 1 in the sum over n is the identity matrix 1 .
LINE FORMATION IN A MAGNETIC FIELD 499
1 − B)−1 (1
(1 1 − B) = 1
yields
∞
1 − B)−1 =
(1 Bk .
k=0
Hence the sum over n in Eq. (9.272) can be written in the form
∞
n−1 −1
1 + τe A)−1
(1 = 1− 1 + τe A)−1
(1
n=1
−1 1 −1
1 + τe A)−1
= 1 − (1 = 1 + τe A)−1
A (1 . (9.273)
τe
On the other hand, we have
−1
A − (11 + τe A)−1 A−1 = A−1 − A−1 (1
1 + τe A)−1
= A−1 1 − (1
1 + τe A)−1 1 + τe A)−1 .
= τe (1
ii) Microturbulent limit: under the limit τe → 0 (microscopic eddies) one gets
−1
I(0) av
= B0 1 + β A U, (9.276)
or, in other words, the Unno-Rachkovsky solution of the deterministic case with
the matrix A replaced by A.
iii) Macroturbulent limit: under the limit τe → ∞ (very large eddies) one obtains
I(0) av
= B0 1 + β A−1 U , (9.277)
1
The connection with the Unno-Rachkovsky solution will be discussed more precisely in the
following.
500 CHAPTER 9
which is again the Unno-Rachkovsky solution with the matrix A−1 replaced by
A−1 .
iv) Finally, for non-magnetic media the matrix A is proportional to the unit matrix,
A = (κr + κR η) 1 = κ 1 , (9.278)
with η defined in Eqs. (9.65). The intensity (the other Stokes parameters are
obviously zero) is given by
8 9<8 9
1 κ
I(0) av = B0 1 + β , (9.279)
1 + τe κ 1 + τe κ
a formula derived by Auvergne et al. (1973) for the case of turbulent velocity fields.
For a plane-parallel atmosphere, we can rewrite Eq. (9.274) in a form more similar
to the Unno-Rachkovsky solution (9.109) by identifying the reference frequency νr
with the line frequency ν0 . Let us denote by te the mean length of the eddies
measured along the vertical to the atmosphere. Bearing in mind footnote 2 on
p. 496, the emerging Stokes profiles for a radiation beam travelling in the µ-direction
are given by
8 , -−1 9−1 8, -−1 9
t t
I(0) av
= B0 1 + βµ C 1 + e C 1 + e C U , (9.280)
µ µ
In these equations, vB and vA are defined by Eqs. (9.26), the Doppler width ∆λD
being due to the thermal velocity only. The quantity ρ is the correlation parameter
1
Note, however, that stochastic variations of the continuum absorption coefficient cannot be
considered if the matrix C has the form of Eq. (9.91). Such variations can be allowed for if C in
Eq. (9.280) is defined to be kc (ν0 )
C = 1 + κL H .
kc (ν0 )
LINE FORMATION IN A MAGNETIC FIELD 501
Fig.9.19. Stokes parameters profiles computed from Eq.(9.280) for the Zeeman triplet 1 S0 − 1 P1
and the bimodal distribution (9.281). The profiles are normalized to B0 . The values of the
(0) (0)
parameters specifying the distribution are: ρ = 0, vB = 1.5, ∆vB = 0.5, vA = 0, ∆vA = 0.5.
◦ ◦
The values of the remaining parameters are: κL = 20, θ = 60 , χ = 0 , a = 0.05, βµ = 5. Full
line: microturbulent limit (te = 0); dashed line: macroturbulent limit (te = 1000).
(−1 < ρ < 1). When ρ = 0 the distribution function reduces to the product of two
(0)
Gaussian distributions, one for vB (centered at vB and having standard deviation
(0)
∆vB ) and the other for vA (centered at vA and having standard deviation ∆vA ).
By simple algebra it can be proved that the distribution (9.281) is normalized to
unity,
∞ ∞
dvB dvA f (vB , vA ) = 1 .
−∞ −∞
Both figures refer to the normal Zeeman triplet 1 S0 − 1 P1 . Figure 9.19 gives the
emerging Stokes profiles computed for ρ = 0 (absence of correlations). Comparison
with panel (a) of Fig. 9.5 clearly shows the broadening of the profiles introduced
by the Gaussian distributions of magnetic field intensity and of turbulent velocity.
The remarkable effects of correlations are shown in Fig. 9.20, computed for ρ = 0.7.
Correlations between magnetic field intensity and line-of-sight velocity are able to
502 CHAPTER 9
where ∆BP satisfies assumptions b), c) and d) above – in other words, is itself a
Kubo-Anderson process.
Let us denote by (∆B1 , ∆B2 , . . . , ∆Bn , . . .) a particular realization of ∆BP in
the intervals (τ0 , τ1 ), (τ1 , τ2 ), . . . , (τn−1 , τn ), . . . . Following the same line of
reasoning, it is easily seen that the additional term ∆BP in Eq. (9.282) makes an
LINE FORMATION IN A MAGNETIC FIELD 503
∞
τn
Ia.c. (0) = O(0, τr ) A(τr ) ∆Bn U dτr .
n=1 τ
n−1
∞
−(τ1 − τ0 ) A1 −(τ2 − τ1 ) A2 −(τn−1 − τn−2 ) An−1
Ia.c. (0) = ∆Bn e e ··· e
n=1
−(τn − τn−1 ) An
× 1 − e U,
which is the additional contribution to Eq. (9.270). Integration over the Poisson
distribution of the jumping points and over the distribution of the physical param-
eters yields
∞
n−1
Ia.c. (0) av
= ∆B (11 + τe A)−1
n=1
n−1
1 + τe A)−1
− (1 1 + τe A)−1 U .
∆B (1
Since
1 + τe A)−1 = τe ∆B A (1
∆B 1 − ∆B (1 1 + τe A)−1 ,
we finally obtain, with the help of Eq. (9.273)
−1
Ia.c. (0) av
1 + τe A)−1
= A (1 1 + τe A)−1 U .
∆B A (1 (9.283)
This formula contains the following special cases (cf. Eqs. (9.275)-(9.279)):
i) Deterministic limit
Ia.c. (0) av
= ∆B U .
which generalizes Eq. (9.274) by allowing for the stochastic behavior of the Planck
function.1
The procedure followed in this section to find an analytical solution to the transfer
equation (9.264) for a Kubo-Anderson process can also be applied to solve the
homogeneous transfer equation
dI
= A(τr ) I . (9.285)
dτr
The calculations are developed in App. 12 and lead to a closed analytical form for
the stochastic average of the evolution operator.
It should be mentioned that the problem of line formation in a stochastic magnetic
field has been tackled, with a completely different approach, by Faulstich (1980).
This approach is a generalization of the method developed by Gail et al. (1980)
for the non-polarized case and is based on the assumption that the intensity of the
magnetic field is constant while its direction changes along the ray path according
to a Markov process. The stochastic average of the emerging Stokes parameters is
I) of finding,
found by solving a Fokker-Planck equation for the probability P (x, Ω,
at point x along the ray path, the magnetic field in the solid angle dΩ about the
and the Stokes parameters in the interval (I, I + dI).
direction Ω
In the former section and in App. 12 we have found that, under the microturbulent
limit, radiative transfer for polarized radiation can be simply described by replac-
ing the propagation matrix A with its stochastic average A (see Eqs. (9.276) and
(A12.8)). This result is valid well beyond the set of assumptions outlined in the
1
Note that a stochastic behavior of the Planck function implies a stochastic behavior of
the temperature, which obviously affects the propagation matrix A. In other words, the symbols
· · · in Eq. (9.284) mean an average over a distribution function f (ζ1 , ζ2 , . . . , ζp ) which includes
temperature among its arguments.
LINE FORMATION IN A MAGNETIC FIELD 505
former section. In fact, it holds whenever the correlation lengths of the stochas-
tic physical parameters are much smaller than the typical scales of the radiative
transfer problem (determined by the eigenvalues of the propagation matrix), irre-
spective of the type of process – Kubo-Anderson, Markov, etc. – characterizing the
stochastic behavior (see e.g. Stenflo, 1971).
Here we want to consider the effect of a microturbulent, isotropic magnetic field
on the propagation matrix, under the assumption that the field modulus B, char-
acterized by a normalized distribution f (B), is the only stochastic variable (except
for microturbulent velocity, assumed uncorrelated with it).1
Since the continuum and line absorption coefficients are not affected by the mag-
netic field, the average propagation matrix reduces to (see Eqs. (9.164))
A = κr 1 + κR H .
On the other hand, the off-diagonal elements of the matrix H vanish because of
the isotropy of the magnetic field.2 From Eqs. (9.50) and (9.32), being
cos θ = sin2 θ cos 2χ = sin2 θ sin 2χ = 0
and
2 2 2 1
sin θ = , cos θ = ,
3 3
we get
1
H = hI 1 = ηb + ηp + ηr 1 , (9.286)
3
where
∞
ηb,p,r = ηb,p,r (B) f (B) dB . (9.287)
0
If we now take for f (B) a Gaussian distribution characterized by the r.m.s. value Bt
(the ‘turbulent’ field),
. 2
2 B 2 − 21 B
Bt
f (B) = e , (9.288)
π Bt3
(t) λ20 e0 Bt
∆λB ≡ ∆λD , (9.289)
4π mc2
1
More complicated magnetic field distributions have been considered by Dolginov and Pavlov
(1972).
2
It follows that the radiation emerging from a semi-infinite atmosphere where a microturbulent,
isotropic magnetic field is present, is unpolarized.
506 CHAPTER 9
with ∆λD the Doppler width. From Eqs. (9.64) and (9.68) we have, up to second-
order terms in ∆λB
1 (2)
q η − Gq ∆λB η +
ηq = G(0) (1)
G ∆λ2B η (q = −1, 0, +1) , (9.290)
2 q
λ20 e0 B
∆λB = ∆λD .
4π mc2
Recalling Eqs. (9.29), and using Eqs. (9.290) and (9.288), the integrals in Eq.
(n)
(9.287) can be performed. Substituting the expressions for Gq given in Table 3.4
we finally obtain
(t) 2
hI = η + Ḡt ∆λB η , (9.291)
where Ḡt , the second-order effective Landé factor for turbulent fields, is given by
1 (2)
Ḡt = G + G(2) 2
1 = ḡ + δ ,
2 0
with1
1 2
δ = g (4s − d2 − 4) .
16 d
It can be easily shown that δ is a non-negative quantity that vanishes only for
Zeeman triplets (normal or anomalous). Consequently, Ḡt is also a non-negative
quantity.
Equation (9.291) can be cast into a more significant form. Let us define a ‘total
Doppler width’ ∆λT as
(t) 2
∆λT = ∆λ2D + 4 Ḡt ∆λB ,
(t)
where ∆λB satisfies the condition (9.289). Using the relation
which follows directly from Eqs. (5.58), it can be easily proved that, up to second-
(t)
order terms in ∆λB , we can write
∆λD 1 λ − λ0 a ∆λD
hI = √ H , ,
∆λT π ∆λT ∆λT
1
Note that δ = (δ + 3 δ ) / 2, where δ and δ are defined in Eqs. (9.78) and (9.227),
respectively.
LINE FORMATION IN A MAGNETIC FIELD 507
where λ0 is defined in Eq. (9.44). We conclude that a weak, turbulent magnetic
field described by the distribution (9.288) produces a broadening of the absorption
profile which is the same as the broadening produced by a velocity field of amplitude
λ0 e0 Bt
weq = Ḡt ,
2π mc
or, numerically
−3 λ0
weq = 1.4 × 10 Ḡt Bt ,
5000
where λ0 is expressed in Å, Bt in G, and weq in km s−1 .
For arbitrary values of Bt , when inequality (9.289) is no longer satisfied, it is
still possible to find an analytical expression for hI . Let us consider the case of
a Zeeman triplet. Bearing in mind Eqs. (9.29), (9.27a) and (9.26), we have from
Eqs. (9.286)-(9.288)
1 λ − λ0
hI = √ H ,a
3 π ∆λD
√ ∞
2 λ − λ0 + g ∆λB λ − λ0 − g ∆λB
+ H , a +H ,a
3π ∆λD ∆λD
0
2
(t) 2
∆λB −1
2 ∆λB / ∆λB
d (∆λB )
× (t)
e (t)
. (9.292)
∆λB ∆λB
Since the integrand is an even function of ∆λB , the integral can be evaluated by
extending it from −∞ to +∞ and dividing by 2. Then we make use of the relation
y
2 1 y 2
−2 y ∂2 y−x
e y0
= H √ 2
, 0 + y0 lim H √ ,0 ,
y0 2 y0 x→0 ∂x2 2 y0
which can be proved with the help of Eqs. (5.55). The evaluation of the integral
in Eq. (9.292) is thus reduced to the evaluation of convolutions of Voigt functions
with different widths, which can be easily done – using the symmetry property
(5.59) – via Eqs. (5.65). The final result is
1 λ − λ0 2 ∆λD λ − λ0 a ∆λD
hI = √ H ,a + √ H ,
3 π ∆λD 3 π ∆λT ∆λT ∆λT
2 ∆λD 2 (t) 2 ∂
2
λ − λ0 a ∆λD
+ √ g ∆λ H , ,
3 π ∆λT B
∂λ2 ∆λT ∆λT
where
(t) 2
∆λT = ∆λ2D + 2 g 2 ∆λB .
This formula could be generalized to more complicated Zeeman patterns, but we
will not develop the relevant calculations.
CHAPTER 10
Let us consider a typical astrophysical plasma (like, for instance, a stellar atmo-
sphere) and let us focus our attention on a particular atom belonging to a given
volume element of the plasma. The atom experiences a continuous series of in-
teractions with the photons propagating inside the plasma, with the surrounding
particles and, if the plasma is magnetized, with the magnetic field. Except for a
few very special cases, the atom will be in a typical non-equilibrium situation: the
distribution of level populations will differ from the Saha-Boltzmann distribution,
and each individual level will have a definite amount of atomic polarization. The
aim of this chapter is to describe the main aspects of these non-equilibrium phe-
nomena, by showing how the most important atomic properties – like the emissivity
and the absorptivity – depend in a critical way on the physical characteristics of
the environment.
Once such physical characteristics are specified, the state of the atom is obtained
by solving the statistical equilibrium equations. In stationary situations, these
equations form a system of linear, algebraic equations whose unknowns are the
atomic density-matrix elements (or those linear combinations of density-matrix
elements representing the multipole moments or spherical statistical tensors). The
number of equations (and of unknowns) depends on the model atom considered,
the coefficients depend both on the model atom and on the physical environment
(characteristics of the incident radiation field, presence or absence of a magnetic
field and/or of collisions). In practice, one is faced with very complicated equations
even for relatively simple model atoms and physical situations. Therefore, although
the calculation of the atomic state under given physical conditions is in principle a
trivial task (it just requires the solution of a system of linear, algebraic equations),
it is in fact extremely difficult to understand the role played by the different factors
in determining such state.
In the general case, there is no way out of this situation: once the model atom
and the physical environment are given, the statistical equilibrium equations have
to be solved numerically. This is however of little help to understand the physics
of the different phenomena. Therefore, rather than illustrating a variety of sample
cases obtained by numerical solutions to the equations, we consider in this chapter
a number of simplified physical situations, by introducing certain approximations
which, in many cases, lead to analytical expressions for the atomic density-matrix
elements.
The basic approximation used throughout this chapter is to consider model atoms
having just two energy states, an upper and a lower state, between which transi-
tions take place. All the three model atoms described in Chap. 7 (multi-level,
multi-term, and multi-level with hyperfine structure) will be studied, but they will
510 CHAPTER 10
We consider a simple atomic model consisting of two levels: a lower and an upper
level characterized by the quantum numbers1 (α , J ) and (αu , Ju ), respectively.
1
The spectroscopic notation is the same as that used throughout the whole book. J is the
NON-EQUILIBRIUM ATOMIC PHYSICS 511
2J + 1 (K)
+ C (αu Ju , α J ) ρK
Q (α J )
2Ju + 1 I
+ TE (α J KQ, αu Ju K Q ) + TS (α J KQ, αu Ju K Q ) ρK
Q (αu Ju )
K Q
− RA (α J KQK Q ) ρK
Q (α J )
K Q
2Ju + 1 (K)
+ C (α J , αu Ju ) ρK
Q (αu Ju )
2J + 1 S
total (orbital + spin) angular momentum quantum number of the electronic cloud, while α is a
collection of inner quantum numbers (see Sect. 3.1).
1
Throughout this chapter the Stokes vector will be denoted by the symbol Ii (with i = 0, 1, 2, 3)
rather than Si used in preceding chapters.
512 CHAPTER 10
All the rates appearing in these equations have been introduced and discussed
in Chap. 7 (see Sect. 7.2 for the radiative rates and Sect. 7.13 for the collisional
rates). Here we just recall that TA , TE , and TS are the radiative transfer rates due
to absorption, spontaneous emission and stimulated emission, respectively, while
RA , RE , and RS are the corresponding relaxation rates. CI and CS are the rates due
to inelastic and superelastic collisions, respectively. Finally, D are the relaxation
rates due to depolarizing elastic collisions.
Equations (10.1) and (10.2) form a system of linear, homogeneous, differential
equations in the unknowns ρK K
Q (α J ) and ρQ (αu Ju ). In stationary situations, the
left-hand sides of both equations are zero and the system reduces to a set of linear,
homogeneous, algebraic equations. The number of equations (and of unknowns) is
given by
Neq = (2J + 1)2 + (2Ju + 1)2 . (10.3)
On the other hand, the determinant of the system is zero,1 so that the multipole
moments can be determined up to a multiplicative factor. This factor can be
found by considering the trace equation, which for the two-level atom reads (see
Eqs. (3.84) and (3.108))
2J + 1 ρ00 (α J ) + 2Ju + 1 ρ00 (αu Ju ) = 1 , (10.4)
N + Nu = N , (10.5)
where
N = N 2J + 1 ρ00 (α J ) , Nu = N 2Ju + 1 ρ00 (αu Ju ) . (10.6)
Once the statistical equilibrium equations are solved, the radiative transfer coeffi-
cients can be found from Eqs. (7.15).
In general, it is impossible to obtain a closed analytical expression for the solution
to Eqs. (10.1)-(10.2). An important exception is provided by the case where the
lower level is unpolarized (or naturally populated) so that its multipole moments
are given by
ρK 0
Q (α J ) = δK0 δQ0 ρ0 (α J ) . (10.7)
This relation is approximately satisfied when the lower-level radiative rates are
much smaller than the lower-level collisional rates, which is often true in the pres-
ence of a very weak (or ‘diluted’) radiation field (see the discussion in Sect. 10.7).
Obviously, Eq. (10.7) is identically satisfied in the special case where the lower level
1
It can easily be shown, with the help of Eqs. (7.14), (2.36a) and (2.49), that the equations
for the time derivatives of
2Ju + 1 ρ00 (αu Ju ) and − 2J + 1 ρ00 (α J )
are identical.
NON-EQUILIBRIUM ATOMIC PHYSICS 513
has J = 0, since in that case ρ00 (α J ) is the only non-zero multipole moment (cf.
Eq. (3.101)).
Consider the statistical equilibrium equations for the upper level (Eq. (10.1))
under the assumption that Eq. (10.7) – the unpolarized lower level hypothesis –
is valid. In stationary situations, and neglecting the rate for stimulated emission,
we immediately get an analytical expression for ρK Q (αu Ju ). Bearing in mind the
expression for RE (Eq. (7.14e)) we obtain
2J +1
TA (αu Ju KQ, α J 0 0) + 2Ju +1 CI(0) (αu Ju , α J ) δK0 δQ0
ρK
Q (αu Ju ) =
2πi νL gα J Q + A(αu Ju → α J ) + CS(0) (α J , αu Ju ) + D(K) (αu Ju )
u u
The ideal case of resonance polarization is obtained by neglecting in Eq. (10.8) both
the effect of the magnetic field and the effect of collisions. Setting in Eq. (10.8)
νL = CI(0) (αu Ju , α J ) = CS(0) (α J , αu Ju ) = D(K) (αu Ju ) = 0 ,
and substituting Eq. (10.9), we have1
B(α J → αu Ju )
ρK
Q (αu Ju ) = 3(2J + 1)
A(αu Ju → α J )
!
1 1 K
× (−1)1+J +Ju +Q K
J−Q (ν0 ) ρ00 (α J ) . (10.10)
Ju Ju J
It is convenient to introduce the symbol wJ(K)J defined by2
u
!
(K) 1+J +Ju 1 1 K
wJ J = (−1) 3(2Ju + 1) , (10.11)
u Ju Ju J
1
Note that Eq. (10.10) is valid in an arbitrary reference system, since the magnetic field is
zero.
2
The symbol was first introduced by Landi Degl’Innocenti (1984).
514 CHAPTER 10
With this new symbol, Eq. (10.10) takes the more compact form
This expression shows that the upper-level multipole moment ρK Q (αu Ju ) is propor-
tional to the corresponding multipole component of the radiation field tensor.1 As
the latter has a limited number of components (K ≤ 2, see Eq. (5.155)), it follows
that only the multipole moments ρK Q (αu Ju ) with K ≤ 2 can be excited by the
radiation field.
Equation (10.13) also points out the meaning of the symbol wJ(K)J . It is a sort
u
of ‘efficiency factor’ characterizing the transfer of the K-th order multipole from
the radiation field to the atomic density matrix in the absorption process from the
lower level (α J ) to the upper level (αu Ju ). The symbol wJ(K)J is defined for any
u
dipole transition (Ju − J = 0, ±1 ; 00) with K ranging from 0 to 2. For K = 0
one obviously has, from Eq. (10.12)
wJ(0)J = 1 . (10.14)
u
Numerical values of wJ(K)J for K = 1 and K = 2 are given in Table 10.1, together
u
with the values of other symbols that will be introduced later on.
Using the expression of the upper-level statistical tensors given in Eq. (10.13),
the emission coefficient can be easily deduced. Since by assumption the magnetic
field is zero, the emission coefficient is given by Eq. (7.16e), which can be rewritten
in the form2
= hν N 2J + 1 A(α J → α J )
εi (ν, Ω) u u u
4π
(K)
× wJ J T Q
K ρK (α J ) φ(ν − ν) ,
(i, Ω) (10.15)
u
Q u u 0
KQ
1
The sign difference in the subscript Q and the factor (−1)Q arise from the fact that JQ K
is a true irreducible tensor (which, under rotation of the reference system, changes according to
Eq. (2.78)), while ρK K ∗
Q is not (it changes according to Eq. (3.98)). The conjugate (ρQ ) – and
not ρKQ itself – is an irreducible tensor. The conjugation property (3.102) explains the above
differences.
2
The right-hand side should indeed be multiplied by the factor ν 3 /ν03 (cf. footnote on p.283).
NON-EQUILIBRIUM ATOMIC PHYSICS 515
TABLE 10.1
3 3
J Ju wJ(1)J W1 (J , Ju ) wJ(2)J W2 (J , Ju ) pQ 3p V 3
u u max max
0 1 −1. 1. 1. 1. 1. 1.
1/2 1/2 −0.816 0.667 0. 0. 0. 1.
1/2 3/2 −0.913 0.833 0.707 0.5 0.429 1.
1 0 0. 0. 0. 0. 0. 0.
1 1 −0.5 0.25 −0.5 0.25 0.2 0.333
1 2 −0.866 0.75 0.592 0.35 0.288 0.957
3/2 1/2 0.408 0.167 0. 0. 0. 0.25
3/2 3/2 −0.365 0.133 −0.566 0.32 0.261 0.172
3/2 5/2 −0.837 0.7 0.529 0.28 0.226 0.921
2 1 0.5 0.25 0.1 0.01 0.008 0.373
2 2 −0.289 0.083 −0.592 0.35 0.288 0.106
2 3 −0.816 0.667 0.490 0.24 0.191 0.893
5/2 3/2 0.548 0.3 0.141 0.02 0.015 0.446
5/2 5/2 −0.239 0.057 −0.605 0.366 0.302 0.072
5/2 7/2 −0.802 0.643 0.463 0.214 0.170 0.871
3 2 0.577 0.333 0.169 0.029 0.022 0.493
3 3 −0.204 0.042 −0.612 0.375 0.310 0.053
3 4 −0.791 0.625 0.443 0.196 0.155 0.854
7/2 5/2 0.598 0.357 0.189 0.036 0.027 0.526
7/2 7/2 −0.178 0.032 −0.617 0.381 0.316 0.04
7/2 9/2 −0.782 0.611 0.428 0.183 0.144 0.840
4 3 0.612 0.375 0.204 0.042 0.032 0.551
4 4 −0.158 0.025 −0.620 0.385 0.320 0.031
4 5 −0.775 0.6 0.416 0.173 0.136 0.828
9/2 7/2 0.624 0.389 0.216 0.047 0.035 0.570
9/2 9/2 −0.142 0.020 −0.623 0.388 0.322 0.025
9/2 11/2 −0.769 0.591 0.407 0.165 0.129 0.819
5 4 0.632 0.4 0.226 0.051 0.039 0.585
5 5 −0.129 0.017 −0.624 0.39 0.324 0.021
5 6 −0.764 0.583 0.399 0.159 0.124 0.811
11/2 9/2 0.640 0.409 0.234 0.055 0.041 0.597
11/2 11/2 −0.118 0.014 −0.626 0.392 0.326 0.018
11/2 13/2 −0.760 0.577 0.392 0.154 0.12 0.804
6 5 0.645 0.417 0.240 0.058 0.044 0.607
6 6 −0.109 0.012 −0.627 0.393 0.327 0.015
6 7 −0.756 0.571 0.387 0.149 0.116 0.798
516 CHAPTER 10
TABLE 10.1
(continued)
3 3
J Ju wJ(1)J W1 (J , Ju ) wJ(2)J W2 (J , Ju ) pQ 3p V 3
u u max max
where Eqs. (10.11) and (7.8) have been used. We recall that N is the number
density of atoms, and that the tensor T Q K is defined in Eq. (5.159) – see
(i, Ω)
Table 5.6 for the explicit expressions of its components. Substitution of Eq. (10.13)
into Eq. (10.15) leads, with the help of Eqs. (10.6), to the expression
= hν N B(α J → α J ) φ(ν − ν)
εi (ν, Ω)
4π u u 0
× WK (J , Ju ) (−1)Q T Q
K J K (ν ) ,
(i, Ω) (10.16)
−Q 0
KQ
Numerical values of WK (J , Ju ) for K = 1 and K = 2 are given in Table 10.1. For
K = 0, we simply have
W0 (J , Ju ) = 1 . (10.18)
Equation (10.16) gives, for the two-level atom under the approximations consid-
ered in this section, the Stokes parameters of the radiation emitted in a scattering
process as a function of the incident radiation field. A still more significant expres-
sion can be obtained by writing the radiation field tensor in terms of the Stokes
parameters of the incoming radiation. Bearing in mind the definition of kLA , the
NON-EQUILIBRIUM ATOMIC PHYSICS 517
Ω
where Pij (Ω, ), the (quantum-mechanical) scattering phase matrix, is given by
Ω
Pij (Ω, ) = WK (J , Ju ) (−1)Q T Q
K
(i, Ω) ) .
T K (j, Ω (10.20)
−Q
KQ
This result, first derived by Hamilton (1947) – see also Chandrasekhar (1950)1 –
generalizes to Quantum Mechanics the expression for the scattering phase matrix
Ω
Rij (Ω, ) that we have obtained by a ‘classical’ derivation (see Sects. 5.7 and
5.10). The relation between Pij (Ω, ) and R (Ω,
Ω ) can easily be established.
Ω
ij
From Eqs. (5.137)-(5.138) we have
(K)
Ω
Rij (Ω, ) ≡ R (Ω, ; 0) =
Ω Rij (Ω, Ω ; 0)
ij
K
= (−1)Q T Q
K
(i, Ω) ) ,
T K (j, Ω (10.21)
−Q
K Q
hence
Ω
Pij (Ω, ) = (K)
WK (J , Ju ) Rij (Ω, Ω ; 0) . (10.22)
K
1
Chandrasekhar introduces the symbols E1 , E2 , and E3 instead of W1 and W2 . The
relationship is the following
E1 = W 2 , E2 = 1 − W 2 , E3 = W 1 .
518 CHAPTER 10
Ω Θ
scattering plane
Fig.10.1. Geometry of a scattering process. The unit vectors ea and ea , which define the positive Q
direction for the incident and the scattered beam, respectively, are perpendicular to the scattering
plane (Ω , Ω).
which generalizes Eq. (5.95). Finally, from Eq. (10.22), using the second expression
in Eq. (5.138) – with H = 0 – and Eq. (2.74), we obtain
Ω ) = ∗
P (i) tP (j) DP P (R) ,
WK (J , Ju ) tK K K
Pij (Ω, (10.23)
KP P
TABLE 10.2
3 3
W1 (J , Ju ) W2 (J , Ju ) pQ 3p V 3
max max
Since W2 values range from 0 to 1, pQ is always positive, which means that the
polarization is perpendicular to the scattering plane. Its maximum value is found
for Θ = 90◦ and is given by
3 W2
pQ = . (10.26)
max 4 − W2
This is an increasing function of W2 . The factor W2 can thus be regarded as an
efficiency factor for the generation of linear polarization from unpolarized radiation
in scattering processes.
The parameter W1 , on the contrary, is only related to circular polarization. In
the geometry of Fig. 10.1, assuming that the incident radiation is totally circu-
larly polarized (say positively), the fractional circular polarization of the scattered
radiation is given by
εV (ν, Ω) P (Ω, Ω ) 6 W1 cos Θ
pV ≡ = 33 = .
εI (ν, Ω)
P00 (Ω, Ω ) 4 − W2 + 3 W2 cos2 Θ
The basic characteristics of the Hanle effect are obtained from Eq. (10.8) by retain-
ing the ‘magnetic’ term (first term in the denominator, which was set to zero in
the preceding section).1 In order to further simplify our treatment, we still neglect
– as in the preceding section – the effects of collisions (this matter is deferred until
Sect. 10.6). Using Eqs. (10.9) and (10.11), we can rewrite Eq. (10.8) in the form
2J + 1 B(α J → αu Ju )
ρK
Q (αu Ju ) =
2Ju + 1 A(αu Ju → α J ) + 2πi νL gα J Q
u u
We recall that the statistical tensors are defined in a reference system having the
z-axis in the magnetic field direction. Comparison with the corresponding expres-
sion obtained for the non-magnetic regime (Eq. (10.13)) shows that the effect of the
magnetic field is to reduce and dephase, via the imaginary term in the denominator,
all the multipole moments with Q = 0. The efficiency of this relaxation process is
controlled by the dimensionless parameter Hu defined by
2πνL gα J
u u
Hu = . (10.28)
A(αu Ju → α J )
1
ρK
Q (αu Ju ) = ρK (α J ) , (10.30)
1 + i QHu Q u u B=0
where [ρKQ (αu Ju )]B=0 are the multipole moments for the non-magnetic case (in
the presence of the same radiation field and defined in the same reference sys-
tem). Equation (10.30) summarizes the ‘essence’ of the Hanle effect. It shows
that coherences with Q = 0 (those connected with the populations of the Zeeman
sublevels, see Eq. (3.101)) are unaffected by the magnetic field, while those with
1
Note that we are still using the unpolarized lower level assumption and we are neglecting,
accordingly, stimulated emission effects.
2
The classical damping constant γ, defined in Eq. (5.28), has its obvious quantum-mechanical
counterpart in the Einstein coefficient A.
NON-EQUILIBRIUM ATOMIC PHYSICS 521
Q = 0 are reduced, with respect to the non-magnetic case, by a factor 1 + Q2 Hu2 .
The magnetic field has the additional effect of introducing a phase factor, given
by arctan(QHu ), in the latter. Such conclusions were anticipated in Sect. 5.13
where an intuitive description of the Hanle effect, based on classical physics, was
presented.
The radiation emitted by the atom along a given direction is obtained by substi-
tution of Eq. (10.27) into Eq. (7.15e) – adapted to the case of our two-level atom.
The resulting expression is quite involved, and will be studied in detail in the next
section. Here we disregard the spectral details of the emitted radiation, and we
restrict attention to the frequency-integrated emission coefficient, defined by
=
ε̃i (Ω) dν ,
εi (ν, Ω)
∆ν
where the interval ∆ν is sufficiently broad to fully cover all the Zeeman components
of the line. Taking into account that all the profiles Φ(να J M , α J M − ν) are
u u u
normalized to unity in frequency (see Eqs. (6.59a-c)), we obtain, along the same
lines leading from Eq. (7.15e) to Eq. (7.16e)
2 4
= h ν N (2J + 1) B(α J → α J )
ε̃i (Ω) u u u
2πc2
√ !
1 1 K ρK (α J ) ,
× 3 (−1)1+J +Ju
TQ
K
(i, Ω) Q u u
Ju Ju J
KQ
hν (K)
=
ε̃i (Ω) N 2Ju + 1 A(αu Ju → α J ) wJ J T Q
K ρK (α J ) .
(i, Ω) (10.31)
Q u u
4π u
KQ
Substitution of Eq. (10.27) into Eq. (10.31) yields, with the help of Eqs. (5.157),
(9.5), (10.6), (10.17) and (10.28)
5
dΩ
3
= kA
ε̃i (Ω) P (Ω, ; B)
Ω ) ,
I (ν , Ω (10.32)
L
4π j=0 ij j 0
Ω
where Pij (Ω, ; B),
the quantum-mechanical scattering phase matrix in the pres-
ence of a magnetic field, is given by1
1
Ω
Pij (Ω, ; B)
= WK (J , Ju ) (−1)Q T Q
K
(i, Ω) )
T K (j, Ω . (10.33)
−Q
1 + i QHu
KQ
1
The expression in Eq. (10.33) was first derived by Landi Degl’Innocenti (1985c).
522 CHAPTER 10
This expression generalizes to Quantum Mechanics the result that we have derived
in Chap. 5 from the classical theory of the electron (Eq. (5.133)). The difference
from the classical result is twofold: the factor WK comes into play, and the classical
parameter H is replaced by Hu (note that both WK and Hu depend on the atomic
transition). As expected, the quantum-mechanical and the classical phase matrix
coincide for the transition (J = 0, Ju = 1) with gα J = 1 (normal Zeeman triplet),
u u
provided the classical damping constant γ is identified with the Einstein coefficient
A(αu Ju → α J ) – we recall that WK (0, 1) = 1 for K = 0, 1, 2 (see Table 10.1).
In general, the quantum-mechanical scattering matrix is formally related to the
multipole components of the classical matrix by the expression
Ω
Pij (Ω, ; B)
= (K)
WK (J , Ju ) Rij (Ω, Ω ; B) , (10.34)
K
(K)
where Rij (Ω, Ω ; B) is defined as in Eq. (5.138) with H replaced by Hu .
Several properties proved in Chap. 5 for the classical scattering phase matrix
are valid for the quantum-mechanical matrix as well. It can easily be shown that
Ω
Pij (Ω, ; B)
is real and satisfies the relation1
Ω
Pij (Ω, ; −B)
= P (Ω , Ω;
B)
, (10.35)
ji
analogous to Eq. (5.94). It can also be shown, using Eq. (10.34), that the value of
the Van Vleck angle (Eq. (5.100)) is independent of the atomic transition – except
of course for Ju = 0 or Ju = 1/2 since in those cases the linear polarization of
the scattered radiation is zero. This property is basically due to the fact that the
values of W2 are limited between 0 and 1. The other properties proved in Sect. 5.12
for the classical matrix are easily extended to the quantum-mechanical matrix. In
particular, we can consider the average of the scattering matrix over an isotropic
distribution of magnetic fields, defined by
2π π
Ω
Pij (Ω, = 1
; B) Ω
dχB sin θB Pij (Ω, ; B)
dθ ,
B
4π
0 0
where θB and χB are the polar and azimuth angle, respectively, of the magnetic
field vector in a given reference frame (see Fig. 5.9). Along the same lines of
Sect. 5.12 we obtain
Ω
Pij (Ω, ; B)
= (K)
WK (J , Ju ) µK Rij (Ω, Ω ; 0) , (10.36)
K
(K)
where the multipole components Rij (Ω, Ω ; 0) are defined in Eq. (10.21) and the
quantities µK are given by Eq. (5.172) with the substitution H → Hu . Equation
(10.36) is the quantum analogue of Eq. (5.174); as expected, the two expressions
1
Equation (10.35) can be proved with the help of Eqs. (5.137), (5.138), (5.165) and (5.169).
NON-EQUILIBRIUM ATOMIC PHYSICS 523
Fig.10.2. Polarization (or Hanle) diagram for the transition (J = 1, Ju = 2), relative to the scat-
tering process illustrated in Fig.5.11. Full lines correspond to β = const., dashed lines to constant
magnetic field strength, parameterized through the angle α2 (see Eqs.(10.28) and (10.38)). Note
the scale difference from the classical Hanle diagram of Fig.5.12.
coincide for the transition (J = 0, Ju = 1) with gα J = 1 (see Table 10.1 and
u u
footnote 2 on p. 520).
In order to illustrate the Hanle effect in a specific case, let us consider the scat-
tering geometry of Fig. 5.11. The polarization of the scattered radiation is given
by Eq. (10.32) with the phase matrix of Eq. (10.34). The multipole components
(K)
Rij (Ω, Ω ; B) are given by Eqs. (5.139).1 The resulting fractional (frequency-
integrated) linear polarization is found to be
ε̃Q (Ω) 3 W2 sin2 β + (1 + cos2 β) cos2 α2
p̃Q ≡ =
ε̃ (Ω)
I
8 + W2 (1 − 3 cos2 β − 3 sin2 β cos2 α2 )
ε̃U (Ω) 6 W2 cos β sin α2 cos α2
p̃U ≡ = , (10.37)
ε̃I (Ω) 8 + W2 (1 − 3 cos2 β − 3 sin2 β cos2 α2 )
where
tan α2 = 2Hu . (10.38)
Figure 10.2 shows the polarization diagram predicted by Eqs. (10.37) for the tran-
sition (J = 1, Ju = 2), which has a W2 value of 0.35 (see Table 10.1). Note that
1
Note that the scattering geometry of Fig. 5.11 is a special case of the geometry of Fig. 5.10
(which is implied by Eqs. (5.139)) corresponding to µ = cos β, µ = 0, χ − χ = 90◦ . The values
of µ, µ , C1 , S1 , C2 , S2 are given by Eqs. (5.97) and (5.98) – in the latter, H is replaced by Hu .
524 CHAPTER 10
the diagram is quite similar to the classical Hanle diagram shown in Fig. 5.12 (cf.
Eqs. (5.102)), apart from a scale factor due to the smaller polarizing efficiency of
the transition (the maximum fractional polarization, corresponding to zero mag-
netic field, is 21/73 0.288 – see Table 10.2 – instead of 1). The shape of the Hanle
diagrams can be appropriately characterized by the relative height, r, of the ‘cen-
ter’ of the diagram (defined as the point corresponding to α2 = 90◦ , β = 90◦ ). The
parameter r is just the ratio of the scattered polarization for infinitely large mag-
netic field to the polarization for zero field, in the 90◦ scattering of an unpolarized
beam with the magnetic field perpendicular to the scattering plane,
thus r varies from 1/3 for W2 = 1 to 1/2 for W2 → 0 (minimum polarizing effi-
ciency). For the transition (J = 1, Ju = 2) we get r 0.437.
In this section we analyze in some detail the frequency dependence of the emitted
radiation in the Hanle effect. It is important to recall that the results presented
in the following are valid under the flat-spectrum approximation for the incident
radiation field. As in the preceding section, we assume that the lower level is
unpolarized and we neglect collisions.
The emission coefficient in the four Stokes parameters, as a function of frequency,
is obtained by substitution of Eq. (10.27) into Eq. (7.15e). Before performing the
substitution, it is convenient to rewrite the latter in the form
= hν N 2J + 1 A(α J → α J )
εi (ν, Ω) u u u
4π
× TQ
K ρK (α J ) ΦKK (J , J ; ν) ,
(i, Ω) (10.39)
Q u u Q u
KK Q
where the generalized profile ΦKK
Q (J , Ju ; ν) is given by1
1
The symbol was introduced by Landi Degl’Innocenti et al. (1991).
NON-EQUILIBRIUM ATOMIC PHYSICS 525
ΦKK
Q (J , Ju ; ν) = 3(2Ju + 1)(2K + 1)(2K + 1)
1+Ju −Mu + q Ju J 1 Ju J 1
× (−1)
−M u M −q −M u M −q
Mu Mu M qq
Ju Ju K 1 1 K
×
Mu −Mu −Q q −q −Q
1
× Φ(να J M , α J M − ν) + Φ(να J M , α J M − ν)∗ . (10.40)
2 u u u u u u
Obviously, Eq. (10.41) reduces to Eq. (10.32) after integration over frequency (see
Eqs. (A13.3) and (10.17)).
A remarkable result can be deduced from Eq. (10.41): provided the width of
the lower level is much smaller than the width of the upper level (which requires a
sufficiently weak incident radiation field), the Hanle effect vanishes in the far wings
of the line. In other words, resonance scattering in the line wings is unaffected by
the presence of the magnetic field.
To prove this property, we go back to Eq. (6.59a). Neglecting frequency shifts,
we can write
1 1
Φ(να J M , α J M − ν) = ,
u u u π Γ − i (να J M , α J M − ν)
u u u
It follows that
1
Φ(να J M , α J M − ν) + Φ(να J M , α J M − ν)∗ =
2 u u u u u u
1 2Γ + i νL gα J (Mu − Mu )
=
u u
. (10.42)
2π Γ − i (να J Mu ,α J M − ν) Γ + i (να J M , α J M − ν)
u u u u u
526 CHAPTER 10
1
Φ(να J M , α J M − ν) + Φ(να J M , α J M − ν)∗ ≈
2 u u u u u u
νL gα J 1 Γ
≈ 1+i u u
(Mu − Mu ) .
2Γ π Γ 2 + (ν0 − ν)2
Substitution into Eq. (10.40) yields, owing to the presence of the third 3-j symbol
νL gα J 1 Γ
ΦKK (J , Ju ; ν) ≈ 1 + i Q u u
Q
2Γ π Γ 2 + (ν0 − ν)2
The summation can easily be evaluated (see the derivation of Eq. (A13.3)). If the
width of the lower level is negligible compared to the width of the upper level, we
have from Eqs. (6.59b) – see footnote 2 on p. 520
A(αu Ju → α J )
Γ = , (10.43)
4π
ΦKK
Q (J , Ju ; ν) ≈ δKK wJ(K)J (1 + i QHu ) φ(ν0 − ν) .
u
Substitution into Eq. (10.41) shows that the dependence on the magnetic field
disappears – in fact, Eq. (10.41) reduces to Eq. (10.19). This proves that the Hanle
effect vanishes in the line wings.
To get a deeper insight into Eq. (10.41), let us consider a specific scattering
process. We refer again to the geometry of Fig. 5.11, and we set β = 0◦ (the
magnetic field points in the direction of the outgoing radiation) and denote by I
and ∆Ω , respectively, the intensity and the (infinitesimal) angular spread of the
incoming, unpolarized beam. The tensors T Q K
can be evaluated from Table 5.6.1
For the sake of simplicity we consider the transition (J = 0, Ju = 1) with gα J = 1.
u u
1
A possible choice for the angles appearing in Table 5.6 is: θ = 0◦ , χ, γ = 90◦ for the emitted
radiation; θ = 90◦ , χ = χ for the incident radiation. An equivalent choice – cf. footnote on
p.523 – is: θ = 0◦ , χ, γ = 0◦ ; θ = 90◦ , χ = χ − 90◦ .
NON-EQUILIBRIUM ATOMIC PHYSICS 527
After some algebra we obtain, with the help of Eqs. (A13.11) and Table 10.1
= 3 k A ∆Ω I φ + φ
ε0 (ν, Ω) −1
8 L 1
= 3 k A ∆Ω I
ε1 (ν, Ω)
1
(φ + φ1 ) −
2Hu
(ψ − ψ1 )
8 L 1 + 4Hu2 −1 1 + 4Hu2 −1
= 3 k A ∆Ω I
ε2 (ν, Ω)
2Hu
(φ + φ1 ) +
1
(ψ − ψ1 )
8 L 1 + 4Hu2 −1 1 + 4Hu2 −1
= − 3 k A ∆Ω I φ − φ ,
ε3 (ν, Ω) (10.44)
−1
8 L 1
Fig.10.4. The linear polarization degree pL and the rotation angle α (in degrees) corresponding
to the emission coefficients of Fig.10.3. The dashed lines have the same meaning as in Fig.10.3.
to the Zeeman effect – and the complex shape of the linear polarization profiles.
From Eqs. (10.44) it can easily be shown that the scattered radiation is totally
polarized at every single frequency. Some features of the linear polarization are
better appreciated in Fig. 10.4, where the polarization degree pL and the rotation
angle α, given by
,ε -
ε21 + ε22 1
pL = , α = arctan 2 + α0
ε0 2 ε1
(with α0 defined as in Eqs. (1.8b,c)) are plotted against frequency. At line center
the polarization degree is unity while the direction of maximum polarization is
rotated through a large angle ( 63◦ ) from the zero-field direction. In the far
wings, pL tends again to unity while α tends to zero, according to the property
proved earlier that the Hanle effect vanishes. In fact, the asymptotic values of the
fractional emission coefficients derived from Eqs. (10.44) for |ν0 − ν| → ∞ are
pQ = 1 , pU = p V = 0 .
In this section we consider the interaction of a two-level atom with the radiation
field in the presence of a strong magnetic field, in order to illustrate an important,
general concept: atomic polarization can be produced not only by the anisotropy
and/or polarization of the incident radiation, but also by the spectral structure of
the radiation itself. For the sake of simplicity we assume (as in the former sections)
the lower level to be unpolarized and we neglect collisions.
NON-EQUILIBRIUM ATOMIC PHYSICS 529
In the strong field regime, the Zeeman splitting is much larger than the natural
line width. This implies Hu 1 (see Eq. (10.28)) so that, according to Eq. (10.30),
the upper-level statistical tensors with Q = 0 can be neglected.1 In other words, all
coherences are zero and atomic polarization can only involve differences between
the populations of Zeeman sublevels (cf. Eq. (3.101)). Under this approximation
Eq. (10.27) reduces to
Substitution into Eq. (10.39) gives the corresponding expression for the emission
coefficient. Using Eqs. (5.157), (9.5) and (10.6) we obtain
= kA
εi (ν, Ω) ΦKK (J , Ju ; ν)
L 0
KK
5
dΩ (K)
3
× ) I (ν , Ω
T K (j, Ω
wJ J T 0K (i, Ω) ) ,
0 j 0
4π j=0 u
1
Obviously, this statement holds only in a reference system having the z-axis along the magnetic
field direction. It should be remarked that neglecting the statistical tensors with Q = 0 is, in any
case, an approximation which makes impossible to account for certain phenomena typical of line
wings (like the disappearance of the Hanle effect described in the previous section).
530 CHAPTER 10
5
dΩ
3
× ) I (ν
T (j, Ω
j αu Ju Mu , α J M , Ω ) . (10.46)
4π j=0 qq
This expression gives the populations of the Zeeman sublevels of the upper level as
determined by a radiation field which can be anisotropic, polarized, and frequency-
dependent (within the limitations specified above). Its implications are best illus-
trated by comparison with the extreme case where the radiation field is isotropic,
unpolarized, and flat across the whole frequency interval spanned by the spectral
line,
) = δ J ,
Ij (να J M , α J M , Ω
u u u
j0
where J is the field intensity. In such case the integral over the solid angle can be
evaluated with the help of Eqs. (5.146), (5.128) and (5.143),
5
dΩ ) = 1 ,
T (0, Ω
4π qq 3
and the summation over M and q can be performed via Eq. (2.23a). The result is
N 1 c2
ρα (Mu ) = J,
u u
J
N 2J + 1 2hν03
which shows that all the Zeeman sublevels of the upper level are evenly populated
(unpolarized upper level).
If any of the above conditions is not met, a certain amount of atomic polariza-
tion (population unbalances) will form in the upper level, because of a mechanism
of selective pumping of the different Zeeman sublevels. There may exist selec-
tive pumping due to anisotropy (and/or polarization) of the incident radiation,
or selective pumping due to spectral structure of the incident radiation, or both.
Obviously, the second mechanism is effective only when the Zeeman sublevels are
split because of the presence of a magnetic field. In the preceding sections of this
chapter only the first mechanism has been considered.1
1
It should be remarked that the strong-field regime analyzed in this section is one of the
few cases where a frequency-structured radiation field can be treated within the flat-spectrum
approximation (such possibility is related to the fact that the atomic density matrix is diagonal).
If the Zeeman separation is comparable with the natural line width and the radiation field is
significantly structured across that width, the theory developed in this book cannot be applied
(see Sect. 6.5).
NON-EQUILIBRIUM ATOMIC PHYSICS 531
χ Ω
θ Ω
θ scattered
Ω radiation
Ω
incident
radiation
, is scattered into the direction Ω.
Fig.10.5. An unpolarized radiation beam, having direction Ω
The direction of the magnetic field B is specified by the angles θ and χ in the reference frame
e (Ω),
(ea (Ω), Ω).
The angle between Ω and B
is θ .
b
3 N c2 ∆Ω
ρ∓1 ≡ ρα J (∓1) = (1 + cos2 θ ) I (ν0 ∓ νL )
u u 4 N 2hν03 4π
3 N c2 ∆Ω
ρ0 ≡ ρα J (0) = 2 sin2 θ I (ν0 ) . (10.47)
u u 4 N 2hν03 4π
These formulae show that by suitably choosing the angle θ and the spectrum of the
incident radiation, the three Zeeman sublevels can be arbitrarily populated. Note,
however, that if the spectrum of the incident radiation is symmetrical about line
center – in particular, if it is flat across the whole spectral line – the populations
of the sublevels Mu = 1 and Mu = −1 are the same (the atom is ‘aligned’ but
not ‘oriented’, see Sect. 3.7 and Table 3.6): atomic orientation can arise only if
I (ν0 − νL ) and I (ν0 + νL ) are different.
Next we evaluate the radiation scattered by the atom in the direction Ω (see
Fig. 10.5). The emission coefficient can be obtained from Eq. (7.19e), with the
given by Table 5.3 (with θ → θ, γ → 180◦ − χ). Using Eqs. (7.8),
tensors Tqq (i, Ω)
(3.107) and (10.6), and the shorthand notations for ρα J (Mu ) of Eqs. (10.47), we
u u
obtain
3 1 ρ φ + ρ−1 φ1
= ε
ε0 (ν, Ω) L ρ0 φ0 sin2 θ + 1 −1 1 + cos2 θ
ρ−1 + ρ0 + ρ1 2 2
3 1 ρ φ + ρ−1 φ1
= ε
ε1 (ν, Ω) L ρ φ − 1 −1 sin2 θ cos 2χ
ρ−1 + ρ0 + ρ1 2 0 0 2
3 1 ρ φ + ρ−1 φ1
= ε
ε2 (ν, Ω) L ρ φ − 1 −1 sin2 θ sin 2χ
ρ−1 + ρ0 + ρ1 2 0 0 2
532 CHAPTER 10
= ε 3 1
ε3 (ν, Ω) ρ φ − ρ1 φ−1 cos θ , (10.48)
L
ρ−1 + ρ0 + ρ1 2 −1 1
where εL is defined in Eq. (9.11) and the profiles φ−1 , φ0 , φ1 in Eq. (A13.10).
If the atom is unpolarized (ρ−1 = ρ0 = ρ1 ), Eqs. (10.48) reduce to Eqs. (9.10)
written for the transition (J = 0, Ju = 1). In such case two remarkable spectral
properties hold: a) the frequency profile of ε0 , ε1 , ε2 is symmetrical about line cen-
ter, the frequency profile of ε3 is antisymmetrical; b) the integral over frequency of
ε1 , ε2 and ε3 is zero. As obvious from Eqs. (10.48), none of these properties is any
longer valid if atomic polarization is present. In particular, the symmetry character-
istics of the profiles about line center are broken if the atom is oriented (ρ−1 = ρ1 ).
The frequency-integrated linear polarization turns out to be proportional to the
amount of atomic alignment, the frequency-integrated circular polarization to the
amount of atomic orientation (cf. Table 3.6).
It should be remarked that the formal invariance of Eqs. (10.48) under the trans-
formation χ → π + χ (rotation through 180◦ of the magnetic field vector about the
direction of the scattered radiation) does not mean that the scattered radiation is
unaffected by such transformation – this is only true when the atom is unpolarized.
In general, the populations ρ−1 , ρ0 , ρ1 depend on the angle θ (see Eqs. (10.47))
which is not invariant for χ → π + χ.
Resonance polarization and the Hanle effect are deeply modified by the presence
of collisions. In this section we illustrate their effects for the two-level atom, under
the restrictive assumption that the lower level is unpolarized. Thus we refer again
to Eq. (10.8), and we recall that the collisional rates for inelastic and superelastic
collisions are connected by the Einstein-Milne relation (Eq. (7.98))
hν
(0) 2J + 1 − k T0
CI (αu Ju , α J ) = u e B c CS(0) (α J , αu Ju ) , (10.49)
2J + 1
2J + 1 B(α J → αu Ju )
ρK
Q (αu Ju ) =
2Ju + 1 A(αu Ju → α J )
where
CS(0) (α J , αu Ju ) D(K) (αu Ju )
= , δu(K) = , (10.51)
A(αu Ju → α J ) A(αu Ju → α J )
and where
hν
2hν03 − k T0
BP (Tc ) = e B c
c2
is the Planck function (in the Wien limit, where stimulated emission is neglected)
at the temperature Tc .
The physical meaning of the quantities introduced in Eqs. (10.51) is quite obvious.
represents the ratio of collisional to radiative de-excitation rates of the upper level,
whereas δu(K) – with δu(0) = 0, see Eq. (7.102) – represents the effective number of
depolarizing collisions (for the statistical tensors of rank K) taking place during the
lifetime of the excited level. It is worth noticing that, under the limit δu(K) → ∞,
Eq. (10.50) predicts that all the ρK Q (αu Ju ) with K = 0 vanish. The only non-zero
statistical tensor is ρ00 (αu Ju ), and Eq. (10.50) can be cast into the form
This relation is easily recognized as the usual statistical equilibrium equation of the
‘scalar’ (unpolarized) case, once the left-hand side is interpreted as the line source
function (see Eqs. (9.5), (9.11), (9.13) and (10.6)).1
To obtain the radiation emitted by the atom along a given direction Ω, we have to
substitute Eq. (10.50) into Eq. (10.39) or, if one is only interested in the frequency-
integrated radiation, into Eq. (10.31). Let us consider the latter case first. Use of
Eqs. (9.5), (10.6), (10.17), (5.157) and Table 5.6 yields
= kA
ε̃i (Ω) L B (T ) δ
1 + P c i0
5
dΩ
3
+ Ω
P (Ω, ; B, ) ,
, δ (K) ) I (ν , Ω (10.53)
4π j=0 ij u j 0
where the generalized scattering phase matrix (i.e. the phase matrix in the presence
of a magnetic field and collisions) is given by
Ω
Pij (Ω, ; B,
, δ (K) ) = WK (J , Ju ) (−1)Q T Q
K )
T K (j, Ω
(i, Ω)
u −Q
KQ
1
× . (10.54)
1 + i QHu + + δu(K)
1
The right-hand side is often written in the slightly different form
BP (Tc ) + (1 − ) J00 (ν0 ) ,
where = /(1 + ).
534 CHAPTER 10
Equations (10.53) and (10.54) show the double role played by collisions in resonance
scattering. On the one hand, inelastic (and superelastic) collisions introduce an
additional term in the emission coefficient. This term affects only intensity, not
polarization, and is proportional to /(1 + ) or, in other words, to the branching
ratio
CS(0) (α J , αu Ju )
,
A(αu Ju → α J ) + CS(0) (α J , αu Ju )
which represents the collisional de-excitation probability divided by the total de-
excitation probability of the upper level. The unpolarized character of the emitted
radiation is a consequence of the fact that we have assumed an isotropic distribution
of colliders, which prevents the possibility of describing impact polarization.
On the other hand, collisional rates affect the scattering phase matrix via the
factor
1
, (10.55)
1 + i QHu + + δu(K)
which involves both inelastic and elastic collisions. This factor describes the so-
called quenching of resonance polarization (see Mitchell and Zemanski, 1934). Its
effects are easily understood by rewriting Eq. (10.55) in the form (cf. Eqs. (5.193),
(5.194) for a similar ‘classical’ expression)
1 1 1
= , (10.56)
1 + i QHu + + δu(K) 1 + + δu(K) 1 + i QHu (K)
Hu 2πνL gα J
Hu (K) = = u u
.
(K)
1 + + δu A(αu Ju → α J ) + CS(0) (α J , αu Ju ) + D(K) (αu Ju )
Equations (10.54) and (10.56) show that the generalized phase matrix can be for-
mally deduced from the Hanle phase matrix (Eq. (10.33)) by the substitutions
WK (J , Ju )
WK (J , Ju ) → , Hu → Hu (K) .
1 + + δu(K)
Thus the quenching of resonance polarization consists both in a net reduction of
the scattered polarization and in a reduction of the efficiency of the magnetic field
as depolarizing agent in the Hanle effect.
If the dependence of the scattered radiation on frequency is considered, a further
effect of collisions comes into play, i.e., the broadening of the individual profiles
Φ(να J M , α J M − ν). Neglecting frequency shifts, these profiles are given by
u u u
1 Γ
Φ(να − ν) =
J M , α J M
u u u π Γ 2 + (να J Mu , α J M − ν)2
u u
i να J M , α J M − ν
u u u
+
,
π Γ + (να J M , α J M − ν)2
2
u u u
NON-EQUILIBRIUM ATOMIC PHYSICS 535
γu + γ f
Γ = Γ + 2Γc = + ,
4π 2π
γu and γ being the inverse lifetimes of the upper and lower level, respectively, and
f the frequency of elastic collisions. For the two-level atom, neglecting the width
of the lower level, we have
= kA
0 (J , Ju ; ν) T 0 (i, Ω)
Φ0K K
εi (ν, Ω) L BP (Tc )
1+
K
5
dΩ (K)
3
KK K )
T K (j, Ω
+ ΦQ (J , Ju ; ν) wJ J (−1)Q T Q (i, Ω) −Q
4π j=0
u
KK Q
1 ) .
× I (ν , Ω (10.57)
1 + i QHu + + δu(K) j 0
The broadening of the profiles due to collisions has important consequences and,
in particular, it breaks down a property proved in Sect. 10.4 for the collisionless
regime: while in the latter resonance scattering in the far wings of a spectral line is
unaffected by the magnetic field, this is no longer true in the presence of collisions.
For → ∞ only the first term of Eq. (10.57) is non-zero. It can easily be shown
that under this limit Eq. (10.57) reduces to Eqs. (9.10) written for LTE.
Up to now we have systematically ignored the possibility that some atomic polar-
ization may form in the lower level of a two-level atom under the action of the
incident radiation field: all the results derived in Sects. 10.2 to 10.6 are based on
the unpolarized lower level approximation. Here we release this approximation,
in order to illustrate how the former results are modified when lower-level atomic
polarization is allowed for.
536 CHAPTER 10
in the expressions of the rates TA and RA – on the other hand, the rates TE and RE
are proportional to the Kronecker symbol δQQ . Thus the overall set of equations
is decoupled into separate subsets characterized by a fixed Q value. As the trace
equation (10.4) – the only non-homogeneous equation – involves only statistical
tensors with Q = 0, all the other ones are identically zero. Note, incidentally, that
this means that all coherences between magnetic sublevels are zero (see Eq. (3.101)):
under the assumptions adopted in this section, atomic polarization can only consist
in population differences of the magnetic sublevels. On the other side, since the
1
As obvious from the following argument, the property is valid even if inelastic and superelastic
collisions and/or stimulated emission are taken into account. It also holds for the multi-level (not
just the two-level) model atom.
NON-EQUILIBRIUM ATOMIC PHYSICS 537
K
multipole components JQ r (ν0 ) with Kr = 1 are zero, all the radiative rates vanish
r
unless (K − K ) = 0 or (K − K ) = ±2 : as far as TA and RA are concerned, this
is again due to the 3-j symbol in Eq. (10.58) which, for Q = Q = Qr = 0, is
zero unless (K + K + Kr ) is even (see Eq. (2.25)); as for TE and RE , both are
proportional to the Kronecker symbol δKK . It follows that the remaining set of
equations (involving the statistical tensors with Q = 0 ) is in turn decoupled into
two separate subsets relating statistical tensors of even and odd rank, respectively.
As the latter do not appear in the trace equation, the corresponding subset is
identically solved by the null solution.
Because of the above property, the number of equations to be solved is largely
reduced. It can be shown that such number is
J + Ju + 2 for J , Ju integers
Neq = (10.59)
J + Ju + 1 for J , Ju half-integers ,
d K
ρ (α J ) = − A(αu Ju → α J ) + D(K) (αu Ju ) ρK 0 (αu Ju )
dt 0 u u
+ TA (αu Ju K 0, α J K 0) ρK
0 (α J )
K even
!
d K Ju Ju K
ρ0 (α J ) = (2Ju + 1) A(αu Ju → α J ) (−1)1+J +Ju ρK
0 (αu Ju )
dt J J 1
− RA (α J K 0 K 0) + δKK D(K) (α J ) ρK
0 (α J ) , (10.60)
K even
where K is even. The rates TA and RA , given by Eqs. (7.14a) and (7.14d) respec-
tively, reduce, with the help of Eqs. (2.26a), (2.36a) and (2.49), to the following
(2J + 1) B(α J → αu Ju ) 0 c2
J0 (ν0 ) = J 0 (ν ) = n̄
(2Ju + 1) A(αu Ju → α J ) 2hν03 0 0
J02 (ν0 ) w
=√
J00 (ν0 ) 2
D(K) (αu Ju )
= δu(K)
A(αu Ju → α J )
D(K) (α J )
= δ(K) (10.62)
B(α J → αu Ju ) J00 (ν0 )
(Eqs. (7.8) have been used in the first relation). Their meaning is the following. The
parameter n̄ represents the solid-angle average of the number of photons per mode
at frequency ν0 (see Eqs. (4.36), (5.157) and Table 5.7); in the present case it is
much less than unity, since we have assumed stimulated emission to be negligible.
The parameter w, satisfying the condition −1/2 ≤ w ≤ 1 under the conditions
considered in this section, is the so-called anisotropy factor . Its value is unity for a
unidirectional (unpolarized) radiation beam, and zero for an isotropic (unpolarized)
radiation field – see Eqs. (5.164). Finally, δu(K) , already introduced in Eqs. (10.51),
represents the effective number of depolarizing collisions (for the statistical tensors
of rank K) taking place during the lifetime of the upper level, while δ(K) is the
corresponding quantity for the lower level. Note that, by combining the first and
the last of Eqs. (10.62), we can write
since D(K) (α J ) and D(K) (αu Ju ) are of the same order of magnitude and n̄ 1,
it follows that
δ(K) δu(K) .
In the following we apply Eqs. (10.60) to a few simple cases. We begin with the
transition (J = 1, Ju = 0) which is the simplest transition where lower-level polar-
ization plays a role. After some calculations, involving the numerical evaluation of
NON-EQUILIBRIUM ATOMIC PHYSICS 539
several 3-j, 6-j, and 9-j symbols, one gets the following independent equations1
1 d 0 1 1
ρ0 (αu Ju ) = − ρ00 (αu Ju ) + √ n̄ ρ00 (α J ) + √ n̄w ρ20 (α J )
A(αu Ju → α J ) dt 3 6
1 d 2 1 1 1
ρ (α J ) = − n̄ 1 − w + δ(2) ρ20 (α J ) − √ n̄w ρ00 (α J ) ,
A(αu Ju → α J ) dt 0 3 2 3 2
where the parameters defined in Eqs. (10.62) have been introduced. In stationary
situations, the solution can be written in the form2
ρ20 (α J ) √ w
σ02 (α J ) ≡ =− 2
ρ00 (α J ) 2 − w + 2 δ(2)
ρ00 (αu Ju ) 1 w2
= √ n̄ 1 − . (10.64)
ρ00 (α J ) 3 2 − w + 2 δ(2)
These expressions exhibit several interesting aspects. First of all, atomic polar-
ization, as well as the ratio of the upper to the lower level population, depend
in a non-linear way on the anisotropy factor w: this is generally the case when
lower-level polarization is allowed for.
Secondly, atomic polarization is zero if w = 0, which is quite obvious because in
that case the physical environment is completely isotropic (we recall that in our
treatment collisions are isotropic by assumption). Apart from this special case,
atomic polarization tends to zero for δ(2) → ∞: in general, the approximation
of the unpolarized lower level is strictly justified only under such limit. On the
other hand, according to Eq. (10.63), δ(2) is inversely proportional to n̄, or to
the intensity of the radiation field. Thus we can also say that lower-level atomic
polarization increases as the intensity of the radiation field is increased. This is
easily understood if we consider that the radiative rates (which are responsible for
atomic polarization) are proportional to the intensity of the radiation field, while
the collisional rates are independent of it (they only depend on the density of the
colliders).
Another peculiar aspect of Eqs. (10.64) is that for w = 1 and δ(2) = 0 (unidirec-
tional, unpolarized radiation beam and no collisions) we have
√
σ02 (α J ) = − 2 , ρ00 (αu Ju ) = 0 . (10.65)
The second result means that the upper level is completely depopulated (the atom
is perfectly transparent to the incident radiation), which looks rather surprising
1
We recall that the equations for the time derivatives of ρ00 (α J ) and ρ00 (αu Ju ) are not
independent: see footnote on p.512.
2 K (αJ) = ρK (αJ)/ρ0 (αJ), sometimes referred to as reduced statistical tensor,
The symbol σQ Q 0
will be often used in the following.
540 CHAPTER 10
at first sight. However, one should bear in mind that a unidirectional, unpo-
larized radiation beam induces only transitions with ∆M = ±1.1 The Zeeman
sublevels M = ±1 of the lower level are therefore depopulated by the incident
radiation, while the sublevel M = 0 is not. On the other hand, the upper level
(Ju = 0, Mu = 0) de-excites with the same probability to each of the three sub-
levels of the lower level. The result of this cycle is that all the atoms are eventually
pumped
√ into the sublevel M = 0 of the lower level, which explains both the factor
− 2 for the relative amount of alignment (cf. Table 3.6) and the total absence of
population in the upper level.
Finally, Eqs. (10.64) show that depolarizing collisions affect the ratio of the upper
level to the lower level population, which may look surprising because such colli-
sions are elastic (see Sect. 7.13). The reason for this is that depolarizing collisions
tend to mix up the populations of the Zeeman sublevels of each (αJ)-level, thus
affecting (indirectly) the absorption and emission of radiation. Such effect is clearly
illustrated by the example just considered of a unidirectional, unpolarized incident
beam: in stationary situations and under the limit δ(2) = 0, all the atoms are in the
sublevel M = 0 of the lower level whence absorption can no longer take place, so
that the upper level is completely depopulated (Eqs. (10.65)). But if depolarizing
collisions are present, some of the atoms are forced to move from the M = 0 to
the M = ±1 sublevels whence absorption is possible, which eventually leads to a
non-null value for the population of the upper level.
The transition (J = 1, Ju = 0), though interesting under several aspects, is not
so illuminating under the point of view of resonance polarization: since Ju = 0, the
emitted radiation is unpolarized irrespective of the presence of lower-level atomic
polarization. The latter can only affect the absorption properties of the atom, an
argument that we are not going to pursue here but that might be easily handled
by substituting the expressions of the statistical tensors of the lower level into
Eqs. (7.15a,c) applied to the two-level atom.
We now analyze the simplest transition where atomic polarization can be present
both in the lower and the upper level, namely the transition (J = 1, Ju = 1). Appli-
cation of Eqs. (10.60) to this case yields the following set of independent equations
1 d 0 1
ρ (α J ) = − ρ00 (αu Ju ) + n̄ ρ00 (α J ) − √ n̄w ρ20 (α J )
A(αu Ju → α J ) dt 0 u u 2 2
1 d 2 1
ρ (α J ) = − 1 + δu(2) ρ20 (αu Ju ) − √ n̄w ρ00 (α J )
A(αu Ju → α J ) dt 0 u u 2 2
1
− n̄ 1 + w ρ20 (α J )
2
1
This fact can be easily verified by considering the statistical equilibrium equations in the
standard representation and the corresponding expressions of the radiative rates (Eqs. (7.5) and
(7.9), respectively). It is sufficient to recall that for a unidirectional, unpolarized radiation beam
the only non-zero components of the (reducible) radiation field tensor are J11 (ν0 ) and J−1−1 (ν0 )
– cf. Eqs. (5.153) and Table 5.4.
NON-EQUILIBRIUM ATOMIC PHYSICS 541
1 d 2 1 1
ρ (α J ) = − ρ20 (αu Ju ) + √ n̄w ρ00 (α J )
A(αu Ju → α J ) dt 0 2 2 2
1
− n̄ 1 + w + δ(2) ρ20 (α J ) ,
4
where the notations of Eqs. (10.62) have been used. In stationary situations, the
solution can be written in the form
√ 6 + 3w + 4 δ(2)
σ02 (αu Ju ) = − 2 w
4 (1 + δu )(4 + w + 4 δ(2) ) − (1 + w) − w2 (3 + 2 δu(2) )
(2)
√ 3 + 2 δu(2)
σ02 (α J ) = 2w
2 (1 + δu )(4 + w + 4 δ(2) ) − (1 + w)
(2)
ρ00 (αu Ju ) 4 (1 + δu(2) )(4 + w + 4 δ(2) ) − (1 + w) − w2 (3 + 2 δu(2) )
= n̄ . (10.66)
ρ00 (α J ) 4 (1 + δu(2) )(4 + w + 4 δ(2) ) − (1 + w)
These expressions are rather involved and clearly show the intricate dependence of
atomic polarization on the different parameters. However, it is easily seen that the
basic characteristics found for the transition (J = 1, Ju = 0) are confirmed: atomic
polarization (of both the upper and the lower level) depends non-linearly on the
anisotropy factor w, and vanishes for w = 0; the ratio of the upper to the lower
level population also depends non-linearly on w, and is affected by the presence of
depolarizing collisions.
Consider now the limit δ(2) → ∞. Equations (10.66) reduce to
√
2 w ρ00 (αu Ju )
σ02 (αu Ju ) =− , σ02 (α J ) = 0 , = n̄ . (10.67)
4 1 + δu(2) ρ00 (α J )
These can be compared with the corresponding expressions derived in Sect. 10.6
for the two-level atom under the unpolarized lower level approximation. Setting
= Hu = 0 (the conditions considered in the present section) into Eq. (10.50) we
obtain, for an arbitrary transition (J , Ju )
where ‘u.l.l.’ stands for ‘unpolarized lower level’, and where we have used the
notations of Eqs. (10.62). For the transition (J = 1, Ju = 1), Eqs. (10.68) give
the same results as Eqs. (10.67) – see Table 10.1. Once again we see that the
unpolarized lower level approximation is correct under the limit δ(K) → ∞ – a
statement which, on the other hand, is suggested by the structure itself of the
statistical equilibrium equations. It is important to remark that the condition
δ(K) 1 occurs not only when D(K) A(αu Ju → α J ) – in which case δu(K) 1, so
that the polarization of the upper level tends to zero as well – but also when n̄ 1
(see Eq. (10.63)). Therefore the unpolarized lower level approximation is especially
542 CHAPTER 10
√ 1 ρ00 (αu Ju ) 3
σ02 (αu Ju ) = − 2 , σ02 (α J ) = √ , = n̄ .
2 ρ00 (α J ) 4
According to Table 3.6, this means that the Zeeman sublevels of the upper level
with Mu = ±1, and the Zeeman sublevel of the lower level with M = 0 are
completely depopulated. The populations of the remaining sublevels, normalized
to unity, are
3 n̄ 2
ρα J (0) = , ρα J (±1) = .
u u 4 + 3 n̄ 4 + 3 n̄
The analytical results obtained for the two simple cases (J = 1, Ju = 0) and
(J = 1, Ju = 1) might in principle be extended to transitions involving larger J
values. However, the algebraic system to be solved becomes of course more and
more complicated; at the same time, the dependence of atomic polarization on the
various parameters becomes so intricate that no real insight into the physics of the
scattering process can be gained. In order to give an idea of the atomic polarization
arising in different transitions, we collect in Table 10.3 the values of the reduced
statistical tensors
ρK
0 (αu Ju ) ρK
0 (α J )
σ0K (αu Ju ) = , σ0K (α J ) =
ρ00 (αu Ju ) ρ00 (α J )
2 wJ(2)J
σ0 (αu Ju ) u.l.l. = √u
2
obtained for the same physical conditions plus the assumption of the unpolarized
lower level (cf. Eqs. (10.68)). The last two columns of Table 10.3 show, respectively,
the fractional linear polarization pQ expected in a 90◦ scattering process, and the
same quantity evaluated under the additional assumption of the unpolarized lower
level, [pQ ]u.l.l.. The expression of the former can be deduced from Eq. (10.15).
Defining the positive Q direction as the perpendicular to the scattering plane, we
obtain, with the help of Table 5.6 (with θ = 90◦ , γ = 90◦ ) and Eq. (10.14)
TABLE 10.3
Atomic polarization for different transitions in the collisionless, maximum anisotropy regime.
The letters u and denote the upper and lower level, respectively. The last two columns give
the fractional linear polarization in a 90 ◦ scattering. The label u.l.l. means ‘unpolarized lower
level’.
σ02 (u) σ04 (u) σ06 (u) σ08 (u) σ010 (u) [σ02 (u)]u.l.l.
J Ju pQ [pQ ]u.l.l.
σ02 () σ04 () σ06 () σ08 () σ010 ()
0.707 − − − − 0.707
0 1 1. 1.
− − − − −
− − − − − −
1/2 1/2 0. 0.
− − − − −
0.5 − − − − 0.5
1/2 3/2 0.429 0.429
− − − − −
− − − − − −
1 0 0. 0.
−1.414 − − − −
−1.414 − − − − −0.354
1 1 1. 0.2
0.707 − − − −
0.622 0.214 − − − 0.418
1 2 0.448 0.288
0.321 − − − −
− − − − − −
3/2 1/2 0. 0.
−0.5 − − − −
−0.526 − − − − −0.4
3/2 3/2 0.353 0.261
0.308 − − − −
0.764 0.265 − − − 0.374
3/2 5/2 0.5 0.226
0.5 − − − −
−0.202 − − − − 0.071
2 1 −0.021 0.008
−0.489 0.146 − − −
−0.460 −0.247 − − − −0.418
2 2 0.319 0.288
0.272 0.146 − − −
0.903 0.354 0.089 − − 0.346
2 3 0.556 0.191
0.659 0.137 − − −
−0.357 − − − − 0.1
5/2 3/2 −0.053 0.015
−0.541 0.141 − − −
−0.408 −0.109 − − − −0.428
5/2 5/2 0.287 0.302
0.244 0.065 − − −
1.035 0.458 0.132 − − 0.327
5/2 7/2 0.611 0.170
0.809 0.242 − − −
544 CHAPTER 10
TABLE 10.3
(continued)
σ02 (u) σ04 (u) σ06 (u) σ08 (u) σ010 (u) [σ02 (u)]u.l.l.
J Ju pQ [pQ ]u.l.l.
σ02 () σ04 () σ06 () σ08 () σ010 ()
3 W2 (J , Ju )
pQ u.l.l. = , (10.70)
4 − W2 (J , Ju )
where Eqs. (10.68) and (10.17) have been used. Note that expression (10.70) has
already been derived in Sect. 10.2 (Eq. (10.26)).
It is interesting to observe that, with some exceptions, the linear polarization of
the scattered radiation is larger (in absolute value) when the atomic polarization
of the lower level is taken into account. The exceptions are provided by transitions
having ∆J = Ju − J = 0 (and J > 2), for which the difference between the last
two columns of Table 10.3 is, however, rather small. It is also interesting to notice
that for transitions with ∆J = −1 the polarization of the scattered radiation is
negative (polarization direction parallel to the scattering plane) when the atomic
polarization of the lower level is taken into account.
The results in Table 10.3 can also be used to reconstruct the relative popu-
lations of the Zeeman sublevels via Eq. (3.99). Referring for instance to the
transition (J = 1, Ju = 2), one can easily find that the relative populations are
9/22 (M = ±1) and 4/22 (M = 0) for the lower level, and 9/25 (Mu = ±2), 2/25
(Mu = ±1), 3/25 (Mu = 0) for the upper level.1
The results of the previous section can be generalized by allowing for the presence
of a magnetic field: this leads to an investigation of the Hanle effect in more gen-
eral terms than in Sect. 10.3, since lower-level atomic polarization is now taken into
account. In order to simplify the problem, we adopt exactly the same assumptions
as in the preceding section (except of course for the presence of the magnetic field):
we neglect stimulated emission as well as inelastic and superelastic collisions, and
we assume the incoming radiation to be unpolarized and cylindrically symmetri-
cal about a given direction. The flat-spectrum approximation requires now the
radiation to be unstructured across a frequency interval centered at the transition
frequency ν0 and larger than the Zeeman splitting of the line.
Since there are two preferred directions (the magnetic field vector and the symme-
try axis of the radiation field), it is convenient to write the statistical equilibrium
equations in a reference system whose z-axis is parallel to one of them. Either
choice has its own advantages and disadvantages. In the frame with the z-axis
1
In terms of rational numbers,
√ one has √ √
2 5 2 13 70 2 14
σ0 (α J ) = , σ02 (αu Ju ) = , σ04 (αu Ju ) = .
22 175 35
546 CHAPTER 10
along the magnetic field direction, one can apply directly Eq. (7.11). The magnetic
contribution to the equations is very simple (being diagonal), but the expressions
of the transfer and relaxation rates are rather complicated because the radiation
field has no special symmetry characteristics about the magnetic field’s direction.
In the frame with the z-axis parallel to the symmetry axis of the radiation field,
the expressions of the rates are much simpler (because the radiation field tensor
has only two non-zero components, J00 (ν0 ) and J02 (ν0 )), whereas the magnetic term
involves a kernel connecting different statistical tensors (see Eqs. (7.78) and (7.79)).
In the following we choose the latter reference frame. It can easily be proved
that, in stationary situations, the only non-null statistical tensors are those with
K even (see Sect. 10.7). Owing to the presence of the magnetic kernel, which mixes
up statistical tensors of the same rank but with different Q values, the statistical
tensors with Q = 0 are in general non-zero. Therefore, contrary to the previous
section, atomic polarization involves both population unbalances and coherences
between Zeeman sublevels. The number of equations (and of unknowns) is now
given by (cf. Eq. (10.59))
(J + 1)(2J + 1) + (Ju + 1)(2Ju + 1) for J , Ju integers
Neq = (10.71)
J (2J + 1) + Ju (2Ju + 1) for J , Ju half-integers .
The statistical equilibrium equations for the two-level atom, under the physical
conditions specified above, can be obtained from Eq. (7.78) by adding the contribu-
tion of depolarizing collisions (Eq. (7.101)). Bearing in mind the formal invariance
of the radiative rates proved in Sect. 7.12 – which implies that such rates are still
given by Eqs. (7.14) – we get, with the help of Eqs. (2.26a), (2.36a) and (2.49)1
d K
ρQ (αu Ju ) = −2πi νL gα J KQQ
K K
ρQ (αu Ju )
dt u u
Q
+ TA (αu Ju KQ, α J K Q) ρK
Q (α J )
K even
d K
ρQ (α J ) = −2πi νL gα J KQQ
K K
ρQ (α J )
dt
Q
!
Ju Ju K
+ (2Ju + 1) A(αu Ju → α J ) (−1) 1+J +Ju
ρK
Q (αu Ju )
J J 1
− RA (α J KQK Q) + δKK D(K) (α J ) ρK
Q (α J ) , (10.72)
K even
θ
γ
symmetry axis
of the incident radiation
Fig.10.6. The rotation R appearing in Eqs.(10.73) and (10.74) carries the reference system (XY Z)
into (xyz).
and
∗
KQQ
K
= DQ
K
Q (R) Q DQ
K
Q (R) , (10.73)
Q
R being the rotation which carries the reference system with the z-axis directed
along the magnetic field into the reference system with the z-axis directed along
the symmetry axis of the radiation field.
The expression of the kernel KQQ K
can be somewhat simplified. Referring to
Fig. 10.6, the rotation R is given, in terms of Euler angles, by
548 CHAPTER 10
and the product of two rotation matrices via Eq. (2.77). With the help of Eqs.
(2.23a) and (2.73) we obtain the simpler expression
K K 1
KQQ
K
= K(K + 1)(2K + 1) (−1)K−Q D0q
1
(R) . (10.74)
−Q
Q q
Even for the simplest transitions, Eqs. (10.72) are so involved that it isn’t worth
trying to solve them (in stationary situations) analytically: for instance, according
to Eq. (10.71), the transition (J = 1, Ju = 1) is described by a system of 12 equa-
tions in 12 unknowns. It is therefore necessary to resort to a numerical solution.
Once the solution is found, the radiation emitted in a given direction can be
evaluated via Eq. (10.39), or via Eq. (10.31) if only the frequency-integrated radi-
ation is required. To give an example, we refer to the geometry of Fig. 10.6 and we
evaluate the frequency-integrated fractional polarization of the radiation emitted
in the x-direction (i.e., at 90◦ from the symmetry axis of the incoming radiation),
taking the y-axis as the positive Q direction. Using the expressions of T Q K in
(i, Ω)
◦ ◦ ◦
Table 5.6 (with θ = 90 , χ = 0 , γ = 90 ) we obtain from Eq. (10.31), with the
help of Eqs. (3.102) and (10.14)
√
ε̃Q wJ(2)J 3 σ02 + 6 Re(σ22 )
p̃Q = = √ u
√
ε̃I 2 2 − wJ(2)J σ02 − 6 Re(σ22 )
u
√
ε̃ 2 6 wJ(2)J Im(σ12 )
p̃U = U = − √ u √ , (10.75)
ε̃I 2 2 − wJ(2)J σ02 − 6 Re(σ22 )
u
K
where σQ are the reduced statistical tensors of the upper level.
Figures 10.7 and 10.8 show the Hanle diagrams for the transitions (J = 1, Ju = 1)
and (J = 1, Ju = 2), respectively.1 They are obtained from Eqs. (10.75), with
the values of the statistical tensors derived by solving numerically Eqs. (10.72)
– written for stationary situations – together with the trace equation (10.4), under
the following conditions: the magnetic field vector lies in the x-y plane (θ = 90◦ in
Fig. 10.6); the incoming radiation is a unidirectional, unpolarized beam travelling
in the z-direction; its intensity is very weak; depolarizing collisions are negligible.
1
Hanle diagrams for other transitions have been presented by Landolfi and Landi Degl’Innocenti
(1986).
NON-EQUILIBRIUM ATOMIC PHYSICS 549
In terms of the parameters defined in Eqs. (10.62) the last three conditions read:
w = 1, n̄ 1, δu(K) = δ(K) = 0.
If the diagrams in Fig. 10.7 or 10.8 are compared with a ‘usual’ Hanle diagram
(see e.g. Fig. 10.2), an obvious difference comes out: the latter consists of a unique
‘lobe’, while the former are split into two distinct lobes. Such lobes correspond to
the relaxation of the lower-level coherences (lower-level Hanle effect) and of the
upper-level coherences (‘usual’ Hanle effect), respectively. The order of magnitude
of the magnetic field strength in either lobe is different, because – as suggested by
Eqs. (10.72) – the typical field which depolarizes the lower level is such that
2πνL gα J ≈ B(α J → αu Ju ) J00 (ν0 ) ,
whereas the typical field which depolarizes the upper level is such that
2πνL gα J ≈ A(αu Ju → α J ) .
u u
it follows that the former is approximately n̄ times smaller than the latter. The
lower-level and the ‘usual’ Hanle effect are clearly separated in Figs. 10.7 and 10.8
because the incident radiation is extremely diluted (n̄ = 10−6 ). The shape of the
diagrams will however remain the same provided n̄ 1. For n̄ 1 the two regimes
tend to overlap; but a correct treatment of this case requires that the stimulated
emission rates be included in the statistical equilibrium equations (see the next
section).
Let us compare the diagrams in Figs. 10.8 and 10.2 more closely. They refer to
the same transition and the same geometrical and physical conditions: the only
difference is the atomic polarization of the lower level, which is allowed for in
Fig. 10.8 and neglected in Fig. 10.2 (unpolarized lower level approximation). For
zero magnetic field, the value of p̃Q in Fig. 10.8 is larger than the value in Fig. 10.2
(approximately 0.45 and 0.29, respectively: see Table 10.3), consistently with the
results presented at the end of Sect. 10.7. The value of p̃Q corresponding to the
relaxation of the lower-level coherences (the ‘transition region’ between the two
lobes of Fig. 10.8) depends on the value of the angle χ, ranging approximately
from 0.30 for χ = 0◦ or 180◦ to 0.33 for χ = 90◦ . In any case, this value is larger
than p̃Q 0.29 of Fig. 10.2, where lower-level atomic polarization is neglected a
priori. The same characteristic appears from Fig. 10.7, where the value of p̃Q cor-
responding to the transition between the two lobes (independent of χ in this case)
is about 0.33, to be compared with p̃Q = 0.2 given in Table 10.3 for the transition
(J = 1, Ju = 1) under the unpolarized lower level approximation. These examples
NON-EQUILIBRIUM ATOMIC PHYSICS 551
show that atomic polarization of the lower level is reduced, but not completely re-
moved, by the magnetic field . The complete cancellation of lower-level polarization
can only occur in the presence of depolarizing collisions.
d K
ρ (α J ) = − A(αu Ju → α J ) + D(K) (αu Ju ) ρK 0 (αu Ju )
dt 0 u u
+ TA (αu Ju K 0, α J K 0) ρK
0 (α J )
K even
− RS (αu Ju K 0 K 0) ρK
0 (αu Ju )
K even
!
d K Ju Ju K
ρ (α J ) = (2Ju + 1) A(αu Ju → α J ) (−1) 1+J +Ju
ρK0 (αu Ju )
dt 0 J J 1
− RA (α J K 0 K 0) + δKK D(K) (α J ) ρK
0 (α J )
K even
+ TS (α J K 0, αu Ju K 0) ρK
0 (αu Ju ) , (10.76)
K even
where K is even and the rates TA and RA are given by Eqs. (10.61), while the
rates TS and RS can be obtained from Eqs. (7.14c,f) with the help of Eqs. (2.26a),
(2.36a) and (2.49)
1
The precise meaning of the flat-spectrum approximation is different in the two cases: see
Sects. 10.7 and 10.8.
552 CHAPTER 10
1 d 0 √
ρ0 (αu Ju ) = − 1 + n̄ ρ00 (αu Ju ) + 3 n̄ ρ00 (α J )
A(αu Ju → α J ) dt
1
− √ n̄w ρ20 (αu Ju )
2
1 d 2 1
ρ0 (αu Ju ) = − 1 + n̄ − n̄w + δu(2) ρ20 (αu Ju )
A(αu Ju → α J ) dt 2
.
1 3
− √ n̄w ρ00 (αu Ju ) + n̄w ρ00 (α J ) ,
2 2
1
The same simplification occurs for the transitions (J = 1/2, Ju = 1/2) and (J = 1/2, Ju = 3/2),
where lower-level polarization is zero in the presence of an unpolarized, cylindrically symmetrical
radiation field. In fact, the only statistical tensors that can be defined for J = 1/2 are ρ00 and
ρ1Q , and the latter are zero because K is odd.
NON-EQUILIBRIUM ATOMIC PHYSICS 553
where the notations of Eqs. (10.62) have been used. In stationary situations, the
solution can be written in the form
1 w
σ02 (αu Ju ) = √
2 1 + n̄ 1 − w − 12 w2 + δu(2)
1
2
ρ00 (αu Ju ) √ 1 + n̄ 1 − 12 w − 12 w2 + δu(2)
= 3 n̄ . (10.77)
ρ00 (α J ) 1 + n̄ 1 + n̄ − 12 n̄w + δu(2) − 12 n̄2 w2
1 d 0 1
ρ0 (αu Ju ) = − 1 + n̄ ρ00 (αu Ju ) + √ n̄ ρ00 (α J )
A(αu Ju → α J ) dt 3
1
+ √ n̄w ρ20 (α J )
6
1 d 2 1 1
ρ0 (α J ) = − n̄ 1 − w + δ(2) ρ20 (α J )
A(αu Ju → α J ) dt 3 2
1 1
− √ n̄w ρ00 (α J ) + √ n̄w ρ00 (αu Ju ) .
3 2 6
In stationary situations, the solution can be written in the form
√ w
σ02 (α J ) = − 2
2 − w + 2 δ(2) + n̄ 2 − w − w2 + 2 δ(2)
ρ00 (αu Ju ) 1 2 − w − w2 + 2 δ(2)
= √ n̄ . (10.78)
0
ρ0 (α J ) 3 2 − w + 2 δ(2) + n̄ 2 − w − w2 + 2 δ(2)
554 CHAPTER 10
Fig.10.9. Fractional polarization of the radiation scattered at 90 ◦ from the incident (unpolarized)
beam as a function of the beam intensity, for different atomic transitions (J , Ju ) involving integral
J values.
Comparison with Eqs. (10.64) shows that stimulated emission reduces the polariza-
tion of the lower level (except in the extreme case w = 1 and δ(2) = 0 ). The ratio
of the upper to the lower level population depends again in a complicated way on
the different parameters (anisotropy factor, intensity, depolarizing collisions). Only
for an isotropic, unpolarized radiation field (w = 0), or for strong collisional
√ rates
(δ(2) → ∞), the second of Eqs. (10.78) yields the classical value (1/ 3) n̄ /(1 + n̄).
For more complicated transitions, the influence of stimulation effects on atomic
polarization can be evaluated numerically, by solving Eqs. (10.76) – in stationary
situations – together with the trace equation (10.4). Rather than atomic polar-
ization itself, it is probably more instructive to examine the polarization of the
radiation scattered by the atom in a given direction. If we refer for instance to the
geometry of Fig. 10.6 (with B = 0) and consider the frequency-integrated fractional
polarization scattered in the x-direction (i.e., at 90◦ from the symmetry axis of the
incident radiation), taking the y-axis as reference direction for Q, we have from
Eqs. (10.75)
3 wJ(2)J σ02 (αu Ju )
p̃Q = √ u
, p̃U = 0 .
2 2 − wJ(2)J σ02 (αu Ju )
u
Figures 10.9 and 10.10 show the results of such calculations for several atomic
NON-EQUILIBRIUM ATOMIC PHYSICS 555
in the second. The expressions of the rates TS and RS can be derived from
Eqs. (7.14c,f) with the help of Eqs. (2.26a), (2.36a) and (2.49), bearing in mind
that the index K is an even integer and that the only non-zero components of the
radiation field tensor are J00 (ν0 ) and J02 (ν0 )
Fig.10.11. Same as Fig.10.8 for n̄ = 0.5 (intense irradiation of the model atom).
For model atoms composed of more than two levels, the statistical equilibrium
equations become more and more involved and depend on a growing number of
physical parameters. However, according to the specific model atom and to the
characteristics of the incoming radiation, it is sometimes possible to simplify con-
siderably the problem. One of such cases, which can even be treated analytically,
is illustrated in this section.
Consider the three-level model atom of Fig. 10.12, where, by assumption, tran-
sitions between levels a,b and between levels b,c are allowed, while the transition
between levels a,c is forbidden. We assume the incident radiation field at each of
the Bohr frequencies νba , νcb to be very diluted,
where n̄ba , n̄cb are the average numbers of photons per mode at frequencies νba ,
νcb respectively. This situation is often referred to as stepwise excitation. For
simplicity, we assume that there is no magnetic field and we neglect inelastic and
superelastic collisions, while we retain depolarizing collisions.
An order-of-magnitude estimate shows that the statistical tensors of levels a, b,
c are of order
1, n̄ba , n̄ba n̄cb ,
558 CHAPTER 10
respectively. Let us consider the statistical equilibrium equation for the statistical
tensors of the intermediate level (level b). From Eqs. (7.11) and (7.101) we have,
using shorthand notations
d
ρ = TA ρa + TE + TS ρc − RA + RE + RS ρb − D ρb . (10.80)
dt b
The order of magnitude of the different terms is the following (see Eqs. (7.14) and
(10.62))
where Aba and Acb are the Einstein coefficients for spontaneous emission of the
transitions (a, b) and (b, c), respectively. Equations (10.80)-(10.81) show that, to
first order in the number of photons per mode, only the rates TA , RE and D make
a contribution to the time derivative of ρb : it follows that level b is only coupled
with the lower level a.1 The same reasoning shows that stimulated emission can
be disregarded in both transitions (a, b) and (b, c). The three-level model atom
considered can therefore be regarded as the combination of two ‘superposed’ two-
level atoms. In stationary situations, the statistical tensors ρK
Q (b) are obtained as
the solution to the statistical equilibrium equations for the two-level atom (a, b).
The statistical tensors ρKQ (c) are then obtained as the solution to the statistical
equilibrium equations for the two-level atom (b, c), with ρK Q (b) known from the
previous solution.
As an illustration, we consider a three-level model atom with Ja = 0, Jb = 1,
Jc = 2. We assume for simplicity that the incoming radiation (frequency-inde-
pendent around the Bohr frequencies νba and νcb ) is unpolarized and cylindrically
1
This approximation implies indeed the further condition Acb n̄cb Aba , which follows from
Eq. (10.79) provided the order of magnitude of the two Einstein coefficients is the same.
NON-EQUILIBRIUM ATOMIC PHYSICS 559
symmetrical about a fixed direction: this implies, as shown in Sect. 10.7, that only
the statistical tensors with K even and Q = 0 are non-zero. Following Eqs. (10.62),
we introduce the anisotropy factors in the two lines
wba J 2 (ν ) wcb J 2 (ν )
√ = 00 ba , √ = 00 cb .
2 J0 (νba ) 2 J0 (νcb )
(2)
where we have used the value w10 = 1 given by Table 10.1 and where, according
to Eqs. (10.62)
D(2) (αb Jb )
δb(2) = .
Aba
The statistical equilibrium equations for level c – considered as the upper level of
the two-level atom (b, c) – are given by the first of Eqs. (10.60). Using Eqs. (10.61)
and (10.62) we obtain, after some algebra
.
1 d 0 5 1
ρ (α J ) = − ρ00 (αc Jc ) + n̄cb ρ00 (αb Jb ) + √ n̄cb wcb ρ20 (αb Jb )
Acb dt 0 c c 3 2 30
.
1 d 2 1 7
ρ (α J ) = − 1 + δc(2) ρ20 (αc Jc ) + n̄ w ρ0 (α J )
Acb dt 0 c c 2 6 cb cb 0 b b
.
1 7 1
+ n̄cb 1 + wcb ρ20 (αb Jb )
2 3 7
.
1 d 4 3
ρ0 (αc Jc ) = − 1 + δc ρ0 (αc Jc ) + 3
(4) 4
n̄ w ρ2 (α J ) ,
Acb dt 35 cb cb 0 b b
where
D(K) (αc Jc )
δc(K) = .
Acb
In stationary situations, eliminating ρ00 (αb Jb ) and ρ20 (αb Jb ) via Eqs. (10.82), one
finally gets
. (2)
2 1 7 wcb 1 + δb + wba 1 + 17 wcb 1
σ0 (αc Jc ) = (2)
2 10 1
1 + δb + 20 wcb wba 1 + δc(2)
9 wcb wba 1
σ04 (αc Jc ) = √ (2)
5 14 1 + δ b + 1
w w
20 cb ba 1 + δc(4)
ρ00 (αc Jc ) √ 1 wcb wba
= 5 n̄cb n̄ba 1 + . (10.83)
0
ρ0 (αa Ja ) 20 1 + δb(2)
560 CHAPTER 10
These expressions show that the amount of atomic polarization of level c depends
on the anisotropy factors in both transitions and on the depolarizing rates of levels
b and c, while it is independent of the intensity of the radiation field in either tran-
sition (this is due to the basic characteristics of stepwise excitation n̄ba , n̄cb 1,
Acb n̄cb Aba ). Note that σ02 (αc Jc ) – contrary to σ04 (αc Jc ) – is non-zero even if the
radiation field at the frequency νcb of the upper transition is isotropic (wcb = 0).
In the extreme case of maximum anisotropy (wba = wcb = 1) and no depolarizing
collisions (δb(2) = δc(2) = δc(4) = 0), Eqs. (10.83) give
. .
5 10 6 2
σ02 (αc Jc ) = , σ04 (αc Jc ) = . (10.84)
7 7 7 7
pQ 0.652 .
This figure can be compared with the value 0.448, corresponding to the two-level
atom (J = 1, Ju = 2), and with the value 0.288 corresponding to the same model
atom with unpolarized lower level (see Table 10.3). The basic result of stepwise
excitation for the three-level atom considered is to increase the polarization of the
radiation scattered in the upper transition.
Another special case of three-level model atom leading to simple and interesting
results is illustrated in Fig. 10.13. There are two ‘lower’ levels, a and b, but
absorption processes can only take place between one of them (say, level a) and the
upper level c. Level c can then decay to both levels a and b, giving rise to resonance
1 √
This is consistent with the value σ02 (αb Jb ) = 1/ 2 deduced from Eqs. (10.82) for wba = 1
(2)
and δb = 0. According to Table 3.6, such value means that the sublevel Mb = 0 is depopulated.
NON-EQUILIBRIUM ATOMIC PHYSICS 561
d K
ρ (α J ) = TA (αc Jc KQ, αa Ja 0 0) ρ00 (αa Ja )
dt Q c c
− RE (αc Jc KQK Q ) ρKQ (αc Jc ) .
K Q
In stationary situations we obtain, with the use of Eqs. (7.14e), (10.9) and (10.11)1
2Ja + 1 B(αa Ja → αc Jc )
ρK
Q (αc Jc ) = w(K)
2Jc + 1 A(αc Jc → αa Ja ) + A(αc Jc → αb Jb ) Jc Ja
× (−1)Q J−Q
K
(νca ) ρ00 (αa Ja ) . (10.85)
1
Obviously, the schematic model of Fig. 10.13 is not self-consistent: in stationary situations,
all the atoms would be in level b. Here it is understood that level b is coupled (for instance
collisionally) to level a.
562 CHAPTER 10
The emission coefficient in the Raman line is given by Eq. (10.15), which now reads
= hν N 2J + 1 A(α J → α J )
εi (ν, Ω) c c c b b
4π
(K)
× wJ J T Q
K ρK (α J ) φ(ν − ν) .
(i, Ω) (10.86)
c b
Q c c cb
KQ
Substitution of Eq. (10.85) into Eq. (10.86) yields, with the help of Eq. (10.6)
= hν N B(α J → α J ) b φ(ν − ν)
εi (ν, Ω)
4π a a a c c r cb
× WK (Ja , Jc , Jb ) (−1)Q T Q
K J K (ν ) ,
(i, Ω) (10.87)
−Q ca
KQ
A(αc Jc → αb Jb )
br = ,
A(αc Jc → αa Ja ) + A(αc Jc → αb Jb )
and the new symbol WK (J , Ju , J ) is defined by (see Eq. (10.11))1
Comparison of Eqs. (10.87) and (10.16) shows that, apart from the obvious presence
of the branching ratio and the frequency change of the emitted radiation, Raman
scattering is described by an expression strictly similar to that for resonance scat-
tering. The only difference is that the symbol WK (J , Ju ) of resonance scattering
is replaced by the symbol WK (J , Ju , J ). This difference is non-trivial because for
certain combinations of the three J quantum numbers, W2 (J , Ju , J ) turns out to
be negative. In such cases, the direction of Raman scattering polarization produced
by unpolarized incident radiation is parallel to the scattering plane (although the
lower level a is by assumption unpolarized).2
Numerical values of the symbol WK (J , Ju , J ) can be obtained from Eq. (10.88)
and Table 10.1. It is worth noticing that wJ(2)J is negative for J = Ju (except the
u
special case J = Ju = 1/2 for which wJ(2)J = 0) and positive for J = Ju : therefore
u
W2 (J , Ju , J ) is negative when J = J and either Ju = J or Ju = J . From
Eqs. (10.14) and (10.17) it follows that
ν
ν
where K is even and where the rates TA and RA are given by Eqs. (10.61); the equa-
tion for dρK K
0 (αb Jb )/dt is the same as that for dρ0 (αa Ja )/dt with the substitutions
αa → αb , Ja → Jb .
We wish to illustrate the typical results arising from the solution of the statis-
tical equilibrium equations on a particularly simple example. Assuming Ja = 1/2,
Jb = 3/2, Jc = 1/2, the atom is described by the four statistical tensors ρ00 (Ja ),
ρ00 (Jb ), ρ20 (Jb ), ρ00 (Jc ). After some algebra we obtain the following equations
1 d 0
ρ (α J ) = ρ00 (αc Jc ) − n̄ca ρ00 (αa Ja )
A(αc Jc → αa Ja ) dt 0 a a
1 d 0 1 1
ρ (α J ) = √ ρ00 (αc Jc ) − n̄cb ρ00 (αb Jb )
A(αc Jc → αb Jb ) dt 0 b b 2 2
1
− n̄cb wcb ρ20 (αb Jb )
4
1 d 2 1 1
ρ (α J ) = − n̄cb wcb ρ00 (αb Jb ) − n̄cb 1 + δb(2) ρ20 (αb Jb ) ,
A(αc Jc → αb Jb ) dt 0 b b 4 2
where, according to Eqs. (10.62) and (10.63), we have introduced the anisotropy
factor
wcb J 2 (ν )
√ = 00 cb ,
2 J0 (νcb )
the average numbers of photons per mode (much less than unity by assumption)
B(αa Ja → αc Jc ) 0 B(αb Jb → αc Jc ) 0
n̄ca = J (ν ) , n̄cb = 2 J (ν ) ,
A(αc Jc → αa Ja ) 0 ca A(αc Jc → αb Jb ) 0 cb
and the quantity
D(2) (αb Jb )
δb(2) = 2 .
n̄cb A(αc Jc → αb Jb )
In stationary situations, the solution to these equations can be written in the form
√
ρ00 (αb Jb ) 4 2 1 + δb(2) n̄ca
= (10.89a)
0
ρ0 (αa Ja ) 4 1 + δb − wcb n̄cb
(2) 2
w
σ02 (αb Jb ) = − cb (2) (10.89b)
2 1 + δb
ρ00 (αc Jc )
= n̄ca . (10.89c)
ρ00 (αa Ja )
These expressions are very interesting since they predict that, depending on the
physical situation, different ‘kinds’ of population inversion between the lower levels
a and b may occur.
From Eqs. (10.89a) and (3.108), the ratio of the overall populations of levels a
and b is given by
nα J 8 1 + δb(2) n̄ca
b b
= . (10.90)
nα J
a a
4 1 + δb − wcb n̄cb
(2) 2
NON-EQUILIBRIUM ATOMIC PHYSICS 565
If the incident radiation is isotropic (wcb = 0), the populations of the two levels
are proportional to the corresponding statistical weights when n̄ca = n̄cb ,
nα Jb
b
=2;
nα Ja
a
ρ(αb Jb , Mb = ± 3/2) w 4 1 + δb(2) n̄ca
= 1 − cb (2) . (10.92)
ρ(αa Ja , Ma = ± 1/2) 2 1 + δb 4 1 + δb(2) − wcb2 n̄
cb
566 CHAPTER 10
Fig.10.15. The Zeeman sublevels (artificially split) corresponding to levels a and b of the model
atom of Fig.10.14, for Ja = 1/2, Jb = 3/2, Jc = 1/2. Under the conditions specified in the text,
their populations are proportional to the number of dots drawn on each of them.
, n̄ -2 , -, .,
n̄ - n̄ -, n̄ -
ca
+ 4 1 + δb(2)
1 − ca − ca < wcb < 2 1 + δb(2) 1 − ca ,
n̄cb n̄cb n̄cb n̄cb
hνba hνba
2 < wcb < 2 .
kB T kB T
are negligible, the results are independent of the values of the Einstein coefficients
of the transitions connecting the lower levels a,b with the upper level c. Under
the same condition, the results are not affected by the radiation intensity at the
individual frequencies νca and νcb , they only depend on their ratio. It follows that,
provided this ratio is not too far from unity, selective population inversion can take
place irrespective of the distance between the atom and the radiating source.
It should be remarked that selective population inversion (without overall popu-
lation inversion) can easily arise also in the ‘inverted’ three-level model atom having
Ja = 3/2, Jb = 1/2, Jc = 1/2. In the case of maximum anisotropy, neglecting de-
polarizing collisions and assuming n̄ca /n̄cb = 1, one obtains the same result as in
Fig. 10.15 except for the exchanged energy position of the two levels. The sublevels
M = ± 1/2 of the (upper) level J = 1/2 show population inversion with respect to
the levels M = ± 3/2 of the (lower) level J = 3/2.
where [ρK
Q (αJ)]
(n)
is the n-th order contribution to the statistical tensor ρK Q (αJ).
Substitution of Eq. (10.95) into the statistical equilibrium equations leads to an
equality between two power series in λ. Since λ is arbitrary, the coefficients of each
power must separately be equal.
To the lowest order, the radiative rates TA , TS , RA , RS (the only rates connecting
statistical tensors with different K, Q values) contain only the J00 component of
568 CHAPTER 10
the radiation field tensor. Bearing in mind their expressions (Eqs. (7.14)), it is
easily seen that the statistical equilibrium equations split into separate systems of
equations, each one involving statistical tensors with the same value for K and Q.
Since the trace equation involves only populations (proportional to ρ00 ), in station-
ary situations all the statistical tensors with K = 0 are zero. For K = 0 we obtain,
with the use of Eqs. (7.14), (2.26a), (2.36a) and (2.49)
d √ (0) (0)
2J + 1 ρ00 (αJ) = B(α J → αJ) J00 (ναJ, α J ) 2J + 1 ρ00 (α J )
dt
α J
(0)
+ A(αu Ju → αJ) + B(αu Ju → αJ) J00 (να J , αJ ) 2Ju + 1 ρ00 (αu Ju )
u u
αu Ju
− B(αJ → αu Ju ) J00 (να J , αJ ) + A(αJ → α J )
u u
αu Ju α J
√ (0)
+ B(αJ → α J ) J00 (ναJ, α J ) 2J + 1 ρ00 (αJ)
α J
(0)
+ CI(0) (αJ, α J ) 2J + 1 ρ00 (α J )
α J
(0)
+ CS(0) (αJ, αu Ju ) 2Ju + 1 ρ00 (αu Ju )
αu Ju
√ (0)
− CI(0) (αu Ju , αJ) + CS(0) (α J , αJ) 2J + 1 ρ00 (αJ) , (10.96)
αu Ju α J
and the trace equation reads (see Eqs. (3.84) and (3.108))
(0) √ (0)
nαJ = 2J + 1 ρ00 (αJ) =1.
αJ αJ
Thus the zero-order equations coincide with the usual statistical equilibrium equa-
tions for the level populations of the scalar (unpolarized) case – see e.g. Mihalas,
1978).
Passing to the first order, it is convenient to consider separately the equations
Q (αJ)] /dt with K = 0. The former turn out to
for d [ρ00 (αJ)](1) /dt and for d [ρK (1)
This means that, in stationary situations, the first-order correction to ρ00 (αJ) is
identically zero. For K = 0 we obtain, using again Eqs. (7.14), (2.26a), (2.36a) and
(2.49)
NON-EQUILIBRIUM ATOMIC PHYSICS 569
d K (1) (1)
ρQ (αJ) = −2πi νL gαJ Q ρKQ (αJ)
dt .
2J + 1 & (1)
+
B(α J → αJ) yJJ (K)
J00 (ναJ, α J ) ρK Q (α J )
2J + 1
α J
(K)
(0) '
+ wJJ (−1)Q J−Q K
(ναJ, α J ) ρ00 (α J )
.
2J + 1 (1)
+ u
A(αu Ju → αJ) yJ(K)J ρK Q (αu Ju )
2J + 1 u
αu Ju
. &
2J + 1 (1)
+ u
B(αu Ju → αJ) yJJ (K)
J00 (να J , αJ ) ρK Q (αu Ju )
2J + 1 u u u
αu Ju
(K)
(0) '
+ wJJ (−1)K+Q J−Q K
(να J , αJ ) ρ00 (αu Ju )
u u u
& K (1)
− B(αJ → αu Ju ) J0 (να J , αJ ) ρQ (αJ)
0
u u
αu Ju
(K)
(0) '
+ wJJ (−1)K+Q J−Q
K
(να J , αJ ) ρ00 (αJ)
u u u
K (1)
− A(αJ → α J ) ρQ (αJ)
α J
& (1)
− B(αJ → α J ) J00 (ναJ, α J ) ρK
Q (αJ)
α J
(K)
(0) '
+ wJJ (−1)Q J−Q
K
(ναJ, α J ) ρ00 (αJ)
.
2J + 1 (K) (1)
+ C (αJ, α J ) ρK
Q (α J )
2J + 1 I
α J
.
2Ju + 1 (K) (1)
+ C (αJ, αu Ju ) ρK
Q (αu Ju )
2J + 1 S
αu Ju
& (0) ' (1)
− CI(0) (αu Ju , αJ) + CS (α J , αJ) + D(K) (αJ) ρKQ (αJ) . (10.97)
αu Ju α J
(K)
In this equation, the symbol wJJ is defined as in Eq. (10.11),
!
(K) 1+J+J 1 1 K
wJJ = (−1) 3(2J + 1) , (10.98)
J J J
(K)
while the symbol yJJ , invariant under interchange of J and J , is defined by
!
1+J+J +K
J J K
y (K)
JJ =y (K)
JJ = (−1) (2J + 1)(2J + 1) , (10.99)
J J 1
570 CHAPTER 10
d K (1) (1)
ρQ (αi Ji ) = aij ρKQ (αj Jj )
dt j
(0)
+ bij ρ00 (αj Jj ) (K = 0) , (10.100)
j
where the indices i and j run over all the atomic levels (excluding, obviously, those
for which the multipole moment ρK Q is undefined). The coefficients aij contain the
contributions of the magnetic field, of collisions, and of the radiative rates: the
last one, however, is restricted to terms involving the only component J00 of the
radiation field tensor (evaluated at the Bohr frequency connecting levels i and j).
The coefficients bij contain only contributions of the radiative rates (absorption
K
and stimulated emission) restricted to terms involving the components J−Q of the
radiation field tensor with K = 0.
The structure of Eq. (10.100) immediately suggests the method to solve (in sta-
tionary situations, and up to the first order) the statistical equilibrium equations
under the weak anisotropy approximation. First, the values of [ρ00 (αj Jj )](0) are
obtained by solving Eq. (10.96) – the statistical equilibrium equations for the
level populations. These values are then substituted into the right-hand side of
Eq. (10.100) which reduces to an algebraic, inhomogeneous system in the unknowns
[ρK
Q (αi Ji )]
(1)
with K = 0. The substantial simplification introduced by the weak
anisotropy approximation is that this system is formed by a set of uncoupled sub-
systems, each characterized by its own K, Q values.
As apparent from the structure of Eq. (10.100), the solutions [ρK
Q (αi Ji )]
(1)
will be
K
linear combinations of the multipole components J−Q (evaluated at the different
Bohr frequencies of the model atom considered) with the same K, Q values. An
obvious consequence of this fact is that in the weak anisotropy approximation only
statistical tensors with K ≤ 2 can be non-zero (to first order).
In this section we go back to the two-level model atom, already analyzed from
several points of view, and we solve the statistical equilibrium equations under the
assumption of weak anisotropy of the radiation field. Following the method outlined
NON-EQUILIBRIUM ATOMIC PHYSICS 571
in the former section, we will first determine (from the zero-order equations) the
statistical tensors ρ00 , then (from the first-order equations) the statistical tensors
Q with K = 0, both for the upper and the lower level.
ρK
From Eqs. (10.96) we obtain, in stationary situations
where ν0 is the transition frequency and where, for conciseness, the index (0) of
the statistical tensors has been omitted. This expression can be simplified by
dividing numerator and denominator by the Einstein coefficient A(αu Ju → α J ).
Using Eqs. (7.8) and the Einstein-Milne relation between inelastic and superelastic
collisional rates (Eq. (7.98), valid for a Maxwellian distribution of velocities), we
get
ρ00 (αu Ju ) 2Ju + 1 n̄ + b(Tc)
0 = , (10.101)
ρ0 (α J ) 2J + 1 1 + n̄ +
where the dimensionless parameters n̄ and are given by Eqs. (10.62) and (10.51),
respectively, and where b(Tc ) is the Boltzmann factor
hν0
−
kB Tc
b(Tc ) = e ,
n̄ 1 + − b(Tc) 1
K
σQ (α J ) = wJ(K)J yJ(K)J 1 + n̄ + (K)
u
1 + n̄ + ∆ u u
!
− (−1)K wJ(K)J 1 + n̄ + + δu(K) + i Hu Q (−1)Q sK
−Q (ν0 ) , (10.102)
u
1 (K) (K)
We recall that yJ J = yJ J (see Eq. (10.99)).
u u
572 CHAPTER 10
where K = 1, 2 and
∆ = 1 + n̄ + + δu(K) + i Hu Q n̄ 1 + δ(K) + i H Q + b(Tc )
− n̄ yJ(K)J 1 + n̄ + (K) u yJ(K)J + (K)
.
u u
K
σQ (α J ) = (−1)1+K x(K) Q K
J J (−1) s−Q (ν0 ) , (10.105)
u
NON-EQUILIBRIUM ATOMIC PHYSICS 573
− (−1)
(K) K (K) (K)
wJJ wJ J yJJ
x(K)
JJ = (K) 2 = (−1)1+J+J 3(2J + 1)
1 − yJJ
! ! !
1 1 K J+J 1 1 K J J K
+ (−1) (2J + 1)
J J J J J J J J 1
× !2 . (10.106)
J J K
1 − (2J + 1)(2J + 1)
J J 1
Using Eqs. (10.104), (10.62) and (10.93), the first of Eqs. (10.105) can be rewritten
in the form
This is identical with Eq. (10.13) except for the presence of the quantity x(K)
J J in the u
place of wJ(K)J . It follows that for the two-level atom under the weak anisotropy
u
approximation, resonance scattering is still described by Eq. (10.19), where the
scattering phase matrix is now given by
Pij (Ω, ) =
Ω XK (J , Ju ) (−1)Q T Q
K
(i, Ω) ) ,
T K (j, Ω (10.107)
−Q
KQ
with
!2 ! ! !
1 1 K 1 1 K 1 1 K J J K
+ (−1)J+J (2J + 1)
J J J J J J J J J J J 1
× !2
J J K
1 − (2J + 1)(2J + 1)
J J 1
for K = 1, 2, and
X0 (J, J ) ≡ 1 . (10.108)
Obviously, it should be kept in mind that the incident radiation field in the right-
hand side of Eq. (10.19) has now to satisfy the weak anisotropy approximation
described by Eq. (10.94). For instance, we cannot use Eq. (10.19) with the phase
matrix of Eq. (10.107) to describe the scattering of a radiation beam, because the
weak anisotropy approximation does not hold in that case.
1
Note that for K = 0 the symbol is indeterminate (being of the form 0/0). This is
not surprising, since Eqs. (10.105) are a special case of Eqs. (10.102), which hold for K = 1, 2.
(0)
Equations (10.105) are correct also for K = 0 if we set by definition xJ J = 1 in the former and
(0) u
xJ J = −1 in the latter.
u
574 CHAPTER 10
Numerical values of the symbols yJ(K) (K)
J , xJ J and XK (J, J ) are given, for different
transitions, in Table 10.4. Table 10.5 collects the analytical expressions of X1 (J, J )
and X2 (J, J ) obtained via Eqs. (2.36d) and (2.36h). Note that, contrary to the
symbols W1 and W2 , which satisfy the inequality 0 ≤ WK ≤ 1, the symbols X1 and
X2 can be negative as well as larger than unity. In particular, X2 is negative for
all transitions having Ju = J − 1 (except (J = 1, Ju = 0) and (J = 3/2, Ju = 1/2)
for which X2 is zero). As illustrated at the end of Sect. 10.7, this is due to the
presence of lower-level atomic polarization.
K
σQ (α J ) = (−1)1+K
wJ(K)J 1 + δu(K) − (−1)K wJ(K)J yJ(K)J
× u
u
(K) 2
u
(−1)Q sK
−Q (ν0 ) . (10.109)
(K)
1 + δu (K)
1 + δ − yJ J
u
Substitution of Eqs. (10.104), (10.62) and (10.93) into the first of Eqs. (10.109)
yields, for K = 0
2J + 1 B(α J → αu Ju )
ρK
Q (αu Ju ) =
2Ju + 1 A(αu Ju → α J )
wJ(K)J 1 + δ(K) − (−1)K wJ(K)J yJ(K)J
× u
u
2
u
(−1)Q J−Q
K
(ν0 ) ρ00 (α J ) .
1 + δu(K) 1 + δ(K) − yJ(K)J
u
Comparison with Eq. (10.13) shows that resonance scattering is still described by
Eq. (10.19), with a phase matrix given by
Ω
Pij (Ω, ) = XK (J , Ju ) d.c. (−1)Q T Q
K
(i, Ω) ) ,
T K (j, Ω
−Q
KQ
where the new symbol (the symbol XK (J , Ju ) ‘modified by depolarizing collisions’)
is defined to be
(K) 2
wJ J 1 + δ(K) − (−1)K wJ(K)J wJ(K)J yJ(K)J
u u u u
XK (J , Ju ) d.c. = 2 (K = 1, 2)
1 + δu(K) 1 + δ(K) − yJ(K)J
u
X0 (J , Ju ) d.c. ≡ 1 . (10.110)
NON-EQUILIBRIUM ATOMIC PHYSICS 575
TABLE 10.4
J J yJ(1)
J x(1)
J J
X1 (J, J ) yJ(2)
J x(2)
J J
X2 (J, J )
0 1 0. −1. 1. 0. 1. 1.
1/2 1/2 −0.333 −0.612 0.5 0. 0. 0.
1/2 3/2 0.745 −1.369 1.25 0. 0.707 0.5
1 0 0. 0. 0. 0. 0. 0.
1 1 0.5 −1. 0.5 −0.5 −1. 0.5
1 2 0.866 −1.732 1.5 0.592 0.819 0.485
3/2 1/2 0.745 −0.612 −0.25 0. 0. 0.
3/2 3/2 0.733 −1.369 0.5 0.2 −0.471 0.267
3/2 5/2 0.917 −2.092 1.75 0.748 0.962 0.509
2 1 0.866 −1. −0.5 0.592 −0.385 −0.038
2 2 0.833 −1.732 0.5 0.5 −0.394 0.233
2 3 0.943 −2.449 2. 0.828 1.113 0.545
5/2 3/2 0.917 −1.369 −0.75 0.748 −0.579 −0.082
5/2 5/2 0.886 −2.092 0.5 0.657 −0.365 0.221
5/2 7/2 0.958 −2.806 2.25 0.875 1.268 0.587
3 2 0.943 −1.732 −1. 0.828 −0.753 −0.127
3 3 0.917 −2.449 0.5 0.75 −0.350 0.214
3 4 0.968 −3.162 2.5 0.905 1.424 0.631
7/2 5/2 0.958 −2.092 −1.25 0.875 −0.920 −0.174
7/2 7/2 0.937 −2.806 0.5 0.810 −0.341 0.211
7/2 9/2 0.975 −3.518 2.75 0.925 1.581 0.677
4 3 0.968 −2.449 −1.5 0.905 −1.084 −0.221
4 4 0.95 −3.162 0.5 0.85 −0.335 0.208
4 5 0.980 −3.873 3. 0.939 1.738 0.724
9/2 7/2 0.975 −2.806 −1.75 0.925 −1.246 −0.269
9/2 9/2 0.960 −3.518 0.5 0.879 −0.331 0.206
9/2 11/2 0.983 −4.228 3.25 0.950 1.896 0.771
5 4 0.980 −3.162 −2. 0.939 −1.407 −0.318
5 5 0.967 −3.873 0.5 0.9 −0.329 0.205
5 6 0.986 −4.583 3.5 0.958 2.054 0.819
11/2 9/2 0.983 −3.518 −2.25 0.950 −1.568 −0.366
11/2 11/2 0.972 −4.228 0.5 0.916 −0.327 0.204
11/2 13/2 0.988 −4.937 3.75 0.964 2.212 0.867
6 5 0.986 −3.873 −2.5 0.958 −1.727 −0.415
6 6 0.976 −4.583 0.5 0.929 −0.325 0.204
6 7 0.990 −5.292 4. 0.969 2.370 0.916
576 CHAPTER 10
TABLE 10.4
(continued)
TABLE 10.5
X1 (J, J ) X2 (J, J )
J +2 (J + 2)(2J + 1)(2J + 5)
J = J + 1 2 10 (4J 2 + 8J + 1)
1 (2J − 1)(2J + 3)
J = J 2 10 (2J 2 + 2J − 3)
(J − 1)(2J + 1)(2J − 3)
J = J − 1 −J − 1 −
2 10 (4J 2 − 3)
This symbol satisfies the obvious relations (cf. Eqs. (10.106), (10.108), (10.17),
(10.54))
lim XK (J , Ju ) d.c. = XK (J , Ju )
(K)
δ →0
(K)
δu →0
lim XK (J , Ju ) d.c. = WK (J , Ju )
(K)
δ →∞
(K)
δu →0
W (J , J )
lim XK (J , Ju ) d.c. = K (K)u .
(K)
δ →∞ 1 + δu
K
σQ (α J ) = (−1)1+K
wJ(K)J 1 + δu(K) + i Hu Q − (−1)K wJ(K)J yJ(K)J
× u
u
u Q K
2 (−1) s−Q (ν0 ) . (10.111)
1 + δu(K) + i Hu Q 1 + δ(K) + i H Q − yJ(K)J
u
Using Eqs. (10.104), (10.62) and (10.93), the first of Eqs. (10.111) can be rewritten
in the form
2J + 1 B(α J → αu Ju )
ρK
Q (αu Ju ) =
2Ju + 1 A(αu Ju → α J )
wJ(K)J 1 + δ(K) + i H Q − (−1)K wJ(K)J yJ(K)J
× u (K)
u
u Q K 0
2 (−1) J−Q (ν0 ) ρ0 (α J ) .
1 + δu + i Hu Q 1 + δ(K) + i H Q − yJ(K)J
u
Comparison with Eq. (10.27) shows that the frequency-integrated radiation emitted
in a scattering process is still given by Eq. (10.32), where the phase matrix is now
Ω
Pij (Ω, ; B
)= XKQ (J , Ju ; B) d.c. (−1)Q T Q
K ) , (10.112)
T K (j, Ω
(i, Ω) −Q
KQ
1
The comparison must of course be restricted to statistical tensors of the form ρ00 or ρ20 .
We recall that, when the radiation field is characterized by the only components J00 and J02 and
the magnetic field is zero, the only non-null statistical tensors are those with K even and Q = 0;
by contrast, under the weak anisotropy approximation, the only non-null statistical tensors are
those with K ≤ 2 and any Q.
578 CHAPTER 10
with the symbol [XKQ (J , Ju ; B)]d.c. – which, contrary to the symbols XK (J , Ju )
and [XK (J , Ju )]d.c. , depends also on the quantum number Q – defined by
XKQ (J , Ju ; B) d.c. =
(K) 2
wJ J 1 + δ(K) + i H Q − (−1)K wJ(K)J wJ(K)J yJ(K)J
u u u u
= 2 (K = 1, 2)
1 + δu(K) + i Hu Q 1 + δ(K) + i H Q − yJ(K)J
u
X00 (J , Ju ; B) d.c. ≡ 1 . (10.113)
As an application of Eq. (10.112), we can evaluate the average of the scattering
phase matrix over an isotropic distribution of magnetic fields. Along the lines of
Sect. 5.12 we obtain (cf. Eq. (10.36))
Ω
Pij (Ω, ; B
) = (K)
MK (B) Rij (Ω, Ω ; 0) ,
K
(K)
where the matrix Rij (Ω, Ω ; 0) is defined in Eq. (10.21) and the quantity MK (B)
– the quantum analogue of the factor µK defined in Eq. (5.172) – is the average of
[XKQ (J , Ju ; B)]d.c. over all possible values of Q
1
MK (B) = XKQ (J , Ju ; B) d.c. .
2K + 1
Q
d = 1 + δu(K) H + 1 + δ(K) Hu
e = −H Hu .
After some algebra we obtain
M0 (B) = 1
1 a a(c + e) + bd
M1 (B) = +2
3 c (c + e)2 + d2
1 a a(c + e) + bd a(c + 4e) + 4bd
M2 (B) = +2 +2 . (10.114)
5 c (c + e)2 + d2 (c + 4e)2 + 4d2
1
Note that the quantities a, b, c, d depend on K.
NON-EQUILIBRIUM ATOMIC PHYSICS 579
We can consider some interesting limits of these expressions. For the transition
(J = 0, Ju = 1), neglecting depolarizing collisions, Eqs. (10.114) give the ‘classical’
result of Eqs. (5.173) with Hu in the place of H (see Eqs. (10.103) and Tables 10.1
and 10.4). For zero magnetic field, Eqs. (10.114) obviously reduce to Eqs. (10.110),
lim MK (B) = XK (J , Ju ) d.c. .
B→0
K
σQ (α J ) = (−1)1+K
wJ(K)J 1 + δu (K) + i Hu Q − (−1)K wJ(K)J yJ(K)J (−1)Q sK
−Q (ν0 )
× u
u
u
,
1 + δu (K) + i Hu Q 1 + δ(K) + i H Q − yJ(K)J
2 1 + n̄
u
1
We recall that Eq. (10.112) is valid for n̄ 1, which implies H Hu .
580 CHAPTER 10
where
δu(K) Hu
δu (K) = , Hu = .
1 + n̄ 1 + n̄
Comparison with Eqs. (10.111) shows that the effect of stimulated emission is
twofold. On the one hand, the efficiency of collisions and of the magnetic field in
depolarizing the upper level is reduced:1 this is a consequence of the fact that the
lifetime of the upper level is now shorter. On the other hand, atomic polarization
is globally decreased by a factor (1 + n̄). This is consistent with the results derived
in Sect. 10.9.
In the previous sections of this chapter we have studied the interaction of an atomic
system with the radiation field by referring to the simplest model atom, that we
denoted as multi-level. In that model the atomic energy states are ‘uncorrelated’
J-levels, and atomic transitions take place between two such levels. Now we extend
this study to the so-called multi-term atom considered in Sects. 7.5-7.7, which
describes the atomic structure by taking into account (under the assumption of
L-S coupling) the spin-orbit interaction. In this model, transitions involve an
upper and a lower term, i.e., two ‘groups’ of J-levels, each characterized by the
same values of the quantum numbers L and S. Transitions are now associated with
a set of spectral lines (the components of the multiplet) rather than with a single
line. Atomic polarization involves coherences both between magnetic sublevels of
each J-level and between magnetic sublevels of the different J-levels of each term.
Coherences of the latter kind (which do not exist, by definition, in the multi-level
atom) are responsible for several remarkable phenomena, both in the presence and
in the absence of a magnetic field.
The following treatment of the multi-term atom is strictly analogous to our pre-
vious treatment of the multi-level atom. In particular, we restrict attention to
a simplified atomic model composed of just two terms. However, two important
points should be emphasized. The first one concerns collisions, which – contrary
to the previous case – will not be considered. This is because the theory of col-
lisions developed in Chap. 7 is only valid for the multi-level atom. Extension of
such theory to the multi-term atom is rather complicated and is outside the aims
of this book. The second point concerns the flat-spectrum approximation. We still
assume that the radiation impinging on the atomic system satisfies such approx-
imation – this is indeed required by the general equations (statistical equilibrium
and radiative transfer) established in Chap. 6: from this point of view, there is no
difference from the multi-level case. However, the range over which the radiation
field must be frequency-independent is much larger in the present case: neglecting
the magnetic field, it basically coincides with the entire frequency range spanned by
1 (K)
A similar effect is also present in the lower level, since the quantities δ and H are
inversely proportional to n̄ (cf. Eqs. (10.63) and (10.103)).
NON-EQUILIBRIUM ATOMIC PHYSICS 581
the multiplet, to be compared with the (natural plus collisional) frequency width
of a single line in the multi-level case. For instance, the relevant wavelength range
for the CaII doublet is about 100 Å (see Sect. 10.17). These two points (exclusion
of collisions and flat-spectrum approximation) obviously set severe constraints to
the applicability of the following results to real cases.
As in the multi-level case, we will often introduce certain approximations (con-
cerning either the atomic system or the radiation field) in order to simplify the
equations and to obtain analytical expressions for the atomic polarization and for
the radiation emitted in scattering processes. These approximations are briefly
recalled by subheadings at the beginning of each section.
Let us consider a two-term model atom, and let us denote by (β L S) and (βu Lu S)
the quantum numbers characterizing the lower and upper term, respectively (both
in L-S coupling).1 The atom is interacting with an anisotropic, polarized radiation
field specified by the Stokes vector Ii (ν, Ω). A magnetic field is present, with asso-
ciated Larmor frequency νL (see Eq. (3.10)) and direction Ω . The flat-spectrum
B
approximation requires that the radiation field is frequency-independent across an
interval ∆ν centered at νβ L S, β L S (the Bohr frequency corresponding to the
u u
transition between the two terms, see Eq. (7.22)) and encompassing the Bohr fre-
quencies of all the transitions between the different magnetic sublevels originating,
in the Paschen-Back effect regime, from the two terms.2 We fix a reference sys-
tem with the z-axis along the magnetic field direction, and we describe the atom
through the multipole moments of the density matrix. From Eq. (7.38) we get, for
the upper term
d βL SK
u u ρ (J ,
Q u u J ) = −2πi Nβ L S (KQJu Ju , K Q Ju Ju ) βu Lu SρK
Q (Ju , Ju )
dt
u u
K Q Ju Ju
+ TA (βu Lu SKQJu Ju , β L SK Q J J ) β L SρK
Q (J , J )
K Q J J
− RE (βu Lu SKQJu Ju K Q Ju Ju )
K Q Ju Ju
βu Lu S K
+ RS (βu Lu SKQJu Ju K Q Ju Ju ) ρQ (Ju , Ju ) , (10.115)
1
The spectroscopic notation is the same as that employed in the whole book: L is the orbital
angular momentum quantum number of the electronic cloud and S is the spin, while β describes
the electronic configuration. The spin of the lower and upper term is the same, so that a dipole
transition is allowed between them.
2
Such frequencies, denoted by νβ L Sju Mu , β L Sj M , are defined in Eq. (7.26).
u u
582 CHAPTER 10
βu Lu S K
+ TS (β L SKQJ J , βu Lu SK Q Ju Ju ) ρQ (Ju , Ju )
− RA (β L SKQJ J K Q J J ) β L SρK
Q (J , J ) . (10.116)
K Q J J
The expressions of the rates are given in Sect. 7.6. We remind the reader that TA ,
TE , and TS are the radiative transfer rates due to absorption, spontaneous emission
and stimulated emission, respectively, while RA , RE , and RS are the corresponding
relaxation rates. N is the kernel containing the contributions of fine structure and
of the magnetic field.
Equations (10.115) and (10.116) form a system of linear, differential equations in
the unknowns β L SρKQ (J , J ) and ρQ (Ju , Ju ). In stationary situations the left-
βu Lu S K
hand sides are zero, and the equations reduce to an algebraic, linear, homogeneous
system. The number of equations (and of unknowns) is given by
Neq = (2S + 1)2 (2L + 1)2 + (2Lu + 1)2 , (10.117)
and the determinant of the system is zero.1 The normalization condition is provided
by the trace equation, which reads (see Eqs. (3.84), (3.101), (3.108))
β L S 0 βu Lu S 0
2J + 1 ρ0 (J , J ) + 2Ju + 1 ρ0 (Ju , Ju ) =1, (10.118)
J Ju
N + Nu = N ,
with
N = N 2J + 1 β L S 0
ρ0 (J , J )
J
Nu = N 2Ju + 1 βu Lu S 0
ρ0 (Ju , Ju ) . (10.119)
Ju
Once the statistical equilibrium equations are solved, the radiative transfer coeffi-
cients can be found from Eqs. (7.47).
As obvious from Eq. (10.117), the system contains a large number of equations
even for the simplest transitions. Analytical solutions are therefore impracticable
unless some approximation is introduced. One of the most natural and fruitful
1
Using the expressions of the rates and of the kernel N , it can be shown directly that the
equations for the time derivatives of the linear combinations
% β L S 0
% βu Lu S 0
J
2J + 1 ρ0 (J ,J ) and Ju
2Ju + 1 ρ0 (Ju , Ju )
(the populations of the lower and upper term) are identical, except for the opposite sign.
NON-EQUILIBRIUM ATOMIC PHYSICS 583
approximations, that will often be used in the following sections, is to neglect the
atomic polarization of the lower term. Although collisions are not included in
Eqs. (10.115)-(10.116), it is reasonable to believe, by analogy with the two-level
case (cf. Sect. 10.7), that such unpolarized lower term approximation is roughly
satisfied in the presence of depolarizing collisions and a weak radiation field (which
also implies that stimulation effects should be neglected).
Let us examine how the statistical equilibrium equations are modified by this
approximation. The condition that all the sublevels of the lower term are evenly
populated and that no coherence is present between them can be written in the
form
ρβ L S (j M , j M ) = δj j δM M C , (10.120)
The constant C is proportional to the overall population of the lower term. Sub-
stitution of Eq. (10.121) into the first of Eqs. (10.119) yields
1 N
C= . (10.122)
(2S + 1)(2L + 1) N
The equations for the statistical tensors of the upper term can now be derived
from Eq. (10.115) by neglecting the relaxation rate due to stimulated emission and
by making use of Eqs. (10.121) and (10.122). Recalling Eq. (7.46b) we obtain, in
stationary situations
A(βu Lu S → β L S) βu Lu SρK
Q (Ju , Ju ) +
βu Lu S K
+ 2πi Nβ L S (KQJu Ju , K Q Ju Ju ) ρQ (Ju , Ju ) =
u u
K Q Ju Ju
2J + 1 N
= TA (βu Lu SKQJu Ju , β L S 0 0J J ) , (10.123)
(2S + 1)(2L + 1) N
J
where the kernel N is given by Eq. (7.41) and the transfer rate TA by Eq. (7.45a).
Taking into account Eqs. (2.26a) and (2.49), the latter can be rewritten in the form
In the following sections we will analyze the consequences of Eq. (10.123) both in
the non-magnetic and in the magnetic regime.
× (−1)1+J +Ju +Q (2J + 1)
J
! ! !
1 1 K Lu L 1 Lu L 1
× K
J−Q (νβ L S, β L S ) ,
Ju Ju J J Ju S J Ju S u u
To obtain the radiation emitted by the atom along a given direction Ω, we have to
substitute Eq. (10.126) into the expression of the emission coefficient for the multi-
term atom. This is given by Eq. (7.47e) for the general case where a magnetic field
is present. In the non-magnetic case it becomes much simpler, since the following
substitutions are to be performed
CJj (βLS, M ) → δjJ , Φ(νβ L Sju Mu , β L Sj M − ν) → Φ(νβ L SJu , β L SJ − ν) ,
u u u u
NON-EQUILIBRIUM ATOMIC PHYSICS 585
where
Eβ L S (Ju ) − Eβ L S (J )
u u
νβL SJu , β L SJ = ,
u u h
with EβLS (J) the eigenvalue of the spin-orbit Hamiltonian corresponding to the
eigenvector |βLSJM (see Eq. (3.61a)). With the help of Eqs. (2.34), (3.100) and
(5.158) we obtain1
2 4
= h ν N (2L + 1) B(β L S → β L S)
εi (ν, Ω) u u u
2πc2
× (−1)1+J +Ju (2J + 1) 3(2Ju + 1)(2Ju + 1)
KQ Ju Ju J
! ! !
Lu L 1 Lu L 1 1 1 K
×
J Ju S J Ju S Ju Ju J
× TQ
K
(i, Ω) ρQ (Ju , Ju )
βu Lu S K
1
× Φ(νβ L SJ , β L SJ − ν) + Φ(νβ L SJ , β L SJ − ν)∗ . (10.127)
2 u u u u u u
1
Similar calculations have been performed to derive Eq. (7.48g) from Eq. (7.47e).
2 A A
The quantity kM – the analogue of kL of Eq. (9.5) for a multiplet – is defined as the integral
over frequency of the absorption coefficient η0A (ν, Ω)
for a two-term atom with unpolarized lower
term. The integral is easily evaluated using Eq. (7.47a) – or equivalently Eq. (7.48b) – and taking
into account Eqs. (10.121) and (10.122). The expression in Eq. (10.128) implies the approximation
ν ≈ νβ L S, β L S .
u u
586 CHAPTER 10
where the interval ∆ν is sufficiently broad to fully cover all the lines of the multiplet.
Since the Φ profiles are normalized to unity in frequency (Eqs. (6.59a-c)), we can
perform the summation over J via Eq. (2.41). Writing the radiation field tensor in
terms of the Stokes parameters via Eqs. (5.157), we obtain the expression (see the
analogous Eq. (10.19) valid for the two-level atom with unpolarized lower level)
5
dΩ
3
= kA
ε̃i (Ω) Ω
Pij (Ω, ) I (ν ) ,
,Ω (10.131)
M fs j βu Lu S, β L S
4π j=0
Ω
where [Pij (Ω, )] , the fine-structure scattering phase matrix, is given by
fs
Ω
Pij (Ω, ) = WK (β L Sβu Lu ) fs (−1)Q T Q
K
(i, Ω) ) ,
T K (j, Ω (10.132)
fs −Q
KQ
with
3(2Lu + 1)
WK (β L Sβu Lu ) fs =
2S + 1
!2 !2
1 1 K Lu Lu K
× (2Ju + 1)(2Ju + 1)
Lu Lu L Ju Ju S
Ju Ju
1
× . (10.133)
1 + 2πi νβ L SJu , βu Lu SJu A(βu Lu S → β L S)
u u
Comparison of Eqs. (10.132) and (10.20) shows that the new coefficient is the direct
equivalent, for fine-structure multiplets, of the coefficient WK (J , Ju ) introduced for
the two-level atom. It can easily be proved that, like WK (J , Ju ), it is real
∗
WK (β L Sβu Lu ) fs = WK (β L Sβu Lu ) fs , (10.134)
Equation (10.133) can be cast in a more significant form. Recalling Eq. (10.17)
one has
WK (β L Sβu Lu ) fs = WK (L , Lu ) DK (βu Lu S) fs , (10.135)
NON-EQUILIBRIUM ATOMIC PHYSICS 587
where [DK (βLS)]fs , the depolarizing factor due to fine structure,1 is given by
!2
1 L L K
DK (βLS) fs = (2J + 1)(2J + 1)
2S + 1
J J S
JJ
1
× . (10.136)
1 + 2πi νβLSJ , βLSJ A(βLS → β L S)
which shows that the two-term atom behaves as a two-level atom having J = L ,
Ju = Lu . This is an obvious manifestation of the principle of spectroscopic stability:
if the spin induces negligible splittings, it can just be disregarded.
b) In the opposite case where the fine-structure intervals are much larger than the
Einstein coefficient, all the terms with J = J make a negligible contribution to
the double summation in Eq. (10.136), so that
!2
∞ 1 L L K
DK (βLS) fs = DK (LS) fs = (2J + 1)2 . (10.139)
2S + 1 J J S
J
∞
Values of DK are collected in Table 10.6 for K = 1, 2 2 and for all terms having
S ≤ 9/2, L ≤ 7. Terms with S = 0 or L = 0 are not included since for such terms
∞
the definition of DK is meaningless.
c) When the J-levels satisfy the Landé interval rule (see Eq. (3.60)), one has
1
νβLSJ , βLSJ = ζ(βLS) J (J + 1) − J(J + 1) , (10.140)
2h
and the depolarizing factor can be written in the form
!2
1 L L K
DK (βLS) fs = (2J + 1)(2J + 1)
2S + 1
J J S
JJ
1
× 2 , (10.141)
1+ x2
J (J + 1) − J(J + 1)
1
It is important to note that the depolarizing factor depends only on the fine-structure
properties and on the Einstein coefficient for spontaneous de-excitation of the term.
2
Use of Eq. (2.36a) shows that [D0∞ (LS)]fs = [D0 (βLS)]fs = 1.
588 CHAPTER 10
TABLE 10.6
∞ (LS)] for different terms
Values of the depolarizing factors [DK fs
where
π ζ(βLS)
x= . (10.142)
h A(βLS → β L S)
The dimensionless parameter x represents the ratio between a typical fine-structure
splitting of the J-levels of the term and the natural width of the same levels.
Obviously, Eq. (10.141) reduces to Eq. (10.137) for x = 0, and to Eq. (10.139) for
x → ∞. As an illustration of Eq. (10.141) we plot in Fig. 10.16, as functions of x,
the depolarizing factors [D1 ]fs and [D2 ]fs for the term 3 P.
Consider now the polarization of the radiation emitted in a scattering process.
Because of the close similarity of Eqs. (10.131) and (10.19), the phase matrix cor-
responding to the geometry of Fig. 10.1 is given by Eq. (10.24) with the factor
[WK (β L Sβu Lu )]fs in the place of WK (J , Ju ). For the 90◦ scattering of an unpo-
larized radiation beam, the fractional frequency-integrated polarization is therefore
given by (cf. Eqs. (10.25))
3 W2 (L , Lu ) D2 (βu Lu S) fs
p̃Q =
4 − W2 (L , Lu ) D2 (βu Lu S) fs
p̃U = 0 . (10.143)
NON-EQUILIBRIUM ATOMIC PHYSICS 589
Fig.10.16. The depolarizing factors [D1 (βLS)]fs and [D2 (βLS)]fs for the term 3 P against the
parameter x defined in Eq.(10.142). The asymptotic values for x → ∞ (0.5 for D1 and 0.278 for
D2 ) are also shown.
1 1
Φ(νβ L SJu , β L SJ − ν) = ,
u u π Γ − i νβ L SJu , β L SJ −ν
u u
1 A(βu Lu S → β L S)
Γ = A(βu Lu SJu → β L SJ ) = .
4π 4π
J
1 Γ
= . (10.144)
π Γ − i (νβ L SJu , β L SJ − ν) Γ + i (νβ L SJu , β L SJ − ν)
u u u u
!2
≈ kA 2Lu + 1 Lu L 1
εi (ν, Ω) M 3(2Ju + 1)2 (2J + 1)
2S + 1 J Ju S
! ! !
S−L −J +K+Q 1 1 K 1 1 K Lu Lu K
× (−1)
Ju Ju J Lu Lu L Ju Ju S
KQ
× TQ
K J K (ν
(i, Ω) − ν) , (10.145)
−Q β L S, β L S )
u u
φ(νβ
u u
L SJu , β L SJ
where
1 Γ
φ(νβ − ν) = .
L SJu , β L SJ
u u π Γ 2 + (νβ L SJu , β L SJ − ν)2
u u
1
This is no longer true for hyperfine-structure multiplets. See Sect. 10.22.
NON-EQUILIBRIUM ATOMIC PHYSICS 591
≈ k A S J ,Ju φ(ν
εi (ν, Ω) M βu Lu SJu , β L SJ − ν)
× WK (L Lu S, J Ju ) (−1)Q T Q K J K (ν
(i, Ω) −Q β L S, β L S )
u u
, (10.146)
KQ
Equation (10.146), when compared with Eq. (10.16), shows that a line belonging
to a multiplet behaves in resonance scattering as an isolated line, with the only
difference that the symbol WK (J , Ju ) defined in Eq. (10.17) is replaced by the more
general symbol WK (L Lu S, J Ju ) defined above. This new symbol satisfies several
important properties:
a) When S = 0 the multiplet reduces to a single line. This implies Lu = Ju and
L = J . Using Eq. (2.36a) one obtains the obvious result
WK (J Ju 0, J Ju ) = WK (J , Ju ) .
W0 (L Lu S, J Ju ) = 1 .
c) Apart from the trivial case S = 0, the only multiplet for which the symbols
WK (L Lu S, J Ju ) and WK (J , Ju ) coincide is the simplest of the multiplets, namely
2
S − 2 P (L = 0, Lu = 1, S = 1/2). Using Eq. (2.36a) we have in fact
WK 0 1 12 , 12 Ju = WK 12 , Ju .
d) Owing to the properties of the 6-j symbols, all lines originating from an S upper
term (Lu = 0) have W1 and W2 symbols equal to zero. The same holds for all lines
having Ju = 0, whereas all lines with Ju = 1/2 have W2 = 0, but W1 = 0.
592 CHAPTER 10
e) For several lines the value of W2 is negative, which means that the direction
of the scattered linear polarization produced by an unpolarized radiation beam is
parallel to the scattering plane (see comments about Eq. (10.143)).
Table 10.7 collects the numerical values of WK (L Lu S, J Ju ) for all multiplets (ex-
cept singlets) satisfying the electric-dipole selection rules (cf. Sect. 7.11), up to
S = 5/2, L = 2.
For frequencies that are not in the neighborhood of single spectral lines, the
approximation of restricting the summation over Ju , Ju and J in Eq. (10.129) to
specific values is not justified. All the terms contribute, and the influence of the
‘off-diagonal’ terms (those having Ju = Ju ) is fundamental.
Let us consider a frequency ν which is well outside the frequency range of the
multiplet. The last term in Eq. (10.129) can approximately be written in the form
(see Eq. (10.144))
1 Γ
φ(ν0 − ν) ≈ ,
π (ν0 − ν)2
where ν0 is a sort of center of gravity of the whole multiplet. As this term does not
depend on the quantum numbers Ju , Ju , J , the summation over these numbers
can be performed via Eqs. (2.41) and (2.39). One gets
≈ k A φ(ν − ν)
εi (ν, Ω) M 0
× WK (L , Lu ) (−1)Q T Q
K J K (ν
(i, Ω) −Q β L S, β L S ) ,
u u
(10.148)
KQ
where WK (L , Lu ) is the symbol introduced for the two-level atom (Eq. (10.17)).
This is an interesting result: at large distance from the ‘center of gravity’, the mul-
tiplet behaves in resonance scattering as a simple transition between two spinless
levels L and Lu .
To find the frequency dependence of the scattered radiation outside the limited
ranges considered above (neighborhood of single lines, ‘far wings’ of the multiplet)
one should resort to numerical calculations, by specifying the full set of physical
parameters entering Eq. (10.129). These include – in addition to the scattering
geometry – the quantum numbers of the lower and upper term, the energies of the
different J-levels, and the value of the Einstein A coefficient.
Two examples are shown in Figs. 10.17 and 10.18, where the fractional linear po-
larization pQ = εQ /εI resulting from the 90◦ scattering of an unpolarized radiation
beam is plotted as a function of wavelength for two different multiplets, multiplet
n. 1 of CaII (2 S − 2 P) and multiplet UV n. 1 of FeII (6 D − 6 D). The curves have
been computed from Eq. (10.129) by neglecting the width of the lower levels, so
that the frequency-dependent term is given by Eq. (10.144). Both figures refer to
the scattering geometry of Fig. 10.1 (with Θ = 90◦ ),1 therefore a positive value for
1
We recall that, for the scattering conditions√specified above, the only non-zero components
of the radiation field tensor are J00 and J02 = J00 / 2 (cf. Eqs. (5.164)), and that the components
K are obtained from Table 5.6 setting θ = γ = 90◦ .
of the tensor T Q
NON-EQUILIBRIUM ATOMIC PHYSICS 593
TABLE 10.7
TABLE 10.7
(continued)
TABLE 10.7
(continued)
Fig.10.17. Fractional polarization pQ against vacuum wavelength λv (in Å) in the 90◦ scattering
of an unpolarized radiation beam, for multiplet n.1 of CaII. The vertical lines mark the wavelength
positions of the two lines of the multiplet according to the code: 1) J = 1/2, Ju = 3/2 (K line);
2) J = 1/2, Ju = 1/2 (H line). Atomic data are from Moore (1949). The value of the Einstein
coefficient is A = 1.48 × 108 s−1 .
On the other hand, the asymptotic value for the multiplet of Fig. 10.18 can be
derived from Eq. (10.148) with the use of Table 10.1,
3 W2 (2, 2)
pQ = 0.288 .
4 − W2 (2, 2)
It should be realized that the εI profile, as given by Eq. (10.129), basically consists
of a set of extremely narrow (∆λ < 0.1 mÅ) Lorentzian profiles centered at the
wavelengths of the individual lines (J , Ju ). The effect of interferences on the ratio
NON-EQUILIBRIUM ATOMIC PHYSICS 597
Fig.10.18. Same as Fig.10.17 for multiplet UV n.1 of FeII. The J , Ju values of the lines are the
following: 1) 9/2, 7/2; 2) 7/2, 5/2; 3) 9/2, 9/2; 4) 5/2, 3/2; 5) 7/2, 7/2; 6) 3/2, 1/2; 7) 5/2, 5/2;
8) 3/2, 3/2; 9) 1/2, 1/2; 10) 7/2, 9/2; 11) 1/2, 3/2; 12) 3/2, 5/2; 13) 5/2, 7/2. Atomic data are
from Moore (1952). The value of the Einstein coefficient is A = 3.1 × 108 s−1 .
pQ shows up just across those wavelength intervals where the emissivity is extremely
weak . This is the reason why it is difficult to detect such interference phenomena
in laboratory experiments. The situation may drastically change in astrophysical
plasmas, where optical thickness effects can compensate for the low emissivity.
These phenomena have indeed been observed, for the first time, in the radiation
scattered by the CaII doublet at the extreme limb of the sun (Stenflo, 1980).
The ‘fragile’ nature of interference phenomena can also be realized by assuming
that the plasma, in addition to the resonantly scattered radiation in a given mul-
tiplet, also emits a certain amount of continuum radiation (which, for the sake of
simplicity, we suppose to be unpolarized). Denoting by ε(c) the continuum emis-
sivity, the total emission coefficient is given by
= ε(M) (ν, Ω)
εi (ν, Ω) + ε(c) δ , (10.149)
i i0
where ε(M)
i (ν, Ω), the emissivity in the multiplet, is still given by Eq. (10.129).
Figure 10.19 illustrates, in a specific case, the strong reduction of interference
effects due to the presence of some continuum emissivity.
598 CHAPTER 10
Fig.10.19. Solid line: same as Fig.10.18 in the presence of a given amount of continuum, unpo-
larized radiation (see Eq.(10.149)). The value of ε(c) is 3 × 10−9 times the maximum emissivity
in the multiplet (at the center of line 3 of Fig.10.18). The dotted curve (identical with the curve
of Fig.10.18) corresponds to ε(c) = 0.
A closed expression for the statistical tensors of the upper term can be found by
solving (in stationary situations) the statistical equilibrium equations in the energy-
eigenvector representation, so to obtain the density-matrix elements of the upper
term; these can then be combined, by the appropriate transformation, in order to
get the statistical tensors. This procedure does not avoid the need for numerical
calculations: the problem of solving a linear algebraic system is changed into the
problem of diagonalizing the spin-orbit plus magnetic Hamiltonian (see Sect. 3.4),
which in most cases must be performed numerically. However, the latter is a much
easier task, because the matrix involved is of lower rank and block-diagonal.
The statistical equilibrium equations in the energy-eigenvector representation
are given by Eq. (7.29). We are interested in the statistical tensors of the upper
term of a two-term atom whose lower term is, by assumption, unpolarized. Using
Eqs. (7.34e), (10.120) and (10.122), and neglecting stimulated emission, we obtain,
in stationary situations
N 1
ρβ L S (ju Mu , ju Mu ) =
u u N (2S + 1)(2L + 1)
TA (βu Lu Sju Mu ju Mu , β L Sj M j M )
× ,
A(βu Lu S → β L S) + 2πi νβ L S (ju Mu , ju Mu )
j M u u
where the rate TA is given by Eq. (7.34a) and the Bohr frequency νβ L S by
u u
Eq. (7.30). The statistical tensors of the upper term can now be obtained by
applying the transformation (7.37). Via a long but direct calculation involving the
use of Eqs. (3.62b), (5.156), (2.42), and (2.41), one obtains
N 3(2Ju + 1)(2Ju + 1)
βu Lu S K
ρQ (Ju , Ju ) = K
N 2S + 1
Ju Ju K
! !
1+L −S+Ju +Ju +Ju +K +Q 1 1 K Lu Lu K
× (−1)
Lu Lu L Ju Ju S
× J−Q
K
(νβ L S, β L S ) , (10.150)
u u
B(β L S → βu Lu S)
× .
A(βu Lu S → β L S) + 2πi νβ L S (ju Mu , ju Mu )
u u
Obviously, for zero magnetic field we get back previous results. For B = 0 we have
in fact
j
CJu → δj J , νβ L S (ju Mu , ju Mu ) → νβ L SJ , β L SJ ,
u u u u u u u u u u u
600 CHAPTER 10
where the last Bohr frequency is given by Eq. (7.43). Use of Eq. (2.23a) yields
B(β L S → βu Lu S)
lim K = δKK δJ J δJ J ,
B→0 u u u u A(βu Lu S → β L S) + 2πi νβ L SJ , βu Lu SJu
u u u
Ω
where [Pij (Ω, ; B)]
, the scattering phase matrix for a fine-structure multiplet in
fs
the presence of a magnetic field, is given by1
Pij (Ω, Ω ; B)
= WKK Q (β L Sβu Lu ; B) fs
fs
KK Q
× (−1)Q T Q
K
(i, Ω) ) ,
T K (j, Ω (10.152)
−Q
with
! !
3(2Lu + 1) 1 1 K 1 1 K
WKK Q (β L Sβu Lu ; B) fs =
2S + 1 Lu Lu L Lu Lu L
× (2K + 1)(2K + 1)(2Ju + 1)(2Ju + 1)(2Ju + 1)(2Ju + 1)
Ju Ju Ju Ju Mu Mu
! !
Lu Lu K Lu Lu K Ju Ju K Ju Ju K
×
Ju Ju S Ju Ju S −Mu Mu −Q −Mu Mu −Q
j j j j
× CJu (βu Lu S, Mu ) CJu (βu Lu S, Mu ) CJu (βu Lu S, Mu ) CJu (βu Lu S, Mu )
u u u u
ju ju
1
× . (10.153)
S (ju Mu , ju Mu ) A(βu Lu S → β L S)
1 + 2πi νβ
L
u u
The symbol now introduced depends on the quantum numbers specifying the mul-
tiplet (L , Lu , S), on the fine-structure intervals of the upper term, on the Einstein
1
The following expression was first derived by Landi Degl’Innocenti (1990).
NON-EQUILIBRIUM ATOMIC PHYSICS 601
coefficient A(βu Lu S → β L S), and on the magnetic field modulus B. Its main
properties are collected in App. 14.
In the following we apply Eq. (10.151) to a particularly simple case, i.e., the
transition 3 S − 3 P (L = 0, Lu = 1, S = 1). We assume that the upper term
satisfies the Landé interval rule, so that it can be fully characterized by the quantity
ζ(βu Lu S) – see Eqs. (3.60), (10.140). Bearing in mind Eqs. (10.153) and (3.61), it
is immediately seen that the Stokes parameters of the scattered radiation depend
on A(βu Lu S → β L S), ζ(βu Lu S) and B via the two ratios
ζ(βu Lu S) µ0 B
and .
A(βu Lu S → β L S) A(βu Lu S → β L S)
π ζ(βu Lu S) µ0 B
x= , γ= , (10.154)
h A(βu Lu S → β L S) ζ(βu Lu S)
3 sin2 β W220 + (1 + cos2 β) Re(W222 )
p̃Q =
8 + (1 − 3 cos2 β) W220 − 3 sin2 β Re(W222 )
−6 cos β Im(W222 )
p̃U =
8 + (1 − 3 cos2 β) W220 − 3 sin2 β Re(W222 )
√
−2 3 cos β W120
p̃V = . (10.155)
8 + (1 − 3 cos2 β) W220 − 3 sin2 β Re(W222 )
The last expression shows that the scattered radiation contains, in general, a def-
inite amount of circular polarization. This phenomenon is closely associated with
the existence of fine structure (it has no counterpart in the multi-level atom) and
will be discussed extensively in Sect. 10.20. The expressions for the linear po-
larization still bear a clear resemblance to the corresponding expressions for the
two-level atom (Eqs. (10.37)). For zero magnetic field the circular polarization van-
ishes and the expressions for the linear polarization reduce to Eqs. (10.143) – see
Eqs. (A14.6), (10.134) and (10.135).
602 CHAPTER 10
Fig.10.20. Polarization (or Hanle) diagrams for the transition between the lower term 3 S and the
upper term 3 P, corresponding to the scattering geometry of Fig.5.11 (only the curves for β = 0◦
and β = 180◦ are shown). The left panel is obtained for x = 0.25, the right panel for x = 0.5.
The curve β = 0◦ is labelled by the values of γ.
Fig.10.21. Same as Fig.10.20 for x = 1 (left panel) and x = 3 (right panel). The curve corre-
sponding to β = 180◦ (symmetrical to the curve β = 0◦ about the line p̃U = 0 ) is not drawn.
Note the scale difference with Fig.10.20.
Figures 10.20-10.21 show the Hanle diagrams obtained from Eqs. (10.155)1 by
assigning different values to the parameter x. The first thing to notice is the overall
1
Note that the evaluation of the symbols WKK Q requires the numerical calculation, for
each value of x and γ , of the eigenvalues and eigenvectors of the matrix defined in Eqs. (3.61).
Alternatively, one can determine (for each value of x and γ ) the upper-level statistical tensors by
solving numerically Eqs. (10.123); the frequency-integrated scattered polarization is then obtained
from Eq. (7.48g) – with the profile φ replaced by unity.
NON-EQUILIBRIUM ATOMIC PHYSICS 603
Fig.10.22. Absolute value of the quantity W222 for the transition 3 S − 3 P as a function of the
magnetic field strength (parameterized through γ), for different values of x. Note the appearance
of sharp peaks around the critical γ values 0.616, 0.707, and 1.217 for large x values.
decrease of the scattered polarization with increasing x (i.e., with decreasing the
Einstein A coefficient): this effect has already been pointed out in our discussion
of resonance polarization in the absence of magnetic fields (see Sect. 10.16).
The second phenomenon is the appearance of characteristic curly shapes which
become more and more prominent as x is increased. This phenomenon is due
to level-crossing interferences (see Bommier, 1980), and is associated with the
overlapping of different M -sublevels in the (incomplete) Paschen-Back effect – see
Sect. 3.4, and in particular Fig. 3.8 which refers to the term 3 P. Whereas in the
Zeeman effect of an ‘isolated’ J-level the M -sublevels spread out linearly in energy
with increasing magnetic field – with a monotonic decrease of the corresponding
coherences – the M -sublevels of a given term may get closer, or even overlap, as
the field is increased. Table 3.5 shows that for γ = 0.616, 0.707, and 1.217 there are
overlappings between pairs of M -sublevels of the term 3 P corresponding to ∆M = 2
(these are the only overlappings that affect scattering polarization in the particular
geometry considered). Around such critical γ values the coherence between the two
overlapping sublevels gets larger (in absolute value) and the scattering polarization
approaches its zero-field value, which explains the presence of curls in the diagrams.
On the other hand, the Einstein A coefficient controls the natural width of the
sublevels. Thus the larger is A, the larger is the γ-interval (around each critical
value) in which the sublevels effectively overlap. This explains why the curls in
the diagrams clearly show up for small values of A (large values of x) while they
practically disappear for larger values of A (smaller values of x).
As a further illustration of these concepts, we plot in Fig. 10.22 the absolute value
of W222 as a function of γ for three different values of x. It is clearly seen that the
604 CHAPTER 10
three peaks are almost ‘unresolved’ for small x values, while they get sharper and
sharper as x is increased.
Apart from the loopy structure of the diagrams, the Hanle effect in two-term
atoms involves another interesting phenomenon which shows up most clearly when
the magnetic field is perpendicular to the scattering plane. For β = 90◦ or 270◦ ,
Eqs. (10.155) yield
3 W220 + Re(W222 )
p̃Q =
8 + W220 − 3 Re(W222 )
p̃U = p̃V = 0 .
Let us compare the values of p̃Q corresponding to zero magnetic field and to very
strong magnetic field (in the sense specified in App. 14). From Eqs. (A14.6),
(10.135), and (A14.8) we have
3 W2 (L , Lu ) D2 (βu Lu S) fs
lim p̃Q =
B→0 4 − W2 (L , Lu ) D2 (βu Lu S) fs
3 W2 (L , Lu )
lim p̃Q = ,
B→∞ 8 + W2 (L , Lu )
and it may well happen that the second value is larger than the first. Consider
for instance the transition 3 S − 3 P. If the fine-structure separations of the upper
term are much larger than the Einstein A coefficient (x → ∞), one obtains from
Tables 10.1 and 10.6
where
1
MK (B) fs = WKKQ (β L Sβu Lu ; B) fs . (10.156b)
2K + 1
Q
1
This name has been suggested by Bommier (1980) who discusses the importance of the
effect on the polarization of the HeI D3 line at 5876 Å.
NON-EQUILIBRIUM ATOMIC PHYSICS 605
The quantity [MK (B)]fs generally decreases with increasing magnetic field. In
particular we have, using Eqs. (A14.6), (10.135), and (A14.8)
MK (B → ∞) fs 1
= .
MK (B = 0) fs (2K + 1) DK (βu Lu S) fs
If the fine-structure intervals of the upper term are much smaller than the Einstein
A coefficient, the depolarizing factor is unity (see Eq. (10.137)), thus the above
ratio is 1/3 for K = 1 and 1/5 for K = 2. This is the classical result predicted
by Eqs. (5.173). On the other hand, if the fine-structure intervals are comparable
with A, the depolarizing factor is less than unity, hence the above ratio is larger
than 1/3 and 1/5, respectively. For instance, for an upper term 3 P in the limit of
very large fine-structure intervals (x → ∞), one gets from Table 10.6
M1 (B → ∞) fs M2 (B → ∞) fs
0.667 , 0.719 .
M1 (B = 0) fs M2 (B = 0) fs
This is a further manifestation of the anti-level-crossing effect discussed above.
unpolarized scattered
beam δ beam
lamp detector
cell
Fig.10.23. Schematic experimental set-up for measuring fine-structure constants via level-crossing
interferences (Franken effect). The cell contains a vapor of the atomic species to be investigated.
In practice, it is necessary to introduce a small tilt angle between the scattered and the incident
beam to avoid contamination of the signal by direct incoming light.
Fig.10.24. The signal SD at the photodetector (in units of k) as a function of the magnetic
field strength, for the scattering geometry of Fig.10.23. The resonant transition is 3 S − 3 P. The
relevant parameters are x = 10, δ = 45◦ .
The Hanle phase matrix for the two-term atom (Eq. (10.152)) differs radically from
the corresponding matrix for the two-level atom (Eq. (10.33)) because of the double
summation over K and K . The presence of this double summation, together with
the fact that the only non-zero components of the tensor T Q K are those with
(3, Ω)
K = 1 (see Table 5.6), has important consequences on the circular polarization of
the scattered radiation.
In a two-level atom, the only way to get frequency-integrated, circularly polarized
scattered radiation is to irradiate the atom with circularly polarized light. This
property is already contained in the classical expression of the scattering phase
matrix (Eqs. (5.99)) and passes directly to the quantum-mechanical expression
(Eq. (10.34)). It can be understood by observing that frequency-integrated circular
polarization can be emitted in a given transition only if some orientation is present
in the upper level (see Eq. (10.31)); on the other hand, orientation in the upper level
of a two-level atom can only be produced by illuminating the atom with circularly
polarized radiation.1
These properties are no longer valid for the two-term atom. Equation (10.150)
shows, because of the presence of the summation over K , that orientation can be
produced in the upper term even if no circular polarization is present in the incident
radiation. This phenomenon has properly been called the alignment-to-orientation
conversion mechanism 2 since the upper term has, so to say, the possibility of
converting some ‘alignment’ present in the radiation field (the K = 2 components
of the radiation field tensor) into atomic orientation (the K = 1 components of the
statistical tensors).3
1
An exception to this rule occurs in the presence of strong magnetic fields, provided
the spectrum of the incident radiation satisfies suitable conditions (see Sect. 10.5 and Landi
Degl’Innocenti, 1985b). Such peculiar situation is not considered in this section.
2
This name was proposed by Kemp et al. (1984).
3
Obviously, the component J00 makes no contribution to the atomic orientation of the upper
608 CHAPTER 10
scattered
radiation
unpolarized
radiation
Figure 10.26 shows p̃V as a function of γ, for x = 10 (see Eqs. (10.154)) and
δ = 40◦ . It is clear from the figure that the conversion mechanism is quite efficient
especially at those magnetic field values corresponding to level crossings between
pairs of M -sublevels with ∆M = 1 (cf. Table 3.5).
The importance of this mechanism for broad-band circular polarization in stellar
atmospheres has been discussed by Kemp et al. (1984). According to these au-
thors, Lehmann (1964) was apparently the first to realize that excitation of atoms
in a magnetic field by non-circularly polarized light could lead to atomic orien-
tation. Some circular polarization profiles observed in the radiation from solar
prominences have been interpreted by Landi Degl’Innocenti (1982b) on the basis
of this mechanism (to obtain the line profile of the scattered radiation in the four
Stokes parameters one should substitute the statistical tensors of Eq. (10.150) into
the frequency-dependent expression of the emission coefficient, Eq. (7.47e)).
It should be remarked that the ‘opposite’ of the alignment-to-orientation conver-
sion mechanism exists as well. Such mechanism, that can be named orientation-
to-alignment conversion, is responsible for the appearance of an additional contri-
bution to the linear polarization of scattered radiation when the atom is irradiated
by circularly polarized light.
The results obtained in Sects. 10.16-10.20 are based on the hypothesis that no
atomic polarization is present in the lower term. If we want to release this hy-
pothesis, we have to go back to the statistical equilibrium equations (10.115) and
(10.116) established in Sect. 10.15. In the general case (arbitrary radiation field,
presence of a magnetic field) these equations are rather involved. In this section
we confine attention to a particularly simple, yet illuminating case.
Similarly to Sect. 10.7, we assume that the incident radiation field – besides
satisfying the flat-spectrum approximation, see Sect. 10.15 – is unpolarized and
cylindrically symmetrical about a fixed direction, which we take as the z-axis of our
reference system: this means that the only non-zero components of the radiation
field tensor are J00 (νβ L S, β L S ) and J02 (νβ L S, β L S ). We also assume that the
u u u u
610 CHAPTER 10
where n(L, S), the number of statistical tensors of the form ρQ (J, J )
βLS K
with K
even and Q = 0, can be written in the form
1 2
n(L, S) = J − J< 2J> + 2J> J< − 4J<
2
+ 6J> + 6J< + 1 + J> + ξ ,
6 >
where
1 if S is an integer
J> = L + S , J< = |L − S| , ξ= 1
2 if S is a half-integer .
Under the above assumptions, the statistical equilibrium equations (10.115) and
(10.116) reduce to the following (see Eqs. (7.41), (7.45b) and (7.46b))
d
ρ0 (Ju , Ju )
βu Lu S K
= − A(βu Lu S → β L S) + 2πi νβ L SJu , βu Lu SJu ρ0 (Ju , Ju )
βu Lu S K
dt u u
+ TA (βu Lu SK0Ju Ju , β L SK 0J J ) β L SρK
0 (J , J )
K even J J
d
ρ0 (J , J )
β L S K
= −2πi νβ L SJ , β L SJ ρ0 (J , J )
β L S K
dt
+ TE (β L SK0J J , βu Lu SK0Ju Ju ) βu Lu SρK
0 (Ju , Ju )
Ju Ju
− RA (β L SK0J J K 0J J ) β L SρK
0 (J , J ) , (10.159)
K even J J
where K is even and the Bohr frequencies are defined in Eq. (7.43). The expressions
of the rates TA , TE , RA are easily derived from Eqs. (7.45a,b) and (7.46a) with the
1
Because of the conjugation property (3.100), Neq also represents the number of real unknowns
of the algebraic system.
NON-EQUILIBRIUM ATOMIC PHYSICS 611
Equation (10.159) can be cast in dimensionless form by dividing both sides by the
Einstein coefficient A(βu Lu S → β L S) and by introducing the parameters
(2L + 1) B(β L S → βu Lu S) 0
J (ν )=
(2Lu + 1) A(βu Lu S → β L S) 0 βu Lu S, β L S
c2
= J00 (νβ L S, β L S ) = n̄ ,
2h νβ3 L S, β L S
u u
u u
612 CHAPTER 10
J02 (νβL S, β L S ) w
0
u u
=√ (10.160)
J0 (νβ L S, β L S ) 2
u u
(Eqs. (7.33) have been used in the first relation). The quantities n̄ and w – already
introduced in Sect. 10.7 – represent the solid-angle average of the number of photons
per mode and the anisotropy factor at the transition frequency, respectively. If both
the upper and lower term satisfy the Landé interval rule (Eq. (10.140)), we can
also introduce the parameters
π ζ(βu Lu S)
= xu ,
h A(βu Lu S → β L S)
π ζ(β L S)
=
h B(β L S → βu Lu S) J00 (νβ L S, β L S )
u u
2L + 1 π ζ(β L S)
= = x , (10.161)
2Lu + 1 h n̄ A(βu Lu S → β L S)
which represent the ratios of the typical fine-structure splitting of the upper and
lower term, respectively, to the corresponding inverse lifetime. The parameter xu
has already been used in former sections (see Eq. (10.142)).
Let us apply Eqs. (10.159) to the simplest transition where lower-term atomic
polarization (including coherences between different J-levels) plays a role: this is
the transition between a lower term 2 P and an upper term 2 S. The statistical
tensors involved are six; introducing shorthand notations, they can be written in
the form
, - , - , -
u 0 1 1 0 3 3 2 1 3
ρ0 , =x ρ0 , =z ρ0 , = u + iv
2 2 2 2 2 2
, - , - , -
0 1 1 2 3 3 2 3 1
ρ0 , =y ρ0 , =t ρ0 , = −u + i v ,
2 2 2 2 2 2
where x, y, z, t, u, v are real. We assume that the lower term satisfies the Landé
interval rule, so that the quantity x defined in Eq. (10.161) can be used. After
some calculations, involving the numerical evaluation of several 3-j, 6-j, and 9-j
symbols, we obtain the set of equations
√ √ √
1 d 1 2 2 2
x = −x + n̄ y + n̄ z + n̄w t − n̄w u
A(u → ) dt 3 3 6 3
√
1 d 1 1 2
y = x − n̄ y + n̄w u
A(u → ) dt 3 3 6
√
1 d 2 1 1 1
z= x − n̄ z − n̄w t + n̄w u
A(u → ) dt 3 3 6 6
1 d 1 1 1
t = − n̄w z − n̄ t − n̄w u
A(u → ) dt 6 3 6
NON-EQUILIBRIUM ATOMIC PHYSICS 613
√
1 d 2 1 1 1
u= n̄w y + n̄w z − n̄w t − n̄(4 − w) u − n̄ x v
A(u → ) dt 12 12 12 12
1 d 1
v = n̄ x u − n̄(4 − w) v , (10.162)
A(u → ) dt 12
where
These equations show that the interference term, ρ20 (1/2, 3/2), decreases with in-
creasing x and tends to zero for x → ∞. Under this limit, the two-term atom con-
sidered here reduces to a special case of the three-level atom examined in Sect. 10.12
(two lower ‘uncorrelated’ levels with J = 1/2 and J = 3/2 respectively, one up-
per level with J = 1/2). It can be easily seen that the results there obtained
(Eqs. (10.89a-c)), once adapted to the present case by the substitutions
On the other hand, for x → 0 the interference term is comparable with the other
statistical tensors. Under this limit, the J-levels of the lower term are degenerate
and one can apply the principle of spectroscopic stability (see Sect. 7.10). The
statistical tensors of the lower and upper term (now considered as spinless terms)
can be derived from Eq. (3.105). Bearing in mind Eq. (3.100), we obtain
√
0 3 (2 − w)
ρ0 (L = 1) =
3(2 − w) + n̄(1 − w)(2 + w)
√
6w
ρ0 (L = 1) = −
2
3(2 − w) + n̄(1 − w)(2 + w)
u 0 n̄(1 − w)(2 + w)
ρ0 (L = 0) = .
3(2 − w) + n̄(1 − w)(2 + w)
These results are in agreement with those obtained in Sect. 10.7 for the two-level
atom with J = 1, Ju = 0 (cf. Eqs. (10.64) with δ(2) = 0).
Equations (10.163) show that the statistical tensors of both the lower and the
upper term are strongly sensitive to the value of x . It should be remarked, how-
ever, that the quantity x is fairly large for most fine-structure multiplets under
typical astrophysical or laboratory conditions (where the radiation field is very di-
luted, n̄ 1). As an example, we can consider the MgI triplet at λ 5170 Å.
The lower term is a 3 P whereas the upper term is a 3 S. The three lines of the
multiplet – named b1 , b2 , and b4 , respectively – lie at 5183.6 Å (J = 2, Ju = 1),
5172.7 Å (J = 1, Ju = 1), and 5167.3 Å (J = 0, Ju = 1). The quantity ζ/h for the
lower term is approximately 6 × 1011 s−1 (see Eq. (10.140)), and the Einstein A
coefficient is 1.04 × 108 s−1 . Assuming n̄ 10−2 (typical of the solar atmosphere),
we get x 5 × 106 , which means that coherences between different J-levels of the
lower term are 6 to 7 orders of magnitude smaller than the ‘diagonal’ statistical
tensors. In particular, the atomic polarization of the upper term (hence scattering
polarization) is practically unaffected by lower-term interferences.
The simple transition 2 P−2 S discussed above requires the solution of an algebraic
system of 6 equations. For more complicated transitions, it is almost mandatory to
resort to a numerical solution of the statistical equilibrium equations (10.159). The
Stokes parameters of the scattered radiation can then be obtained by substituting
the values of the statistical tensors of the upper term into Eq. (10.127).
In the following we illustrate the results of such numerical calculations for two
different multiplets, multiplet UV n. 1 of FeII (6 D − 6 D) – already considered in
Sect. 10.17 – and multiplet n. 2 of MgI (3 P − 3 S). According to Eq. (10.158),
the corresponding algebraic systems contain 110 and 12 equations, respectively. In
both cases we refer to the 90◦ scattering of an unpolarized radiation beam (Fig. 10.1
with Θ = 90◦ ): the anisotropy factor w is unity, and the fractional polarization of
the scattered radiation is independent of the average number of photons n̄ provided
n̄ 1 (which is implicit in Eqs. (10.159) since stimulated emission is neglected).1
1
In fact it can be shown, bearing in mind the structure of Eqs. (10.159) and the expressions of
NON-EQUILIBRIUM ATOMIC PHYSICS 615
Fig.10.27. Fractional polarization pQ against vacuum wavelength λv (in Å) in the 90◦ scattering
of an unpolarized radiation beam, for multiplet UV n.1 of FeII. The reference direction for linear
polarization is the perpendicular to the scattering plane. The solid line is obtained by taking
lower-term atomic polarization into account, the dashed line (identical to the curve in Fig.10.18)
by neglecting it.
Figures 10.27 and 10.28 show the fractional linear polarization pQ as a function
of wavelength in the two multiplets. For the multiplet of FeII the results are
not very different from those obtained in the absence of lower-term polarization.
On the contrary, there is a dramatic effect for the multiplet of MgI. The fractional
polarization has a complicated wavelength dependence, with a peak value of almost
40% (in absolute value): if lower-term atomic polarization were neglected, pQ would
be zero at all wavelengths.1
the radiative rates, that the statistical tensors of the upper term are proportional to n̄ provided n̄
is much smaller than the ratio between the typical fine-structure splitting of the lower term and
the Einstein A coefficient.
1
This is easily verified from Eq. (10.129). The only non-zero components of the tensor
TQ have K = 2 (see Table 5.6), and the 6-j symbol
K (1, Ω)
!
1 1 K
Lu Lu L
is zero for Lu = 0 and K = 2. Note that the same symbol appears in Eq. (10.126), which means
that there is no atomic polarization in the upper term 3 S.
616 CHAPTER 10
Fig.10.28. Same as Fig.10.27 for multiplet n.2 of MgI. The vertical lines mark the wavelength
positions of the three lines of the multiplet according to the code: 1) J = 0, Ju = 1 (b4 line);
2) J = 1, Ju = 1 (b2 line); 3) J = 2, Ju = 1 (b1 line). Atomic data are from Moore (1949). The
value of the Einstein A coefficient is 1.04 × 108 s−1 .
As remarked in Sect. 7.9, the statistical equilibrium equations for the multi-level
atom with hyperfine structure bear a very strong resemblance to the corresponding
equations for the multi-term atom, and the same holds for the expressions of the
radiative transfer coefficients. Most of the results obtained in this chapter for the
two-term atom can thus be extended to the two-level atom with hyperfine structure
by means of simple, formal transformations.
Let us consider a simplified atomic model consisting of two levels, a lower level
having quantum numbers (α , J ) and an upper level having quantum numbers
(αu , Ju ). If the atom has nuclear spin I, both levels are split into a collection
of hyperfine components characterized by the total angular momentum F (for
the lower level) and Fu (for the upper level). When a magnetic field is present,
the F -levels are further split into their magnetic components as fully explained in
Sect. 3.5.
For this model atom, we can study the problem of resonance scattering under
the flat-spectrum approximation and the unpolarized lower level assumption. The
relevant equations are readily obtained by carrying out the formal substitutions
NON-EQUILIBRIUM ATOMIC PHYSICS 617
β → α βu → αu S→I
L → J Lu → Ju
J → F Ju → Fu (10.164)
5
dΩ
3
= kA
ε̃i (Ω) Ω
P (Ω, ) I (ν ) ,
,Ω
L
4π j=0 ij hfs j αu Ju , α J
Ω
where [Pij (Ω, )] , the hyperfine-structure scattering matrix, is given by
hfs
Ω
Pij (Ω, ) = WK (α J Iαu Ju ) hfs (−1)Q T Q
K
(i, Ω) ) ,
T K (j, Ω
hfs −Q
KQ
with
3(2Ju + 1)
WK (α J Iαu Ju ) hfs =
2I + 1
!2 !2
1 1 K Ju Ju K
× (2Fu + 1)(2Fu + 1)
Ju Ju J Fu Fu I
Fu Fu
1
× ,
1 + 2πi να J IFu , αu Ju IFu A(αu Ju → α J )
u u
1
Following the formalism of Sect. 7.9, we omit the index I when inessential.
618 CHAPTER 10
where
!2
1 J J K
DK (αJI) hfs = (2F + 1)(2F + 1)
2I + 1
F F I
FF
1
× .
1 + 2πi ναJIF , αJIF A(αJ → α J )
The properties of the depolarizing factors are strictly similar to those of their fine-
structure analogues (cf. Eqs. (10.137)-(10.142)):
a) If the hyperfine-structure intervals of the upper level are much smaller than the
Einstein A coefficient, the depolarizing factors reduce to unity.
b) If such intervals are much larger than A, they tend to the asymptotic values
!2
∞
1 2 J J K
DK (JI) hfs = (2F + 1) .
2I + 1 F F I
F
Numerical values of [D1∞ ]hfs and [D2∞ ]hfs for half-integral values of J are given
in Table 10.8 (the table is restricted to I ≤ 9/2, J ≤ 15/2). For integral values
of J one can directly use Table 10.6, taking into account the formal identity of
∞ ∞
[DK (JI)]hfs and [DK (LS)]fs under the substitution J → L, I → S.
c) If the hyperfine-structure intervals of the upper level are simply described by
the magnetic-dipole term, one has (see Eqs. (3.70))
1
ναJIF , αJIF = A(α, J, I) F (F + 1) − F (F + 1) ,
2h
!2
1 J J K
DK (αJI) hfs = (2F + 1)(2F + 1)
2I + 1
F F I
FF
1
× 2 ,
1+ x2 F (F + 1) − F (F + 1)
where x is given by
π A(α, J, I)
x= . (10.165)
h A(αJ → α J )
TABLE 10.8
∞ (JI)]
Values of the depolarizing factors [DK hfs for levels with half-integral J quantum number
= k A 2Ju + 1
εi (ν, Ω)
(−1)I−J +Fu +Fu +F +K+Q
L
2I + 1
KQ Fu Fu F
! !
Ju J 1 Ju J 1
× 3(2Fu + 1)(2Fu + 1)(2F + 1)
F Fu I F Fu I
! ! !
1 1 K 1 1 K Ju Ju K J K (ν
× TQ
K
(i, Ω) −Q αu Ju , α J )
Fu Fu F Ju Ju J Fu Fu I
∗
1 Φ(ναu Ju IFu , α J IF − ν) + Φ(ναu Ju IFu , α J IF − ν)
× . (10.166)
2 1 + 2πi να J IF , α J IF A(αu Ju → α J )
u u u u u u
If the width of the F -sublevels of the lower level is negligible compared to the width
of the F -sublevels of the upper level, the last term can be written in the simpler
620 CHAPTER 10
1 Γ
,
π Γ − i (να J IFu , α J IF − ν) Γ + i (να J IFu , α J IF − ν)
u u u u
where
A(αu Ju → α J )
Γ = .
4π
It should be remarked that the frequency distance between the individual lines
of a hyperfine-structure multiplet is usually comparable with their natural width.
Therefore, the value of the emission coefficient at the frequency of a specific line
(F̄u , F̄ ) cannot be deduced by restricting the summation in Eq. (10.166) to the
values Fu = Fu = F̄u , F = F̄ : in other words, the analogue of Eq. (10.145) for
hyperfine-structure multiplets is not valid. On the contrary, the asymptotic ex-
pression given in Eq. (10.148) has its direct hyperfine-structure counterpart. At
large distance from the single lines we have
1 Γ
≈ kA
εi (ν, Ω) WK (J , Ju ) (−1)Q T Q
K J K (ν
(i, Ω) −Q αu Ju , α J ) ,
L
π (ν0 − ν)2
KQ
ju → iu , Mu → fu ,
NON-EQUILIBRIUM ATOMIC PHYSICS 621
Fig.10.29. Fractional polarization pQ against wavelength distance from line center (in m Å) in
the 90◦ scattering of an unpolarized radiation beam, for the NaI D 2 line. Hyperfine-structure
constants are from Fuller and Cohen (1969), Hartmann (1970), Figger and Walther (1974).
connected with the presence of the magnetic field. Neglecting stimulation ef-
fects, we can write the frequency-integrated emission coefficient in the form (cf.
Eqs. (10.151)-(10.153))
5
dΩ
3
= kA
ε̃i (Ω) Ω
Pij (Ω, ; B)
I (ν ) ,
,Ω
L hfs j αu Ju , α J
4π j=0
with
! !
3(2Ju + 1) 1 1 K 1 1 K
WKK Q (α J Iαu Ju ; B) hfs =
2I + 1 Ju Ju J Ju Ju J
× (2K + 1)(2K + 1)(2Fu + 1)(2Fu + 1)(2Fu + 1)(2Fu + 1)
Fu Fu Fu Fu fu fu
! !
Ju Ju K Ju Ju K Fu Fu K Fu Fu K
× ×
Fu Fu I Fu Fu I −fu fu −Q −fu fu −Q
622 CHAPTER 10
Fig.10.30. Polarization (or Hanle) diagram for the D 2 line of NaI, corresponding to the scattering
process illustrated in Fig.5.11. The curve β = 0◦ is labelled by the field intensity in G.
i i i i
× CFu (αu Ju I, fu ) CFu (αu Ju I, fu ) CFu (αu Ju I, fu ) CFu (αu Ju I, fu )
u u u u
iu iu
1
× . (10.167)
1 + 2πi να J
u u
I (iu fu , iu fu ) A(αu Ju → α J )
The Bohr frequency occurring in the last term is defined by (cf. the analogous
definition (7.30) for fine structure)
Fig.10.31. Depolarizing effect due to an isotropic distribution of magnetic fields on the NaI D 2
line. The quantities [MK (B)]hfs are defined in Eq.(10.168).
– described in Sects. 10.18, 10.19 and 10.20, respectively – also take place in
hyperfine-structure multiplets. All these phenomena are conveniently described
in terms of the symbol WKK Q defined in Eq. (10.167). For instance, the Hanle
phase matrix averaged over an isotropic distribution of magnetic fields is given by
(cf. Eqs. (10.156a,b))
Ω
Pij (Ω, ; B)
= (K)
MK (B) hfs Rij (Ω, Ω ; 0) ,
hfs
K
with
1
MK (B) hfs = WKKQ (α J Iαu Ju ; B) hfs . (10.168)
2K + 1
Q
Figure 10.31 shows the behavior of the quantities [MK (B)]hfs with magnetic field
intensity for the NaI D2 line. Note the conspicuous anti-level-crossing effect on this
particular line. The ratio [MK (B → ∞)]hfs /[MK (B = 0)]hfs is found to be 0.60 for
K = 1 and 0.67 for K = 2, rather than 1/3 and 1/5, respectively, as it would if
hyperfine separations were negligible compared to the Einstein A coefficient.
CHAPTER 11
ASTROPHYSICAL APPLICATIONS:
SOLAR MAGNETOMETRY
Solar magnetometry, or the ‘art’ of measuring magnetic fields in the solar atmo-
sphere, started almost one century ago with the first application of polarimetry
to astrophysical research. By means of a Fresnel prism (acting as a quarter-wave
plate) and a Nicol prism (acting as a polarizer), in 1908 Hale succeeded in taking
two separate spectra of a sunspot in right and left circular polarization. The line
shifts between the two spectra, due to the Zeeman effect, led to the first measure-
ment of a solar magnetic field (Hale, 1908).
Solar magnetometry has enormously evolved since the pioneering work of Hale
and has now become a mature science which employs sophisticated technologies
and modern astronomical instrumentation. The photographic plate used by Hale
has been replaced, during the years, by photomultipliers, diode arrays, and CCD
cameras. At the same time, new polarimeters have been conceived, capable of
attaining higher and higher polarimetric accuracy. The advent of techniques of
image stabilization and the construction of new instruments in proper sites (or
operating from space) has also opened the possibility of observing the sun at higher
and higher spatial and temporal resolution.
All these technological improvements have led to a better understanding of solar
magnetism but, at the same time, have opened new questions about the meaning
itself of the process of measuring magnetic fields in a highly structured plasma such
as the solar atmosphere. There is today strong evidence that the solar magnetic
field outside sunspots is often concentrated in structures whose lateral dimensions
are at the limit of the resolving power that can be attained with present instru-
mentation, which is of the order of a few tenths of arcsec1 (see for instance Stenflo,
1994). Even more complex is the picture emerging from some recent ideas, which
point to the existence of magnetic fields organized in fibrils of some few km in di-
ameter (the MISMA hypothesis, Sánchez Almeida et al., 1996), and of ‘turbulent’
fields of a few G permeating the full solar atmosphere outside active regions (Sten-
flo, 1994). Although these ideas have not yet been fully confirmed by observations,
they clearly suggest the difficulties inherent in the measurement of a quantity, the
magnetic field, that may show sharp variations over length-scales smaller than the
resolution element of the instrument.
Most of the methods and the techniques that have been developed for solar
magnetometry ignore, however, this kind of difficulties, and are based on the simple
assumption that the magnetic field is uniform across the resolution element. In
some cases a less restrictive assumption is made, i.e., that the magnetic field is
uniform but covers only a fraction f – the so-called filling factor – of the observed
1
One arcsec corresponds to about 730 km on the solar surface.
626 CHAPTER 11
First introduced in solar research in the early 1950s (Babcock, 1953), the lon-
gitudinal magnetograph is probably the most renowned and certainly the most
widespread instrument for the study of solar magnetism.
A typical longitudinal magnetograph performs a polarimetric and spectral analy-
sis of the solar radiation and records, for each resolution element, a signal SV given
by
V (λ) p(λ) dλ
SV = , (11.1)
I(λ) p(λ) dλ
where I(λ), V (λ) are the Stokes parameters and p(λ) is a narrow profile centered
over one of the wings (the blue or the red) of a magnetically sensitive line. The
signal is then encoded on a grey-scale to produce a map of the full sun or of an
active region called a magnetogram.
The spectral analysis can be performed by a Lyot-type birefringent filter or by an
interference filter or by a combination of both. In this case p(λ) is nothing but the
filter profile. Alternatively, one can use a spectrograph and feed the detector by
means of a slit in its focal plane. In this case p(λ) is (nominally) a rectangle function
that differs from zero in the wavelength interval corresponding to the slit. The
polarimetric analysis, which must take place before the spectral analysis, can be
carried out in a variety of different ways. In the original Babcock’s magnetograph,
for instance, it was performed by the combination of an ADP crystal (see Sect. 1.5)
and a polarizer. In some magnetographs two signals are recorded, one in the blue
wing and the other (of opposite sign) in the red wing. The two individual signals
are then combined to obtain a composite signal with less noise.
The signal SV can be related in a simple way to the longitudinal component of
the magnetic field vector in the observed area under a number of assumptions, the
most important of which is the weak field approximation. We have seen in Sect. 9.6
that, under this approximation, the V Stokes parameter is given by the expression1
1
We recall here that the only condition for this formula to hold – besides the weak field
SOLAR MAGNETOMETRY 627
(Eq. (9.80))
∂I(λ)
V (λ) = −∆λB ḡ cos θ , (11.2)
∂λ
where ∆λB is the Zeeman splitting defined in Eq. (3.13), ḡ is the effective Landé
factor of the selected line (defined in Eq. (3.44)), θ is the angle between the magnetic
field direction and the line of sight (see Fig. 9.1), and I(λ) is the intensity profile.
Since V (λ) is linear in the product B cos θ and I(λ) is unaffected by the magnetic
field to lowest order, under the weak field approximation we have
SV = C B , (11.3)
where B = B cos θ is the longitudinal component of the magnetic field and where
C is the so-called calibration constant of the longitudinal magnetograph, that can
be deduced via more or less complicated procedures involving the knowledge of the
filter profile p(λ) and of the line profile I(λ).
As a first-order approximation, one can assume that the line profile has a Gaus-
sian shape of the form (cf. Eq. (9.86))
λ−λ 2
− 0
∆λp
I(λ) = Ic 1 − dc e , (11.4)
assumption – is the constancy with depth of the longitudinal component of the magnetic field;
see Table 9.1.
2
1
In a profile of the form e−(x/a) √, the quantity a is connected with the full width at half
maximum, ∆x, by the relation ∆x = 2 ln 2 a 1.665 a.
628 CHAPTER 11
Recalling Eqs. (11.3) and (3.14), the calibration constant of the longitudinal mag-
netograph can thus be written in the form
−1
where C is in G−1 , λ0 in Å and X in mÅ .
It is often in the observer’s interest to increase the signal SV (for an assigned line
and a given magnetic field) as much as possible. This can be obtained by carefully
selecting the filter position λF so as to maximize the absolute value of the quantity
X. As an example, consider the FeI line λ 6302.5, a Zeeman triplet having ḡ = 2.5.1
From the atlas of the solar spectrum by Delbouille et al. (1973) one can estimate
the quantities dc and ∆λp of Eq. (11.4). Their values, which refer to disk center,
are found to be dc 0.66 and ∆λp 60 mÅ. If the filter passband is ∆λF = 75 mÅ
(a typical value for filter-type magnetographs), the extrema of X are located at
−1
(λF − λ0 ) = ± 57.3 mÅ, where X = ± 5.05 × 10−3 mÅ . The corresponding value
of the calibration constant (when the filter is positioned in the blue wing) is
Fig.11.1. The magnetograph signal SV as a function of the longitudinal component of the mag-
netic field B for a specific line – a Zeeman triplet with ḡ = 2.5 – formed in a Milne-Eddington
atmosphere (see text for details). The curves are labelled by the value of the inclination angle θ
defined in Fig.9.1, and are drawn up to a field intensity of 3500 G. The dotted line represents the
linear relation implicit in the magnetograph calibration.
velocity fields, or to differences – with respect to the average sun – of the thermo-
dynamic parameters characterizing the various structures of the solar atmosphere
(granules, intergranular lanes, faculae, pores, sunspots’ umbrae and penumbrae,
etc.).
The saturation effect and the inaccuracies just mentioned are important lim-
itations which prevent one from regarding the longitudinal magnetograph as a
fully quantitative instrument for the measurement of solar magnetic fields. Yet its
importance in solar research can hardly be overestimated. It is just through mag-
netographs that the general properties of solar magnetism have been determined
(large-scale topology, fine structure, cyclic variations, etc.) and that several cor-
relations between magnetic fields and other indicators of solar activity have been
established. It is also important to mention that the results obtained by magne-
tographs are currently used for the reconstruction of coronal magnetic fields by
means of suitable numerical techniques.
Q(λ) p (λ) dλ U (λ) p (λ) dλ
SQ = , SU = , (11.6)
I(λ) p (λ) dλ I(λ) p (λ) dλ
where p (λ) is a profile that may differ from p(λ). The three signals SQ , SU , SV may
be recorded either simultaneously or sequentially. The first alternative implies the
use of a sophisticated polarimeter, capable of measuring the four Stokes parameters
at the same time. In this case, the spectral analysis is the same for the various
Stokes parameters and p (λ) is identical with p(λ). In the second alternative it
is convenient to use different profiles in order to maximize the signals. Since the
extrema of Q and U do not occur at the same wavelength positions as those of V ,
one can adapt the spectral analysis to the observed Stokes parameter by tuning
the filter (in filter-type magnetographs) or by moving the slit (in spectrograph-type
magnetographs).
Similarly to the preceding section, one can easily relate the polarization signals
SQ , SU to the modulus of the transverse component of the magnetic field and to
its orientation in the plane perpendicular to the line of sight provided a number
of conditions – basically the weak field approximation – are met. We found in
Sect. 9.6 that, under this approximation, the U Stokes parameter defined in the
preferred frame is given by Ũ(λ) = 0, which implies
Ũ (λ) p (λ) dλ
SŨ = =0, (11.7)
I(λ) p (λ) dλ
whereas for the Q Stokes parameter, defined in the same frame, we obtained various
expressions (see Eqs. (9.83), (9.84) and (9.85)) each valid in different wavelength
intervals and under different assumptions on the depth dependence of the physical
parameters (see Table 9.1).1 For our purposes it is convenient to use Eq. (9.84)
1 η ∂I(λ)
Q̃(λ) = − ∆λ2B Ḡ sin2 θ , (11.8)
4 η ∂λ
which is valid at any λ, even if it implies rather restrictive assumptions on the
process of line formation. The meaning of the symbols ∆λB , θ, and I(λ) has
already been recalled after Eq. (11.2). Ḡ is defined in Eq. (9.76), while η and η
are defined in Eqs. (9.65) and (9.70).
Since Q̃(λ) is proportional to the product B 2 sin2 θ, and I(λ) is unaffected by the
magnetic field to lowest order, under the weak field approximation we have
Q̃(λ) p (λ) dλ
SQ̃ = = C⊥ B⊥ 2
, (11.9)
I(λ) p (λ) dλ
1
A common assumption is the constancy with depth of the azimuth of the magnetic field
vector, which enables a preferred frame to be defined.
SOLAR MAGNETOMETRY 631
where B⊥ 2
= B 2 sin2 θ is the square modulus of the transverse component of the
magnetic field vector and C⊥ is the second calibration constant of the vector mag-
netograph. To derive an expression for C⊥ , we follow a procedure quite similar
to that followed for C in the previous section. Using Eqs. (11.4) and (11.5) for
the line and filter profile, respectively, and assuming that the profile of the line
absorption coefficient for zero magnetic field, η, has also a Gaussian shape
λ−λ 2
0
1 −
η(λ) = √ e ∆λD
,
π
1
SQ̃ = − ∆λ2B Ḡ sin2 θ Y ,
4
where
(λF −λ0 )2
− ∆λ2p ∆λ2F (λ −λ )2 ∆λ4
∆λ2p + ∆λ2F
2 dc 1 e 1− (∆λ2p + ∆λ2F ) ∆λ2D
− 2 (∆λ2F+ ∆λ
0 p
2 )2 ∆λ2
∆λp ∆λ2p + ∆λ2F p F D
Y= .
(λF −λ0 )2
−
∆λp ∆λ2p + ∆λ2F
1 − dc e
∆λ2p + ∆λ2F
From Eqs. (11.9) and (3.14), the second calibration constant of the vector magne-
tograph can thus be written in the form
−2
where C⊥ is in G−2 , λ0 in Å and Y in mÅ .
To give an example, let us refer to the line FeI λ 6302.5, already considered in
the former section, with ∆λp = 60 mÅ, dc = 0.66, ∆λD = 40 mÅ, and ∆λF =
75 mÅ. It can easily be seen that the quantity Y is negative for all values of
(λF − λ0 ), and that its extrema are located at (λF − λ0 ) = ± 71.3 mÅ, where
−2
Y = −2.32 × 10−4 mÅ .This would be the optimum filter position for measuring
weak transverse fields. However, in actual measurements, it is convenient to shift
the filter position to larger distances from line center in order to avoid, as much as
possible, the contamination due to magneto-optical effects (that will be analyzed
below),1 whose importance strongly decreases in the line wings (see Sect. 9.22 and,
in particular, Fig. 9.17). In the present example, a good compromise is to position
the filter at (λF − λ0 ) = ± 120 mÅ (three Doppler widths), where the quantity Y
−2
is −1.64 × 10−4 mÅ , which corresponds to a reduction of sensitivity of only 30%
1
Note that Eq. (11.8), which is obtained from an expansion of the radiative transfer equations
to the second order in ∆λB /∆λD , does not contain terms related to magneto-optical effects. Such
terms appear in the expansion from the third order on (see Sect. 9.6).
632 CHAPTER 11
relative to its maximum value. Since for the FeI λ 6302.5 line is Ḡ = 6.25 (see
Table 9.3), we obtain from Eq. (11.10)
On the other hand, Eq. (11.7) yields SŨ = 0. Assuming SQ̃ > 0,1 the angle χ,
defined in the interval (−π/2, π/2), can be found from the following expressions
(cf. Eqs. (1.8)): for SQ = 0
1 S
χ= arctan U + χ0 (11.12a)
2 SQ
where ⎧
⎪
⎨0 if SQ > 0
π
χ0 = if SQ < 0 and SU > 0 (11.12b)
⎪
⎩−π
2
and for SQ = 0
1
χ= 4π if SU > 0
(11.12c)
− 14 π if SU < 0 .
Finally, since from Eqs. (11.11) we have SQ̃ = 2 + S 2 , Eq. (11.9) can be
SQ U
rewritten in the form
SL ≡ SQ 2 + S2 = C B2 .
U ⊥ ⊥ (11.13)
1
This is the case for the example just mentioned of the FeI λ 6302.5 line, where the calibration
constant C⊥ was found to be positive. In the (less frequent) opposite case, all the > signs in
Eqs. (11.12) should be changed into < signs, and vice versa.
SOLAR MAGNETOMETRY 633
Fig.11.2. The magnetograph signal SL as a function of the transverse component of the magnetic
field B⊥ for the same line and atmospheric parameters as in Fig.11.1. The curves are labelled
by the value of the inclination angle θ defined in Fig.9.1, and are drawn up to a field intensity of
3500 G. The dotted line represents the quadratic relation implicit in the magnetograph calibration.
The expressions just derived allow one to recover the direction and the modulus
of the transverse component of the magnetic field vector from the linear polariza-
tion signals SQ and SU . Such expressions are however valid only in the limit of
weak magnetic fields. When the field intensity increases, saturation effects come
into play, similarly to the case of the longitudinal magnetograph discussed in the
previous section.
For linear polarization there are two distinct effects due to saturation: on the
one hand, the signal SL turns out to be much smaller than what predicted by the
quadratic relationship of Eq. (11.13); on the other hand, the azimuth χ is found to
be affected by an ‘error’, ∆χ, due to magneto-optical effects.
To give a quantitative idea of these two effects, we assume – as in the former
section – the process of line formation in the solar atmosphere to be described by
the Milne-Eddington model. Once the atmospheric and line parameters, and the
width and position of the filter are specified, the magnetograph signals SQ and SU
can be computed from Eqs. (11.6) using the Unno-Rachkovsky expressions (9.110)
for I(λ), Q(λ) and U (λ). The behavior of SL and ∆χ against B⊥ for a set of
parameters (the same as in Fig. 11.1) simulating the λ 6302.5 FeI line is shown in
Figs. 11.2 and 11.3, respectively. The profile p (λ) in Eqs. (11.6) was assumed to be
a Gaussian function centered at 120 mÅ from line center and having ∆λF = 75 mÅ.
The sign convention for ∆χ in Fig. 11.3 is such that it has to be added to the χ
value of Eqs. (11.12) in order to retrieve the true azimuth of the magnetic field
vector.
Apart from saturation effects, the vector magnetograph is subjected to the same
634 CHAPTER 11
Fig.11.3. The ‘error’ angle ∆χ due to magneto-optical effects is plotted against B⊥ for the same
line and atmospheric parameters as in Figs.11.1 and 11.2 and the same filter profile as in Fig.11.2.
The curves are labelled by the values of θ. For the sign convention of ∆χ refer to the text.
limitations pointed out for the longitudinal magnetograph at the end of Sect. 11.1.
Notwithstanding such limitations, this instrument remains one of the main tools of
solar physics. A particularly long lasting and successful example is the tower vector
magnetograph of the University of Alabama in Huntsville and of the Marshall Space
Flight Center that has been operating since the mid-1970s (Hagyard et al., 1982)
and that has provided important results on the shear of magnetic fields in flare sites.
It has to be remarked that in the early period of operation of this instrument, the
error ∆χ introduced by saturation and magneto-optical effects was not taken into
account in the data-reduction analysis, which led Hagyard et al. (1977) to interpret
their observations of isolated sunspots in terms of a twisted magnetic field. It
was shown later (Landi Degl’Innocenti, 1979a) that such observations were fully
consistent with a pure radial magnetic configuration once the ∆χ correction was
properly included.
To overcome the limitations of magnetographs, it was early realized that the mea-
surement of solar magnetic fields could be substantially improved by recording the
full profiles of the four Stokes parameters across a wavelength interval encompass-
ing one or more suitable spectral lines. An instrument capable of such recordings
is currently called a Stokesmeter , though the name of spectropolarimeter is used
as well. The data obtained with a Stokesmeter contain a wealth of information,
whereas a typical vector magnetograph ‘compresses’ all this information into just
SOLAR MAGNETOMETRY 635
three quantities (the signals SV , SQ , and SU defined in Eqs. (11.1) and (11.6)).
Hence the obvious superiority of Stokesmeters over magnetographs.1
Several techniques have been developed in the past for recovering the magnetic
field vector from the Stokes parameters’ profiles recorded, as a function of wave-
length, by a Stokesmeter. In this section we describe in some detail one of such
techniques, the so-called Unno-fit technique. Pioneered by Harvey et al. (1972) and
Auer et al. (1977b), and improved by Landolfi and Landi Degl’Innocenti (1982)
and Landolfi et al. (1984) to allow for magneto-optical and damping effects, this
technique has been brought to a high level of sophistication by the group of the
High Altitude Observatory working on the Advanced Stokes Polarimeter data (see
for instance Skumanich and Lites, 1987; Lites and Skumanich, 1990).
The technique consists in comparing the observed Stokes profiles with the Unno-
Rachkovsky solution to the radiative transfer equation for polarized radiation (see
Sect. 9.8) and in varying the set of parameters on which such solution depends
until a best fit is obtained. In practice, a kind of ‘merit function’ is constructed,
typically of the form
3
N
(obs) (thr) 2
M {ζi } = Wjα Ijα − Ijα {ζi } , (11.14)
j=0 α=1
(obs) (thr)
where Ijα is the j-th Stokes parameter measured at wavelength λα ,2 Ijα is
the corresponding theoretical value dependent on a set {ζi } of physical parameters
(thr)
(i = 1, 2, . . . , p), and Wjα is a matrix of weights. The quantities Ijα can be
computed from Eq. (9.109) by taking into account the spectral smearing profile of
the instrument, which depends on the resolving power of the spectrograph, on the
geometrical width of the entrance slit, on the pixel width and, in many cases, on
the data reduction procedure. If P (∆λ) is such a profile (normalized to unity in
wavelength), one has from Eq. (9.109)
!
−1
Ijα = B0 δj0 + β
(thr)
(C )j0 λ=λ −∆λ P (∆λ) d(∆λ) , (11.15)
α
where C is the matrix defined in Eq. (9.91) and β = βµ is the slope of the Planck
function along the propagation direction in the solar atmosphere.
The dependence of Ijα on the two physical parameters B0 and β is explicitly
(thr)
expressed in Eq. (11.15). The dependence on the other seven physical parameters
(κL , B, θ, χ, ∆λD , Γ , wA – see Sect. 9.5 for their definition) is implicitly con-
tained in the matrix C. The merit function M ({ζi }) thus depends, in total, on
1
It should not be forgotten that magnetographs may be preferred to Stokesmeters for scientific
purposes which do not require high precision magnetometry. Just because they involve a much
smaller number of observational data, magnetographs are generally faster and more agile than
Stokesmeters. For this reason they are often preferred in space research.
2
We assume that the Stokes parameters are sampled at different wavelength points (pixels)
along the dispersion axis, as typical of modern observations. N represents the number of pixels
covered by the observation.
636 CHAPTER 11
nine physical parameters which, for later reference, can be conveniently divided
into magnetic parameters (B, θ, χ), velocity parameter (wA ), and thermodynamic
parameters (B0 , β , κL , ∆λD , Γ ).1 As to the weights Wjα , they can be used for
two different purposes. On one side they allow one to remove from the fit – by
setting to zero the corresponding weight – certain wavelength intervals where the
observed Stokes profiles are contaminated either by experimental errors or by the
presence of a blending line (like, for instance, the telluric line in the red wing of
FeI λ 6302.5, see footnote on p. 628). On the other side, they can be used to in-
crease the importance of the fit to one (or more) Stokes parameter(s) relative to
the remaining ones. In many cases, for instance, the intensity profile is one to two
orders of magnitude larger (in absolute value) than the other Stokes profiles. A fit
with equal weights will eventually favor just that profile which is less sensitive to
the magnetic parameters. Since in general the intensity profile is also affected by
other sources of contamination – like stray light in the spectrograph – it turns out
that a good choice is to assign weight unity to the Stokes profiles Q, U , and V and
a weight ranging from 0.01 to 0.1 to the I profile.2
In order to obtain the best fit to the observed data, one needs to find the absolute
minimum of the merit function in the hyperspace of the parameters ζi . This is not
a trivial problem because the dependence of the merit function on the parameters
is non-linear and because the merit function has, in general, several secondary
minima. An appropriate mathematical procedure that can be followed in such cases
is due to Marquardt (1963) and is fully described in Bevington (1969). Starting
from some ‘guess values’ of the parameters ζi , the Marquardt algorithm looks for
the minima of the merit function by following a path in the parameters’ hyperspace.
The path consists in a succession of steps whose length and direction is determined
by evaluating the quantities βk (k = 1, . . . , p) and αkm (k, m = 1, . . . , p) defined
by
∂M {ζi } ∂ 2 M {ζi } ∂βk ∂βm
βk = , αkm = = = .
∂ζk ∂ζk ∂ζm ∂ζm ∂ζk
From Eq. (11.14) one has
3
N
(obs) (thr)
(thr)
∂Ijα
βk = −2 Wjα Ijα − Ijα
j=0 α=1
∂ζk
3
N (thr) (thr) 2 (thr) !
∂Ijα ∂Ijα (obs) (thr) ∂ Ijα
αkm = 2 Wjα − Ijα − Ijα ,
j=0 α=1
∂ζk ∂ζm ∂ζk ∂ζm
1
The number of parameters can be reduced to 8 by defining the merit function in terms of the
Stokes parameters normalized as in Eqs. (9.112). By so doing, the dependence on B0 disappears
and β is replaced by β /(1 + β ).
2
This statement is based on the experience gathered mostly with the data of the Advanced
Stokes Polarimeter of the High Altitude Observatory. For an ideal Stokesmeter, with observed
Stokes profiles affected by Gaussian errors with variance σjα , the weights should indeed be given
by Wjα = 1/σjα . In that case the merit function of Eq. (11.14) would reduce to a standard
chi-square function (see e.g. Bevington, 1969).
SOLAR MAGNETOMETRY 637
the last expression being usually approximated by neglecting the second term in
braces which gets much smaller that the first when approaching the minimum
of the merit function. The Marquardt algorithm thus requires the evaluation of
(thr)
derivatives of the form ∂Ijα /∂ζk . As apparent from Eq. (11.15), the derivatives
with respect to B0 and β are trivial. The derivatives with respect to the other
parameters (implicitly contained in the matrix C) can be obtained with the help
of Eq. (9.204). One has
,
∂C −1 -
(thr)
∂Ijα
= −B0 β C −1 C P (∆λ) d(∆λ) ,
∂ζk ∂ζk j0 λ=λα−∆λ
and the derivatives of the matrix C with respect to the various parameters are
easily evaluated using Eqs. (9.91), (9.39), (9.32), (9.29), (9.27), (9.26) and (5.58).
As stated above, the Marquardt algorithm eventually finds a minimum of the
merit function in the hyperspace of the parameters, but there is no guarantee that
this is the absolute minimum. To overcome such problem, one is forced to iterate
the procedure starting from a different set of guess values of the parameters, and
only after a sufficient number of iterations one can be reasonably confident that
the absolute minimum has indeed been reached. To make the search easier, it is
advisable to start the Marquardt algorithm with ‘plausible’ initial values (as close
as possible to those corresponding to the absolute minimum).1 By so doing, the
possibility of finding a secondary minimum is largely reduced. In the case of raster-
type observations, the best fit values of the parameters found for a given point of
the solar surface can serve as initial values for the surrounding points.
When applied to data showing a reasonable
amount of polarization (roughly
speaking, when the maximum value of Q2 + U 2 + V 2 / I across the line profile
is larger than 1%), the Unno-fit technique provides reliable values of the magnetic
parameters. Such values are practically independent of the weights Wjα introduced
in Eq. (11.14). The same is true for the velocity parameter, whereas the thermo-
dynamic parameters may show larger variations when varying the weights and,
in some cases, even somewhat unrealistic values may be found for them. This is
due to the fact that there is a strong ‘trade-off’ in the Unno-Rachkovsky solutions
among the parameters B0 , β , κL , ∆λD , and Γ , in the sense that a given set
of Stokes profiles can be reproduced, approximately with the same accuracy, by
several different combinations of these parameters. However, this drawback of the
Unno-fit technique does not represent a serious limitation for the measurement of
the magnetic field – which is the main object of the method.
Two examples of the application of the Unno-fit technique are shown in Figs. 11.4
and 11.5. The data were collected at the Advanced Stokes Polarimeter on June
19, 1992 in the penumbra of the large round sunspot of Active Region NOAA 7201
(see Skumanich et al., 1997, for a description of the observations). One can notice
1
The thermodynamic parameters of a line are known fairly well and do not show very
large variations over the solar surface. The velocity parameter can be simply set to zero. The
real problem is with the magnetic parameters. Suitable first-guess values can be obtained, for
instance, from a preliminary data analysis based on the weak field approximation.
638 CHAPTER 11
Fig.11.4. Unno-fit to ASP data for the FeI λ 6302.5 line observed in the penumbra of a sunspot.
The Stokes parameters are normalized to the continuum intensity and are plotted against pixel
number. The dispersion is 12.74 mÅ/pixel. The data points drawn as open circles are due to a
telluric line and were excluded from the fit. Note the reversal in the V profile due to magneto-
optical effects (see Sect. 9.22). The magnetic parameters are B = 2130 G, θ = 128◦ , χ = 22◦ (see
Fig.9.1 for the definitions of θ and χ ).
the quite remarkable fit to the observed Q, U , and V profiles in Fig. 11.4 (the fit
to the intensity profile is worse because, in these illustrative examples, we have not
taken into account the spectral smearing profile of the instrument, P (∆λ) ). On
the contrary, the observed polarization profiles shown in Fig. 11.5 are reproduced
only marginally by the fit. This is probably due to the presence of strong velocity
gradients (Evershed flow) which introduce considerable asymmetries in the profiles
(note, for instance, the presence of three distinct lobes in the V profile). Obviously,
such asymmetries cannot be reproduced by the fit, based on a model which does
not allow for any depth dependence of the physical parameters.
SOLAR MAGNETOMETRY 639
Fig.11.5. Same as Fig.11.4 for a different region of the sunspot. The magnetic parameters are
B = 1070 G, θ = 100◦ , χ = 31◦ . Note that the quality of the fit is much worse than in Fig.11.4.
This is probably due to the Evershed effect (see text).
The bisector technique and the center-of-gravity technique were proposed in the
late 1960s to overcome the saturation problems inherent in the magnetographic
measurements of the longitudinal component of the magnetic field vector. The two
techniques were developed almost simultaneously by Rayrole (1967) and Semel
(1967), respectively.
Both techniques are based on the observation of the right and left (positive and
negative) circular polarization in a magnetically sensitive spectral line. Such quan-
tities are connected with the Stokes parameters I and V by the simple relation
1
I± (λ) = I(λ) ± V (λ) . (11.16)
2
In the bisector technique the two profiles I+ (λ) and I− (λ) are observed simultane-
(b) (b)
ously, and their ‘central’ wavelengths or bisectors, λ+ and λ− respectively, are
640 CHAPTER 11
recovered via a procedure described in more detail below. The longitudinal com-
ponent of the magnetic field is then obtained from the equation (see Eq. (3.13) for
the definition of ∆λB )
(b) (b)
where λ0 is in Å, (λ+ − λ− ) in mÅ, and B in G.
The center-of-gravity technique is based, rather than on the profiles I± (λ), on
the direct measurement of their first-order moments defined by1
1
I − I± (λ) λ dλ
2 c
λ± =
(g)
, (11.19)
1
Ic − I± (λ) dλ
2
where Ic is the intensity of the continuum (assumed unpolarized), and where the
integrals are extended to the full line profile. The longitudinal component of the
magnetic field vector is obtained from equations strictly similar to Eqs. (11.17)-
(11.18)
(g) (g)
λ+ − λ− = 2 ḡ ∆λB cos θ , (11.20)
or, numerically
1.071 × 109 (g) (g)
B = B cos θ = 2 λ+ − λ− . (11.21)
ḡ λ0
Equations (11.17) and (11.20) are indeed exact for any Zeeman triplet (normal
or anomalous) formed in an arbitrary atmosphere with a constant magnetic field
parallel or antiparallel to the line of sight (θ = 0◦ or 180◦ referring to the geometry
of Fig. 9.1). This particular case has been analyzed in Sect. 9.10. From Eqs. (9.126)
we have, for θ = 0◦
1 ¯
I± (λ) = I(λ ∓ g ∆λB ) ,
2
¯
where I(λ) is the intensity that would be observed if the magnetic field were
switched off leaving all the other physical parameters (including the source func-
tions) unchanged. This equation shows that the profiles I+ (λ) and I− (λ) have
exactly the same shape, and are just shifted in wavelength by 2g∆λB . It follows
(b) (b)
that, irrespective of the procedure followed to deduce λ+ and λ− from the profiles
1 (g)
An instrumental set-up for measuring the quantities λ± , sometimes called a lambdameter,
is described in Semel (1967).
SOLAR MAGNETOMETRY 641
∂I
V (λ) = − ḡ ∆λB cos θ ,
∂λ
1 ∂I 1
I± (λ) = I(λ) ∓ ḡ ∆λB cos θ ≈ I(λ ∓ ḡ ∆λB cos θ) .
2 ∂λ 2
This expression shows that the right and left circular polarization profiles are again
the same except for an overall shift, hence1
Such convolution has the double purpose of reducing the noise contained in the
data and of transforming the original profiles I± (λ), often characterized by two or
1
Note that this result is independent of the actual shape of the profile and is also independent
(b) (b)
of the procedure followed to deduce λ+ and λ− from the profile itself.
642 CHAPTER 11
Fig.11.6. The relative error introduced in the measurement of B by the bisector technique is
plotted as a function of |B | for different values of the inclination angle θ, defined as in Fig.9.1.
The curves are drawn up to a field strength of 3500 G. The figure refers to the line FeI λ 5250.21
formed in a Milne-Eddington atmosphere (see text for further details).
three minima,1 into profiles having just one minimum. Referring to one of the two
profiles, for instance I+ (λ), one then looks for the two solutions, λ1 and λ2 , of the
equation
I+ (λ) = Im + r Iw − Im ,
where Im is the value of I+ corresponding to the minimum of the profile, Iw is
its value in the far wings, and r is a parameter ranging from 0 to 1. From λ1
(b) (b)
and λ2 one finally obtains the bisector λ+ as (λ1 + λ2 )/2. The bisector λ− is
obtained by applying the same procedure to the profile I− (λ). Obviously, the two
parameters ∆λR and r are in principle arbitrary, and the value of B derived from
Eq. (11.18) depends on them. This dependence is weak if the magnetic field is
only mildly inclined with respect to the line of sight but can drastically increase
for large inclinations.
We have checked the reliability of Eqs. (11.18) and (11.21) by selecting a specific
spectral line and by varying the modulus and inclination of the magnetic field. The
line is FeI λ 5250.21, a Zeeman triplet having ḡ = 3 that is currently used in solar
magnetometry for its high sensitivity (see Tables 9.2 and 9.3). In our simulation
the line is formed in a Milne-Eddington model atmosphere with a suitable choice
of atmospheric and line parameters (κL = 6.1, ∆λD = 32 mÅ, a = 0.04, βµ = 12).
(b)
Figure 11.6 shows the relative error, δB , introduced by the bisector technique
1
When the magnetic field is sufficiently large and inclined with respect to the line of sight,
the profiles I± (λ) show three minima, corresponding to the Zeeman components σb , π, and σr .
SOLAR MAGNETOMETRY 643
Fig.11.7. Same as Fig.11.6 for the relative error introduced in the measurement of B by the
center-of-gravity technique.
when the two parameters ∆λR and r are set to 140 mÅ and 0.65, respectively. The
(b)
error δB is defined by
(b)
(b)
B −B
δB = ,
B
(b)
where B is evaluated via Eq. (11.18) and B is the value of the longitudinal
magnetic field employed in the simulation. The figure shows that the relative error
is always less than about 10% for magnetic field’s inclinations smaller than 60◦ .
(g)
Similarly, Fig. 11.7 gives the relative error δB introduced by the center-of-gravity
technique
(g)
(g)
B −B
δB = ,
B
(g)
where B is obtained from Eq. (11.21). In this case the relative error is always
less than about 10%, with a definite tendency to underestimate the actual value of
B .
The bisector technique and the center-of-gravity technique provide only the lon-
gitudinal component of the magnetic field. They have to be complemented with
other techniques in order to obtain the full measurement of the magnetic field
vector (see e.g. Rayrole, 1967, for the possibility of recovering the transverse com-
ponent from the maximum amplitude of the V Stokes parameter). Moreover, both
techniques are limited by systematic errors which, however, can be fairly well es-
timated by means of correction tables adapted to the specific observation, and
constructed via procedures similar to that followed above for obtaining Figs. 11.6
644 CHAPTER 11
and 11.7. The main advantage of these techniques is the possibility of obtaining a
fairly accurate estimate of the longitudinal component of the magnetic field (not
affected by saturation effects) either by a simple procedure of data handling on the
profiles I± (λ) (bisector technique) or by a direct measurement of the quantities
(g)
λ± (center-of-gravity technique).
One of the major problems of solar magnetometry is the fact that, in many cases,
the magnetic field is not uniform across the resolution element of the instrument
employed in the observations. A second problem, intimately related to the first,
is the fact that the local values of the thermodynamic parameters of the solar
atmosphere, being strongly correlated with the amplitude and direction of the
magnetic field vector, may also show large spatial variations over the same area.
In this situation, any method or technique that can be envisaged to deduce the
magnetic field vector – or one of its components – from observations is obviously
bound to yield a sort of ill-defined ‘average’ value.
In this section we will analyze in some detail this particular aspect of solar mag-
netometry, by illustrating how the different techniques that have been described in
the previous sections are affected by the lack of resolution in the observations.
where the integrals in dS are extended to the solar region Σ covered by the in-
strument. A more significant expression can be obtained by introducing the local
value of the magnetograph signal, SV (P), defined by
V (λ; P) p(λ) dλ
SV (P) = ,
I(λ; P) p(λ) dλ
which shows that the ‘unresolved’ magnetograph signal (SV )Σ is an average over
the observed area of the ‘resolved’ magnetograph signal SV (P) weighted by the
filtered intensity.
As shown in Sect. 11.1, the magnetograph signal is proportional to the longitu-
dinal component of the magnetic field vector under the weak field approximation.
Therefore, if the magnetic field is everywhere weak across the surface Σ, one has,
recalling Eq. (11.3)
C (P) B (P) I(P) dS
SV Σ = Σ ,
I(P) dS
Σ
Two special cases of this equation deserve some attention. The first is the case
where the intensity profile I(λ; P) is the same as the profile corresponding to the
quiet photosphere. This implies that the quantity I(P) is independent of P and
that C (P) = (C )q , thus Eq. (11.24) reduces to
1
(B )app = B = B (P) dS , (11.25)
A Σ
where A is the area of the surface Σ. This result is at the basis of the state-
ment – often quoted in solar magnetometry – according to which the longitudinal
magnetograph measures the flux of the magnetic field vector across the observed
area. Apart from an obvious dimensional factor (needed to transform the average
longitudinal field into the magnetic flux), and from an equally obvious geometrical
646 CHAPTER 11
distinction (the longitudinal component coincides with the vertical component only
at disk center), the above derivation shows that this statement is only valid under
a heavy set of assumptions (field everywhere weak, intensity profile independent of
point P and coincident with the average photospheric profile).
The second special case of Eq. (11.24) occurs when one assumes that a fraction
f of the observed area is covered by a uniform magnetic field having longitudinal
component (B )m , whereas the remaining fraction (1 − f ) is just the quiet (non-
magnetic) solar atmosphere. The quantity f is often referred to as the filling
factor . The intensity profile is assumed to be constant throughout the ‘magnetic
fraction’ of the observed area and may differ from the profile corresponding to the
quiet atmosphere. This implies different values for the filtered intensities of the
‘magnetic’ and ‘quiet’ fractions (Im and Iq , respectively) as well as different values
for the calibration constants (C )m and (C )q . Under these assumptions we obtain
from Eq. (11.24)
f (C )m Im
(B )app = (B )m . (11.26)
(C )q (1 − f ) Iq + f Im
If, for instance, the intensity profile in the magnetic region is simply scaled by a
factor Θ (a sort of thermodynamic parameter) with respect to the profile corre-
sponding to the quiet atmosphere, one has
Im = Θ Iq , (C )m = (C )q , (11.27)
so that
fΘ
(B )app = (B )m . (11.28)
1−f +fΘ
Equations (11.26) and (11.28), though valid under a number of simplifying as-
sumptions, are able to account for a phenomenon known since a long time in solar
magnetometry, namely the differences that are commonly found when comparing
the values of B derived from longitudinal magnetographs operating simultane-
ously on different spectral lines (Harvey and Livingston, 1969). Such differences
cannot be interpreted as due only to magnetic saturation effects (see Sect. 11.1)
and imply that each spectral line has its own peculiar behavior in magnetic regions
(its own Θ value, for instance, in the simplified model leading to Eq. (11.28)). In
other words, since the thermodynamic structure of magnetic regions is different
from that of the quiet atmosphere, each line is characterized by a different amount
of weakening (or strengthening) which eventually affects the apparent value of B
measured by a longitudinal magnetograph.
In order to analyze the influence of unresolved fields on the results obtained by
vector magnetographs, Eq. (11.23) has to be complemented with the corresponding
equations for the Stokes parameters Q and U (see Eqs. (11.6))
dS Q(λ; P) p (λ) dλ
SQ Σ = Σ
dS I(λ; P) p (λ) dλ
Σ
SOLAR MAGNETOMETRY 647
dS U (λ; P) p (λ) dλ
SU Σ = Σ . (11.29)
dS I(λ; P) p (λ) dλ
Σ
Denoting by χ(P) the azimuth of the magnetic field at point P reckoned in the ob-
server’s frame, Eqs. (11.29) can be rewritten in the following form (cf. Eqs. (11.11))
SQ̃ (P) cos 2χ(P) − SŨ (P) sin 2χ(P) I (P) dS
SQ Σ = Σ
I (P) dS
Σ
As shown in Sect. 11.2, when the magnetic field is weak, the signal SŨ is zero
(Eq. (11.7)), whereas the signal SQ̃ is proportional to the square of the transverse
component of the magnetic field vector (Eq. (11.9)). Thus, if the magnetic field is
everywhere weak across the surface Σ, one has
2
C ⊥ (P) B⊥ (P) cos 2χ(P) I (P) dS
SQ Σ = Σ
I (P) dS
Σ
2
C ⊥ (P) B ⊥ (P) sin 2χ(P) I (P) dS
SU Σ = Σ , (11.30)
I (P) dS
Σ
where the symbol · · · means an average over the surface Σ as in Eq. (11.25), and
the apparent transverse component is given by
2
(B⊥ )app = B⊥ (P) cos 2 χ(P) − χapp .
The second is the case of the filling factor hypothesis. Here, denoting by the
index ‘m’ all quantities referring to the magnetic region, and by the index ‘q’ all
quantities referring to the quiet atmosphere, one has
χapp = χm
and
f (C⊥ )m Im
(B⊥ )app = (B⊥ )m , (11.33)
(C⊥ )q (1 − f ) Iq + f Im
fΘ
(B⊥ )app = (B⊥ )m . (11.34)
1−f +fΘ
It is also interesting to combine these last expressions with Eqs. (11.26) or (11.28)
in order to find the influence of the lack of resolution on the inclination angle θ of
1
Note that Eq. (11.31) leaves an ambiguity of 90 ◦ in the value of χapp . This ambiguity can
be removed by requiring that the argument of the square root in Eq. (11.32) be positive.
SOLAR MAGNETOMETRY 649
Fig.11.8. The ‘apparent’ inclination of the magnetic field vector is plotted as a function of the
true inclination angle of the field in an unresolved magnetic region. The parameter x is, as an
order of magnitude, equal to the filling factor f (see text).
the magnetic field vector with respect to the line of sight. The ‘apparent’ value of
the inclination angle, θapp , is connected with the value in the unresolved magnetic
region, θm , by the relation
where1
fΘ
x= .
1−f +fΘ
The relation between θapp and θm is shown in Fig. 11.8 for different values of the
parameter x. The figure clearly shows that when the filling factor f is very small
(hence x is also very small), the magnetic field recovered by a vector magnetograph
tends to appear as a highly inclined field (θ close to 90◦ ), no matter what the true
inclination of the field in the unresolved region is.
Up to now we have basically restricted our discussion to weak fields. When strong,
unresolved fields are present, things get more complicated because saturation effects
come into play and it is difficult, in general, to disentangle such effects from those
1
This expression for x is obtained by combining Eqs. (11.28) and (11.34). The combination
of Eqs. (11.26) and (11.33) leads to a more complicated expression for x which, however, is still
proportional to the filling factor f under the limit f → 0.
650 CHAPTER 11
(i)
where Iq (λ) is the intensity of the solar radiation coming from the quiet atmo-
(i) (i)
sphere, Im (λ) and Vm (λ) are the intensity and circular polarization emerging from
the magnetic region, and p1 (λ), p2 (λ) are two identical profiles, the first centered
in a wing of line 1 and the second in the corresponding wing of line 2, at the same
distance from line center. If the two lines are identical from the point of view of
their thermodynamic behavior, this implies that
Iq(1) (λ) p1 (λ) dλ = Iq(2) (λ) p2 (λ) dλ , (11.36)
and if the magnetic field in the magnetic region is weak we also have
(1) (2)
Im (λ) p1 (λ) dλ = Im (λ) p2 (λ) dλ
where ḡ1 and ḡ2 are the effective Landé factors of the two lines. Substituting these
relations into Eq. (11.35) and evaluating the ratio R of the two signals one has,
irrespective of the filling factor and of any possible weakening (or enhancement) of
the lines in the magnetic region
SV 1 ḡ
R= = 1 .
SV 2 ḡ2
If, on the contrary, the magnetic field falls outside the weak field regime, it can be
expected that the line having the larger Landé factor (line 1, say) saturates more
than the other, so that
ḡ
R< 1 .
ḡ2
SOLAR MAGNETOMETRY 651
Fig.11.9. A simulation of the ratio R of the circular polarization signals observed by a longitudinal
magnetograph in the two lines λ 5250.21 and λ 5247.05, plotted as a function of the magnetic field
intensity for different values of the inclination angle θ (full line: θ = 0◦ or 180◦ ; dotted line:
θ = 30◦ or 150◦ ; dashed line: θ = 60◦ or 120◦ ).
Since the ratio R is a function of the magnetic field vector that can be evaluated
through model calculations, it follows that from the observed value of R it is possi-
ble to obtain an order-of-magnitude estimate of the magnetic field intensity in the
unresolved structure.
By a fortunate coincidence, the solar spectrum does indeed present a pair of
lines particularly suitable for the application of the line-ratio technique. These are
the two FeI lines at 5250.21 Å (line 1) and 5247.05 Å (line 2). Both lines belong to
multiplet n. 1 of FeI 1 and happen to have practically the same value for the quantity
g̃ f (α J → αu Ju ), g̃ being the degeneracy of the lower level of the transition
and f (α J → αu Ju ) the oscillator strength defined in Eq. (9.20). Laboratory
measurements by Blackwell et al. (1979) give in fact
Log(g̃f ) line 1 = −4.938 , Log(g̃f ) line 2 = −4.946 ,
with a relative difference less than 2%. This property, together with the fact that
the lower levels of the two lines are very close in energy (the difference is 0.034 eV)
and with the further property of the closeness in the spectrum (which implies
the same Doppler broadening) assures that the two lines have, to a very good
approximation, the same absorption coefficient and hence the same behavior for
any stratification of the thermodynamic parameters with depth.
Figure 11.9 shows the behavior of the ratio R for the two lines mentioned as
a function of the magnetic field intensity, for different field inclinations. R was
1
Line 1 corresponds to the transition a5 D0 − z 7 Do1 . It is a normal Zeeman triplet having
ḡ = 3. Line 2 corresponds to the transition a5 D2 − z 7 Do3 . It is a Zeeman multiplet with ḡ = 2.
652 CHAPTER 11
computed under the limit of very small filling factor which yields, taking into
account Eqs. (11.35) and (11.36)
Vm(1) (λ) p1 (λ) dλ
R= .
Vm(2) (λ) p2 (λ) dλ
The V Stokes profiles were computed assuming the two lines to be formed in a
Milne-Eddington atmosphere with the same set of atmospheric and line parameters
(the same values as in Fig. 11.6 – which refers to the λ 5250.21 line – were used,
namely κL = 6.1, ∆λD = 32 mÅ, a = 0.04, βµ = 12). The profiles p1 (λ) and p2 (λ)
were supposed to have a Gaussian shape as in Eq. (11.5) with ∆λF = 75 mÅ, and
to be positioned at 60 mÅ from line center for both lines. As expected from our
previous discussion, the ratio R approaches asymptotically the value 1.5 (= ḡ1 /ḡ2 )
under the limit B → 0, and decreases monotonically with increasing B.
Through the line-ratio technique applied to this specific pair of lines it has been
possible to give a quantitative estimate of the dimensions and of the intrinsic field
intensity of magnetic regions observed at low spatial resolution by magnetographic
techniques (Howard and Stenflo, 1972; Frazier and Stenflo, 1972; Stenflo, 1973).
The magnetic field outside sunspots was found to be concentrated in small areas,
with typical dimensions of the order of 100-300 km, mostly located in the super-
granular network and having intrinsic magnetic field intensities in the kG range.
It should be remarked, however, that in low resolution observations weak fields
tend to be substantially underestimated, because they have a larger probability of
reconnecting at short distances, i.e., within the resolution element of the instru-
ment. This implies that the contribution of weak fields to both the observed signals
(SV )1 and (SV )2 , and hence to their ratio R, is likely to cancel out in many cases.
(thr)
where Ijα (in the right-hand side) is defined in Eq. (11.15) and where (Iα )q , the
intensity of the quiet atmosphere at wavelength point α, is assumed to be known
a priori.1 The filling factor f can now be considered as an additional parameter
that can be deduced, together with the other ones, from the observational data.
1
In raster-type observations, (Iα )q can be obtained by taking the average of the intensity
spectra over all the points of the raster (excluding sunspots and active regions, if necessary).
SOLAR MAGNETOMETRY 653
This procedure, when applied to active region observations, generally shows that
the filling factor f recovered from the fit is much less than unity outside sunspots, a
further indication of the existence of unresolved fields (Lites and Skumanich, 1990).
which shows that the ‘unresolved’ center of gravity of the right (left) circular po-
larization profile is an average over the observed region of the ‘resolved’ center of
gravity of the same profile weighted by the corresponding integrated line depression.
As shown in Sect. 11.4, for any Zeeman triplet formed in a constant magnetic
field parallel to the line of sight (θ = 0◦ in the geometry of Fig. 9.1), one has (see
Eq. (11.22))
(g) (g)
λ+ (P) − λ− (P) = 2 g ∆λB (P)
and
J+ (P) = J− (P) = J (P) ,
654 CHAPTER 11
where A is the area of the surface Σ. This equation shows that, in the case of
unresolved fields, the center-of-gravity technique measures the flux (see comments
after Eq. (11.25)) of the magnetic field vector across the observed area, though
it should be stressed that this statement is only valid under the restrictive set of
assumptions specified above (Zeeman triplet, field parallel or antiparallel to the
line of sight, integrated line depression independent of P).
The second particular case is that of the filling factor hypothesis; one gets (cf.
del Toro Iniesta et al., 1990)
f Jm
(B )app = (B )m ,
(1 − f ) Jq + f Jm
where the indices ‘q’ and ‘m’ refer to the quiet atmosphere and to the magnetic
atmosphere, respectively.
In the previous sections of this chapter we have analyzed in some detail the most
traditional inversion techniques of solar magnetometry, with the aim of providing
the reader with a firm background on the physical phenomena that underlie the
problem of measuring magnetic fields in the solar atmosphere. In this section we
give a brief review and some bibliographical references on more recent techniques,
without however pretending to be exhaustive. It should be realized that the amount
1
The same line of reasoning can be applied to θ = 180◦ and leads to the same expression for
(B )app . Equation (11.38) is thus valid for an arbitrary mixture of fields parallel or antiparallel
to the line of sight.
SOLAR MAGNETOMETRY 655
of work carried out on this subject during almost one century is very large, and a
full treatment is outside the aims of this book.
Fig.11.10. The peak asymmetry and the area asymmetry are defined in terms of the quantities
ab , ar , Ab , and Ar (see Eqs.(11.39)). Ab and Ar (where b and r stand, as customary, for ‘blue’
and ‘red’) are the absolute values of the areas contained between the V profile and the wavelength
axis, ab and ar the absolute values of the peak heights. The figure refers to a positive polarity
observation. For negative polarity it would be reversed about the wavelength axis.
ab − ar Ab − Ar
δa = , δA = , (11.39)
ab + ar Ab + Ar
where the meaning of the different symbols is specified in Fig. 11.10. It turns
out that both the peak asymmetry and the area asymmetry are generally positive
(irrespective of the prevailing polarity of the observed active region), and widely
vary from line to line. Typical values at disk center are of the order of 10% for the
peak asymmetry and 5% for the area asymmetry. Such values generally decrease
with increasing distance from disk center and become negative for many lines at
the extreme limb.
1
Stokes profiles asymmetries had already been observed by the so-called Stokes-I and Stokes-II
polarimeters of the High Altitude Observatory (Baur et al., 1981).
SOLAR MAGNETOMETRY 657
ray path
walls of the
magnetic flux-tube
velocity flow
Fig.11.11. In a funnel-shaped magnetic flux-tube a ray path like (a) gives rise to a V profile with
positive peak asymmetry and positive area asymmetry.
The interpretation of these results has basically been given in terms of the filling
factor hypothesis, by supposing that the polarization signal contained in the obser-
vations (the spectrum of V , or of Q and U ) is due to a collection of unresolved and
ubiquitous magnetic structures, the so-called flux-tubes. In an extensive series of
papers, the geometrical and thermodynamic properties of a kind of ‘average’ flux-
tube have been established by comparing the observed polarization profiles with
numerical solutions to the radiative transfer equations for polarized radiation (see
Solanki, 1993, for a detailed review of the subject). The model that has emerged
from these investigations is that of a slender, funnel-shaped, static magnetic tube
having typical horizontal dimensions of a few hundreds of km, in hydrostatic equi-
librium with the surrounding unperturbed atmosphere. The asymmetries of the V
profiles are interpreted in this model as due to the presence of a macroscopic flow of
material directed downward in the non-magnetic atmosphere surrounding the tube
(Grossman-Doerth et al., 1988b). It can be proved that the ray paths crossing
the walls of the flux-tube, such as that schematized in Fig. 11.11, lead to positive
values for the peak and area asymmetries defined in Eqs. (11.39).1 It should be
remarked that the empirical flux-tube model described above is consistent with a
dynamical model obtained by two-dimensional magnetohydrodynamic simulations
of a collapsing magnetic structure (Knölker et al., 1988).
Notwithstanding the success obtained by the flux-tube hypothesis in explaining
the FTS data, the intrinsic lack of spatial resolution of the data themselves in-
evitably raises some doubts on whether the model described above may indeed
1
The positive sign of the area asymmetry is in fact predicted by the simple slab-model
developed as Case iii of Sect. 9.21. That model can be adapted to the present case by setting
(s) (s)
∆λA < 0 and (∆λB − ∆λB ) < 0. For a line with g > 0, Eqs. (9.256) and (9.257) yield v > 0
for θ = 0 and v < 0 for θ = 180◦ , which are consistent with a positive area asymmetry (cf.
◦
Eq. (9.238)).
658 CHAPTER 11
sin2 θ cos 2χ
Q(λ) = Ic pb (λ)
1 + cos2 θ
1
This result is obtained from Eq. (5.49) with T = 5800 K, ξ = 1 km s−1 , µ = 55.8 (atomic
weight of iron).
2
It should be recalled that Eqs. (9.144) are valid under the restriction of a constant magnetic
field direction. Gradients of the magnetic field intensity and of the line-of-sight velocity are in
principle allowed, provided that no overlapping of the different Zeeman components occurs (cf.
Eqs. (9.138)).
SOLAR MAGNETOMETRY 659
sin2 θ sin 2χ
U (λ) = Ic pb (λ)
1 + cos2 θ
cos θ
V (λ) = 2Ic pb (λ) , (11.40)
1 + cos2 θ
- in the π component
I(λ) = Ic 1 − pp (λ)
Q(λ) = −Ic pp (λ) cos 2χ
U (λ) = −Ic pp (λ) sin 2χ
V (λ) = 0 , (11.41)
- in the σr component
I(λ) = Ic 1 − pr (λ)
sin2 θ cos 2χ
Q(λ) = Ic pr (λ)
1 + cos2 θ
sin2 θ sin 2χ
U (λ) = Ic pr (λ)
1 + cos2 θ
cos θ
V (λ) = −2Ic pr (λ) , (11.42)
1 + cos2 θ
where Ic is the continuum intensity, θ and χ are the inclination and azimuth an-
gles defined in Fig. 9.1, and pb (λ), pp (λ), pr (λ) are three profiles centered at the
wavelengths (λ0 − g ∆λB ), λ0 , and (λ0 + g ∆λB ) respectively, g being the Landé
factor and ∆λB the Zeeman splitting defined in Eq. (3.14). The two profiles pb (λ)
and pr (λ) have the same shape and coincide with the profile of a fictitious, non-
magnetic line formed in the same atmosphere and having a ratio between line and
continuum absorption coefficient given by
1
κ= 1 + cos2 θ κL ,
2
where κL is the corresponding quantity for the full line. The profile pp (λ) has
a different shape and coincides with the profile of a fictitious, non-magnetic line
having
κ = sin2 θ κL .
Equations (11.40)-(11.42) can be used to derive the magnetic field vector from
the observed Stokes profiles of an infrared triplet with well-separated σ and π
components. The field intensity is directly obtained by measuring the wavelength
separation between any two components (σ or π) in any of the four Stokes param-
eters and by using Eq. (3.14).1 The azimuth χ is obtained by considering the ratio
1
Note that to perform this operation there is no need for polarimetric observations, it
660 CHAPTER 11
U (λ)
tan 2χ = .
Q(λ)
Finally, the inclination θ is obtained from the ratio between V and Q2 + U 2 in
either the σb or the σr component. For instance, in the σb component
2 cos θ V (λ)
= ,
sin2 θ Q (λ) + U 2 (λ)
2
whence √
1 + x2 − 1
θ = arccos ,
x
where
V (λ)
x= .
Q2 (λ) + U 2 (λ)
Obviously, in the case of noisy data this simple procedure can be replaced by a
more sophisticated one involving the minimization of an appropriate merit function,
similarly to what discussed in Sect. 11.3 about the Unno-fit technique.
is sufficient to have the intensity spectrum. Moreover, this operation is so simple as to justify
the statement, often heard about infrared lines, ‘A ruler is all that is needed to measure the
magnetic field’. However, Eqs. (11.40)-(11.42) show that the intensity spectrum carries very little
information about θ and no information at all about χ .
SOLAR MAGNETOMETRY 661
where B0 and β are the parameters specifying the source function, A is the prop-
agation matrix defined in Eqs. (9.164), and where the meaning of the symbol · · ·
is defined in Eq. (9.271).
In the MISMA hypothesis, the stochastic average of the propagation matrix, A,
is assumed to be represented by an expression of the form
n
A = fi Ai . (11.44)
i=1
This is a special case of Eq. (9.271) corresponding to the assumption that the
atmosphere consists of n independent components, each one characterized by its
own propagation matrix, Ai , and its own probability (or occupation fraction), fi .
By means of Eqs. (11.43) and (11.44) it is possible to reproduce rather well the
typical asymmetries of the Stokes profiles observed either in sunspots, or in facular
regions or on the quiet sun, even if this requires, in some cases, the introduction
of up to three components (Sánchez Almeida et al., 1996). For explaining facular
observations, for instance, one needs one prevailing, non-magnetic component with
an inclined flow, a static magnetic component with a field slightly inclined with
respect to the vertical, and a supplementary magnetic component having a field
again slightly inclined with respect to the vertical and a flow of material antiparallel
to the magnetic field.
The MISMA hypothesis has also been generalized to allow for a depth depen-
dence of the matrices Ai of Eq. (11.44) and an arbitrary behavior with depth of
the source function BP . An inversion technique for this case has been proposed by
Sánchez Almeida (1997). Broadly speaking, this technique resembles, in its math-
ematical details, the one that we have already described in Subsect. 11.6.a, with
the difference that the radiative transfer equation that is solved to find a fit to the
observations is the ‘stochastic’ equation in the microturbulent limit, namely
dI
= A I − BP U ,
dτ
where the matrix A is given by Eq. (11.44) and BP is the local value of the Planck
function. This generalized version of the MISMA hypothesis can lead to a very large
number of degrees of freedom. To narrow down the number of possibilities, the
inversion code uses several restrictive assumptions based on magnetohydrodynamic
considerations. This inversion procedure seems very promising and has proved
particularly successful for interpreting the asymmetries observed in the V profile
in quiet sun observations (Sánchez Almeida and Lites, 2000).
11.7. Disambiguation
We have already remarked in Sect. 9.5 that the radiative transfer equation for po-
larized radiation is invariant under a rotation of 180◦ of the magnetic field vector
662 CHAPTER 11
about the line of sight, provided atomic polarization phenomena are neglected (see
the introduction to Chap. 9). In most of the spectral lines that are commonly
employed in solar magnetometry, atomic polarization is believed to play an in-
significant role. It follows that the magnetic field vector recovered from standard
observations suffers from an intrinsic 180◦ ambiguity in its azimuth. This raises the
fundamental problem – often referred to as the disambiguation problem – of find-
ing the true orientation of the transverse component of the magnetic field vector
between the two alternatives which are left undecided by the observations.
In the case of an isolated observation the problem cannot be solved. On the
contrary, in the case of raster-type observations, several different strategies can be
followed. Important contributions to this matter have been given by Cuperman et
al. (1990), Canfield et al. (1993), Lites et al. (1995), and Semel and Skumanich
(1998). Just to give an example, we describe in some detail the procedure followed
by Lites et al. (1995): one starts by considering the sunspots as ‘azimuth centers’
from which the transverse component of the magnetic field must diverge for positive
polarity or to which must converge for negative polarity. This criterion solves the
ambiguity for the regions surrounding the sunspots. In plage areas, and away from
regions of newly emerging flux, one selects the azimuth which gives a field direction
closer to the local vertical.1 Finally, when the previous criteria are not effective,
the selection is made by minimizing the point-by-point discontinuity of the azimuth
in the observer’s frame. By so doing, a ‘map’ of the magnetic field is recovered and
this also allows one to determine one component of the electric current density jz
(with z the vertical axis) through the equation
∂By ∂Bx
4π jz = − .
∂x ∂y
1
This criterion is based on the idea that in such regions the magnetic field is nearly always
oriented close to the vertical direction. Obviously, the criterion does not work for observations
performed at disk center.
CHAPTER 12
ASTROPHYSICAL APPLICATIONS:
RADIATION ANISOTROPY IN STELLAR ATMOSPHERES
As shown in Chaps. 5, 7, and 10, the anisotropy of the radiation field, suitably de-
K
scribed by the irreducible tensor JQ (ν) defined in Eqs. (5.157), is the basic physical
quantity controlling the amount of atomic polarization in atomic energy levels. Re-
ferring for instance to the idealized case of a two-level atom with unpolarized lower
level, Eq. (10.10) shows that in the non-magnetic, collisionless regime, the statisti-
cal tensors of the upper level are proportional to the corresponding components of
the radiation field tensor. In the following chapters we will discuss the phenomena
of resonance polarization and the Hanle effect – which are intimately connected
with the presence of atomic polarization – in the astrophysical context. In view of
this analysis, it is useful to study in some detail the properties of the radiation field
anisotropy in some of the most common astrophysical scenarios, like the interior
or the outer layers of a ‘classical’ stellar atmosphere.1
The study presented in this chapter is restricted to schematic situations where
the atmospheric structure is described by analytical, plane-parallel models. In
particular, we will consider the Milne-Eddington and the grey model atmospheres,
and we will find the behavior of the radiation field anisotropy both ‘inside’ the
atmosphere (as a function of optical depth) and ‘outside’ the atmosphere (as a
function of height over the stellar surface).
It should be remarked from the very beginning that a plane-parallel atmosphere
is a cylindrically symmetrical environment, the symmetry axis being the vertical
to the atmosphere itself. It is readily seen from Eqs. (5.157) and Table 5.6 that,
in such an environment, the only non-zero components of the irreducible radiation
field tensor (in a reference system with the z-axis directed as the vertical) are J00 and
J02 . A single parameter is therefore sufficient to fully characterize the anisotropy
of the radiation field. This parameter is the so-called anisotropy factor, already
defined in Chap. 10 (cf. Eqs. (10.62) and (10.160))
√ J02 (ν)
wν = 2 , (12.1)
J00 (ν)
where
√ the expressions of J00 (ν) and J02 (ν) are given by Eqs. (5.164) and the factor
2 is introduced for convenience. We remind the reader (cf. Sect. 10.7) that the
value of the anisotropy factor is always contained in the range (−1/2, 1), being 0 for
an isotropic radiation field, 1 for an infinitely sharp beam parallel or antiparallel to
the z-axis, and −1/2 for an azimuth-independent radiation field in the x-y plane.
1
By ‘classical’ we mean an atmosphere where the radiation field is unpolarized and therefore
described by the only intensity.
664 CHAPTER 12
The main body of this chapter is devoted to the analysis of the anisotropy factor
in stellar atmospheres. In Sect. 12.4 we will consider some typical phenomena
of ‘symmetry breaking’ which entail the appearance of further components of the
K
tensor JQ (ν).
tν t −t
− ν ν dtν
Iν (tν , µ) = Sν (tν ) e |µ| (µ < 0) , (12.4)
|µ|
0
where we have assumed that the atmosphere is not irradiated by any external
source.
We now evaluate the moments of the intensity, defined by (cf. Chandrasekhar,
1950)
1
1
Jν (tν ) = Iν (tν , µ) dµ
2
−1
1
The absorption coefficient considered here is the ‘scalar’ absorption coefficient of the
traditional (non polarized) theory of stellar atmospheres. For our purposes it is not necessary to
specify the physical processes contributing to kν .
RADIATION ANISOTROPY IN STELLAR ATMOSPHERES 665
1
Fν (tν ) = 2 Iν (tν , µ) µ dµ
−1
1
1
Kν (tν ) = Iν (tν , µ) µ2 dµ . (12.5)
2
−1
∞ −xy 1
e x
−µ
En (x) = dy = e µn−2 dµ (x > 0 ; n = 0, 1, 2, . . .) , (12.6)
yn
1 0
where
tν − tν
σ(tν − tν ) = sign (tν − tν ) = . (12.8)
|tν − tν |
The anisotropy factor at optical depth tν can be expressed in terms of the mo-
ments Jν (tν ) and Kν (tν ). From Eqs. (12.1), (5.164) and (12.5) we have
1
The flux is indeed π Fν .
666 CHAPTER 12
This expression is valid under the only assumption that the atmosphere is plane-
parallel (hence cylindrically symmetrical).
Now we consider the special case of the Milne-Eddington model, which implies
that the source function coincides with the local Planck function (LTE hypothesis)
and that the latter varies linearly with the optical depth tν ,1
Substituting into Eq. (12.10), and performing elementary integrals,2 one finds the
following expression
1 βν
wν (0) = . (12.13)
4 βν + 2
Figure 12.1 shows the behavior of the anisotropy factor with optical depth as de-
rived from Eq. (12.12), for several values of βν (only positive values are considered,
which implies a source function increasing with optical depth). Such behavior can
be understood via the following arguments. First of all it should be noticed that,
as apparent from Eqs. (5.164), the component J02 (ν) – hence the anisotropy fac-
tor – tends to be positive when the radiation flowing ‘almost vertically’ (θ 0◦ or
θ 180◦ ) is more intense than the radiation flowing ‘almost horizontally’ (θ 90◦ ),
and negative in the opposite case.3 Next we write down the expressions of the inten-
sity Iν (tν , µ) for the Milne-Eddington model atmosphere. From Eqs. (12.3)-(12.4)
we obtain, using Eq. (12.11)4
1
Apart from slightly different notations, these assumptions are the same as those introduced
in Sect. 9.8 when discussing the Unno-Rachkovsky solution.
2
It is useful to recall that the exponential integrals satisfy the relations
d 1
E (x) = −En−1 (x) (n > 0) , En (0) = (n > 1) .
dx n n−1
3 √
The boundary between the two cases involves the Van Vleck angle θV = arccos (1/ 3), cf.
Eq. (5.100).
4
Equations (12.14) can also be derived as a special case of the Unno-Rachkovsky solutions
(Eqs. (9.108)) corresponding to no magnetic field. It should be kept in mind, however, that tc is
the continuum optical depth, while tν is the total (line plus continuum) optical depth.
RADIATION ANISOTROPY IN STELLAR ATMOSPHERES 667
Iν (tν , µ) = B0 1 + βν µ + βν tν (µ > 0)
−
tν , t -
− ν
Iν (tν , µ) = B0 1 + βν tν − e |µ| − βν |µ| 1 − e |µ| (µ < 0) . (12.14)
These expressions show that, for the outward-directed radiation (µ > 0), the in-
tensity increases – provided βν > 0 – with increasing µ, whatever the value of tν .
The opposite occurs for the radiation flowing inward (µ < 0): at any given tν , the
intensity decreases as |µ| is increased.
The former effect is responsible for limb darkening and will be referred to as the
source-function gradient effect: it gives a positive contribution to the anisotropy
factor. The latter, which may be called the surface effect , gives a negative con-
tribution to the anisotropy factor. For βν = 0 the radiation flowing outward is
isotropic, thus only the surface effect is at work and, as shown in Fig. 12.1, the
anisotropy factor is always negative. By contrast, for large values of βν the source-
function gradient effect takes over and the anisotropy factor is always positive. For
intermediate values of βν , the anisotropy factor is positive at tν = 0 (as shown by
Eq. (12.13)) and then decreases with increasing tν . If βν is smaller than a thresh-
old value (that is numerically found to be 1.1098) the anisotropy factor becomes
negative beyond a certain value of tν , otherwise it remains positive at any tν . In
any case, the anisotropy factor tends to zero for tν → ∞, because both the outward
and the inward-directed radiation become isotropic.
The maximum value of the anisotropy factor in the Milne-Eddington model at-
mosphere is 0.25. Such value is found at tν = 0 in a high-gradient atmosphere
668 CHAPTER 12
Integration of Eq. (12.15) over frequency yields the transfer equation for I(t, µ)
d
µ I(t, µ) = I(t, µ) − BP (t) , (12.18)
dt
RADIATION ANISOTROPY IN STELLAR ATMOSPHERES 669
where
∞
σ 4
BP (t) = BP ν, T (t) dν = T (t) , (12.19)
π
0
F (t) = F0 (12.20b)
1
K(t) = F0 t + Q , (12.20c)
4
where Q is an integration constant. Equations (12.20a) and (12.18) allow us to
write the zero-order moment of the frequency-integrated intensity in the form (cf.
Eq. (12.7a))
∞
1
J(t) = E1 |t − t| J(t ) dt , (12.21)
2
0
while for the first-order moment we have, considering the flux at the surface (cf.
Eq. (12.7b))
∞
F0 = 2 E2 (t) J(t) dt . (12.22)
0
J(t) = 3 K(t) ,
Following Hopf (1934), the function J(t) is written, for an arbitrary value of t, in
the form
3
J(t) = F0 t + q(t) , (12.23)
4
where q(t) is the so-called Hopf function which obviously satisfies the property
lim q(t) = Q .
t→∞
Substitution of Eq. (12.23) into Eq. (12.21), and evaluation of elementary integrals,1
leads to the following non-homogeneous integral equation for q(t)
∞
1 1
q(t) = E3 (t) + E1 |t − t| q(t ) dt . (12.24)
2 2
0
On the other hand, substitution of Eq. (12.23) into Eq. (12.22) shows that the Hopf
function obeys the relation
∞
1
E2 (t) q(t) dt = . (12.25)
3
0
The Hopf function has been widely studied by means of analytical, as well as
numerical techniques (Chandrasekhar, 1950; Kourganoff and Busbridge, 1952). In
particular, it can be proved that q(0), the surface value of the Hopf function, is
given by (see App. 15)
1
q(0) = √ . (12.26)
3
With modern computing techniques, the evaluation of the Hopf function is rather
straightforward. A simple numerical algorithm is described in App. 16, and a graph
of q(t) is given in Fig. 12.2. The figure√shows that q(t) is a monotonic function
which grows from the surface value 1/ 3 = 0.57735 to the boundary value at
t → ∞, Q = 0.71044.
The anisotropy factor for the frequency-integrated intensity in a grey atmosphere
can easily be expressed in terms of the Hopf function. Considering the grey ana-
logue of Eq. (12.9) one obtains, with the use of Eqs. (12.20c) and (12.23)
Fig.12.2. The Hopf function, solution of the integral equation (12.24), is plotted against optical
depth.
Fig.12.3. The anisotropy factor for the frequency-integrated intensity in the grey model atmo-
sphere is plotted as a function of optical depth. Its surface value is 0.11526.
Figure 12.3 shows a graph of the anisotropy factor w versus Log t. The function w(t)
decreases monotonically with increasing t – as obvious from Eq. (12.27). According
to the discussion presented at the end of the former section this means that, in the
grey model atmosphere, the source-function gradient effect prevails over the surface
effect at all optical depths.
In the grey atmosphere it is also possible to find the behavior with optical depth
of the anisotropy factor at a specific frequency (or wavelength). The expression of
the ‘monochromatic’ anisotropy factor, wν (t), is obtained from Eqs. (12.9), (12.7a)
and (12.7c) by performing the substitutions in Eq. (12.16). It is therefore necessary
to determine the dependence of the temperature T on optical depth, and then
to evaluate numerically the integrals in Eqs. (12.7a,c). Introducing the effective
672 CHAPTER 12
Fig.12.5. The surface value of the anisotropy factor of a grey atmosphere is plotted as a function
of the normalized frequency x.
d − hν −1 h ν dT
βν = ln BP ν, T (t) = 1 − e kB T ,
dt kB T 2 dt
In the previous sections we have analyzed the anisotropy factor in the interior of a
stellar atmosphere. Here we extend such analysis to the ‘outer’ environment of a
star. Referring to Fig. 12.6, let us consider a point P located at a height h over the
stellar surface. Neglecting the possible presence of starspots over the spherical cap
seen from point P, the radiation field is cylindrically symmetrical about the vertical
direction, and its properties are fully described by the two components J00 (ν) and
J02 (ν) of the radiation field tensor or, alternatively, by J00 (ν) and the anisotropy
factor wν .
Similarly to Sect. 12.1, we introduce the two integrals
1 1
1 1
Jν = Iν (µ) dµ , Kν = Iν (µ) µ2 dµ , (12.29)
2 2
−1 −1
where Iν (µ) is the specific intensity of the radiation propagating through point P
along the direction specified by the angle θ, with µ = cos θ. The anisotropy factor
674 CHAPTER 12
Fig.12.6. Geometry for the calculation of the anisotropy factor in the outer layers of a star. R∗
is the stellar radius.
3Kν − Jν
wν = . (12.30)
2 Jν
N
Iν (α) = Iν (0) 1 − ui (ν) 1 − cosi α , (12.31)
i=1
where α is the angle between the propagation direction of the radiation and the ver-
tical to the atmosphere, and where ui (ν) are empirically determined coefficients.1
As apparent from Fig. 12.6, the angle α is related to the angle θ. Since
R∗
sin γ = , (12.32)
R∗ + h
1
The direct determination of these coefficients from observations is possible only for the
sun. In the stellar case, they can be determined from a fit to theoretical results based on model
atmospheres.
RADIATION ANISOTROPY IN STELLAR ATMOSPHERES 675
1
1
N
(µ2 − cos2 γ)i
Kν = Iν (0) u0 (ν) + ui (ν) µ2 dµ ,
2 i=1
sini γ
cos γ
where
N
u0 (ν) = 1 − ui (ν) .
i=1
We now restrict our analysis to the case of a quadratic expansion of the limb-
darkening law (N = 2 in Eq. (12.31)). After some algebra, which involves the
evaluation of elementary integrals, one gets
1
Jν = I (0) a0 + a1 u1 (ν) + a2 u2 (ν)
2 ν
1
Kν = Iν (0) b0 + b1 u1 (ν) + b2 u2 (ν) , (12.34)
2
where
a0 = 1 − Cγ
1 1 Cγ2 1 + Sγ
a1 = Cγ − − ln
2 2 Sγ Cγ
(Cγ + 2)(Cγ − 1)
a2 = (12.35)
3(Cγ + 1)
and
1
b0 = 1 − Cγ3
3
1 3 1 Cγ4 1 + Sγ
b1 = 8Cγ − 3Cγ − 2 −
2
ln
24 8 Sγ Cγ
(Cγ − 1)(3Cγ3 + 6Cγ2 + 4Cγ + 2)
b2 = ,
15(Cγ + 1)
676 CHAPTER 12
with
Cγ = cos γ , Sγ = sin γ .
Substitution into Eq. (12.30) yields
1 c0 + c1 u1 (ν) + c2 u2 (ν)
wν = , (12.36)
2 a0 + a1 u1 (ν) + a2 u2 (ν)
where
c0 = 3b0 − a0 = Cγ Sγ2
1 Cγ2 1 + Sγ
c1 = 3b1 − a1 = 8Cγ − 3Cγ − 8Cγ + 2 + (4 − 3Cγ )
3 2 2
ln
8 Sγ Cγ
Cγ − 1
c2 = 3b2 − a2 = 9Cγ3 + 18Cγ2 + 7Cγ − 4 . (12.37)
15(Cγ + 1)
It is instructive to consider some limiting cases of Eq. (12.36). The first is the case
of a star without limb darkening. Setting u1 (ν) = u2 (ν) = 0, Eq. (12.36) reduces
to
1 c0 1
wν = = Cγ (1 + Cγ ) .
2 a0 2
The anisotropy factor is independent of frequency and is simply due to geometri-
cal effects. Recalling Eq. (12.32), wν can explicitly be expressed in terms of the
height h . We have √
2x + x2 + (1 + x) 2x + x2
wν = ,
2(1 + x)2
where
h
x= .
R∗
In particular, for small heights (h R∗ ) we obtain the asymptotic behavior
.
x h
wν ≈ = ,
2 2R∗
while for large heights we find the obvious result (see comments after Eq. (12.1))
lim wν = 1 .
h→∞
Next we consider the behavior of wν for small and large values of h/R∗ in the
general case where u1 (ν) and u2 (ν) are non-zero. For h → 0 (or γ → π/2) one
obtains from Eq. (12.36), with the help of Eqs. (12.35) and (12.37)
1 15 u1 (ν) + 16 u2 (ν)
lim wν = wν 0 = .
h→0 20 6 − 3 u1 (ν) − 4 u2 (ν)
RADIATION ANISOTROPY IN STELLAR ATMOSPHERES 677
Fig.12.7. The anisotropy factor in the outer layers of the solar atmosphere is plotted against the
height h normalized to the solar radius R . The full lines correspond, from top to bottom, to
the wavelengths 0.37, 0.50, 0.80, 2.0, and 10.0 µm, respectively. The dashed line is obtained by
neglecting limb darkening. Data for limb darkening are from Allen (1973).
By means of a series expansion it can also be proved that for h R∗ one has
9 1 − u1 (ν) − u2 (ν) 20 − 5 u1 (ν) − 8 u2 (ν) h
wν ≈ wν 0 + 2 .
5 6 − 3 u (ν) − 4 u (ν) 2R∗
1 2
3 15 − 7 u1 (ν) − 10 u2 (ν) 1
wν ≈ 1 − .
10 6 − 2 u1 (ν) − 3 u2 (ν) (1 + h/R∗ )2
For the solar case, the quantities u1 (ν) and u2 (ν) are tabulated by Allen (1973),
and the anisotropy factor can be computed via Eq. (12.36). Figure 12.7 shows the
behavior of wν as a function of height over the solar surface expressed in solar radii.
For a given height, the anisotropy factor is larger in the ultraviolet, consistently
with the results obtained in the previous section (cf. Fig. 12.4).
In the former sections of this chapter we have considered the idealized situation of
a plane-parallel, cylindrically symmetrical stellar atmosphere. The assumption of
cylindrical symmetry implies that the only non-zero components of the radiation
678 CHAPTER 12
field tensor are J00 (ν) and J02 (ν). However, such assumption is only approximately
satisfied in real stellar atmospheres, and there are many different phenomena that
may be responsible for symmetry breaking. In these cases, the radiation field can
no longer be described in terms of the mean intensity and anisotropy factor alone.
Further components of the radiation field tensor come into play, as obvious from
Eqs. (5.157) and Table 5.6. Restricting our analysis to ‘classical’ (or unpolarized,
see footnote on p. 663) atmospheres, the non-zero components of the radiation field
tensor are given by 5
dΩ K ,
K
JQ (ν) = T (0, Ω) I(ν, Ω)
4π Q
or, more explicitly, by
5
dΩ
J00 (ν) = I(ν, Ω)
4π
5
1 dΩ
J02 (ν) = √
(3 cos2 θ − 1) I(ν, Ω)
2 2 4π
√ 5
3 dΩ ± iχ
J±1 (ν) = ∓
2
sin θ cos θ e I(ν, Ω)
2 4π
√ 5
3 dΩ ± 2iχ
.
2
J±2 (ν) = sin2 θ e I(ν, Ω)
4 4π
Consider for instance the solar atmosphere. The phenomena of granulation and
supergranulation, or the presence of any structure typical of solar activity, such
as sunspots, faculae, etc.,1 introduce local symmetry breakings. Obviously, the
quantitative evaluation of the radiation field tensor cannot be carried out without
a detailed three-dimensional model of the structure considered and of its interface
with the surrounding atmosphere.
It is possible, however, to write down simple expressions for the radiation field
tensor at a given point P of the outer atmosphere under the assumption of a
‘point-like’ structure. Let us refer to Fig. 12.8, where the z-axis of the right-
handed, orthogonal frame (xyz) is directed along the vertical through P and the
x-axis points in an arbitrary direction. We suppose the spot to be centered at a
point of the solar surface characterized by the colatitude δ and the longitude ϕ in
the spherical coordinate system illustrated in the same figure. We also suppose
that the spot is seen under a small solid angle ∆Ω from point P and that ∆Iν is
the contrast, as seen from point P, of its intensity with respect to the unperturbed
atmosphere. Under these assumptions, the radiation field tensor at point P is given
by2
1
We recall again that we are concerned here with ‘classical’ atmospheres. In this context
typical symmetry-breaking agents, like magnetic fields, have no direct effect. An ‘active region’
is simply characterized by an intensity contrast with respect to the nearby regions.
2
Equations (12.38) are valid provided the spot is visible from point P, which implies
δ < arccos [R /(h + R )] .
RADIATION ANISOTROPY IN STELLAR ATMOSPHERES 679
Fig.12.8. Geometry for the evaluation of the radiation field tensor at point P in the presence of
a spot over the solar surface.
∆Ω
J00 (ν) = J00 (ν) cyl.s. + ∆Iν
4π
1 ∆Ω
J02 (ν) = J02 (ν) cyl.s. + √ (3 cos2 θ0 − 1) ∆Iν
2 2 4π
√
3 ± iχ0 ∆Ω
2
J±1 (ν) = ∓ sin θ0 cos θ0 e ∆Iν
√ 2 4π
3 ± 2iχ0 ∆Ω
2
J±2 (ν) = sin2 θ0 e ∆Iν , (12.38)
4 4π
where [J00 (ν)]cyl.s. and [J02 (ν)]cyl.s. are the components of the radiation field tensor
as computed in the cylindrically symmetrical case (see former sections), and where
the angles θ0 and χ0 are connected with the spot’s coordinates by the simple
relations
sin δ
χ0 = ϕ − π , θ0 = arctan .
1 − cos δ + h/R
Equations (12.38) can easily be generalized to the case where several structures
are present over the solar surface, and clearly show the role played by activity in
determining the radiation field tensor in the outer solar atmosphere.
680 CHAPTER 12
Fig.12.9. Schematic illustration of the radiation field symmetry breaking at point P due to the
presence of a prominence between P and the solar surface.
where Ic(0) is the continuum intensity at disk center, dc is the central depression of
the line (dc < 1), ν0 and ∆νp are the line central frequency and width, respectively,
u1 is the limb-darkening coefficient, and α is the angle between the direction Ω and
the vertical to the atmosphere. The geometry is illustrated in Fig. 12.10: (xyz) is
the reference system (the comoving frame) which is moving with velocity v with
respect to the star; the velocity direction is specified by the angles θv and χv , the
direction Ω by the angles θ and χ ; the angle α is related to θ and to the relative
height h/R∗ of point P by Eqs. (12.33) and (12.32). Introducing these notations,
and recalling Eq. (12.39), the argument of the exponential in Eq. (12.41) takes the
form
1
The transformation between the two reference systems entails indeed further effects of order
v/c. One is aberration, which implies that a ray propagating along the direction Ω in the star’s
frame propagates, in the comoving frame, along a direction Ω given by Ω (1 + v · Ω
/c) − v /c
(to first order in v/c). The second is the fact that the intensity is multiplied by the factor
(ν/ν )3 ≈ 1 − 3(v · Ω)/c
when passing from the first to the second frame. Both effects are however
rather small, and negligible in most applications. Even considering a velocity v 100 km s−1 ,
aberration implies an angular distance between Ω and Ω of the order of 1 arcmin, whereas the
second effect implies a correction to the intensity of order 10 −3 .
682 CHAPTER 12
θ θ
χ χ
Fig.12.10. Geometry for the evaluation of the comoving-frame radiation field tensor at point P.
The reference system (xyz) – the comoving frame – moves with velocity v relative to the star.
The direction of v is specified by the angles θv and χv .
2 2
ν − ν0 ν − ν0 + ν (v/c) cos Θ
− =− , (12.42)
∆νp ∆νp
where, via the cosine theorem
v · Ω
cos Θ = = cos θ cos θv + sin θ sin θv cos(χ − χv ) . (12.43)
v
Substitution of Eqs. (12.41) and (12.42) into Eq. (12.40) leads to the following
expression for the comoving-frame radiation field tensor at frequency ν0
2π γ
I (0) 2 2
1 − d e−ω cos Θ
K
JQ (ν0 ) = c dχ dθ sin θ T Q
K
(0, Ω) c
4π
0 0
× 1 − u1 (1 − cos α) , (12.44)
where
ν0 v
ω= (12.45)
∆νp c
RADIATION ANISOTROPY IN STELLAR ATMOSPHERES 683
Fig.12.11. The radiation field tensor in the comoving frame, computed according to Eq.(12.44),
is plotted against the parameter ω defined in Eq.(12.45). Curves are labelled by the value of θv ,
while χv is set to 0◦ . The average intensity J00 (ν0 ) – upper panel – is normalized√
to the continuum
intensity at disk center, while J02 (ν0 ) – lower panel – is normalized to J00 (ν0 )/ 2. The relevant
parameters are: dc = +0.8 (absorption line), u1 = 0.2, h = 0.01 R∗ . Note in the upper panel the
Doppler brightening effect.
is the ratio between the Doppler shift induced by the velocity v and the width of
the spectral line (both expressed in frequency units).
It should be borne in mind that the key parameter in the Doppler effect is the
v · Ω
component of the velocity v along the propagation direction Ω, = v cos Θ. For
an assigned v , the angle Θ obviously depends both on θ and on χ (see Fig. 12.10
or Eq. (12.43)), and this entails the symmetry breaking: the radiation field at
point P is cylindrically symmetrical (χ-independent) in the star’s frame, but not in
the comoving frame. At the same time, Eqs. (12.44)-(12.45) show that this effect
vanishes for v → ∞ : under this limit the radiation field becomes cylindrically
symmetrical also in the comoving frame.
The integral in Eq. (12.44) can be evaluated numerically. Figures 12.11 and
12.12 show the various components of the radiation field tensor as functions of the
684 CHAPTER 12
Fig.12.12. Same as Fig.12.11 for the components J12 (ν0 ) and J22 (ν0 ). The remaining components
2 (ν ) = (−1)Q J 2 (ν ).
are obtained from J−Q 0 Q 0
Fig.12.13. Same as Fig.12.11 except for the line depression dc , which is set to −4 (emission line).
Note in the upper panel the Doppler dimming effect.
the zero-velocity case. As expected, the Doppler effect becomes ineffective for very
large velocity, since in that case the observer sees, at frequency ν0 , the radiation
emitted by the star in the far wings of the line irrespective of the velocity direction.
Thus for ω → ∞ the anisotropy factor tends to its ω = 0 value,1 and this implies
that the curves in the lower panel of Fig. 12.11 have to go through a maximum or
a minimum. Actually, such extrema are found at ω values of about 1.5 or slightly
less.
Figure 12.12 shows the behavior of the other non-vanishing components of the
comoving-frame radiation field tensor. The interpretation of the different curves
follows from arguments similar to those developed above. In particular, it should
be remarked that J12 and J22 are zero both for ω = 0 and for ω → ∞, because in
either case the radiation field in the comoving frame is cylindrically symmetrical.
The situation is completely reversed for an emission line, as shown in Figs. 12.13
1
The coincidence of these two values is a consequence of the assumption of a frequency-
independent limb-darkening coefficient.
RADIATION ANISOTROPY IN STELLAR ATMOSPHERES 687
and 12.14. In this case one speaks of Doppler dimming effect (upper panel of
Fig. 12.13). The anisotropy factor turns out to be lowered – relative to its zero-
velocity value – for radial velocities (θv 0◦ ), and enhanced for transversal veloc-
ities (θv 90◦ ). Similarly, there is a sign switch for the components J12 andJ22 in
comparison with the absorption line case.
CHAPTER 13
ASTROPHYSICAL APPLICATIONS:
THE OUTER LAYERS OF STELLAR ATMOSPHERES
The theoretical results on atomic polarization derived in Chap. 10, and the study
performed in Chap. 12 on the properties of the radiation field flowing from a stellar
atmosphere allow one to deduce in a rather straightforward way the polarization
signatures of spectral lines formed in an optically thin plasma (like a prominence
or a coronal condensation) located at a certain height over the stellar surface. In
this chapter we present a number of applications of such results with the purpose,
on one side, of discussing the limits of applicability of the theory developed in
this book, and, on the other side, of showing the important role often played by
the Doppler effect in determining the shape of polarimetric profiles. The methods
illustrated in this chapter can be considered as the basic tools for the interpretation
of spectropolarimetric observations in solar prominences or in the solar corona – as
long as the hypothesis of optical thinness is verified – and for the diagnostics of
prominence and/or coronal magnetic fields.
levels. The ratio of such coherences to level populations is roughly the same as
the ratio of the natural width of the level to the Bohr frequency of the coherence.
Therefore, if the Bohr frequency of one of the coherences is ‘large’ (comparable with
∆νp or ∆νw , so that the flat-spectrum approximation is not valid), the coherence
itself is expected to be much smaller that the diagonal elements of the density
matrix (by two orders of magnitude in the example above). Moreover, as illustrated
in Sect. 10.17, such coherences are important in affecting the polarization properties
of the emissivity only in the far wings of the spectral line, where the emissivity itself
is very low in all four Stokes parameters and is often masked by other processes,
like, e.g., the continuum emissivity. These arguments show that, if we exclude some
few special cases involving particularly strong lines with well-developed damping
wings formed in optically thick media, it is usually justified to consider a simplified
atomic model where such coherences are set to zero ab initio. This approximation is
particularly suited for the applications that we are going to develop in this chapter,
which refer to optically thin plasmas.
To conclude, we want to stress that the flat-spectrum approximation in polarized
radiative transfer is analogous to the complete redistribution approximation (in the
atomic frame) in standard radiative transfer. In the latter, only few lines need to
be treated with the more sophisticated theory of partial redistribution; similarly,
in polarized radiative transfer most of the applications to concrete problems can be
handled with the simple approach of the flat-spectrum approximation. Only in few
cases more sophisticated (and more complicated) formalisms have to be invoked
(Bommier, 1997a,b; Landi Degl’Innocenti et al., 1997).
As shown in Sect. 12.4, the radiation field tensor experienced by an observer moving
with velocity v in the outer layers of a stellar atmosphere may strongly depend,
because of the Doppler effect, on v . As a consequence, it has to be expected that
the density matrix of an atomic system moving in the same environment may be
strongly correlated with its velocity.
Given this situation, a statistical description of the atomic system has to be given
in terms of the product of two functions: the velocity distribution function f (v ),
defined such that f (v ) d3v , with
f (v ) d3v = 1 , (13.3)
is the probability of finding an atom of the given species with velocity contained in
the infinitesimal volume d3v of the velocity space, times ρnm (v ), the v -dependent
density matrix, here defined on the basis of the energy eigenvectors. With these
notations, the quantity
dP = f (v ) ρnn (v ) d3v
692 CHAPTER 13
gives, for instance, the probability of finding the atom with velocity v and in the
internal state specified by the eigenvector | n . In the following, we will refer to
products of the form f (v ) ρnm (v ), or – when using the spherical tensor represen-
tation – f (v ) ρK v ) or f (v ) βLS ρK
Q (αJ; Q (J, J ;
v ), with the name of velocity-space
density matrix .
We will now derive a statistical equilibrium equation for the velocity-space density
matrix. To this aim, it should be taken into account that all the processes that we
have considered so far in establishing the time evolution of the density matrix, such
as absorption and emission of photons, inelastic and superelastic collisions with
electrons, and depolarizing collisions with perturbers (mainly neutral hydrogen
atoms), are practically ineffective in changing the velocity of the atom. Consider,
for instance, the emission or the absorption of a photon of visible wavelength by
an atom which is moving with velocity v . The conservation of momentum implies
that the atom experiences a recoil corresponding to a velocity variation ∆v given
by
h
∆v ≈ ,
λ µ mH
where λ is the photon’s wavelength and µ is the atomic weight of the atom. For
example, assuming λ = 5000 Å and µ = 56 (the atomic weight of iron), one gets
∆v 1.4 cm s−1 ,
which should be compared with velocities of the order of some km s−1 that are
typically found in stellar atmospheres. Somewhat more efficient from this point of
view are collisions with thermal electrons, either elastic, inelastic, or superelastic.
If ve is the typical velocity of an electron, the recoil suffered by the atom is given,
in terms of velocity variation, by
m ve
∆v ≈ .
µ mH
Assuming for v and ve the r.m.s. values corresponding to the same temperature,
one has .
ve µ mH
= ,
v m
so that .
∆v m
≈ .
v µ mH
For µ = 56 one gets, for instance
∆v
3.1 × 10−3 .
v
Finally, depolarizing collisions are elastic collisions due to long-range interactions,
and they are also ineffective in changing appreciably the velocity of the atom.
THE OUTER LAYERS OF STELLAR ATMOSPHERES 693
The only processes that are important, under this point of view, are close col-
lisions with atoms. Such collisions, that can be referred to as velocity-changing
collisions, are characterized by very small impact parameters and by a relatively
large exchange of kinetic energy between the colliding atoms. It has then to be
expected that such collisions will also induce transitions between the energy levels
(or sublevels) of the atom, thus affecting its density matrix.
Referring for simplicity to the case of a multi-level atom in the statistical tensor
representation, and taking into account the foregoing considerations, we can write
the statistical equilibrium equation for the velocity-space density matrix in the
form
d d K δ
f (v ) ρK
Q (αJ;
v ) = f (
v ) ρ (αJ;
v ) + f (
v ) ρ K
(αJ;
v ) . (13.4)
dt dt Q 0 δt Q
v.c.c.
The first term in the right-hand side is due to the ‘ordinary’ processes mentioned
above – which are ineffective, to a very good approximation, in changing the ve-
locity of the atom. In particular, the quantity
d K
ρ (αJ; v )
dt Q 0
is the sum of the right-hand side of Eq. (7.11) – radiative processes – and the
right-hand side of Eq. (7.101) – collisional processes. It should be remarked that
the radiative rates TA , TS , RA , and RS (which are proportional to the radiation
field tensor) depend in general on the velocity v .
The second term in the right-hand side of Eq. (13.4) is due to velocity-changing
collisions, and can be regarded as a generalization of the Boltzmann term which is
met in the usual kinetic theory of gases (see e.g. Oxenius, 1986). Neglecting the
internal state of the colliders, such term – that will be referred to in the following
as the generalized Boltzmann term – can be written in the form
δ K
f (v ) ρQ (αJ; v ) =
δt v.c.c.
= f (v ) ρK
v ) Fi (u )
Q (α J ;
i α J K Q
× qi (v , u → v , u ; α J K Q → αJKQ) w d3 u
− f (v ) ρK
Q (αJ;
v)
× Fi (u ) qi (v , u → v , u ; αJKQ → α J K Q ) w d3 u . (13.5)
i α J K Q
which is solved by
d
f (v) = 0
dt
d K d K
ρ (αJ; v ) = ρ (αJ; v ) .
dt Q dt Q 0
These equations show that the density matrix of an atom moving with velocity v
is completely decoupled from the density matrix of atoms moving with different
velocities, and that its value can be found by solving the statistical equilibrium
equations of Chap. 7: obviously, the Doppler effect must be taken into account in
the expressions for the radiative rates involving the radiation field tensor. On the
other hand, the velocity distribution f (v ) remains undetermined, and can only be
established by means of different physical considerations. In many cases, f (v ) can
be simply assumed to be a Maxwellian distribution, possibly centered at a non-zero
velocity, like in the case of the solar wind.1
It should be remarked that, when the velocity-coherence approximation is valid,
the v -dependence of the density matrix is only due to the fact that atoms moving
with different velocities may experience, because of the Doppler effect, different
values of the radiation field tensor. Obviously, when the incident radiation field
presents no spectral structure across each of the frequency intervals centered at
the transition frequencies of the model atom and sufficiently wide to encompass all
possible Doppler shifts, the v -dependence of the density matrix disappears and one
is left with a ‘unique’ density matrix ρK Q (αJ).
The second approximation that can be introduced to bring back Eq. (13.4) to
a more tractable form is the so-called approximation of complete redistribution on
velocities. Here one assumes a priori that velocity-changing collisions are so efficient
in reshuffling the atomic velocities that any velocity/density-matrix correlation
is lost. This implies that the velocity-space density matrix can be written as
f (v ) ρK
Q (αJ), the second factor being independent of
v , which in turn implies that
the term
d K
ρ (αJ; v )
dt Q 0
1
In certain situations, bi-modal or three-modal distributions can be more appropriate.
696 CHAPTER 13
with the velocity-dependent rates replaced by their averages over the velocity dis-
tribution f (v ). Synthetically,
Rate (v ) → f (v ) Rate (v ) d3v . (13.7)
Since the rates depend on v only through the radiation field tensor, it is easily seen,
recalling Eqs. (5.157) and the formulae for the non-relativistic Doppler effect, that
substitution (13.7) is equivalent to
K
K
JQ (ν) → J¯Q
K
(ν) = d3v f (v ) JQ (ν) v , (13.8)
K
where [JQ (ν)]v , the radiation field tensor in the comoving frame, is given by (cf.
Eq. (12.40))
5 , -
dΩ K
3
v · Ω .
K
JQ (ν) v = T Q (i, Ω) Ii ν 1 + ,Ω (13.9)
4π i=0 c
∞ 5 ,
dΩ K v-
3
J¯Q
K
(ν) = dv fc (v) T Q (i, Ω) Ii ν 1 + ,Ω ,
4π i=0 c
−∞
∞ , ν − ν - 5 dΩ 3
c I (ν , Ω)
J¯Q
K
(ν) = dν fc c T K (i, Ω)
ν ν 4π i=0 Q i
−∞
∞
= dν p(ν − ν) JQ
K
(ν ) , (13.10)
−∞
where
c , ν − ν -
p(ν − ν) = f c .
ν c ν
For an isotropic distribution of velocities, substitution (13.7) is thus equivalent
to the substitution of the radiation field tensor with a suitable convolution over
frequencies. In the case of a Maxwellian distribution, where fc (v) is given by (cf.
Eq. (5.42)) 2
1 1 − vv
fc (v) = √ e T ,
π vT
THE OUTER LAYERS OF STELLAR ATMOSPHERES 697
where
vT
∆νD = ν .
c
Up to now, the integral in d3v of the generalized Boltzmann term has been
disregarded. An inspection of the right-hand side of Eq. (13.5), joined with the
hypothesis of the velocity independence of the density matrix, shows that such term
leads to the appearance in the statistical equilibrium equations of two additional
contributions that can be written in the form
+ BT (α J K Q → αJKQ) ρK
Q (α J )
α J K Q
− BR (αJKQ → α J K Q ) ρK
Q (αJ) , (13.12)
α J K Q
where
BT (α J K Q → αJKQ) =
= d3v d3 u f (v ) Fi (u ) qi (v , u → v , u ; α J K Q → αJKQ) w
i
BR (αJKQ → α J K Q ) =
= d3v d3 u f (v ) Fi (u ) qi (v , u → v , u ; αJKQ → α J K Q ) w .
i
As shown in the former section, there is a special case where the density matrix
does not depend on the velocity of the atom. This happens when the radiation field
impinging on the atom is spectrally unstructured at all the transition frequencies
that contribute to establish its statistical equilibrium.
Whereas this situation is common in laboratory experiments, it is much less
frequent in astrophysical plasmas. However, some of the spectral lines that are
currently used for the diagnostics of the outer layers of the solar atmosphere fall
within this category. A typical example is the so-called coronal green line of FeXIV
at 5304 Å. The solar spectrum shows no prominent lines in the neighborhood of
this wavelength, which implies that the excitation of the ions is due to the solar
continuum and is practically independent of the Doppler effect.
In this section we illustrate the main features of the phenomena of resonance
polarization and the Hanle effect for this simple case where it is fully justified to
neglect velocity/density-matrix correlations, so that a unique, velocity-independent
density matrix can be defined.
Referring to Fig. 13.1, we consider an atom (described by a suitable model as
outlined in Sect. 13.1) located at point P in the outer solar atmosphere. We assume
the incident radiation field at P to be unpolarized, cylindrically symmetrical about
the vertical through P, and spectrally unstructured in the neighborhood of each
of the frequencies νi corresponding to the transitions involved in the model atom.
The observer, receiving the radiation along the direction Ω , sees the atom at an
0
apparent height h over the solar limb, the true height h over the solar surface
being related to h and to the aspect angle1 δ by the simple equation
(h + R ) cos δ = h + R ,
where R is the solar radius. From the knowledge of h and of the limb-darkening
coefficients of the solar spectrum, the radiation field tensor can easily be derived at
each of the frequencies νi . Assuming a quadratic limb-darkening law, and drawing
on the results derived in Sect. 12.3, we can write the non-zero components of
1
The determination of the angle δ is often non-trivial and, in some cases, even impossible.
For prominences, one can obtain an estimate of δ by measuring the position of the corresponding
filament over the disk during the few days preceding (or following) the observation.
THE OUTER LAYERS OF STELLAR ATMOSPHERES 699
χ
θ
Fig.13.1. Geometry of a scattering process in the outer layers of the solar atmosphere. The
points to the observer, while the direction e – the reference direction for positive
direction Ω 0 a
Q – is parallel to the solar limb (as seen by the observer). θB and χB are the polar angles of the
magnetic field vector relative to the local vertical.
the radiation field tensor in a reference system with the z-axis directed along the
vertical through P in the form
1
J00 (νi ) = Jν , J02 (νi ) = √ wν Jν , (13.13)
i i i
2
where Jν and wν are given in Eqs. (12.34) and (12.36), respectively.
i i
The radiation field tensor is just one of the factors that have to be specified to find
the density-matrix components of the atom and, ultimately, the Stokes parameters
of the radiation emitted along the direction Ω . The remaining factors are the
0
magnetic field, that can be parameterized through its modulus, B, and the angles
θB and χB as in Fig. 13.1, and the collisional cross sections. Here we are mainly
interested in the basic features of resonance polarization and the Hanle effect in
the outer layers of the solar atmosphere, so we will just suppose that, owing to
the low density of the plasma, collisions can be neglected. Obviously, when using
a specific spectral line as a diagnostic tool for inferring – for instance – the value
of the magnetic field, this approximation has to be carefully tested, case by case.
For example, detailed analyses have shown that such approximation is justified for
the HeI λ 10830 and λ 5876 lines, whereas it is not for the hydrogen lines of the
Balmer series and for the magnetic-dipole forbidden lines that are commonly used
for the diagnostics of coronal fields (FeXIV λ 5304, FeXIII λ 10747 and λ 10798,
FeX λ 6374).1
1
For the hydrogen lines, collisions with electrons (and protons) turn out to be important
700 CHAPTER 13
Having specified all the geometrical and physical parameters, one can turn to the
problem of finding the density matrix of the radiating atom. This is obtained by
solving (for stationary situations) the statistical equilibrium equations that have
been derived in Chap. 7. The relevant equations are Eq. (7.11) for the case of a
multi-level atom, Eq. (7.38) for a multi-term atom, or Eq. (7.65) for a multi-level
atom with hyperfine structure. It should be remarked that all three equations are
valid in a reference system with the z-axis directed along the magnetic field. The
radiative rates appearing in these equations obviously contain the radiation field
tensor evaluated in that reference system, while Eqs. (13.13) refer to a system with
the z-axis in the vertical direction. The two systems are related by a suitable
rotation. Taking into account Eq. (2.78) one has, with evident notations1
K
JQ (νi ) mag.field = D0Q
K
(R) J0K (νi ) vert. , (13.14)
because of the quasi-degeneracy of the energy levels with respect to the quantum number l. For
coronal lines, collisional rates are comparable to radiative rates because of the low values of the
Einstein coefficients (A 101 -102 s−1 ).
1
Alternatively, one can use the equations written in the vertical frame, with the difference
that the magnetic term takes a more complicated form, as explained in Sect. 7.12.
2
Note that this last equation is valid under the limit where all the hyperfine components can
be considered coincident.
THE OUTER LAYERS OF STELLAR ATMOSPHERES 701
where the value of the angle γB is the same as in Eq. (13.15). We recall that the
rotation matrix DPKQ (R) for the composite rotation of Eq. (13.18) can be computed
with the help of Eq. (2.74).
The Stokes parameters of the radiation flowing along the direction Ω can be
0
finally obtained by integration of the radiative transfer equations. Under the hy-
pothesis of an optically thin plasma, one has
) = ε (ν, Ω
Ii (ν, Ω ) ds , (13.19)
0 i 0
) = kA
εi (ν, Ω Φ̂KK (J = 0, Ju = 1 ; ν)
0 L Q
KK Q
) 1
× (−1)Q T Q
K
(i, Ω J K (ν ) , (13.20)
0
1 + i QHu −Q 0
where kLA is the frequency-integrated line absorption coefficient defined in Eq. (9.5),
Φ̂KK
Q is the generalized profile defined in Eq. (10.40)1 – with the individual profiles
Φ(νi − ν) replaced by their Doppler convolutions as in Eq. (13.16) – , Hu is given
by Eq. (10.29), T Q K ) by Eq. (13.17), and J K (ν ), the radiation field tensor
(i, Ω 0 Q 0
at the transition frequency ν0 defined in the magnetic field frame, is given by
Eq. (13.14). Similarly, for the frequency-integrated emission coefficient one can
rewrite Eq. (10.32) in the form
1
) = kA
ε̃i (Ω (−1)Q T Q
K )
(i, Ω J K (ν ) . (13.21)
0 L 0
1 + i QHu −Q 0
KQ
Substituting Eq. (13.20) into Eq. (13.19), and assuming a small-size coronal conden-
sation permeated by a uniform magnetic field, one obtains for the Stokes parameters
of the radiation reaching the observer the same expression as the right-hand side
of Eq. (13.20) with the quantity kLA replaced by the frequency-integrated optical
thickness τL defined by
τL = kLA ds . (13.22)
Similarly, for the frequency-integrated Stokes parameters one obtains the same
expression as the right-hand side of Eq. (13.21) with kLA replaced by τL .
In spite of the simplicity of the model atom considered, the expressions just
derived implicitly depend on several parameters (limb-darkening coefficients of the
continuous solar radiation, height and aspect angle of the coronal condensation,
intensity and direction of the magnetic field, Einstein coefficient of the line, velocity
distribution of the scattering atoms). A full analysis of the interplay of the different
parameters is well outside the purposes of this book. In the following we limit
ourselves to present a number of illustrative results for a few special cases.
a) Resonance polarization
In the absence of magnetic fields, the expressions of the emitted Stokes parameters
considerably simplify, since Hu = 0 and it is not necessary to perform the rotation
(13.15) to change the reference system. On the other hand, the generalized profiles
simplify according to Eq. (A13.5). With the help of Table 5.6 one obtains for the
non-zero Stokes parameters
) = τ J 0 (ν ) + √ 1
I(ν, Ω 0 L 0 0 3 sin2 δ − 1 J02 (ν0 ) φ̂(ν)
2 2
3
) = τ √ cos δ J 2 (ν ) φ̂(ν) ,
2
Q(ν, Ω 0 L 0 0 (13.23)
2 2
1
The generalized profiles for the special case J = 0, Ju = 1 are explicitly given in App. 13
(see Eqs. (A13.11)).
THE OUTER LAYERS OF STELLAR ATMOSPHERES 703
K
where the tensor JQ (ν0 ) is evaluated in the vertical frame, and where the real
profile φ̂(ν), which takes the Doppler effect due to the velocity distribution of
atoms into account, is given by
, -
v · Ω
φ̂(ν) = d3v f (v ) φ ν0 1 + 0
−ν . (13.24)
c
Fig.13.3. Polarization diagram for the radiation emitted along the direction Ω in the geometry
0
of Fig.13.1. The graph refers to a two-level atom (J = 0, Ju = 1) located in the plane of the sky
(δ = 0◦ ) and illuminated by the continuous solar radiation at 5000 Å in the presence of a horizontal
magnetic field (θB = 90◦ ). Full lines correspond to χB = const., while broken lines correspond
to constant magnetic field strength, parameterized through the quantity Hu (see Eq.(10.29)).
defined by
) dν
Q(ν, Ω ) dν
U (ν, Ω
0 0
p̃Q = , p̃U = .
) dν
I(ν, Ω ) dν
I(ν, Ω
0 0
As apparent from Eq. (13.21), the corresponding quantity p̃V vanishes identically,
K
because the only non-zero components of JQ (ν0 ) are those with K = 0 or K = 2,
and the components T 0 (3, Ω0 ) and T Q (3, Ω0 ) are zero (see Table 5.6).
0 2
1
It should be recalled that such diagrams have been introduced for the diagnostics of magnetic
fields in prominences (Bommier, 1977), and that it is generally believed that in these objects the
magnetic field is horizontal in order to sustain the prominence material against gravity.
THE OUTER LAYERS OF STELLAR ATMOSPHERES 705
Fig.13.4. Same as Fig.13.3 for a different aspect angle (δ = 30◦ ). The solid lines labelled with 1,
2, . . . , 12 correspond to χB = 0◦ , 30◦ , . . . , 330◦ , respectively. The broken lines correspond to
the same Hu values as in Fig.13.3.
Fig.13.5. Same as Fig.13.3 for θB = 45◦ and δ = 30◦ . The labelling of the curves is the same as
in Fig.13.4.
1
Were not for the presence of atomic polarization, the V profile would be well-represented
by Eq. (11.2), because the magnetic field is very weak (νL /∆νD 2 × 10−3 ). Since, in the
case considered, atomic polarization is about 10% (as shown by the values of Q/I and U/I in
THE OUTER LAYERS OF STELLAR ATMOSPHERES 707
Fig.13.6. The Stokes parameters, computed from Eq.(13.20) for the geometry of Fig.13.1, are
plotted against the reduced frequency (ν0 − ν)/∆νD . The main geometrical parameters are:
δ = 0◦ , θB = 90◦ , χB = 0◦ . The remaining parameters, as well as the normalization of the
Stokes parameters, are specified in the text. Note the different scale of the panels.
)
Q(ν, Ω )
U (ν, Ω )
V (ν, Ω
)=
pQ (ν, Ω 0
, )=
pU (ν, Ω 0
, )=
pV (ν, Ω 0
,
0 ) 0 ) 0 )
I(ν, Ω 0 I(ν, Ω 0 I(ν, Ω 0
some departures from this idealized situation clearly appear (see Fig. 13.7). In the
line core, up to a distance of approximately two Doppler widths from line center,
pQ and pU are practically constant and equal, with very good approximation, to
the corresponding values p̃Q and p̃U relative to the integrated profiles. Beyond two
Doppler widths the Hanle effect disappears – consistently with our discussion of
Sect. 10.4 – so that pU goes asymptotically to zero, whereas pQ approaches the
limiting value corresponding to resonance polarization in the absence of magnetic
Fig. 13.6), it has to be expected that Eq. (11.2) represents the V profile with a precision of the
same order.
708 CHAPTER 13
Fig.13.7. The fractional polarization in the three Stokes parameters Q, U , and V of Fig.13.6 is
plotted against the reduced frequency (ν0 − ν)/∆νD (solid lines). The dashed lines represent the
values obtained from frequency-integrated profiles.
fields (pQ 0.28, cf. Fig. 13.3). It should be remarked that the frequency distance
from line center where the curves pQ and pU start to inflect (about two Doppler
widths in the example of Fig. 13.7) is a function of the reduced damping constant,
Γ A
a= = ,
∆νD 4π ∆νD
and that such distance increases for decreasing a, and vice versa. It should also be
remarked that the behavior illustrated in Fig. 13.7 may substantially be altered by
any source of (unpolarized) continuous emission that may add to the line emission.
As apparent from Fig. 13.6, at two Doppler widths from line center the intensity
is reduced by a factor 102 with respect to its peak value.
It is interesting to compare the Stokes profiles in Fig. 13.6 with those in Fig. 10.3.
The two figures refer to the same transition (J = 0, Ju = 1) and to the same ge-
ometry (a 90◦ scattering with the magnetic field pointing to the observer); the
Einstein coefficient A, the Landé factor, and the magnetic field intensity are the
same; stimulated emission is neglected in both cases. The main difference is that
Fig. 10.3 refers to the rest frame of the atom, while Fig. 13.6 refers to the observer’s
frame, the emitting atoms being characterized by an assigned velocity distribution.
The second difference concerns the anisotropy of the incident radiation field, which
is much smaller in the case of Fig. 13.6 (in Fig. 10.3 we had a radiation beam).
The profiles in Fig. 13.6 are convolutions of profiles like those in Fig. 10.3 with
the velocity distribution of atoms. As a result, the ‘structure’ visible in Fig. 10.3
– characterized by typical widths of order Γ = A/(4π) – is completely lost in
Fig. 13.6, where the typical width is the Doppler broadening ∆νD (numerically,
THE OUTER LAYERS OF STELLAR ATMOSPHERES 709
Fig.13.8. Same as Fig.13.6 for a magnetic field lying in the plane of the sky (δ = 0◦ , θB = 45◦ ,
χB = −90◦ ). Note the presence of a weak circular polarization signal.
∆νD 103 × Γ ). In Fig. 13.6 the polarization in the far wings is reduced because
of the smaller anisotropy (pQ 0.28, to be compared with pQ = 1). Moreover,
as shown by Fig. 13.7, the polarization within the line is further reduced by the
Doppler effect (in Fig. 10.3 the radiation is totally polarized at every single fre-
quency).
The polarization profiles in the Hanle effect regime may show, in some cases,
rather peculiar features. A striking example is given in Fig. 13.8, which corresponds
to the same set of parameters as Fig. 13.6 except for the magnetic field’s direction,
which is now characterized by the angles θB = 45◦ , χB = −90◦ . In spite of the fact
that the magnetic field is now lying in the plane of the sky, and is thus perpendicular
to the line of sight, the V Stokes parameter shows a typical antisymmetrical profile
that might erroneously be interpreted (according to Eq. (11.2)) as due to a very
weak magnetic field having a non-zero component along the line of sight. In fact,
this circular polarization profile originates from the terms with K = 1, K = 2,
Q = ±1 in Eq. (13.20), whose contribution does not vanish, in general, even for
magnetic fields perpendicular to the line of sight.
710 CHAPTER 13
It is nowadays believed (on the basis of several observations) that magnetic fields
in solar prominences typically range from 0 to 100 G, with most of reliable mea-
surements falling in a more restricted interval centered at about 20 G. On the other
hand, as we have seen on several examples developed in this book (cf. Sects. 5.9,
10.3, and 13.3), the Hanle effect, as observed in a given spectral line, turns out to be
particularly sensitive to magnetic field values such that the dimensionless parame-
ter Hu , defined in Eq. (10.29), is contained in an interval that we can – somewhat
arbitrarily – set to (0.1–10). As shown by Eq. (10.29), the range of sensitivity
of the Hanle effect in terms of magnetic field’s intensity depends on the Einstein
coefficient of the line and on the Landé factor of the upper level involved in the
transition. Assuming for these quantities the typical values A = 5 × 107 s−1 and
gα J = 1, we obtain a range that extends between 0.6 G and 60 G. Therefore, the
u u
Hanle effect turns out to be a particularly suitable diagnostic tool for the magnetic
fields typically found in solar prominences.1
Among the many spectral lines that are observed in solar prominences, one may
ask which would be the most suitable for this type of diagnostics. The answer is
that the ‘ideal’ spectral line should have the following properties:
a) It should be sufficiently bright in the prominence spectrum to allow high precision
polarimetric measurements. Typically, an error less than at least 10−3 in the
observed quantities p̃Q and p̃U is required.
b) The spectral line itself, and all the other lines involved in the model atom
needed to treat the problem, should be optically thin. When this requirement is
K
fulfilled, the radiation field tensor JQ (νi ) of Eqs. (13.13) can be simply computed
from the photospheric intensity, whose spectral and center-to-limb properties are
K
well-known. For optically thick lines, the tensor JQ (νi ) can only be computed by
introducing a model for the thermodynamic structure of the prominence and its ge-
ometry (vertical slab, horizontal slab, cylinder, arch, etc.). This obviously leads to
a strongly model-dependent diagnostics (for an example, see Landi Degl’Innocenti
et al., 1987).
c) Since high velocities are often observed in prominences,2 all the lines involved
in the model atom should fall within continuum windows of the photospheric solar
spectrum, in order to avoid possible contaminations of the radiation field tensor
K
JQ (νi ) due to the Doppler effect. If this is not the case, the theory presented in the
former section – which implies the absence of velocity/density-matrix correlations –
cannot be applied, and one has to resort to a more complicated formalism (see
Sect. 13.7). It should be noticed that this is a rather severe requirement because
1
Early suggestions for using the Hanle effect for the measurement of magnetic fields in
prominences were given by Öhman (1929).
2
According to Tandberg-Hanssen (1995), non-thermal velocities up to 10 km s −1 are often
observed in the outer edges of quiescent prominences. In active prominences velocities can be
even higher.
THE OUTER LAYERS OF STELLAR ATMOSPHERES 711
λ
λ
λ λ
Fig.13.9. The model atom of the triplet system of HeI (orthohelium) used for the diagnostics
of magnetic fields in prominences. The dashed lines refer to two infrared transitions that can
be omitted in the theoretical calculations. Note that each of the terms 3 P and 3 D is actually
composed of three different J-levels.
it implies that all the lines involved in the model atom are not present in the
photospheric spectrum.
d) Since depolarizing collisions are often poorly known, it may be convenient to
avoid lines whose interpretation requires atomic models involving transitions with
low values of the A Einstein coefficient. By so doing, one can avoid the introduc-
tion of depolarizing rates into the statistical equilibrium equations and the ensuing
dependence of the results on a further parameter (the number density of particles).
On the other hand, when the cross sections for depolarizing collisions are accurately
known (either from laboratory experiments or from theoretical calculations), ob-
servations of linear polarization may also be used as a diagnostic tool for inferring
the particle density – although the disentangling process generally requires the use
of simultaneous observations in several lines. An example of such procedure has
been given by Bommier et al. (1986a,b).
One line which meets to a large extent the above requirements is the D3 line of
HeI at 5876 Å. According to Bommier (1977), the polarization characteristics of
this line can be described by means of the atomic model of Fig. 13.9. The model
involves 5 terms (11 levels) and 6 transitions, though the two infrared transitions
drawn in broken lines in the figure can safely be omitted. The two visible lines
at 5875.6 Å and 7065.3 Å fall into broad continuum windows. The infrared line at
10830 Å presents in the photospheric spectrum a very shallow depression (of the
order of 5% of the continuum) that can safely be neglected. The only problem is
712 CHAPTER 13
caused by the UV line at 3888.6 Å, which falls into a complicated region of the
spectrum covered by several absorption lines of FeI. Since, however, this is a line
of secondary importance in establishing the statistical equilibrium of the atomic
model, it is reasonable to neglect the exact wavelength dependence of the solar
spectrum in the neighborhood of this line and to smooth it through a convolution
with a rectangular profile having a width of the order of 1 Å.
Concerning optical thickness, it can be stated with a good degree of confidence
that the four lines involved in the calculations are indeed optically thin in quiescent
prominences, except for the infrared line λ 10830 which, in some cases, may reach
non-negligible values of optical thickness.1
Taking into account these remarks, and considering that the four lines involved
in the model have pretty large values of the A Einstein coefficient (so that collisions
can be neglected), one can apply to this model atom the theory developed in the
former section.
Figure 13.10 shows the polarization diagram for the frequency-integrated radia-
1
Polarimetric observations of this line, performed by Lin et al. (1998) on filaments over the
disk, provide a clear indication of non-negligible optical thickness along the vertical direction in
the observed objects.
THE OUTER LAYERS OF STELLAR ATMOSPHERES 713
tion emitted in the D3 line. The diagram refers to a prominence observed in the
plane of the sky (δ = 0◦ ) at a height h = 0.073 R (corresponding to 70 arcsec).
The atomic data for the fine-structure intervals of the terms 3 P and 3 D of the
model atom are from Wieder and Lamb (1957), Pichanik et al. (1968), Kponou et
al. (1971), and Tam (1975), whereas the Einstein coefficients of the transitions are
from Wiese et al. (1966). The limb-darkening coefficients and the intensities of the
continuum radiation (needed to compute the radiation field tensor components J00
and J02 at the frequencies of the transitions) were obtained from the tables of Allen
(1973) by interpolation. For the UV line at 3888.6 Å, the continuum intensity is
reduced by a factor 5 (see our previous discussion).
The shape of the diagram presents many of the features illustrated on simpler
model atoms in Chap. 10. In particular, the feature present in the upper part of
the diagram is due to the partial relaxation of coherences in the levels 2s 3 S and
2p 3 P (lower-level Hanle effect, cf. Sects. 10.8-10.9), while the loops appearing for
field intensities 10 G are due to level-crossing interferences in the upper term of
the D3 line 3d 3 D (cf. Sect. 10.18).
Diagrams like that of Fig. 13.10 have been extensively used by Bommier and
collaborators for the diagnostics of magnetic fields in prominences (Bommier et al.,
1994). It should be remarked that the intrinsic symmetry of the diagram leaves
a fundamental ambiguity in the determination of the magnetic field vector. Such
ambiguity can be removed either with the help of simultaneous measurements in
optically thick lines or by taking advantage of the rotation of the sun under the
hypothesis of a rigid or quasi-rigid rotation of the prominence (Bommier et al.,
1981, 1994).
The diagnostics of prominence magnetic fields can also be performed by inter-
preting directly the Stokes parameters profiles observed in the D3 line. This line
is in fact composed of six fine-structure components, five of which are very close
in wavelength while the sixth, originating from the transition 2p 3 P0 − 3d 3 D1 , is
well-separated, lying at a distance of approximately 350 mÅ to the red from the
center of gravity of the others. In practice, it is like observing two distinct lines for
which two separate polarization diagrams can be drawn.1 Full theoretical details
of this diagnostic procedure are given in Landi Degl’Innocenti (1982b). Its appli-
cations to the interpretation of Stokes profiles from prominences can be found in
Athay et al. (1983) and Querfeld et al. (1985).
An interesting feature of the D3 line is the appearance of a complicated circular
polarization profile. This is because the usual Zeeman effect mixes with the effect
of atomic orientation due to the alignment-to-orientation conversion mechanism
discussed in Sect. 10.20. Figures 13.11 and 13.12 show the Stokes profiles computed
according to Eq. (7.47e). One can notice that the signature of the Zeeman effect is
largely masked by the presence of atomic orientation which produces the prominent
negative lobe.
1
The polarization diagrams for the frequency-integrated radiation in the two components
can be computed from Eqs. (7.48g) and (7.48c). In the latter, one should omit the profile φ and
restrict the summation over J to the values J = 1, 2 for the blue component and J = 0 for the
red component.
714 CHAPTER 13
Fig.13.11. The theoretical Stokes profiles of the D 3 line, normalized to the maximum intensity, are
plotted as a function of the wavelength distance from the center of the red component (expressed
in Å). The profiles refer to a point located in the plane of the sky at a height of 70 . The magnetic
field vector is specified by B = 35 G, θB = 90◦ , χB = 0◦ . The Doppler width is ∆λD = 137 mÅ,
corresponding to a thermal velocity of 7 km s −1 .
Apart from the Hanle effect, the diagnostics of magnetic fields in prominences
can also be carried out with the traditional technique of the longitudinal magne-
tograph.1 As discussed in Sect. 11.1, such technique is based on Eq. (11.2), which
implies both the weak field approximation and the absence of atomic polarization
in the levels involved in the transition. Whereas the weak field approximation is
definitely justified in prominences, the existence of atomic polarization implies that
the magnetograph technique fails in general to give reliable measurements of B in
these objects. This is especially true for fine-structured and hyperfine-structured
lines, whose circular polarization profiles are deeply contaminated by the presence
of a ‘spurious’ signal due to the atomic orientation induced in the upper level by the
alignment-to-orientation conversion mechanism. A typical example is the V pro-
1
Early measurements of the longitudinal component of the magnetic field in prominences
were reported by Zirin and Severny (1961) and by Rust (1966).
THE OUTER LAYERS OF STELLAR ATMOSPHERES 715
Fig.13.12. Same as Fig.13.11 for the fractional polarization in the three Stokes parameters Q, U ,
and V . Note the disappearance of the Hanle effect in the far wings.
file of the D3 line shown in Fig. 13.11. There is no doubt that the magnetograph
technique applied to that profile would produce an erroneous result.
For ‘spectroscopically simple’ lines, on the other hand, Eq. (11.2) gives indeed a
zero-order approximation, that might be improved by solving the statistical equi-
librium equations and by computing, via Eqs. (10.39) and (A13.6), the first-order
correction to the V Stokes parameter due to the presence of atomic polarization.
In general, this correction is expected to be rather small (typically of the order
of 10%–20%), so that Eq. (11.2) should give a fairly good representation of the V
profile.1 Unfortunately, among the many lines that have extensively been used for
the diagnostics of magnetic fields in prominences through the magnetograph tech-
nique (Hα, Hβ, HeI λ 4771, HeI D3 , NaI D1 and D2 , Mg b), only the magnesium
lines fall into this category.
The procedure that has been outlined in Sect. 13.3 for deducing the density matrix
of an atom (or ion) located in the outer layers of a stellar atmosphere, and hence
the polarization properties of the radiation emitted in one of its spectral lines, can
be directly applied to the interpretation of coronal observations in the well-known
forbidden lines of iron. These are the so-called green line of FeXIV at 5304 Å, the
1
Note, however, that the zero-order approximation predicts V = 0 when the magnetic field is
perpendicular to the line of sight. In that case, the only contribution to the circular polarization
profile comes from the first-order correction, and Eq. (11.2) cannot be applied (see Fig. 13.8).
716 CHAPTER 13
so-called red line of FeX at 6374 Å, and the two infrared lines of FeXIII at 10747 Å
and 10798 Å. All the transitions involved in the model atoms needed to interpret
these lines do not match significant features of the solar spectrum, so that the flat-
spectrum approximation is well-justified and velocity/density-matrix correlations
can be neglected.1
Notwithstanding the similarities of the theoretical approach, the interpretation of
polarimetric observations in coronal forbidden lines involves some differences with
respect to the ‘prominence case’ that have to be discussed in some detail.
The first difference concerns the atomic models. Because of the large ionization
degree of the iron ions, fine-structure intervals turn out to be very large, which
entails that such ions can be treated as multi-level atoms, rather than multi-term
atoms.
The second difference arises from the small values of the Einstein coefficient. The
four lines mentioned above, as well as the other lines involved in the model atoms,
are magnetic-dipole forbidden lines with values of A(αu Ju → α J ) ranging from
10 s−1 to 102 s−1 . This means a difference of the order of 106 with respect to the
Einstein coefficient of typical prominence lines, and implies that collisional rates
have to be included in the statistical equilibrium equations, as they are in fact
comparable to the radiative rates, or even larger (Sahal-Bréchot, 1974a,b; 1977).
This is a serious drawback because collisional cross sections are, in general, rather
poorly known; a major effort is still needed on this point in order to attain fully
reliable diagnostic techniques.
The third difference is due to the magnetic-dipole character of the transitions.
As shown in Sect. 6.8, this implies the following transformation on the geometrical
tensor T QK – see Eq. (5.155)
(i, Ω)
TQ
K → γ T K (i, Ω)
(i, Ω) , (13.25)
i Q
For the case we are considering here, transformation (13.25) has no effect on the sta-
K
tistical equilibrium equations because the radiation field tensor JQ (νi ) appearing in
the radiative rates involves only the unpolarized solar radiation (cf. Eqs. (5.157)).
On the contrary, such transformation is fundamental in determining the Stokes pa-
rameters of the radiation emitted in a magnetic-dipole transition: it entails a sign
switch of the emission coefficients εQ and εU with respect to the usual electric-
dipole case.
The fourth difference is also connected with the low values of the Einstein coef-
ficient: even for magnetic fields as weak as 10−4 G, the parameter Hu defined in
1
The only exceptions may be provided by some UV transitions needed for the interpretation
of the green line. However, such transitions are dominated by collisional rates, thus the solar
UV spectrum plays no significant role in the statistical equilibrium equations (Casini and Judge,
1999).
THE OUTER LAYERS OF STELLAR ATMOSPHERES 717
Eq. (10.29) turns out to be much larger than unity. This means that, in the ref-
erence system of the magnetic field, all the density-matrix elements ρK Q (αJ) with
Q = 0 can simply be set to zero. From the diagnostic point of view, this also
implies that there is no possibility of measuring the magnetic field intensity from
linear polarization observations in these lines.
Finally, the fifth difference is related to the fact that the corona, as observed
in the forbidden lines mentioned above, does not present, in general, remarkable
condensations. Therefore, the diagnostic content of observations in these lines is
non-local, and one must ultimately rely either on coronal models of the density and
magnetic field, or on tomographic techniques based on the hypothesis of a rigid or
quasi-rigid rotation.
The peculiarities outlined above have some consequences that are worth to be
discussed in detail. The first thing to notice is that the statistical equilibrium
equations for the density matrix take, in this case, a relatively simple form. Writing
these equations in the reference system of the magnetic field, it turns out that
the only non-vanishing statistical tensors are those of the form ρK Q (αJ) with K
even and Q = 0. The condition on Q has been discussed above, whereas the
condition on K follows from arguments similar to those presented in Sect. 10.7,1
which remain valid even considering the presence of collisions and/or stimulation
effects. It follows that for an atomic model composed of N levels with quantum
numbers (αi Ji ), with i = 1, . . . , N , the total number of equations (and unknowns)
is ⎧
⎪
⎪ %
N
⎪
⎨ (Ji + 1) for Ji integers
i=1
Neq =
⎪
⎪ %
N
⎪
⎩ (Ji + 12 ) for Ji half-integers .
i=1
hν
)=
εi (ν, Ω N 2Ju + 1 A(αu Ju → α J )
0
4π
× ) ρK (α J ) Φ̂KK (J , J ; ν) ,
γi T 0K (i, Ω 0 0 u u 0 u
K even K
where γi is defined in Eq. (13.26), the tensor ρK 0 is defined in the magnetic frame,
and the symbol Φ̂KK
Q (J ,
uJ ; ν) has been introduced in Eq. (13.20). This expres-
sion considerably simplifies by introducing the angle Θ between the magnetic field
vector and the direction Ω , and by defining the reference direction for the Stokes
0
parameters in such a way that the positive Q direction coincides with the projec-
onto the plane perpendicular to Ω
tion of B . By so doing, the components of the
0
tensor T 0K can be simply deduced from Table 5.6 with the substitutions θ = Θ,
) vanish. As a
γ = 0, which imply, in particular, that all the components T 0K (2, Ω 0
consequence, the emission coefficient in the U Stokes parameter is zero, so that the
linear polarization of the radiation emitted along Ω is either parallel or perpen-
0
dicular to the projection of the magnetic field vector onto the plane perpendicular
to Ω .1
0
An analytical expression for the non-vanishing Stokes parameters can be found
with the help of Eq. (A13.6), which is well-verified in the corona given the weak
intensity of the magnetic field. After some Racah algebra we obtain for the emission
coefficient in the four Stokes parameters
) = C ρ0 (α J ) + √1
ε0 (ν, Ω 0 0 u u 3 cos2 Θ − 1 wJ(2)J ρ20 (αu Ju ) φ̂(ν)
2 2 u
3
) = C √ sin2 Θ w(2) ρ2 (α J ) φ̂(ν)
ε1 (ν, Ω 0 Ju J 0 u u
2 2
ε (ν, Ω )=0
2 0
) = C cos Θ ν ḡ ρ0 (α J ) + ∆ ρ2 (α J ) ∂ φ̂(ν)
ε3 (ν, Ω 0 L 0 u u 0 u u , (13.27)
∂ν
where
hν
C= N 2Ju + 1 A(αu Ju → α J ) ,
4π
and where wJ(2)J is the symbol defined in Eq. (10.12) – see also Table 10.1 – , φ̂(ν)
u
is given by Eq. (13.24), νL is the Larmor frequency given by Eq. (3.10), ḡ is the
effective Landé factor of Eq. (3.44), and ∆ is given by
⎡
∆ = −3 2Ju + 1 ⎣ gα J (−1)1+J −Ju Ju (Ju + 1)(2Ju + 1)
u u
! !
2 1 1 1 1 1
×
Ju Ju Ju J Ju Ju
⎧ ⎫⎤
⎨ J Ju 1⎬
+ gα J (J + 1)(2J + 1) J Ju 1 ⎦. (13.28)
J
⎩ ⎭
1 2 1
1
Note that this property is valid locally; line-of-sight integration of the emission coefficient
can obviously invalidate this statement.
THE OUTER LAYERS OF STELLAR ATMOSPHERES 719
value of ρ20 (αu Ju ) can be obtained from Eq. (10.50). One has
(2) 2
2J + 1 B(α J → αu Ju ) wJu J J0 (ν0 ) 0
ρ20 (αu Ju ) = ρ (α J ) , (13.29)
2Ju + 1 A(αu Ju → α J ) 1 + + δu(2) 0
where the quantities and δu(2) are defined in Eq. (10.51), and where J02 (ν0 ) is
the radiation field tensor evaluated in the magnetic frame. Taking into account
Eqs. (13.14)-(13.15) and the expression of the rotation matrix D00 2
(see Table 2.1),
it is easily found, by substitution of Eq. (13.29) into Eq. (13.27), that the sign of
) is the same as the sign of the quantity (3 cos2 θ −1), where θ is the angle
ε1 (ν, Ω 0 B B
between the magnetic field vector and the vertical. Bearing in mind the definition
of the Van Vleck angle θV (see Eq. (5.100)), one has1
)>0
ε1 (ν, Ω )
(linear polarization parallel to B for 0 < θB < θV
0
or π − θV < θB < π
)<0
ε1 (ν, Ω ) for θ < θ < π − θ .
(linear polarization perpendicular to B
0 V B V
It should be mentioned that the diagnostics of coronal magnetic fields based on ob-
servations of linear polarization in forbidden lines was first suggested by Charvin
(1965), and that important theoretical contributions where brought by House
(1970a,b; 1971), Sahal-Bréchot (1974a,b; 1977) and, more recently, by Casini and
Judge (1999). From the observational side, successful results were obtained by
Mickey (1973), Arnaud (1982), Querfeld and Smartt (1984), and Arnaud and
Newkirk (1987).
Obviously, the diagnostics of magnetic fields in the corona can also be performed
via the more traditional techniques based on circular polarization. For such ob-
servations, the magnetograph relation (Eq. (11.2)) is still valid, although the effec-
tive Landé factor, ḡ, should be corrected to allow for atomic polarization. From
Eqs. (13.27) we easily obtain
∂I
V (λ) = −∆λB ḡ κ cos Θ ,
∂λ
where the ‘correction factor’ κ is given by (cf. Casini and Judge, 1999)
∆
1+ ḡ σ02 (αu Ju )
κ= ,
1+ 1
√
2 2
(3 cos2 Θ − 1) wJ(2)J σ02 (αu Ju )
u
with ∆ given by Eq. (13.28) and σ02 (αu Ju ) = ρ20 (αu Ju )/ρ00 (αu Ju ).
Early attempts of measuring the longitudinal component of the coronal magnetic
field through this technique were made by Harvey (1969), who could only give
1
Note that this conclusion is just the opposite of that obtained in Sect. 5.8 for the resonance
scattering of a classical oscillator. The difference is due to the fact that now we are dealing with
a magnetic-dipole transition.
720 CHAPTER 13
an upper limit of about 40 G. Recently Lin et al. (2000) have detected clear
antisymmetrical circular polarization signals in the FeXIII λ 10747 line from a
coronal condensation located over an active region. Two observations, interpreted
according to Eq. (11.2), give magnetic field components of 33 G at 0.15 R , and
10 G at 0.12 R , respectively.
We will now consider a physical situation more complicated than that of Sects. 13.3-
13.5. We still assume that the particle density is much smaller than the critical
value nc of Eq. (13.6), so that the generalized Boltzmann term can be neglected;
but we suppose that the transition frequencies involved in the model atom may
correspond to significant spectral features of the incident radiation field, so that
– contrary to the previous sections – the density matrix of atoms moving with ve-
locity v will depend on v . Under such circumstances, the density matrices of atoms
moving with different velocities are completely decoupled (see Sect. 13.2). The
v -dependent density matrix can be determined by solving the ‘ordinary’ statistical
equilibrium equations, which are written in the atomic rest frame (the comoving
frame introduced in Sect. 12.4). Once the equations are solved, the emissivity in
that frame can be calculated from the expressions derived in Chap. 7. To ob-
tain the emissivity in the observer’s frame, we must take into account the Doppler
shifts corresponding to the velocities of the individual atoms and integrate over the
velocity distribution.
In this section we illustrate these concepts by referring to a simple case where
an approximate analytical expression can be found for the v -dependent density
matrix. The problem that we are going to address consists in finding the radiation
scattered by a collection of two-level atoms (with unpolarized lower level) having an
arbitrary distribution of velocities and illuminated by an anisotropic and frequency-
dependent radiation field. The basic features of this problem have already been
analyzed by Sahal-Bréchot et al. (1998), who also carried out an application to
the OVI coronal line at 1032 Å, and by de Kertanguy (1998) who studied the
polarization of Hα in solar spicules.
Let us consider an atom located at a point P in the outer layers of a stellar atmo-
sphere. The two-level model atom with unpolarized lower level has been analyzed
in Sects. 10.1-10.2, where it was found that, neglecting the effects of magnetic fields
and collisions, the upper-level statistical tensors are given by Eq. (10.13). That ex-
pression is valid in the comoving frame. In the observer’s frame, the upper-level
statistical tensors can be written in the form
K
where [JQ (ν0 )]v is given by Eq. (13.9),
5 , -
K dΩ K
3
v · Ω .
JQ (ν0 ) v = T (i, Ω) Ii ν0 1 + ,Ω (13.31)
4π i=0 Q c
Substitution of Eq. (13.30) into Eq. (13.32) leads, with the help of Eqs. (10.6),
(9.5), and (10.17), to the expression
= kA
εi (ν, Ω) WK (J , Ju ) (−1)Q T Q
K
(i, Ω)
L
KQ
, -
v · Ω K
× 3
d v f (v ) δ ν0 1 + − ν J−Q (ν0 ) v . (13.33)
c
1
A rigorous treatment of this problem requires more sophisticated theories of frequency-
redistribution effects.
2
This substitution, joined with the approximation contained in Eq. (13.30), is often referred
to as the coherent-scattering approximation. It introduces an error of the order of Γ/∆νe , where
∆νe is the width of the emission profile.
722 CHAPTER 13
; ν, Ω
where the redistribution function Rij (ν, Ω ) is given by1
; ν, Ω
Rij (ν, Ω ) =
v · Ω v · Ω
= Pij (Ω, Ω ) d v f (v ) δ ν − ν0 − ν0
3
δ ν0 − ν + ν0 . (13.35)
c c
where J¯QK
(ν0 ) is given by Eq. (13.8).
Equations (13.33) and (13.36) are deeply different. The former involves a sin-
gle integral over the velocity space, while the latter involves the product of two
independent integrals. Only when the radiation field presents no variation across
a spectral interval centered at ν0 and encompassing all the possible Doppler shifts
induced by the velocity distribution f (v ), do the two equations coincide – because,
in that case, [J−QK
(ν0 )]v is independent of v and equal to J¯−Q
K
(ν0 ). Obviously, it
1
The notation employed here for the redistribution function differs from that employed in
the classical book of Mihalas (1978). Apart from the obvious difference that we have here a 4 × 4
matrix-type redistribution function instead of a scalar one, there is an ordering inversion in the
arguments. What we write as R00 (ν, Ω ; ν, Ω
) is written by Mihalas as R(ν ,
n ; ν, n).
THE OUTER LAYERS OF STELLAR ATMOSPHERES 723
Fig.13.13. Geometry of a scattering process in the outer layers of the solar atmosphere, in the
) is the same as in Fig.13.1.
presence of a velocity distribution. The observer’s frame (ea ,eb , Ω 0
The direction of the atomic velocity v is specified by the angles θv and χv .
is Eq. (13.33) that gives, in general, the correct physical description of scattering
phenomena in very diluted plasmas.
It is also interesting to observe that the two equations give the same result for
the frequency-integrated emission coefficient. Bearing in mind Eq. (13.3), one gets
≡ dν ε (ν, Ω)
ε̃ (Ω) = ε̃ (Ω)
≡ dν ε (ν, Ω) =
i i i c.r.v. i c.r.v.
= kLA WK (J , Ju ) (−1)Q T Q
K J¯K (ν ) .
(i, Ω) (13.37)
−Q 0
KQ
where
vx = v sin θv cos χv ,
and where
√
1 3
L0 (v ) = J00 (ν0 ; v, θv , 0) − √ W2 J02 (ν0 ; v, θv , 0) + W2 cos 2χv J22 (ν0 ; v, θv , 0)
2 2 2
√
3 3
L1 (v ) = √ W2 J0 (ν0 ; v, θv , 0) +
2
W2 cos 2χv J22 (ν0 ; v, θv , 0)
2 2 2
√
L2 (v ) = 3 W2 sin χv J12 (ν0 ; v, θv , 0)
L3 (v ) = 0 . (13.39)
The expressions just derived show the existence of an important physical phe-
nomenon induced by velocity on the polarization of the scattered radiation. As
shown in Sect. 12.4, the component J12 (ν0 ; v, θv , 0) of the comoving frame radiation
field tensor is generally non-zero. Therefore, as apparent from Eqs. (13.40) and
(13.39), the frequency-integrated U Stokes parameter of the scattered radiation
does not vanish in general (unless the velocity distribution is isotropic or at least
symmetrical about the vertical). This implies a rotation of the polarization direc-
tion with respect to the tangent to the solar limb. From this point of view, an
anisotropic distribution of velocities acts similarly to a magnetic field, and much
care has to be taken, when interpreting observations, to disentangle the rotation
induced by velocities from that due to the Hanle effect.
As an application of Eq. (13.40) to the diagnostics of velocities, we consider a
stream of atoms moving with a bulk velocity w and we suppose, for the sake of
simplicity, that there is no dispersion around this value, so that the function f (v )
is given by
f (v ) = δ (3) (v − w
).
THE OUTER LAYERS OF STELLAR ATMOSPHERES 725
Fig.13.14. Velocity diagram for the scattering by a two-level atom in the geometry of Fig.13.13.
The scattering point is characterized by δ = 0◦ and h = 0.01 stellar radii. The stellar spectrum
presents an absorption line, characterized by dc = 0.8 and u1 = 0.2 (see Eq.(12.41)), at the
transition frequency. The atom moves relative to the star with a velocity w of fixed modulus.
The ratio between the Doppler shift induced by w and the width of the absorption line is ω = 1.4
(see Eq.(12.45)). Full lines correspond to χw = const., broken lines to θw = const. The parameter
W2 in Eqs.(13.39) is set to unity, corresponding to the transition (J = 0, Ju = 1).
) = k A L (w
ε̃i (Ω 0 L i ),
)
ε̃Q (Ω )
ε̃U (Ω
0 0
p̃Q = , p̃U =
ε̃ (Ω )
ε̃ (Ω )
I 0 I 0
against each other, while allowing the angles θw and χw to vary in the intervals
0◦ ≤ θw ≤ 90◦ , −90◦ ≤ χw ≤ 90◦ .
726 CHAPTER 13
Fig.13.15. Velocity diagram for the same physical situation considered in Fig.13.14, obtained by
varying the velocity modulus. Only the ‘external borders’, corresponding to χw = −90◦ and
+90◦ (right and left side respectively, cf. Fig.13.14) are shown. Curves are labelled by the value
of ω (see Eq.(12.45)).
The velocity diagram of Fig. 13.14 presents a high degree of degeneracy. From
Eqs. (13.39), and from the properties of the comoving-frame radiation field tensor,1
it can easily be seen that both the transformation χw → π − χw and the transfor-
mation w → −w (i.e., θw → π − θw , χw → π + χw ) leave the quantities p̃Q and
p̃U unchanged. Therefore, each point of the diagram corresponds to four possible
determinations: the nominal values (θw , χw ) and the combinations (θw , π − χw ),
(π − θw , π + χw ), (π − θw , −χw ). A further reason of degeneracy is the velocity
modulus, kept fixed to a specific value in the diagram. Obviously, for w = 0 the
diagram reduces to a single point, representing the fractional linear polarization
due to the scattering on a static atom. When w is increased the diagram grows,
reaching its maximum dimensions for a definite w value, and then decreases, as
illustrated in Fig. 13.15. It can be shown that for w → ∞ the diagram reduces
again to a single point, which coincides with the point corresponding to w = 0
because of the assumption of a frequency-independent limb-darkening coefficient.
1
Note that Eqs. (A17.1) and (A17.9) yield the relation
K
JQ (ν0 ; v, π − θv , 0) = e iQπ JQ
K
(ν0 ; v, θv , 0) .
THE OUTER LAYERS OF STELLAR ATMOSPHERES 727
Fig.13.16. Observer’s view of resonance polarization due to an atom moving with velocity w
off the solar limb. The velocity vector lies in the plane of the sky. The direction of linear
polarization (solid line) is intermediate between the parallel to the solar limb (dashed line) and
the perpendicular to the velocity (dotted line).
Fig.13.17. Velocity diagram for an emission line. All the parameters are the same as in Fig.13.14
except for dc and ω, now set to −4 and 2.5, respectively.
to the opposite orientation (parallel to w ) of the dotted line in Fig. 13.16. The
resulting polarization will thus be rotated in the counterclockwise direction with
respect to the parallel to the solar limb.
Consider now the effect of velocity/density-matrix correlations on the polariza-
tion profiles of the scattered radiation. The frequency-dependent emission coef-
ficient is given by Eq. (13.38), and strongly depends on the velocity distribution
f (v ) of the atoms. We consider here the case of a displaced Maxwellian, namely
Fig.13.18. Stokes profiles of the radiation scattered in the direction Ω (Fig.13.13 with δ = 0◦ )
0
by an ensemble of atoms characterized by the velocity distribution of Eq.(13.41). The profiles are
normalized to the maximum intensity and are plotted against the reduced frequency (ν0 −ν)/∆νD .
The solid line is obtained by taking velocity/density-matrix correlations into account, the broken
line corresponds to complete redistribution on velocities. The combination of parameters is:
dc = 0.8, u1 = 0.2, h = 0.01 R , ω = 1, w = vT , θw = 60◦ , χw = −45◦ , W2 = 1 (see text
for explanation of the symbols). Note the blue-shift due to the negative component of the bulk
velocity w
along the line of sight.
)
εQ (ν, Ω )
εU (ν, Ω
)= 0 )= 0
pQ (ν, Ω 0 , pU (ν, Ω 0
ε (ν, Ω ) ε (ν, Ω )
I 0 I 0
are compared with the corresponding quantities obtained under the assumption of
complete redistribution on velocities. The figure clearly shows the exact symmetry
730 CHAPTER 13
of these profiles about line center (which reflects the symmetry of the intensity
)
profile of the photospheric absorption line) and the substantial decrease of pU (ν, Ω 0
in the wings. This is a further similarity between the Hanle effect and the effects
of a velocity distribution on resonance polarization.
The results obtained in the former section can be easily generalized to the case
where a weak magnetic field is present. By ‘weak’ we mean here a field such that
νL ∆νp ,
so that the flat-spectrum approximation is well-verified (see Sect. 13.1) and the
splittings between the different Zeeman components of the transition can be ne-
glected. The results obtained by this approach are obviously in error by a quantity
of order νL /∆νp – or Γ/∆νp , where Γ is the natural width in frequency units of
the upper level of the transition, if this ratio is larger than the former. We recall
again that more sophisticated theories of frequency redistribution are needed to
overcome such limitation.
The generalization of the equations of the former section to the case of the Hanle
effect is obtained starting from the expression of the upper-level statistical tensors
in the presence of a magnetic field (i.e., from Eq. (10.27) instead of Eq. (10.13)).
THE OUTER LAYERS OF STELLAR ATMOSPHERES 731
2J + 1 B(α J → αu Ju ) K
ρK
Q (αu Ju ;
v) = wJ(K)J (−1)Q J−Q (ν0 ) v
2Ju + 1 A(αu Ju → α J ) + 2πi νL gα J Q u
u u
where all the tensorial quantities are defined in a reference system having the z-axis
in the magnetic field direction.1 On the other hand, the expression for the emission
coefficient given in Eq. (13.32) is still valid. Substitution of Eq. (13.42) yields
1
= kA
εi (ν, Ω) WK (J , Ju ) (−1)Q T Q
K
(i, Ω)
L
1 + i QHu
KQ
, -
v · Ω K
× d3v f (v ) δ ν0 1 + − ν J−Q (ν0 ) v , (13.43)
c
5
3
dΩ
= kA
εi (ν, Ω) L dν ; ν, Ω
Rij (ν, Ω ; B)
I (ν , Ω
j
) ,
4π j=0
where
; ν, Ω
Rij (ν, Ω ; B)
= P (Ω, ; B)
Ω
ij
v · Ω v · Ω
× d v f (v ) δ ν − ν0 − ν0
3
δ ν0 − ν + ν0 , (13.44)
c c
Ω
Pij (Ω, ; B)
being the quantum-mechanical scattering phase matrix in the pres-
ence of a magnetic field of Eq. (10.33). It is easily seen that Eq. (13.36) generalizes
into
1
εi (ν, Ω) = kLA WK (J , Ju ) (−1)Q T Q
K
(i, Ω)
c.r.v. 1 + i QHu
KQ
/ , 0
-
v · Ω
× 3
d v f (v ) δ ν0 1 + −ν J¯−Q
K
(ν0 ) , (13.45)
c
Fig.13.20. Velocity-Hanle diagram corresponding to the physical situation of Fig. 13.14 with
the additional presence of a magnetic field of constant modulus (Hu = 2) parallel to w
. All the
remaining parameters have the same values as in Fig.13.14.
The expressions now derived depend on a very large number of parameters. A few
applications are described in the following.
First of all, we will examine how the frequency-integrated linear polarization
of the scattered radiation is affected by the simultaneous presence of a velocity
distribution and a magnetic field. To make an example, we consider the physical
situation of Fig. 13.14, where the atoms move with a unimodal velocity w and the
incident radiation presents an absorption line at the transition frequency, and we
‘add’ a magnetic field B of constant modulus parallel to w.
The resulting ‘velocity-
Hanle diagram’ is shown in Fig. 13.20. The mirror-like diagram of Fig. 13.21 refers
to the same situation with B antiparallel to w.
Comparison of Figs. 13.20-13.21 with Fig. 13.14 shows that the magnetic field has
an important effect on the frequency-integrated fractional polarization, especially
for large inclinations with respect to the vertical. Another obvious effect of the
THE OUTER LAYERS OF STELLAR ATMOSPHERES 733
magnetic field is the symmetry breaking of the diagram. Whereas in the zero-field
case the transformation
χw → −χw
produces a sign switch in p̃U (leading to a diagram symmetrical about the vertical
axis), this symmetry is lost in the magnetic case and is replaced by a more general
one,
p̃Q → p̃Q
for χw → −χw and B → −B ,
p̃U → −p̃U
where B → −B has to be understood in the sense that the magnetic field vector
changes from parallel to antiparallel to the velocity vector w.
Similarly to the non-magnetic case discussed in Sect. 13.6, the velocity-Hanle
diagrams of Figs. 13.20-13.21 present a high degree of degeneracy. It can indeed
be shown from Eq. (13.46), using the properties of the comoving-frame radiation
field tensor and those of the rotation matrices, that a single point in the diagram
of Fig. 13.20 corresponds to four determinations: the nominal set of parameters
θw , χw , ξ = +1
Fig.13.22. Velocity-Hanle diagram for a magnetic field of constant modulus (Hu = 0.4) and
direction (pointing to the observer). The other parameters have the same values as in Fig.13.14.
θw , π − χw , ξ = −1
π − θw , π + χw , ξ = −1
π − θw , −χw , ξ = +1 ,
Fig.13.23. Same as Fig.13.18 except for the direction of the bulk velocity w , here parallel to
the local vertical (θw = 0◦ ), and for the presence of a magnetic field pointing in the same
direction (θB = 0◦ , Hu = 2 ). Full line: taking velocity/density-matrix correlations into account
(Eq.(13.43)). Dashed line: assuming complete redistribution on velocities (Eq.(13.45)).
ASTROPHYSICAL APPLICATIONS:
STELLAR ATMOSPHERES
1
The name ‘non-LTE’ reminds the upgrading of the older and simpler theory based on the
hypothesis of Local Thermodynamic Equilibrium (LTE).
738 CHAPTER 14
radiative
collisions transfer
sol
equations
te
wri
(atmospheric
ve
model)
compare with
ρ α νΩ
observed νΩ
magnetic
sol
field statistical
ve
te
wri
equilibrium
equations
Fig.14.1. Schematic representation of the non-LTE problem of the 2 nd kind. In principle, the
problem is solved by iterating the operations indicated in the loop until self-consistency is reached.
Φ(να J Mu , α J M − ν) → Φ̂(να J Mu , α J M − ν) =
u u u u
, -
v · Ω
= 3
d v f (v ) Φ να J Mu , α J M 1+ −ν . (14.1)
u u c
Substituting the values of ρK Q (αJ) at a given point into Eqs. (7.15), we can compute
the transfer coefficients at that point for any direction and frequency.
On the other hand, the knowledge of the transfer coefficients at each point of
the medium allows one to solve, via the methods developed in Chap. 8, the trans-
fer equation for the (polarized) radiation travelling along any given direction. In
other words, we can compute the Stokes parameters I(ν, Ω), Q(ν, Ω), U (ν, Ω),
and
V (ν, Ω) at any point.
This allows one to compute the radiation field tensor J¯Q K
(ν) of Eq. (13.8) at each
point, and then the radiative rates appearing in the statistical equilibrium equations
via Eqs. (7.14).1 Adding the collisional rates of Sect. 7.13 (see Eq. (7.101)), and
solving the statistical equilibrium equations, we obtain new values for the statistical
tensors ρKQ (αJ).
2
The loop of Fig. 14.1 is thus closed, and – at least in principle – it is sufficient to
repeat it as many times as needed until a self-consistent solution for the statistical
tensors and the radiation field is reached, which solves the non-LTE problem of the
2nd kind.
In the following sections we will present some practical applications of the scheme
just described. However, before proceeding, we feel necessary to add some general
remarks and considerations which may help to clarify the physical and mathemat-
ical aspects of the problem.
a) Under the limit of very large depolarizing rates, or, in other words, when the
quantities D(K) (αJ) defined in Eq. (7.102) are such that
D(K) (αJ) → ∞ (K ≥ 1)
for each (αJ)-level of the atomic model, all the statistical tensors – except those
of rank 0 – vanish. In this case there is no atomic polarization, and the physical
state of the atom is completely specified by the level populations (or the statistical
tensors ρ00 (αJ)). If, in addition, the magnetic field is zero throughout the medium,
the radiation field is unpolarized, so it is fully specified by its intensity (the Stokes
parameter I(ν, Ω)). As a result, the non-LTE problem of the 2nd kind reduces to
the non-LTE problem of the 1st kind. The latter is thus a special case of the former,
corresponding to the two conditions just specified.
1 K (ν) with J¯K (ν) in Eqs. (7.14) is the obvious counterpart of the
The replacement of JQ Q
substitution in Eq. (14.1).
2
In this chapter we adopt the assumption of complete redistribution on velocities in its more
restrictive meaning, where the generalized Boltzmann term is neglected (see comments at the end
of Sect. 13.2). Accordingly, the contributions of Eq. (13.12) are disregarded.
740 CHAPTER 14
te
wri
(atmospheric
model)
ρ α
magnetic
field
Fig.14.2. The self-consistency loop of Fig.14.1 is ‘broken off’ by the LTE assumption, which sets
the density-matrix elements to fixed values, independent of the radiation field.
A possibility is provided by the LTE hypothesis, which implies that the statistical
tensors at any point of the medium are univocally determined by the local values
of the thermodynamic parameters. The statistical equilibrium equations are in
this case ‘implicitly solved’, so that – as schematically illustrated in Fig. 14.2 –
to obtain the Stokes parameters of the radiation travelling along any direction in
the medium one only needs to solve the radiative transfer equations. Most of the
results obtained in Chaps. 9 and 11 are based on this approach.
The second possibility takes place when the medium is optically thin at each of
the wavelengths corresponding to the different transitions involved in the model
atom considered, so that the radiation field impinging on the atom is fixed by the
boundary conditions. Knowing the radiation field and the thermodynamic parame-
ters of the medium, one can write down the statistical equilibrium equations. Their
solution provides the values of the statistical tensors, which allow one to determine
the emissivity and absorptivity properties of the medium and, consequently, the
spectropolarimetric profiles of the emerging radiation. This second possibility is
illustrated in Fig. 14.3. The results obtained in Chaps. 10 and 13 follow from an
approach of this kind.
f) The final remark concerns the limitations related to the flat-spectrum approx-
imation and to the approximation of complete redistribution on velocities. The
problems raised by the flat-spectrum approximation have already been discussed,
though in a different context, in Sect. 13.1. The conclusions that have been reached
there can be easily transferred to the non-LTE problem of the 2nd kind. In partic-
ular, one has to be especially careful with the limitations on the magnetic field’s
intensity (νL ∆νp ) when dealing with the multi-level atom, and with the lim-
itations of Eq. (13.1) when dealing with the multi-term atom or the multi-level
atom with hyperfine structure. If such limitations are not satisfied, one can still
treat the non-LTE problem of the 2nd kind under the flat-spectrum approximation
provided the interferences which violate the limitations themselves are set to zero
742 CHAPTER 14
te
wri
(atmospheric
model)
ρ α νΩ
magnetic
sol
field statistical
ve
te
wri
equilibrium
equations
Fig.14.3. In an optically thin plasma – for instance in a solar prominence – the radiation field
illuminating the atomic system is known a priori. This again ‘breaks off’ the self-consistency loop
of Fig.14.1.
ab initio. This approach fails to give an accurate description of the far wings of
the lines, but this is usually a minor drawback since the diagnostic content of such
spectral intervals is often limited. As remarked in Sect. 13.1, only for particularly
strong spectral lines this approach proves to be inadequate, and a theory including
partial frequency redistribution effects is required.
As far as the approximation of complete redistribution on velocities is concerned,
the situation is quite different. In principle, this approximation might be dropped
by introducing – as in Sect. 13.2 – velocity/density-matrix correlations, thus de-
scribing the atomic system in terms of velocity-dependent statistical tensors. This
implies, however, a dramatic increase in the number of unknowns, which has so far
discouraged any attempt to tackle the non-LTE problem of the 2nd kind in these
more general terms. Yet, the results on resonance scattering and the Hanle effect
obtained in Chap. 13 clearly show that the introduction of velocity/density-matrix
correlations into the non-LTE theory may be important for the correct interpre-
tation of spectropolarimetric profiles. For the time being, the approximation of
complete redistribution on velocities can simply be considered as an artifice used
to bring an extremely complex problem to a tractable form.
possible atomic model, namely a two-level atom without hyperfine structure and
with unpolarized lower level. Consistently with the last assumption, we suppose
that the radiation field impinging on the atom is weak, in the sense that the average
number of photons per mode, n̄, is much smaller than unity, which allows one to
neglect stimulated emission.
We suppose that a collection of such atoms is distributed within a static medium
of arbitrary shape. In this medium the atoms interact with a magnetic field, B,
and with a population of colliding particles having a Maxwellian distribution of
velocities characterized by the temperature T . The atoms have also a Maxwellian
distribution of velocities, such that the corresponding Doppler width of the absorp-
tion profile is ∆νD (in frequency units).
No restriction is made on the spatial variation of the temperature T of the col-
liders and of the magnetic field vector B, as well as on the densities of the atoms
and of the colliders. However, we suppose for simplicity the Doppler width ∆νD
to be constant throughout the medium.
Furthermore, we suppose that the magnetic field is weak (in the sense that the
associated Larmor frequency νL is much smaller than ∆νD ) and that the inverse life-
time of the upper level, γu , is also much smaller than ∆νD , so that the flat-spectrum
approximation is satisfied.1 Finally, we adopt the approximation of complete redis-
tribution on velocities, which implies that at any point P of the medium the atom
can be described by a unique, velocity-independent density matrix that we denote
by K
ρQ (αJ) x ,
with (αJ) = (αu Ju ) for the upper level and (αJ) = (α J ) for the lower level, where
x is the coordinate of point P.
Now we write the statistical equilibrium equation for the multipole moments of
the upper level. Such equation has already been written down explicitly in Chap. 10
(see Eq. (10.1)): however, Eq. (10.1) holds in a reference system with the z-axis
parallel to the magnetic field. Since in the present case the magnetic field vector
is allowed to vary in the medium, it is convenient to introduce a fixed reference
system Σ and to characterize the direction of the field at each point through the
angles θB and χB defined in Fig. 14.4. Then we have to transform Eq. (10.1) as
explained in Sect. 7.12. This leads to the following equation2
d K K
ρQ (αu Ju ) x = −2πi νL gα J KQQ
K
ρQ (αu Ju )
x
dt u u
Q
+ TA (αu Ju KQ, α J K Q ) ρK
Q (α J ) x
+
K Q
1
The applicability of the flat-spectrum approximation follows from the two inequalities
γu ∆νD and νL ∆νD . The latter obviously implies an upper limit on the magnetic field
intensities that can be handled by this formalism.
2
Note that the last two lines of Eq. (10.1) are not affected by a rotation of the reference
system because of the assumed isotropy of collisions.
744 CHAPTER 14
θ Ω
θ
Ω γ
χ
χ
Fig.14.4. The fixed reference system Σ ≡ (xyz) used for the calculations of this section. At each
point P of the medium, the magnetic field vector is specified by the angles θB and χB . ea (Ω)
are the polarization unit vectors for the radiation flowing through P in the direction
and eb (Ω)
≡ (θ, χ).
Ω
− RE (αu Ju KQK Q ) + RS (αu Ju KQK Q ) ρK
Q (αu Ju ) x
K Q
2J + 1 (K)
+ CI (αu Ju , α J ) ρK
Q (α J ) x
2Ju + 1
− CS(0) (α J , αu Ju ) + D(K) (αu Ju ) ρKQ (αu Ju ) x
, (14.2)
where all the rates are evaluated at point x, νL is the Larmor frequency at the
same point, and the kernel KQQ
K
is explicitly given by Eq. (7.79),
∗
KQQ
K
= DQ
K
Q (RB ) Q DQ
K
Q (RB ) , (14.3)
Q
RB being the rotation that carries the local ‘magnetic’ reference system (having
the z-axis aligned with the magnetic field) into the fixed reference system Σ. In
terms of Euler angles one simply has
where γB is an arbitrary angle that can be set to zero. The main properties and
the explicit expressions of the components of KQQ
K
are given in App. 19.
STELLAR ATMOSPHERES 745
The assumptions made at the beginning of this section yield two basic simplifi-
cations in Eq. (14.2):
- because stimulation effects are neglected, the relaxation rate RS is zero;
- because lower-level polarization is neglected, the statistical tensors of the lower
level reduce to ρK 0
Q (α J ) = ρ0 (α J ) δK0 δQ0 . This implies that the only radiative
rate TA needed in Eq. (14.2) is TA (αu Ju KQ, α J 0 0). Such rate is explicitly given
– in the atomic rest frame – by Eq. (10.9). Because of the assumption of complete
K
redistribution on velocities, the radiation field tensor JQ (ν0 ) in Eq. (10.9) has to be
¯K
replaced with the ‘average’ radiation field tensor JQ (ν0 ) given by Eq. (13.8). And
since the velocity distribution of the atoms is assumed to be Maxwellian, J¯Q K
(ν0 )
is given by (cf. Eqs. (13.10) and (13.11))
∞
J¯Q
K
(ν0 ) = dν p(ν0 − ν) JQ
K
(ν) , (14.5)
−∞
with ν
0
−ν 2
1 1 −
p(ν0 − ν) = √ e ∆νD
. (14.6)
π ∆νD
Recalling the expression of the radiative rate RE (Eq. (7.14e)), we can rewrite
Eq. (14.2) in the form
d K K
ρQ (αu Ju ) x = −2πi νL gα J KQQ
K
ρQ (αu Ju )
x
dt u u
Q
− A(αu Ju → α J ) + CS(0) (α J , αu Ju ) + D(K) (αu Ju ) ρKQ (αu Ju ) x
2J + 1
+ B(α J → αu Ju ) wJ(K)J (−1)Q J¯−QK
(ν0 )
2Ju + 1 u
+ δK0 δQ0 CI(0) (αu Ju , α J ) ρ00 (α J ) x ,
c2 2Ju + 1
= wJ(K)J (−1)Q J¯−Q
K
(ν0 ) + δK0 δQ0 BP (T ) ρ00 (α J ) x , (14.8)
2hν03 2J + 1 u
746 CHAPTER 14
where
hν
2hν03 − k T0
BP (T ) = e B (14.9)
c2
is the Planck function in the Wien limit (where stimulation effects are neglected).
In view of the following applications, it is convenient to rewrite Eq. (14.8) in
a more compact form by introducing suitable ‘source functions’ for the different
statistical tensors. Defining
2hν 3 2J + 1 ρKQ (αu Ju )
K
SQ (x ) = 20 x , (14.10)
c 2Ju + 1 ρ00 (α J ) x
K
The quantities SQ (x ) can be referred to as the irreducible components of the two-
level source function. They represent the obvious extension of the usual concept
of source function to the ‘polarized case’, as it is easily seen by considering the
(K = 0, Q = 0) component
2hν 3 2J + 1 ρ00 (αu Ju ) x
S00 (x ) = 20 ,
c 2Ju + 1 ρ00 (α J ) x
where nα J and nα J are the overall populations at point x of the upper and
u u
lower level, respectively. The last expression shows that S00 (x ) is nothing but
the ‘classical’ source function for a two-level atom under the limit of negligible
stimulation effects. It should also be noticed that, according to Eq. (14.11), S00 (x )
obeys the equation (see Eqs. (7.102), (14.7), (10.14) and (A19.2))
or
S00 (x ) = (1 − ) J¯00 (ν0 ) + BP (T ) , (14.13)
where
CS(0) (α J , αu Ju )
= = .
1+ A(αu Ju → α J ) + CS(0) (α J , αu Ju )
STELLAR ATMOSPHERES 747
Equation (14.13) is the ‘classical’ expression relating the source function to the
angle and frequency averaged radiation field and to the Planck function.
We point out that the preceding equations (particularly Eqs. (14.8) and (14.13))
are basically the same as those derived in Sect. 10.6 (see Eqs. (10.50) and (10.52),
respectively). The only difference is that Eq. (10.50) holds in a reference system
where the atom is at rest and with the z-axis parallel to the magnetic field, whereas
Eq. (14.8) holds in the fixed reference system Σ of Fig. 14.4, the atoms being
characterized by an assigned velocity distribution and the magnetic field’s direction
by the angles θB , χB . This implies the appearance in Eq. (14.8) of the kernel KQQ
K
d 3
=−
Ii (ν, Ω) A
Kij +ε
Ij (ν, Ω) (i = 0, . . . , 3) , (14.14)
i
ds j=0
hν
ηiA = N (2J + 1) B(α J → αu Ju )
4π
!
√ 1 1 0
× 3 (−1)1+J +Ju ρ00 (α J ) p(ν0 − ν) δi0 ,
J J Ju
1
It should be remarked that such substitution, based on the assumptions νL ∆νD and
γu ∆νD , implies the same order of approximation as the flat-spectrum approximation that we
have used to derive the statistical equilibrium equation.
748 CHAPTER 14
x
−τν ( x )
x,
=
Ii (ν, Ω) p(ν0 − ν) kLA (x ) e wJ(K)J T Q
K S K (x ) ds
(i, Ω) Q
u
x0
KQ
−τν ( x0 )
x, (b)
+e ,
Ii (ν, Ω) (14.16)
STELLAR ATMOSPHERES 749
(b) is the Stokes vector of the radiation entering the medium at point
where Ii (ν, Ω)
x0 along the direction Ω, s is the coordinate of x reckoned along Ω
(s = |x −x |),
0
and τν (x, x ) is the optical depth at frequency ν between points x and x ,
x
τν (x, x ) = p(ν0 − ν) kLA (x ) ds . (14.17)
x
From Eq. (14.16) it is now possible to find the expression for the radiation field
tensor at point x. Using Eqs. (14.5) and (5.157) we obtain two contributions,
arising from the two terms in the right-hand side of Eq. (14.16). We can write
K K
J¯Q
K
(ν0 ) = J¯Q (ν0 ) I + J¯Q (ν0 ) E , (14.18)
∞ 5
x
K dΩ K
3
J¯Q (ν0 ) I = dν p(ν0 − ν) T Q (i, Ω) ds p(ν0 − ν)
4π i=0
−∞
x0
−τν ( x)
x,
× kLA (x ) e wJ(KJ) T Q
K K
x ) ,
(i, Ω) SQ ( (14.19)
u
K Q
∞ 5
K dΩ K −τν (x,x0 ) (b)
3
J¯Q (ν0 ) E = dν p(ν0 − ν) T (i, Ω) e Ii (ν, Ω) . (14.20)
4π i=0 Q
−∞
Equation (14.19) can be cast in a simpler form by changing the double integral in
dΩ and ds into a volume integral. Since
we get
∞ −τ ( x )
K k A (x ) e ν
x,
J¯Q (ν0 ) I = dν [p(ν0 − ν)] 2 3
d x L
4π(x − x )2
−∞
3
× TQ
K
(i, Ω) wJ(KJ) T Q
K K
x ) .
(i, Ω) SQ ( (14.22)
u
i=0 K Q
Finally, we can substitute the expression of the radiation field tensor at point x
into the statistical equilibrium equation. From Eqs. (14.11), (14.18), and (14.22)
750 CHAPTER 14
we obtain
K
1 + + δu(K) SQ (x ) + iHu KQQ
K K
SQ (
x) =
Q
K
= δK0 δQ0 BP (T ) + wJ (K)
J (−1)Q J¯−Q (ν0 ) E
u
kLA (x )
+ d3 x GKQ,K Q (x, x ) SQ
K
x ) ,
( (14.23)
4π(x − x )2
K Q
where
∞
2 −τ (x,x )
GKQ,K Q (x, x ) = dν p(ν0 − ν) e ν
−∞
3
× wJ(K)J wJ(KJ)
(−1)Q T−Q
K T K (i, Ω)
(i, Ω) . (14.24)
u u
Q
i=0
kLA
dtL = − dz , (14.25)
∆νD
STELLAR ATMOSPHERES 751
and assuming that the stellar atmosphere is not illuminated by external sources of
radiation, Eq. (14.23) takes the form
K
1 + + δu(K) SQ (tL ) + iHu KQQ
K K
SQ (tL ) = δK0 δQ0 BP (T )
Q
∞ ∞ ∞
∆νD
+ dtL dx dy GKQ,K Q (x, x ) SQ
K
(tL ) , (14.26)
4π(x − x )2
0 −∞ −∞ K =0,2 Q
where the indices K and K are restricted to the values 0 and 2. The integral
over x and y can be performed by introducing cylindrical coordinates. These
calculations, that are developed in detail in App. 21, restrict the summation over
Q in the last term of Eq. (14.26) to the single value Q = Q and lead to the
following equation
K
1 + + δu(K) SQ (tL ) + iHu KQQ
K K
SQ (tL ) =
Q
∞
K
= δK0 δQ0 BP (T ) + GKQ,K Q |tL − tL | SQ (tL ) dtL , (14.27)
K =0,2 0
These quantities are real and satisfy the symmetry properties (cf. Eqs. (A20.5)
and (A20.6))
GK Q,KQ (x) = GK −Q,K −Q (x) = GKQ,K Q (x) . (14.29)
The explicit expressions of the multipolar kernels appearing in Eq. (14.27) can be
derived from Eqs. (A21.8), (A21.7) and (A20.11). We obtain
∞
1 2
G00,00 (x) = dv ϕ(v) E1 x ϕ(v)
2
−∞
∞
1 2
G00,20 (x) = √ wJ(2)J dv ϕ(v) 3E3 x ϕ(v) − E1 x ϕ(v)
4 2 u
−∞
∞
1 2
G20,20 (x) = W2 (J , Ju ) dv ϕ(v) 5E1 x ϕ(v) −12E3 x ϕ(v) +9E5 x ϕ(v)
8
−∞
∞
3 2
G21,21 (x) = W (J , J ) dv ϕ(v) E1 x ϕ(v) + E3 x ϕ(v) − 2E5 x ϕ(v)
8 2 u
−∞
752 CHAPTER 14
∞
3 2
G22,22 (x) = W (J , J ) dv ϕ(v) E1 x ϕ(v) + 2E3 x ϕ(v)
16 2 u
−∞
+ E5 x ϕ(v) . (14.30)
The remaining components can be deduced from Eqs. (14.29). In these expressions,
En is the nth exponential integral defined in Eq. (12.6), the symbol W2 (J , Ju ) is
defined in Eq. (10.17), v and ϕ(v) are the reduced frequency and the Gaussian
profile of Eq. (A21.5),
ν0 − ν 1 −v2
v= , ϕ(v) = √ e . (14.31)
∆νD π
Recalling that
∞
1
En (x) dx = (n = 1, 2, . . .) ,
n
0
and that
∞
ϕ(v) dv = 1 ,
−∞
∞ ∞
1
G00,00 (x) dx = , G00,20 (x) dx = 0 ,
2
0 0
∞
7
G2Q,2Q (x) dx = W (J , J ) (Q = 0, ±1, ±2) . (14.32)
20 2 u
0
The multipolar kernels can be evaluated from Eqs. (14.30) via numerical computa-
tions. Figure 14.6 shows a plot of these quantities obtained by Gauss-Hermite inte-
gration. The plot refers to the transition (J = 0, Ju = 1) which implies wJ(2)J = 1
u
and W2 (J , Ju ) = 1.
Equation (14.27) can be solved by means of different numerical techniques. A
possible algorithm is similar to the one described in App. 16 for the solution of the
Hopf equation and is based on a discretization grid on tL and the transformation of
the system of integral equations into a linear system of algebraic equations. Details
can be found in Bommier et al. (1991). Other methods have also been devised,
among which we wish to quote, for their speed and accuracy, the operator-splitting,
iterative methods described by Trujillo Bueno and Manso Sainz (1999).
Once the irreducible components of the source function are known, the Stokes
parameters profiles of the radiation emerging from the atmosphere can be deduced
STELLAR ATMOSPHERES 753
Fig.14.6. The multipolar kernels defined in Eqs.(14.30) are plotted as a function of x for the
transition (J = 0, Ju = 1). The curves are labelled according to the following code: 1) G00,00 ;
2) G00,20 ; 3) G20,20 ; 4) G21,21 ; 5) G22,22 . Note the sign change of curve 2 which occurs at x =
0.75125.
∞ , µτ -
−τν
=
Ii (ν, Ω) wJ J T Q
(K)
K
(i, Ω) e K
SQ ν
dτν . (14.33)
u ϕ(v)
K=0,2 Q 0
Since under this limit the irreducible components of the source function appearing
in Eq. (14.33) become independent of optical depth, the Stokes parameters of the
emerging radiation turn out to be independent of frequency. This is not the case
when continuum opacity is taken into account (see Sect. 14.7).
Particularly important for the theory of radiative transfer of polarized radiation
is the special case of Eq. (14.27) which is obtained by setting Hu = δu(K) = 0 and by
754 CHAPTER 14
∞
(1 + ) S02 (tL ) = G20,00 |tL − tL | S00 (tL ) dtL
0
∞
+ G20,20 |tL − tL | S02 (tL ) dtL . (14.35)
0
which can conveniently be used to test the accuracy of the numerical method
employed for solving the system.
Numerical results on the integration of the system of Eqs. (14.35) and on the en-
suing spectropolarimetric profiles have been published by different authors (Rees,
1978; Faurobert-Scholl and Frisch, 1989; Ivanov, 1990; Bommier et al., 1991; Bom-
mier and Landi Degl’Innocenti, 1996; Ivanov et al., 1997; Trujillo Bueno and Manso
Sainz, 1999). Other numerical results on the profiles have been obtained as partic-
ular cases of more general approaches involving partial redistribution effects (see
Sect. 14.8 and references therein). In Figs. 14.7-14.9 we show some examples for
the transition (J = 0, Ju = 1).
A particularly significant numerical result is the surface value of the ratio of the
two components of the source function, σ = S02 (0)/S00 (0). This ratio is related
1 2 with
In the system of Eq. (14.27) with Hu = 0, the equations involving the components SQ
Q = 0 are decoupled from the equation containing the source term BP (T ). As a consequence,
such components are zero.
2 √
In our notations, this law should better be referred to as the generalized - law.
STELLAR ATMOSPHERES 755
Fig.14.7. The logarithm of the component S00 (tL ) of the source function (normalized to BP ), and
the ratio S02 (tL )/S00 (tL ) – upper and lower panel, respectively – are plotted against the logarithm
of optical depth. The curves are obtained by solving Eqs.(14.35) – homogeneous, isothermal, non-
magnetic, non-depolarizing atmosphere – for the transition (J = 0, Ju = 1) and for two different
values of .
)
Q(ν, Ω 3 wJ(2)J σ
p ≡ = √ u
. (14.36)
)
I(ν, Ω 2 2 − wJ(2)J σ
u
For the transition (J = 0, Ju = 1) – which implies wJ(2)J = 1 – and for = 10−4 ,
u
the published values of σ are 0.08294 (Bommier and Landi Degl’Innocenti, 1996)
and 0.08292 (Trujillo Bueno and Manso Sainz, 1999), corresponding to the p values
1
In Eq. (14.36), the reference direction for positive Q is parallel to the stellar surface (θ = 90◦ ,
γ = 90◦ in Table 5.6).
756 CHAPTER 14
Fig.14.8. The emerging profiles of the intensity (normalized to BP ) and of the ratio Q/I – left
and right panel, respectively – corresponding to the irreducible components of the source function
shown in Fig.14.7. The profiles, obtained from Eq.(14.33) for a heliocentric angle θ = 84◦ .3
(µ = 0.1), are plotted against the reduced frequency (ν0 − ν)/∆νD . The positive Q direction is
parallel to the stellar surface.
Fig.14.9. Limb-darkening curve for the fractional polarization at line center, corresponding to the
irreducible components of the source function of Fig.14.7.
0.09063 and 0.09061, respectively. These values are in agreement with the result
of Ivanov et al. (1997), who give for p the asymptotic expression
√
p = 0.09443 − 0.3805 .
Apart from the special case of Eqs. (14.35), it is important to remark that
Eq. (14.27) holds without any limitation on the optical depth variation of the
different parameters. The quantities , δu(K) , Hu , BP (T ), θB , and χB , which appear
either explicitly or implicitly (through the kernel KQQ K
) in this equation, can be
arbitrary functions of optical depth. A full analysis of the solution of Eq. (14.27) for
different atomic transitions and for different values (and depth dependence) of the
STELLAR ATMOSPHERES 757
above-mentioned parameters is well outside the purposes of this book. Some results
can be found in Bommier et al. (1991) and in Bommier and Landi Degl’Innocenti
(1996).
For further applications involving turbulent magnetic fields, it is convenient to
rewrite Eq. (14.27) by introducing as unknowns the irreducible components of
the source function expressed in the local magnetic frame.1 Such components
K
will be denoted by the symbol [SQ (tL )]M and are still given by Eq. (14.10), the
statistical tensors in the right-hand side being now defined in the local magnetic
frame. Referring to the geometry of Fig. 14.4, and denoting by RB the rotation that
carries the magnetic frame at tL into the fixed reference frame Σ (see Eq. (14.4)),
we have from Eqs. (7.76) and (7.77)
∗
K
SQ (tL ) = SQK
(tL )
M
DQ
K
Q (RB ) (14.37)
Q
(tL ) DQQ (RB ) .
K K K
SQ (tL ) M = SQ (14.38)
Q
∞
K
+ ĜKQ,K Q |tL − tL | SQ (tL )
M
dtL , (14.39)
K =0,2 Q 0
where
ĜKQ,K Q |tL − tL | =
K ∗
= DQQ
K
(RB ) GKQ ,K Q |tL − tL | DQ Q (RB ) , (14.40)
Q
RB being the rotation that carries the magnetic frame at tL into the frame Σ.
The kernels ĜKQ,K Q are a generalization of the multipolar kernels GKQ,K Q
introduced in Eq. (14.28). Since these quantities depend on the magnetic field’s
direction at different optical depths, in general they do not satisfy specific symme-
try properties. However, if the magnetic field’s direction is independent of optical
depth, and if the x-axis of the magnetic frame is chosen to lie in the plane con-
taining the vertical to the atmosphere and the magnetic field vector itself (γB = 0
in Eq. (14.4)), it can be shown, using Eqs. (14.29) and (2.68), that the kernels
ĜKQ,K Q are real and obey the symmetry property
∞ ∞
1
Ĝ00,00 (x) dx = , Ĝ00,2Q (x) dx = 0 ,
2
0 0
∞
7
Ĝ2Q,2Q (x) dx = δQQ W (J , J ) , (14.42)
20 2 u
0
and then to return to the fixed reference system Σ through Eqs. (14.37) and (14.38).
As a result of this back-transformation we obtain the further equivalent equation
K
SQ (tL ) = δK0 δQ0 BP (T )
∞
K
+ K
MQQ (tL ) GKQ ,K Q |tL − tL | SQ
(tL ) dtL , (14.43)
K =0,2 Q 0
where
1
∗
K
MQQ (tL ) = DQ
K
Q (RB ) DQ Q (RB )
K
.
Q
1 + + δu + i Q Hu
(K)
K
The quantity MQQ (tL ) now introduced bears a strong resemblance with the mag-
netic kernel MK
QQ B) defined in Eq. (5.166). Indeed, from Eqs. (14.4) and (2.71)
(
we have
1
K
MQQ (tL ) = DQQ
K
(χB θB γB ) DQ Q (−γB −θB −χB )
K
,
Q
1 + + δu + i Q Hu
(K)
1
Equations (14.42) are valid (provided the magnetic field’s direction is depth-independent)
irrespective of the value of γB .
STELLAR ATMOSPHERES 759
which is the same as Eq. (5.166) except for a slight difference in the denominator
of the last factor.
Equation (14.43) can conveniently be used to handle the case of an atmosphere
pervaded by a microturbulent magnetic field. By ‘microturbulent’ we mean here
that, at any optical depth, the magnetic field has an isotropic distribution of direc-
tions and that the direction at optical depth tL is uncorrelated with the direction
at optical depth tL , no matter how close the two points are in space. Under this
hypothesis, and under the further assumption that the quantities , δu(K) , BP , and
Hu are deterministic, one obtains, averaging over the distribution of magnetic fields
at each depth
K
SQ (tL ) = δK0 δQ0 BP (T )
∞
K
+ K
MQQ (tL ) GKQ ,K Q |tL − tL | SQ (tL ) dtL . (14.44)
K =0,2 Q 0
∞
(1 + + δu(2) ) S02 (tL ) = µ2 (tL ) G20,00 |tL − tL | S00 (tL ) dtL
0
∞
+ G20,20 |tL − tL | S02 (tL ) dtL . (14.46)
0
760 CHAPTER 14
Apart from the presence of the factor µ2 (tL ) in the right-hand side of the second
equation, this is the same as the system holding for S00 (tL ) and S02 (tL ) in a non-
magnetic atmosphere (cf. Eqs. (14.35)),1 and obviously reduces to it under the
limit Hu → 0. Formally, one can pass from the non-magnetic to the ‘turbulent’
atmosphere by the simple transformations
G20,00 |tL − tL | → µ2 (tL ) G20,00 |tL − tL |
G20,20 |tL − tL | → µ2 (tL ) G20,20 |tL − tL | .
1 3
µ2 (tL ) = 1+ .
4 1 + 4(Hu )2
The results obtained in the former section are restricted to a physical regime where
the magnetic field is either zero or, if present, is so weak that the Larmor frequency
νL is much smaller than the Doppler broadening ∆νD . When this inequality is not
satisfied, one enters a different regime that can be simply referred to as the ‘strong
field regime’.2 From one side, the Larmor frequency is much larger than the in-
verse lifetime of the upper level γu , and from the other side it is comparable with
or even larger than ∆νD . As explained in Sect. 13.1, the formalism developed in
this book – based on the flat-spectrum assumption – can still be applied in this
physical regime provided all interferences between different Zeeman sublevels are
neglected a priori. Under this approximation, the atom is described only in terms
of populations of Zeeman sublevels, i.e., only by the diagonal elements of the den-
sity matrix ραJ (M, M ) in the standard representation, or only by the components
ρK
0 (αJ) in the statistical tensor representation.
1 (2)
In Eqs. (14.35), δu was set to zero. Moreover, and BP (T ) were assumed to be
depth-independent, but this does not change the structure of the equations.
2
According to the classification proposed in Sect. 5.16, we deal here with regimes III, IV and
V of panel (a) in Fig. 5.18.
STELLAR ATMOSPHERES 761
In this section we present another example of the general non-LTE problem of
the 2nd kind by considering, similarly to the previous section, a two-level atom
without hyperfine structure and with unpolarized lower level. We suppose again
that a collection of such atoms is distributed within a static medium of arbitrary
shape, where it interacts with a strong magnetic field and with a population of
colliding particles having a Maxwellian distribution of velocities. We assume, as
in the former section, that the atoms themselves have a Maxwellian distribution
of velocities characterized by the parameter ∆νD , and that the radiation field is
weak so that stimulated emission can be neglected. Finally, we adopt again the
approximation of complete redistribution on velocities and, to avoid the introduc-
tion of a too heavy formalism, we suppose that the Doppler broadening ∆νD and
the magnetic field vector are constant throughout the medium, whereas we do not
impose any restriction on the spatial variation of the other physical parameters.
The derivation presented below follows closely the line of reasoning of the former
section. A similar derivation has been given by Landi Degl’Innocenti et al. (1991).
At a point P of the medium, having coordinate x, we write the statistical equi-
librium equation for the multipole moments of the density matrix, defined in the
reference system of the magnetic field. To this aim, we use for the radiative rates
the expressions developed in Sect. 7.4 and, in particular, Eqs. (7.20a) and (7.20e),
whereas for the collisional rates we adopt the same expressions as those of the for-
mer section (see Sect. 7.13.d). Recalling the hypothesis of the unpolarized lower
level, neglecting stimulated emission, taking into account the approximation of
complete redistribution on velocities through the ‘recipe’ of Eq. (13.8), and per-
forming some algebra along the lines of the former section, one obtains, in station-
ary situations, the following equation
c2 2Ju + 1
1 + + δu(K) ρK0 (αu Ju ) =
x 2hν03 2J + 1
× 3(2Ju + 1)(2K + 1)(2K + 1) (−1)1+Ju −M
K Mu M q
2
Ju J 1 Ju Ju K 1 1 K
×
−Mu M −q Mu −Mu 0 q −q 0
!
× J¯0K (να J Mu , α J M ) + δK0 BP (T ) ρ00 (α J ) x , (14.47)
u u
with J¯0K (να J M , α J M ) defined according to Eqs. (14.5) and (14.6).
u u u
The quantity in square brackets in the right-hand side of Eq. (14.47) can be ex-
pressed in terms of the convolution of the generalized profiles defined in Eq. (10.40)
with the velocity distribution of atoms. Such convolution has already been intro-
duced in Chap. 13 and denoted by the symbol Φ̂KK Q (J , Ju ; ν) – see Eq. (13.20).
Here we define a slightly different quantity by setting, for Q = 0
762 CHAPTER 14
Φ̃KK
0 (J , Ju ; ν) = 3(2Ju + 1)(2K + 1)(2K + 1)
2
1+Ju −Mu +q Ju J 1 Ju Ju K
× (−1)
−Mu M −q Mu −Mu 0
Mu M q
1 1 K
× p(να J M , α J M − ν) , (14.48)
q −q 0 u u u
where the profile p is given by Eq. (14.6).1 With this position, Eq. (14.47) takes
the form
c2 2Ju + 1
1 + + δu(K) ρK0 (αu Ju ) =
x 2hν03 2J + 1
∞ !
K
0
× dν Φ̃KK
0 (J ,
uJ ; ν) J0 (ν) + δ K0 BP (T ) ρ0 (α J ) x ,
K −∞
or, introducing the irreducible components of the two-level source function defined
in Eq. (14.10)2
1 + + δu(K) S0K (x ) M = δK0 BP (T )
∞
+ dν Φ̃KK
0 (J , Ju ; ν) J0K (ν) . (14.49)
K −∞
1
The two quantities Φ̂KK0 and Φ̃KK
0 just differ in the fact that the first is a weighted sum of
Voigt profiles centered at the individual frequencies of the Zeeman components of the transition,
while in the second the Voigt profiles are replaced by Gaussian profiles.
2 K (
Consistently with the notations employed in the former section, we use the symbol [SQ x)]M to
recall that the irreducible components of the source function are defined in the magnetic reference
system.
STELLAR ATMOSPHERES 763
The radiative transfer coefficients ηiA , ρAi , and εi can be found from Eqs. (7.21).
Taking into account the hypothesis of complete redistribution on velocities, ne-
glecting for consistency the finite width of the levels, and recalling the definition
of the symbol Φ̃KK
0 of Eq. (14.48), one has (see Eq. (A13.9))
= kA
ηiA (ν, Ω) Φ̃0K (J , J ; ν) ,
T 0K (i, Ω) (14.51)
L 0 u
K
where D is the Dawson function defined in Eq. (5.56). Finally (see Eqs. (A13.8),
(7.8), (14.10), (10.6), and (14.15))
= kA
εi (ν, Ω) S K (x ) Φ̃KK (J , J ; ν) .
T 0K (i, Ω) (14.54)
L 0 M 0 u
KK
The radiative transfer equation can be solved by the methods developed in Chap. 8.
Substituting Eq. (14.54) into Eq. (8.14) and referring to Fig. 14.5, one obtains, for
the Stokes parameters of the radiation propagating through point P along the
direction Ω
x
3
Ii (ν, Ω) = Oij (x, x ; ν) kLA (x ) S K (x ) Φ̃KK (J , J ; ν) ds
T 0K (j, Ω) 0 M 0 u
x0 j=0
KK
3
(b)
+ ,
Oij (x, x0 ; ν) Ij (ν, Ω) (14.55)
j=0
where Oij (x, x ; ν) is the evolution operator between points x and x, s is the
coordinate of x reckoned along the direction Ω, and I (b) (ν, Ω)
j
are the Stokes
parameters of the radiation entering the medium at point x0 and directed along Ω.
Because of the assumption of constant magnetic field and Doppler broadening, the
propagation matrix K A of Eq. (14.50) can be written, according to Eqs. (14.51)
and (14.52), in the form
,
K A = kLA H(ν, Ω) (14.56)
764 CHAPTER 14
where kLA depends on x, while the matrix H depends on frequency and direction
but is independent of x. It follows that the evolution operator has the form of
Eq. (8.22),
−τ ( x ) H(ν,Ω)
x,
Oij (x, x ; ν) = e , (14.57)
ij
where
x
τ (x, x ) = kLA (x ) ds .
x
Using Eq. (14.55), it is now possible to find an expression for the radiation field
tensor J0K (ν) to be substituted into the statistical equilibrium equation (14.49).
From the definition in Eqs. (5.157) one gets
J0K (ν) = J0K (ν) I + J0K (ν) E , (14.58)
5 x
K dΩ K
3 3
J0 (ν) I = T 0 (i, Ω) ds Oij (x, x ; ν) kLA (x )
4π i=0 j=0
x0
× S K (x ) Φ̃K K (J , J ; ν) ,
T 0K (j, Ω) 0 M 0 u
K K
5
K dΩ K
3 3
(b) .
J0 (ν) E = T 0 (i, Ω) Oij (x, x0 ; ν) Ij (ν, Ω)
4π i=0 j=0
Now we substitute Eq. (14.58) into Eq. (14.49) and transform the double integral
over dΩ and ds into an integral in d3 x (see Eq. (14.21)). This leads to the
following equation
∞
(K)
1 + + δu S0K (x ) M = δK0 BP (T ) + dν Φ̃KK
0 (J , Ju ; ν) J0K (ν) E
K −∞
kLA (x )
+ d3 x
G̃K0,K 0 (x, x ) S0K (x ) M , (14.59)
4π(x − x ) 2
K
STELLAR ATMOSPHERES 765
where
∞
K
G̃K0,K 0 (x, x ) = dν Φ̃KK
0 (J , Ju ; ν) Φ̃K
0 (J , Ju ; ν)
K K −∞
3
3
× O (x, x ; ν) T K (j, Ω)
T 0K (i, Ω) . (14.60)
ij 0
i=0 j=0
The quantities G̃K0,K 0 are a natural generalization (though restricted to the values
Q = Q = 0) of the multipole coupling coefficients GKQ,K Q defined in Eq. (14.24).
Their main properties are collected in App. 23.
Equation (14.59) is a system of linear, non-homogeneous, integral equations in the
unknowns [S0K (x )]M . When the local properties of the medium and the boundary
conditions are specified, the system can be solved, and the Stokes parameters of
the radiation emerging from the medium can be computed by means of Eq. (14.55).
Owing to property (A23.6), Eq. (14.59) decouples in two distinct sets of equations,
involving the irreducible components of the source function of even rank (K =
0, 2, . . .) and of odd rank (K = 1, 3, . . .), respectively. In the latter, the only
source terms may be provided by the radiation entering the medium through the
boundary, and it can be shown, using property (A13.4) of the generalized profiles,
that such terms are zero unless the boundary radiation field either contains some
net circular polarization, or is characterized by I, Q, and U Stokes parameters that
are not symmetrical about line center. Excluding these cases, all the components
[S0K (x )]M with K odd are everywhere zero in the medium, and one is left with the
K
unknowns [S00 (x )]M , [S02 (x )]M , . . . , [S0 max (x )]M , where
2Ju if Ju is integer
Kmax = (14.61)
2Ju − 1 if Ju is half-integer .
Similarly to the previous section, we can apply Eq. (14.59) to the special case of a
plane-parallel, semi-infinite stellar atmosphere. Introducing the line optical depth
of Eq. (14.25), and assuming that the atmosphere is not illuminated by external
sources of radiation, we obtain
1 + + δu(K) S0K (tL ) M = δK0 BP (T )
∞
+ G̃K0,K 0 |tL − tL | S0K (tL ) M dtL , (14.62)
K =0,2,...,Kmax 0
∞ ∞
∆νD
G̃K0,K 0 |tL − tL | =
dx dy G̃ x, x ) .
( (14.63)
4π(x − x )2 K0,K 0
−∞ −∞
766 CHAPTER 14
Performing some algebra similar to that developed in App. 21, Eq. (14.63) can be
cast in the form
π/2 2π
∆νD
G̃K0,K 0 (x) = dθ tan θ dχ
4π
0 0
3
3 ∞
K
× dν Φ̃KK
0 (J , Ju ; ν) Φ̃K
0 (J , Ju ; ν)
i=0 j=0 K K −∞
x ∆νD
− H(ν,Ω)
× T 0K (i, Ω) e cos θ .
T 0K (j, Ω) (14.64)
ij
The kernels G̃K0,K 0 (x) obey some important properties. The first is the symme-
try property
G̃K 0,K0 (x) = G̃K0,K 0 (x) , (14.65)
which follows directly from the analogous property of the generalized multipole
coupling coefficients (Eq. (A23.7)).
A second property concerns the integrals of some of the kernels G̃K0,K 0 . One
has
∞ ∞
1
G̃00,00 (x) dx = , G̃00,20 (x) dx = 0 . (14.66)
2
0 0
where H −1 is the inverse of the matrix H. On the other hand, from Eqs. (14.50),
(14.51), and (14.56), we have
Φ̃0K = H(ν, Ω)
(J , Ju ; ν) T 0K (i, Ω) , (14.67)
0 0i
K
so that
∞ π/2 2π
∞
1
K
.
G̃00,K 0 (x) dx = dθ sin θ dχ dν Φ̃K
0 (J , Ju ; ν) T 0K (0, Ω)
4π
0 0 0 K −∞
Taking finally into account Eq. (A13.3), and considering that (see Table 5.6)
The formalism developed in the former section is suitable to discuss in some detail
the special case of the non-LTE problem that has been classified in Sect. 14.1 as
non-LTE of order 1.5. This is an intermediate non-LTE regime where it is assumed
that the atomic system is not polarized whereas the radiation field, because of the
presence of a magnetic field, is polarized. As mentioned in Sect. 14.1, this physical
situation has received considerable attention in the literature because it is consid-
ered appropriate to describe the process of formation of weak or medium-strong
spectral lines in relatively dense and strongly magnetized plasmas, like sunspots
or the atmospheres of magnetic stars. Owing to the relatively high density of
the plasma, it is believed that depolarizing collisions are so efficient to completely
destroy atomic polarization, which considerably simplifies the description of the
atomic system.
Restricting our analysis to a two-level atom, the non-LTE problem of order 1.5
reduces to the solution of the following equation (obtained from Eq. (14.62) under
the limit δu(K) → ∞ for K = 2, 4, . . . , Kmax , with Kmax given by Eq. (14.61))1
∞
(1 + ) S00 (tL ) = BP (T ) + G̃00,00 |tL − tL | S00 (tL ) dtL , (14.68)
0
where we have omitted the bracket [. . .]M around the symbol S00 (tL ) because this
1
Equation (14.62) holds for a two-level atom with unpolarized lower level. In the present
regime, both levels are unpolarized by definition.
768 CHAPTER 14
component of the source function does not depend on the reference system (see
Eq. (14.38)).
The explicit expression of the kernel can be obtained from Eq. (14.64) with the
help of Eq. (14.67). One gets
π/2 2π ∞
∆νD
G̃00,00 (x) = dθ tan θ dχ dν H , (14.69)
4π
0 0 −∞
In order to simplify Eq. (14.68), two different approximations have been proposed
in the literature, referred to as the field-free and the polarization-free approxima-
tion, respectively. The former (Rees, 1969) is based on the substitution of the
quantity H with its limiting value obtained by setting to zero the magnetic field.
In other words, with evident notations
Hf.f. = lim H .
B→0
From Eqs. (14.51)-(14.52), using Eqs. (A23.2) and (A23.3), it can easily be seen
that under this limit the matrix H reduces to
H(ν, Ω) = δij p(ν0 − ν) ,
ij
where p(ν0 − ν) is the profile defined in Eq. (14.6). Substitution into Eq. (14.70)
shows that the kernel G̃00,00 takes the form
∞
1 2
G̃00,00 (x) = dv ϕ(v) E1 xϕ(v) ,
f.f. 2
−∞
where v and ϕ(v) are defined in Eq. (14.31), and where E1 is the first exponential
integral defined in Eq. (12.6). Obviously, under this approximation Eq. (14.68)
reduces to the standard equation of the non-LTE problem of the 1st kind, and
S00 (tL ) is just the corresponding source function.
The polarization-free approximation (Trujillo Bueno and Landi Degl’Innocenti,
1996) is in principle less restrictive. It consists in retaining the diagonal elements
of the matrix H(ν, Ω) while setting to zero the off-diagonal ones. In this approxi-
mation one has
H(ν, Ω) ,
= δij h0 (ν, Ω)
ij
A
= η0 (ν, Ω) =
h0 (ν, Ω) Φ̃0K (J , J ; ν) .
T 0K (0, Ω) 0 u
kLA
K
One then gets, substituting into Eqs. (14.69)-(14.70) and performing the integration
over χ
π/2 ∞
∆νD x ∆νD
2 e− cos θ
h0 (ν,Ω)
G̃00,00 (x) = dθ tan θ dν h0 (ν, Ω) .
p.f. 2
0 −∞
It should be remarked that, although the field-free and the polarization-free ap-
proximations have been discussed here in the special case of a two-level atom, they
are indeed more general and can be applied to multi-level atoms as well. Given a
model atmosphere, the field-free approximation, for instance, tackles the non-LTE
problem of order 1.5 through the following strategy: a) the statistical equilibrium
equations for the level populations are solved as if the magnetic field were not
present; b) from such values, and from the value of the magnetic field vector, one
finds the expressions of the radiative transfer coefficients; c) the radiative transfer
equation is finally solved for any desired line of sight. This procedure implies the
use of a standard non-LTE numerical code for the solution of the statistical equi-
librium equations (point a). The complications arising from the presence of the
magnetic field and from polarization only intervene at points b) and c), when the
self-consistency loop has already been solved. Given its simplicity, the field-free
approximation has been widely used in the past, especially to compute theoretical
Stokes profiles of weak or medium-strong spectral lines originating from magnetized
atmospheres of relatively high density, such as sunspots or Ap stars.
In Sects. 14.2 and 14.3 we have analyzed the non-LTE problem of the 2nd kind
for a two-level atom with unpolarized lower level. The hypothesis of the absence
of atomic polarization in the lower level, joined with the further hypothesis of ne-
glecting stimulated emission, has allowed us to reduce the general non-LTE problem
(summarized by the self-consistency loop of Fig. 14.1) to the solution of a linear
system of integral equations. However, this linearity property is strictly confined to
the particular case just mentioned. It is no longer valid both when a model atom
with more than two levels is considered and, even for a two-level model atom, when
either the approximation of the unpolarized lower level or the approximation of ne-
glecting stimulated emission cannot be applied. In such cases, the solution of the
self-consistency loop has to be performed by means of numerical algorithms, and
it is usually impossible to establish general analytical results.
As an illustration of the problems involved, we consider in this section the sim-
plest possible atomic model leading to a non-linear set of equations, namely a
770 CHAPTER 14
d 2
ρ (α J ) = −B(α J → αu Ju )
dt 0 x
& 1 '
× J¯02 (ν0 ) ρ00 (α J ) x + J¯00 (ν0 ) ρ20 (α J ) x − √ J¯02 (ν0 ) ρ20 (α J ) x
2
− CI(0) (αu Ju , α J ) + D(2) (α J ) ρ20 (α J ) x ,
where J¯Q
K
(ν0 ) is given by Eq. (14.5).
These two equations can be cast in a simpler form by introducing a scalar source
function, S(x ), defined by (cf. Eq. (14.10))
2hν03 √ ρ00 (αu Ju ) x
S(x ) = 2 3 0 , (14.71)
c ρ0 (α J ) x
and a parameter, σ(x ), describing the amount of alignment in the lower level
2
ρ0 (α J ) x
σ(x ) = 0 . (14.72)
ρ0 (α J ) x
1
The hypotheses are the following: static medium, atoms and colliding particles with
Maxwellian velocity distributions, constant Doppler broadening, weak radiation field (negligible
stimulated emission), flat-spectrum approximation, approximation of complete redistribution on
velocities, negligible continuum opacity.
STELLAR ATMOSPHERES 771
Use of Eqs. (10.49) and (7.8) allows one to rewrite the two equations, for stationary
situations, in the form
1
(1 + ) S(x ) = J¯00 (ν0 ) 1 + √ w̄ σ(x ) + BP (T )
2
1 w̄
σ(x ) = − √ , (14.73)
2 1− 1
2 w̄ + β + δ
√ J¯02 (ν0 )
w̄ = 2 ¯0 ,
J0 (ν0 )
is defined in Eqs. (14.7), BP (T ) is the Planck function in the Wien limit (see
Eq. (14.9)), and where the quantities β and δ are given by
B (T ) D(2) (α J )
β = ¯0P , δ = .
J0 (ν0 ) B(α J → αu Ju ) J¯00 (ν0 )
where kLA and p(ν0 − ν) are defined, respectively, in Eqs. (14.15) and (14.6), and
where h0 , h1 , and ε0 are given by
1
h0 = 1 + √ 3 cos2 θ − 1 σ(x )
2 2
3
h1 = − √ sin2 θ σ(x )
2 2
ε0 = S(x ) ,
θ being the angle between the z-axis and the direction Ω (see Fig. 14.4).
The radiative transfer equation can be solved through the methods outlined in
Chap. 8 (see Eq. (8.28)). For a plane-parallel, semi-infinite atmosphere, introducing
772 CHAPTER 14
the optical depth tL of Eq. (14.25) and setting µ = cos θ, one has, for the radiation
at optical depth tL propagating outward (µ > 0)
∞ / 0
ϕ(v) H+ (tL ,tL ) ϕ(v) H− (tL ,tL )
= 1 − − dtL
I(ν, Ω) ϕ(v) S(tL ) e µ
+ e µ
2 µ
tL
∞ / 0
ϕ(v) H+ (tL ,tL ) ϕ(v) H− (tL ,tL )
= 1 − − dtL
Q(ν, Ω) ϕ(v) S(tL ) e µ
− e µ
, (14.75)
2 µ
tL
where v and ϕ(v) are the reduced frequency and the Gaussian profile of Eq. (14.31),
and where
tL
1
H+ (tL , tL ) = 1 + √ 3µ2 − 2 σ(tL ) dtL
2
tL
tL
1
H− (tL , tL ) = 1 + √ σ(tL ) dtL .
2
tL
Equations (14.75) and (14.76) can be used to obtain the two components of the
radiation field tensor, J¯00 and J¯02 , through the usual expressions (see Eqs. (5.157)
and (14.5))
∞ 1
1
J¯00 (ν0 ) = dv ϕ(v)
dµ I(ν, Ω)
2
−∞ −1
∞ 1
1 2
J¯02 (ν0 ) = √ dv ϕ(v) dµ + 3 µ2 − 1 Q(ν, Ω)
3µ − 1 I(ν, Ω) .
4 2
−∞ −1
Substitution of these expressions into Eqs. (14.73) closes the non-LTE loop of the
2nd kind, which – as anticipated – shows a complicated non-linear character.
STELLAR ATMOSPHERES 773
Fig.14.10. The logarithm of the source function S of Eq.(14.71) normalized to BP , and the reduced
statistical tensor σ of Eq.(14.72) – upper and lower panel, respectively – are plotted against the
logarithm of optical depth for two different values of . The plots refer to a homogeneous,
isothermal, non-magnetic, non-depolarizing atmosphere and to the transition (J = 1, Ju = 0).
As pointed out in Sect. 14.1, a solution of the non-LTE loop can be found only
via sophisticated numerical methods. Some results, obtained through accelerated
iterative schemes, are presented in Figs. 14.10 and 14.11. We point out that the
surface value of lower-level atomic polarization, σ(0), obtained for a homogeneous,
isothermal, non-depolarizing atmosphere with = 10−4 , turns out to be equal to
−0.09171. This ratio is related to the fractional linear polarization observed in the
tangential direction by the expression, easily derived from Eq. (14.74)1
)
Q(ν, Ω 3 σ(0)
p ≡ =− √ , (14.77)
I(ν, Ω ) 2 2 − σ(0)
Fig.14.11. The emerging profiles of the intensity (normalized to BP ) and of the ratio Q/I corre-
sponding to the functions S(tL ) and σ(tL ) of Fig.14.10 are plotted against the reduced frequency
(ν0 − ν)/∆νD (solid line). The plots refer to a heliocentric angle θ = 84◦ .3 (µ = 0.1). The
positive Q direction is parallel to the stellar surface. The dotted curves in the right panel show,
for comparison, the Q/I profiles corresponding to the ‘reversed’ transition (J = 0, Ju = 1) – cf.
Fig.14.8.
which, for the σ(0) value quoted above, yields p = 0.09422. Note that both
|σ(0)| and p turn out to be larger than the corresponding values obtained for the
‘inverted’ transition (J = 0, Ju = 1) – see the discussion following Eq. (14.36).
This result shows that, in the absence of depolarizing collisions, dichroism is a very
efficient mechanism for introducing linear polarization in spectral lines.
It should be remarked that in recent times the non-LTE problem of the 2nd kind
has been solved for a variety of two-level model atoms with different values of J
and Ju , and also for more complicated atomic models involving up to 5 different
levels (Trujillo Bueno, 1999; Manso Sainz and Trujillo Bueno, 2001; Manso Sainz,
2002).
In the former sections we have shown how the self-consistency loop underlying the
non-LTE problem of the 2nd kind can be solved in some rather schematic physical
situations involving simple atomic models and highly idealized, unidimensional (or
plane-parallel) atmospheres. However, when trying to interpret real observations
from stellar atmospheres, and particularly solar observations, one has to be aware
of the presence of additional facts which tend to complicate, to some extent, the
physical scenario.
First of all, it should be kept in mind that the solar atmosphere is highly struc-
tured. Even considering the so-called ‘quiet’ atmosphere, the presence of con-
vective motions which manifest themselves through the well-known phenomena of
granulation and supergranulation raises serious doubts about the reliability of the
STELLAR ATMOSPHERES 775
1
For obvious symmetry reasons, it is impossible to detect this phenomenon in an isolated
star, because of the lack of spatial resolution. On the contrary, the phenomenon can show up
with a typical time-dependent signal when the star is eclipsed by a dark companion (see Landi
Degl’Innocenti et al., 1988, for an analysis of the time variability of the polarization signal during
the eclipse).
776 CHAPTER 14
Eq. (7.16a)1
hν
ηiA (ν, Ω) ph
= N h n() (2J + 1) B(αJ → α+ J+ , j, J )
4π
jJ
√ !
1 1 K ρK (αJ) . (14.78)
× 3 (−1)1+J+J +K TQ K
(i, Ω)
J J J Q
KQ
1
ν= Eα J + − EαJ ,
h + +
n() d is the number of quantum states of the free electron with energy contained
in the interval (, + d), and, finally
32π 4
B(αJ → α+ J+ , j, J ) = | αJ d α+ J+ , j, J |2 . (14.79)
3h2 c
8π 3 ν
ηiA (ν, Ω) ph
= N n() (2J + 1) | αJ d α+ J+ , j, J |2
3c
jJ
√ !
1 1 K ρK (αJ) . (14.80)
× 3 (−1)1+J+J +K TQ
K
(i, Ω)
J J J Q
KQ
It is interesting to connect this expression with the ‘classical’ expression (see e.g.
Mihalas, 1978) for the absorption coefficient due to photoionization. Such coeffi-
cient, denoted by kph (ν), is generally expressed in the form
kph (ν) = NαJ σph (ν) αJ , (14.81)
where NαJ is the number density of atoms in the photoionized level and [σph (ν)]αJ
is the photoionization cross section for the same level. From Eq. (14.80), assuming
the level (αJ) to be unpolarized ( ρK 0
Q (αJ) = δK0 δQ0 ρ0 (αJ) ), and taking into
account that
√
NαJ = 2J + 1 ρ00 (αJ) N , =1,
T 00 (0, Ω)
1
Equation (14.78) refers to a single photoionization process from a given level (αJ) to
a given level (α+ J+ ). Moreover, the sum over the final states also implies an integral over the
energy of the free electron. This integral is easily performed taking into account that the profile
φ contained in Eq. (7.16a) is practically a Dirac delta-function.
STELLAR ATMOSPHERES 777
and that !
1 1 0 1
= (−1)1+J+J ,
J J J 3(2J + 1)
one gets, by comparison with Eq. (14.81)
8π 3 ν
σph (ν) αJ = n() | αJ d α+ J+ , j, J |2 .
3c jJ
Recalling Eq. (10.11), this allows one to cast Eq. (14.80) in the more significant
form
A
ηi (ν, Ω) = NαJ σph (ν) αJ (−1)K WαJ (K)
() T Q
K σ K (αJ) , (14.82)
(i, Ω)
ph Q
KQ
Q (αJ)/ρ0 (αJ) and where the symbol WαJ (), which represents
K (K)
where σQ (αJ) = ρK 0
(K)
an extension of the symbol wJJ of Eq. (10.11) to photoionization, is defined by
(K)
wJJ | αJ d α+ J+ , j, J |2
jJ
WαJ () =
(K)
. (14.83)
| αJ d α+ J+ , j, J |2
jJ
The explicit evaluation of the reduced matrix elements appearing in Eq. (14.83)
is, in general, quite involved. In App. 24 we consider a particularly simple case
where the atomic states can be described in the L-S coupling scheme and where
the photoionized electron belongs, before photoionization, to an open shell.
We now turn to discuss scattering phenomena in the continuum, namely Thomson
scattering on free electrons and Rayleigh scattering on atoms or molecules. These
phenomena have been analyzed in great detail by Chandrasekhar (1950). Here we
just recall his conclusions and adapt his results to the formalism used in this book.
In the non-relativistic limit, i.e., at frequencies ν such that hν mc2 , m being
the electron mass, Thomson scattering is characterized by a frequency-independent,
non-dichroic cross section given by
8π e40
σT =
3 m 2 c4
(where e0 is the electron charge and c the speed of light), and by a scattering phase
matrix which, in our notations, has been denoted by the symbol Rij (Ω, Ω ) – or
Ω
Rij (Ω, ; 0) – and which is given by (cf. Eqs. (5.137), (5.138), and (10.21))
Ω
Rij (Ω, ) = (−1)Q T Q
K
(i, Ω) ) .
T K (j, Ω (14.84)
−Q
KQ
Moreover, it has to be taken into account that scattering is coherent in the reference
frame of the electron.
778 CHAPTER 14
Ne being the number density of electrons, and by an emission coefficient εi (ν, Ω)
which, taking into account the Doppler effect due to the motion of the electrons,
is given by1
5
dΩ
3
εi (ν, Ω) = Ne σT Ω
Rij (Ω, ) dν d3v f (v )
T.sc 4π j=0
v · Ω v · Ω
×δ ν −ν +ν − ν Ij (ν , Ω
) , (14.86)
c c
5
dΩ
3
εi (ν, Ω) = Ne σT Ω
Rij (Ω, ) dν
T.sc 4π j=0
(ν−ν )2
1 − 2
× e 2(1−cos Θ) ∆νe Ij (ν , Ω
) , (14.87)
2π(1 − cos Θ) ∆νe
where Θ is the angle between the directions Ω and Ω , and where ∆ν is the
e
Doppler broadening (in frequency units) corresponding to the thermal velocity of
the electrons, .
ν 2kB Te
∆νe = , (14.88)
c m
with kB the Boltzmann constant and Te the electronic temperature.
Equation (14.86) can be cast in a different form recalling Eq. (14.84) and the
definition of the radiation field tensor in the comoving frame. With easy transfor-
mations one gets
εi (ν, Ω) = Ne σT (−1)Q T Q
K
(i, Ω)
T.sc
KQ
, -
v · Ω
× d3v f (v ) J−Q
K
ν−ν , (14.89)
c
v
where the radiation field tensor in the comoving frame is given by Eq. (13.31).
1
In this formula, as well as in Eqs. (14.87) and (14.89), we suppose v/c 1.
STELLAR ATMOSPHERES 779
Ne → NH , σT → σR
(H)
, ∆νe → ∆νH ,
(H)
where NH is the number density of hydrogen atoms, σR is the frequency-dependent
Rayleigh scattering cross section, and ∆νH is the Doppler broadening (in frequency
units) corresponding to the thermal velocity of the hydrogen atoms. The explicit
(H)
expression for σR is quite involved (Kramers, 1924), but a good approximation at
optical wavelengths is provided by the formula
(H) ν4
σR = σT , (14.90)
(ν 2 − ν02 )2
5
dΩ
3
εi (ν, Ω) = Ne σT ) I (ν, Ω
Ω
R (Ω, )
T.sc 4π j=0 ij j
= Ne σT (−1)Q T Q
K J K (ν) ,
(i, Ω) (14.91)
−Q
KQ
The expressions now derived allow one to deduce the polarization properties of the
continuum spectrum of the solar atmosphere. In the plane-parallel approximation,
owing to the cylindrical symmetry of the environment, the transfer of polarized
radiation can be described by a simplified equation which, by a suitable choice
of the reference direction,1 only involves the two Stokes parameters I(ν, Ω) and
Q(ν, Ω). This follows from the fact that the only non-zero components of the
K K
tensor JQ (ν) – appearing in Eqs. (14.89) and (14.91) – and σQ (αJ) – appearing
in Eq. (14.82) – are those with (K = 0, Q = 0) and (K = 2, Q = 0).2 Taking
into account the expressions of the tensor T Q K – Table 5.6 – it can easily be
(i, Ω)
seen that the only non-vanishing components of [εi (ν, Ω)] A
T.sc and of [ηi (ν, Ω)]ph
are those with i = 0, 1, so that the transfer equation has the form
d
I(ν, Ω)
η0 (ν, Ω)
η1 (ν, Ω)
I(ν, Ω)
ε0 (ν, Ω)
=− + , (14.92)
ds
Q(ν, Ω)
η1 (ν, Ω)
η0 (ν, Ω)
Q(ν, Ω)
ε (ν, Ω)
1
where the transfer coefficients η0 , η1 , ε0 , and ε1 result from the sum of the contribu-
tions of all the processes responsible for absorption and emission in the continuum.
Since the degree of anisotropy in the solar atmosphere is everywhere weak, and
since the major source of continuum opacity at visible wavelengths (the H − ion)
contributes neither to dichroism nor to emission of polarized radiation, it follows
that
η (ν, Ω)
η (ν, Ω) , ε (ν, Ω)
ε (ν, Ω) ,
1 0 1 0
so that Eq. (14.92) can be solved by a perturbative approach. One thus gets the
zero-order equation
d = −η (ν, Ω)
I(ν, Ω)
+ ε (ν, Ω)
,
I(ν, Ω) 0 0 (14.93)
ds
d
= −η (ν, Ω)
Q(ν, Ω) Q(ν, Ω)
+ ε (ν, Ω)
− η (ν, Ω)
I(ν, Ω)
, (14.94)
0 1 1
ds
1
This choice is illustrated in Fig. 14.4, where z is the vertical to the atmosphere and where
the angle γ has to be set to π/2 (or 0).
2
The tensor components with K = 1 can be excluded through general considerations involv-
ing the absence of circular polarization inside the atmosphere (see, for instance, the discussion
following Eq. (14.23)). On the contrary, the presence of a magnetic field inclined with respect
to the vertical may introduce non-vanishing σQ 2 (αJ) components with Q = 0. When dichroism
phenomena are important, Eq. (14.92) is valid only in the absence of such fields.
STELLAR ATMOSPHERES 781
=
η0 (ν, Ω) NαJ σph (ν) αJ + Ne σT + NH σR
(H) (He)
+ NHe σR
αJ
=
η1 (ν, Ω) NαJ σph (ν) αJ WαJ
(2) σ 2 (αJ)
() T 02 (1, Ω) 0
αJ
=B
ε0 (ν, Ω) NαJ σph (ν) αJ + Ne σT + NH σR
(H) (He)
+ NHe σR J00 (ν)
P
αJ
dz ,
dtν = −η0 (ν, Ω)
z being the vertical to the atmosphere (directed upwards), and the continuum
source function
ε (ν, Ω)
Sc (ν) = 0 .
η (ν, Ω)0
We easily obtain
Sc (ν) = (1 − β) BP + β J00 (ν) , (14.96)
(H) (He)
Ne σT + NH σR + NHe σR
β= . (14.97)
NαJ σph (ν) αJ + Ne σT + NH σR (H) (He)
+ NHe σR
αJ
Integration of Eq. (14.93) gives, for the intensity of the radiation emerging from
the solar atmosphere
∞ t
− ν dtν
I(ν, Ω) = Sc (ν) e µ , (14.98)
µ
0
= Q(ν, Ω)
Q(ν, Ω)
+ Q(ν, Ω) ,
sc ph
1
where, recalling Eqs. (14.95) and (14.98), and the expression of the tensor T 02 (1, Ω)
∞
3 t
− µν dtν
Q(ν, Ω) sc
= √ 1 − µ2
β J02 (ν) e , (14.99)
2 2 µ
0
and
∞ NαJ σph (ν) αJ WαJ
(2)
() σ02 (αJ)
3
Q(ν, Ω) ph
= − √ 1 − µ2 1−β αJ
2 2 NαJ σph (ν) αJ
0 αJ
∞ t
− µν dtν dtν
× Sc (ν) e . (14.100)
µ µ
tν
The emerging intensity and the scattering contribution to the Q Stokes parameter
can be computed by standard numerical techniques from a given model of the solar
atmosphere. Since in the solar atmosphere the contribution of scattering processes
to the continuum opacity is small (β 1), one can simply set Sc (ν) = BP (ν) – see
Eq. (14.96). The emerging intensity can then be evaluated using Eq. (14.98), and
the tensor J02 (ν) at any optical depth can be obtained by a numerical quadrature.
The scattering contribution to Q can finally be calculated from Eq. (14.99). De-
tailed theoretical results on the fractional polarization, obtained through numerical
procedures similar to the one just described, can be found in Débarbat et al. (1970)
and in Fluri and Stenflo (1999).2
On the contrary, the contribution to the continuum linear polarization arising
from dichroism is more difficult to compute because of the presence in Eq. (14.100)
of the atomic polarization factors, σ02 (αJ), relative to all the atomic levels of differ-
ent atomic species from which significant photoionization can take place. Obviously,
the evaluation of the factors σ02 (αJ) requires the solution of the non-LTE problem
of the 2nd kind for multi-level atoms, a difficult subject which is presently in a
phase of development and upon which we will not dwell any longer in this book.
1
The reference direction for the Stokes parameters is defined here as in Fig. 14.4 with γ = π/2.
Positive Q means linear polarization parallel to the solar limb.
2
Note that very little progress has been made in this field from the observational side.
To the authors’ knowledge, the most complete measurements of continuum linear polarization at
the solar limb are still those by Leroy (1972). Such observations, however, were performed with
broad-band filters and are therefore contaminated by the spurious signals due to spectral lines.
Observations of the true continuum polarization in selected spectral windows have been presented
by Mickey and Orrall (1974) and by Wiehr (1975, 1978).
STELLAR ATMOSPHERES 783
1
= k (c) + k A 1 + √
η0 (ν, Ω) ν L 3 cos2 θ − 1 wJ(2)J σ02 (α J ) φ̂(ν0 − ν)
2 2 u
= kA √3
η1 (ν, Ω) L sin2 θ wJ(2)J σ02 (α J ) φ̂(ν0 − ν)
2 2 u
3 , -
= k (c) (1 − β) B + v · Ω
ε0 (ν, Ω) ν P β i d3vi f (vi ) J00 ν − ν i +
i=1
c
vi
1
The reference direction for the definition of the Stokes parameters is the same as for
of Fig. 14.4 with γ = π/2.
Eq. (14.92), namely the direction ea (Ω)
784 CHAPTER 14
, - !
1 3
v · Ω
+ √ 3 cos2 θ − 1 βi d3vi f (vi ) J02 ν − ν i
2 2 i=1
c
vi
1
+ kLA SL 1 + √ 3 cos2 θ − 1 wJ(2)J σ02 (αu Ju ) φ̂(ν0 − ν)
2 2 u
3 , -
3
= √ v · Ω
ε1 (ν, Ω) sin2 θ kν(c) βi d3vi f (vi ) J02 ν − ν i
2 2 i=1
c
vi
!
+ kLA SL wJ(2)J σ02 (αu Ju ) φ̂(ν0 − ν) .
u
(c)
In these expressions, kν is the continuum absorption coefficient due to photoion-
ization and scattering processes, kLA and SL are the line opacity and the line source
function defined, respectively, in Eqs. (14.15) and (14.12), σ02 (α J ) and σ02 (αu Ju )
are the reduced statistical tensors defined by
Fig. 14.4 (z being the vertical to the solar atmosphere), the profile φ̂(ν0 − ν) is
defined according to Eq. (14.1), BP (ν) is the Planck function, and βi (i = 1, 2, 3)
is the fractional contribution to the opacity coming from scattering by electrons,
H atoms, and He atoms, respectively
(H) (He)
Ne σT NH σR NHe σR
β1 = (c)
, β2 = (c)
, β3 = (c)
;
kν kν kν
finally (cf. Eq. (14.97))
3
β= βi .
i=1
In the solar atmosphere, the reduced statistical tensors of rank 2 are generally
very small ( σ02 (α J ), σ02 (αu Ju ) 1 ), as well as the scattering contribution to the
continuum opacity (β 1).1 This implies that the fractional linear polarization
Q/I is also small, so that the radiative transfer equation can be solved by a per-
turbative approach, similarly to the previous section. For the intensity one obtains
the zero-order equation
d = − k (c) + k A φ̂(ν − ν) I(ν, Ω)
+ k (c) B + k A S φ̂(ν − ν) , (14.101)
I(ν, Ω) ν L 0 ν P L L 0
ds
1
In the top layers of the atmosphere, and especially in the UV region of the spectrum, this
inequality may well be violated, β attaining values close to unity. This happens, however, in a
layer of very small optical thickness, that can be safely neglected for our considerations.
STELLAR ATMOSPHERES 785
whereas for the Q Stokes parameter one gets the first-order equation
3 , -
3 v · Ω
+ √ kν(c) sin2 θ βi d3vi f (vi ) J02 ν − ν i . (14.102)
2 2 i=1
c
vi
d ) = d Q(ν, Ω
)=0.
I(ν, Ω
ds ds
(c)
)= kν BP + kLA SL φ̂(ν0 − ν)
I(ν, Ω (c)
, (14.103)
kν + kLA φ̂(ν0 − ν)
)
Q(ν, Ω )
Q(ν, Ω )
Q(ν, Ω
= + ,
)
I(ν, Ω )
I(ν, Ω line
)
I(ν, Ω sc
, -
3
vi · Ω
βi d3vi f (vi ) J02 ν − ν
)
Q(ν, Ω 3 c
vi
= √ kν(c) i=1
. (14.105)
)
I(ν, Ω sc 2 2 (c)
kν BP + kL SL φ̂(ν0 − ν)
A
All the quantities in the right-hand sides of Eqs. (14.103), (14.104) and (14.105)
have to be evaluated at the top layers of the atmosphere (formally at τ = 0).
786 CHAPTER 14
)
Q(ν0 , Ω 3
= √ wJ(2)J σ02 (αu Ju ) − wJ(2)J σ02 (α J ) , (14.106)
)
I(ν0 , Ω line 2 2 u u
(c)
b) weak line (kLA φ̂(0) kν )
)
Q(ν0 , Ω 3 k A φ̂(0) S
= √ L (c) wJ(2)J σ02 (αu Ju ) L − wJ(2)J σ02 (α J ) . (14.107)
)
I(ν0 , Ω line 2 2 kν u BP u
Equation (14.106) has been proposed by Trujillo Bueno (1999) to give an estimate
of the polarization signal observed in spectral lines close to the solar limb, and to
point out the importance of lower-level atomic polarization in the solar atmosphere.
Considering a multiplet of lines sharing the same upper level, and neglecting lower-
level polarization, we see from Eq. (14.106) that the observed fractional polarization
in the different lines has to be proportional to the value of the corresponding symbol
wJ(2)J . Following Trujillo Bueno (1999), consider for instance the MgI triplet in the
u
green region of the solar spectrum. The three lines at 5167.3 Å, 5172.7 Å, and
5183.6 Å correspond to the transition between a common upper level 3s 4s 3 S1 and
a lower level 3s 3p 3 PoJ with J = 0, 1, and 2, respectively. In the absence of lower-
level polarization, the fractional linear polarization should therefore be proportional
(2)
to w10 (2)
, w11 (2)
, w12 respectively, or, according to Table 10.1, to 1.0, −0.5, 0.1. Since
the observed values of the limb fractional polarization in the three lines are very
far from satisfying these relations, it follows that lower-level polarization must play
a significant role.
It is also important to remark that, according both to Eq. (14.106) and to
Eq. (14.107), the line contribution to the fractional polarization may be either
positive or negative, because both expressions contain the difference between two
terms that may themselves be positive or negative.
Turning to the scattering contribution given by Eq. (14.105), it is important
to distinguish between Thomson scattering and Rayleigh scattering. In the first
case, we should keep in mind that the thermal velocities of electrons are at least
two orders of magnitude larger than the thermal velocities of most of the atoms
contributing to the Fraunhofer spectrum. The ratio between the thermal velocity of
an electron and that of a sodium atom, for instance, is 205, while for an iron atom
such ratio is 319. It follows that the integral over the electron velocities contained
in the numerator of the right-hand side of Eq. (14.105) completely washes out any
spectral detail of the tensor J02 (ν), yielding an expression of the form
STELLAR ATMOSPHERES 787
)
Q(ν, Ω 3
(c)
kν β1 J02 ν
= √ , (14.108)
)
I(ν, Ω T.sc 2 2 kν(c) BP + kLA SL φ̂(ν0 − ν)
is in practice the average of J02 (ν) over a broad frequency interval centered at the
frequency ν and having a width of the order of 2∆νe (cf. Eq. (14.87)), with ∆νe
given by Eq. (14.88).1 This phenomenon bears a strong resemblance with another
phenomenon that solar physicists have known for a long time: the photospheric
radiation scattered by electrons in the corona is indeed strongly polarized and has
a featureless spectrum – whence the name of K (or continuum) corona.
As Eq. (14.108) clearly shows, the Thomson scattering contribution to fractional
polarization is characterized by a frequency dependence typical of the absorption
lines of the usual intensity spectrum, with a positive wing value2 and a depression
at line center. Such depression is totally independent of the intrinsic polarizability
characteristics of the line, like atomic polarization or w(2) factors.
For the Rayleigh scattering contribution to fractional polarization similar argu-
ments can be repeated, and an equation strictly similar to Eq. (14.108) holds. The
only difference is that the factor β1 has to be replaced by β2 (for scattering on hy-
drogen atoms) or β3 (for scattering on helium atoms), whereas the average J02 ν is
still given by Eq. (14.109), the integral being performed on the hydrogen or helium
atom velocities, respectively. Since the hydrogen atoms have thermal velocities
that are several times (about 5 for sodium, 7 for iron, etc.) larger than the thermal
velocities of most of the atoms contributing to the Fraunhofer spectrum, it follows
that the frequency dependence of J02 (ν) will be substantially smoothed by the ve-
locities of the hydrogen atoms but not totally destroyed. A qualitative sketch of
the contribution to the fractional polarization arising from scattering processes is
shown in Fig. 14.12.
We want to remark that the results derived in this section are strongly dependent
on the approximation of plane-parallel atmosphere, thus they have to be used with
much care when interpreting polarimetric observations of the solar limb spectrum.
The non-LTE theory that we have introduced in this chapter, schematically sum-
marized in the self-consistency loop of Fig. 14.1 and usually referred to as ‘non-LTE
1
In terms of wavelength, and assuming an electronic temperature Te of 6000 K, the width of
the interval is about 10 Å at 5000 Å.
2
J02 (ν) is generally positive in the top layers of a stellar atmosphere (see Chap. 12).
788 CHAPTER 14
Fig.14.12. Schematic behavior, as a function of the reduced frequency (ν0 − ν)/∆νD , of the
fractional polarization introduced by Thomson and Rayleigh scattering (full and dotted line,
respectively) in the neighborhood of a spectral line of central frequency ν0 ; ∆νD is the Doppler
width of the profile φ̂(ν0 − ν) of Eq.(14.108). The plot refers to the radiation propagating along
the top layer of a plane-parallel atmosphere. The reference direction for positive Q is parallel
to the atmosphere, and the fractional polarization is normalized to the value of the continuum
spectrum.
of the 2nd kind’, rests, as a foundation stone, on the concept of atomic density ma-
trix. Such concept generalizes to polarized radiative transfer the usual concept of
level populations employed in the non-LTE theory of the 1st kind, and appears to
be well-suited to handle most transfer problems.
There are however some special cases where a different approach, not based on
the density matrix, can be followed. These are the cases where the basic interaction
process between radiation and atoms is a scattering process for which the emission
coefficient vector can be expressed by a linear relation of the form
5 ∞
3
dΩ
=k
εi (ν, Ω) s dν ; ν, Ω
Rij (ν, Ω ) I (ν , Ω
j
) , (14.110)
4π j=0
0
d = −k I (ν, Ω)
+ ε (ν, Ω)
.
I (ν, Ω) (14.111)
ds i s i i
As shown several times in this book, a linear equation similar to Eq. (14.110)
can in fact be established under severe restrictive hypotheses which imply two-level
(or two-terms) atomic models, the absence of atomic polarization in the lower level
(or lower term), and negligible stimulated emission. Under these circumstances,
the density matrix can be factored out and the non-LTE problem of the 2nd kind
reduces to the self-consistent solution of Eqs. (14.110) and (14.111). In other words,
there is no need to solve the statistical equilibrium equations for the density-matrix
elements because such solution is implicitly contained in Eq. (14.110). Obviously,
in these cases the theory provides explicit expressions for the coefficient ks and for
the redistribution matrix.
In general, an approach based on Eqs. (14.110) and (14.111), with suitable expres-
; ν, Ω
sions for ks and Rij (ν, Ω ), avoiding the introduction of the density matrix,
can be more appropriate for the description of certain phenomena. An important
example is the non-LTE problem concerning the interpretation of the extended
wings of some particularly strong spectral lines formed in the solar atmosphere.
Even the sophisticated redistribution matrices deduced in Chap. 13, which take
velocity/density-matrix correlations into account (see Eqs. (13.35) and (A18.1)-
(A18.2) for the non-magnetic case, and Eq. (13.44) for the magnetic case) are
unable to explain such wings, and have to be replaced by redistribution matrices
which include partial redistribution effects (in the atomic rest frame). The density-
matrix theory developed in this book, being based on the Markov approximation,
cannot account for such effects.
Much work has been devoted in the past to find the correct expression for the
redistribution matrix capable of taking into account, at the same time, partial redis-
tribution effects, depolarizing collisions, and the presence of a magnetic field. Im-
portant contributions to this subject have been given by Omont et al. (1972), Sten-
flo (1994), Bommier (1997a,b), and Landi Degl’Innocenti et al. (1997). Further ef-
forts have been devoted to find numerical, self-consistent solutions to Eqs. (14.110)-
(14.111) in plane-parallel atmospheres, either non-magnetized or magnetized, using
various forms of the redistribution matrix and often introducing ad hoc approxima-
tions on the matrix itself.1 Remarkable results on this subject have been obtained
by Rees and Saliba (1982), Saliba (1986), Faurobert (1987, 1988), and Nagendra
(1988, 1994) for the non-magnetic case, and by Faurobert-Scholl (1991, 1992) and
Nagendra et al. (1998) for the case of the Hanle effect.
1
From a historical point of view, it should be recalled that the first self-consistent solution
of Eqs. (14.110)-(14.111) in a plane-parallel atmosphere has been given, through semi-analytical
methods, by Chandrasekhar and Breen (1947). The problem solved by Chandrasekhar is that of an
atmosphere composed of Thomson-scattering free electrons; in this case one has (see Eqs. (14.85)
; νΩ
and (14.91)) ks = Ne σT , Rij (ν, Ω ) = R (Ω, ) δ(ν − ν ).
Ω
ij
APPENDIX
A1. A Fortran Code for Computing 3-j, 6-j, and 9-j Symbols
The following code computes 3-j, 6-j, and 9-j symbols.1 3-j symbols are evaluated
from Eqs. (2.19) and (2.22), 6-j symbols from Eq. (2.35), and 9-j symbols from
Eq. (2.48). The variables appearing as dummy arguments of the three Fortran-
functions W3JS, W6JS, W9JS are integers representing the values of the corre-
sponding quantum numbers multiplied by 2. For instance
1 3
2 2 2
= W3JS (4, 1, 3, −2, −1, 3)
−1 − 21 3
2
3 1 !
1 2 2
3 = W6JS (2, 3, 1, 3, 2, 4)
2 1 2
⎧ 1 3 ⎫
⎨2 2 2 ⎬
1 1
1 = W9JS (4, 1, 3, 1, 1, 2, 5, 2, 5) .
⎩ 25 2
5
⎭
2 1 2
The various conditions that must be satisfied by the arguments of each symbol are
explicitly checked within the code. If any of these conditions is not satisfied, the
code returns the value 0 for the symbol. The instruction CALL FACTRL must
appear in the main program before any of these Fortran-functions is used.
FUNCTION W3JS(J1,J2,J3,M1,M2,M3)
INTEGER Z,ZMIN,ZMAX
COMMON/FACT/FACT(0:301)
W3JS=0.0
IF(M1+M2+M3.NE.0) GOTO 1000
IA=J1+J2
IF(J3.GT.IA) GOTO 1000
IB=J1-J2
IF(J3.LT.IABS(IB)) GOTO 1000
JSUM=J3+IA
IC=J1-M1
ID=J2-M2
IF(MOD(JSUM,2).NE.0) GOTO 1000
IF(MOD(IC,2).NE.0) GOTO 1000
IF(MOD(ID,2).NE.0) GOTO 1000
IF(IABS(M1).GT.J1) GOTO 1000
IF(IABS(M2).GT.J2) GOTO 1000
IF(IABS(M3).GT.J3) GOTO 1000
IE=J3-J2+M1
IF=J3-J1-M2
1
The code is due to Stephen Jackson, formerly at the National Center for Atmospheric
Research, Boulder, Colorado.
792 APPENDIX
ZMIN=MAX0(0,-IE,-IF)
IG=IA-J3
IH=J2+M2
ZMAX=MIN0(IG,IH,IC)
CC=0.0
DO 200 Z=ZMIN,ZMAX,2
DENOM=FACT(Z/2)*FACT((IG-Z)/2)*FACT((IC-Z)/2)
1 *FACT((IH-Z)/2)*FACT((IE+Z)/2)*FACT((IF+Z)/2)
IF(MOD(Z,4).NE.0) DENOM=-DENOM
CC=CC+1.0/DENOM
200 CONTINUE
CC1=FACT(IG/2)*FACT((J3+IB)/2)*FACT((J3-IB)/2)
1 /FACT((JSUM+2)/2)
CC2=FACT((J1+M1)/2)*FACT(IC/2)*FACT(IH/2)
1 *FACT(ID/2)*FACT((J3-M3)/2)*FACT((J3+M3)/2)
CC=CC*SQRT(CC1*CC2)
IF(MOD(IB-M3,4).NE.0) CC=-CC
W3JS=CC
1000 RETURN
END
FUNCTION W6JS(J1,J2,J3,L1,L2,L3)
INTEGER W,WMIN,WMAX
INTEGER SUM1,SUM2,SUM3,SUM4
COMMON/FACT/FACT(0:301)
W6JS=0.0
IA=J1+J2
IF(IA.LT.J3) GOTO 1000
IB=J1-J2
IF(IABS(IB).GT.J3) GOTO 1000
IC=J1+L2
IF(IC.LT.L3) GOTO 1000
ID=J1-L2
IF(IABS(ID).GT.L3) GOTO 1000
IE=L1+J2
IF(IE.LT.L3) GOTO 1000
IF=L1-J2
IF(IABS(IF).GT.L3) GOTO 1000
IG=L1+L2
IF(IG.LT.J3) GOTO 1000
IH=L1-L2
IF(IABS(IH).GT.J3) GOTO 1000
SUM1=IA+J3
SUM2=IC+L3
SUM3=IE+L3
SUM4=IG+J3
IF(MOD(SUM1,2).NE.0) GOTO 1000
IF(MOD(SUM2,2).NE.0) GOTO 1000
IF(MOD(SUM3,2).NE.0) GOTO 1000
IF(MOD(SUM4,2).NE.0) GOTO 1000
WMIN=MAX0(SUM1,SUM2,SUM3,SUM4)
II=IA+IG
IJ=J2+J3+L2+L3
IK=J3+J1+L3+L1
APPENDIX 793
WMAX=MIN0(II,IJ,IK)
OMEGA=0.0
DO 200 W=WMIN,WMAX,2
DENOM=FACT((W-SUM1)/2)*FACT((W-SUM2)/2)*FACT((W-SUM3)/2)
1 *FACT((W-SUM4)/2)*FACT((II-W)/2)*FACT((IJ-W)/2)
2 *FACT((IK-W)/2)
IF(MOD(W,4).NE.0) DENOM=-DENOM
OMEGA=OMEGA+FACT(W/2+1)/DENOM
200 CONTINUE
THETA1=FACT((IA-J3)/2)*FACT((J3+IB)/2)*FACT((J3-IB)/2)
1 /FACT(SUM1/2+1)
THETA2=FACT((IC-L3)/2)*FACT((L3+ID)/2)*FACT((L3-ID)/2)
1 /FACT(SUM2/2+1)
THETA3=FACT((IE-L3)/2)*FACT((L3+IF)/2)*FACT((L3-IF)/2)
1 /FACT(SUM3/2+1)
THETA4=FACT((IG-J3)/2)*FACT((J3+IH)/2)*FACT((J3-IH)/2)
1 /FACT(SUM4/2+1)
THETA=THETA1*THETA2*THETA3*THETA4
W6JS=OMEGA*SQRT(THETA)
1000 RETURN
END
FUNCTION W9JS(J1,J2,J3,J4,J5,J6,J7,J8,J9)
KMIN=ABS(J1-J9)
KMAX=J1+J9
I=ABS(J4-J8)
IF(I.GT.KMIN) KMIN=I
I=J4+J8
IF(I.LT.KMAX) KMAX=I
I=ABS(J2-J6)
IF(I.GT.KMIN) KMIN=I
I=J2+J6
IF(I.LT.KMAX) KMAX=I
X=0.
DO 1 K=KMIN,KMAX,2
S=1.
IF(MOD(K,2).NE.0) S=-1.
X1=W6JS(J1,J9,K,J8,J4,J7)
X2=W6JS(J2,J6,K,J4,J8,J5)
X3=W6JS(J1,J9,K,J6,J2,J3)
X=X+S*X1*X2*X3*FLOAT(K+1)
1 CONTINUE
W9JS=X
RETURN
END
SUBROUTINE FACTRL
COMMON/FACT/FACT(0:301)
DATA NFAC/31/
FACT(0)=1.0
DO 10 I=1,NFAC
10 FACT(I)=FACT(I-1)*FLOAT(I)
RETURN
END
794 APPENDIX
This is simply the contraction of four 3-j symbols over five angular momentum
components, and closely resembles the right-hand side of Eq. (2.34) with e = e,
= . In Eq. (2.34), the only component which is not a summation index ()
appears in the first and fourth symbols, while the corresponding component (Q) in
Eq. (A2.1) appears in the third and fourth symbols. Thus we start, for instance,
from the third 3-j in Eq. (A2.1) and substitute k = c, k = d, K = e, q = γ, q = −δ,
Q = −. By so doing, the third 3-j symbol in Eq. (A2.1) takes the same form as
the first 3-j in Eq. (2.34).
Looking again at Eq. (2.34), we see that the second 3-j contains the column (d, δ)
which appears also (apart from a sign) in the first 3-j. According to our previous
substitution, d and δ correspond to k and −q , respectively, so that we must pick
the second 3-j in Eq. (A2.1) and reshape it, using the symmetry properties of the
3-j symbols, in one of the two equivalent forms
J J k k J J
→
−M M q −q −M M
k J J
→ (−1)k +J +J . (A2.2)
−q M −M
Of these two forms, it is the second one that brings the right-hand side of Eq. (A2.1)
into a form equivalent to the right-hand side of Eq. (2.34). Indeed, choosing the
second form, we substitute J = b, J = f , M = −β, M = −φ, and we can now
reshape the first 3-j in Eq. (A2.1) in the form
J J k k+J+J J k J
→ (−1)
−M M q −M −q M
to get, by the substitutions J = a, M = α and apart from a sign factor, the third
3-j in Eq. (2.34).1
1
Note that if the first form were chosen for the 3-j in Eq. (A2.2) (implying the substitutions
J = b, J = f , M = β, M = φ), it would be impossible to arrange the first 3-j in Eq. (A2.1)
in the form of either the third or the fourth 3-j in Eq. (2.34).
APPENDIX 795
we obtain, by the substitutions given above, the correct 3-j appearing at the fourth
place in Eq. (2.34).
Summarizing all the substitutions,
J =a M =α
J = f M = −φ
J = b M = −β
k=c q=γ
k = d q = −δ
K=e Q = − , (A2.3)
Now we must work on the sign factor to cast it in a form such that Eq. (2.34) can
be applied. In general, when performing this kind of calculations on sign factors
appearing in Racah-algebra expressions, a number of rules can be applied. If Σ is
an arbitrary linear combination of angular momentum and z-component quantum
numbers, these rules can be summarized as follows
I)
a b c a b c
(−1)a±α+Σ = (−1)−a∓α+Σ ,
α β γ α β γ
as (a ± α) is an integer;
II)
a b c a b c
(−1)α+β+γ+Σ = (−1)Σ ,
α β γ α β γ
as (α + β + γ) = 0 because of the presence of the 3-j symbol;
III)
a b c a b c
(−1)a±b±c+Σ = (−1)−a∓b∓c+Σ ,
α β γ α β γ
as the quantity (a ± b ± c) is an integer;
796 APPENDIX
IV)
(−1)3a+Σ = (−1)−a+Σ , (−1)4a+Σ = (−1)Σ ,
as 4a is an even integer.
With the help of these rules, the sign factor appearing in Eq. (A2.4) can be
transformed in the following way
(−1)a+3b+3c+d−e+f −β− =
S1 is the correct sign factor appearing in Eq. (2.34), while S2 can be transformed
by replacing, according to Rule I, (−b − β) by (b + β) and (−d − δ) by (d + δ).
Thus we obtain
S2 = (−1)a+4b+2c+2d−e+f −γ+δ− .
Now we can eliminate the terms 4b and (−γ + δ − ) according to Rules IV and II,
respectively, and we get
S2 = (−1)a+2c+2d−e+f .
Since S2 does not depend on the summation indices in Eq. (A2.4), we can apply
Eq. (2.34) to get
!
1 a b c
S = (−1)a+e+f ,
2e + 1 d e f
and recalling the substitutions (A2.3) we obtain the final result
! !
J+J +K 1 J J k J+J +K 1 J J K
S = (−1) = (−1) .
2K + 1 k K J 2K + 1 k k J
Obviously, the calculation scheme presented here is nothing but an example, and
different procedures can be followed to obtain the same result.
In classical physics, the momentum of the electromagnetic field is given by (see e.g.
Jackson, 1962)
1
P = ×B
E dV .
4πc V
APPENDIX 797
= curl A,
Since B the j-th Cartesian component of P is
1 ∂Am ∂Aj
Pj = Em − Em dV . (A3.1)
4πc m V ∂xj ∂xm
On the other hand, this integral can be transformed into a surface integral. As-
suming that the fields vanish at infinity, Eq. (A3.1) reduces to
1 ∂Am
Pj = E dV . (A3.2)
4πc m V m ∂xj
Similarly, writing the angular momentum of the electromagnetic field in the form
= 1
M × B)
r × (E dV ,
4πc V
one obtains for the j-th component of M
1 ∂Am ∂Al
Mj = jkl xk Em − Em dV .
4πc V ∂xl ∂xm
klm
and observing that the first integral in the right-hand side is zero we obtain
1 ∂Am 1
Mj = jkl xk Em dV + jkl Ek Al dV . (A3.3)
4πc V ∂xl 4πc V
klm kl
We now recall that in Quantum Mechanics the momentum operator p̂j and the
orbital angular momentum operator l̂j are given by
∂
p̂j = −ih̄
∂xj
∂
l̂j = −ih̄ jkl xk ,
∂xl
kl
798 APPENDIX
These expressions suggest quite naturally that the first term in the right-hand side
of Eq. (A3.4) can be regarded as the orbital angular momentum, and the second
term as the intrinsic angular momentum (or spin) of the radiation.
Now we introduce the formalism of second quantization to derive the expressions
of the operators Pˆ and Mˆ according to the Correspondence Principle. For the
operator Pˆ
we start from Eq. (A3.2). Substituting for A and E
the expressions of
the corresponding operators given by Eqs. (4.30) and (4.33), and carrying out the
volume integration along the same lines leading to Eq. (4.24), one obtains
ˆ hν λ) a† (ν, Ω,
λ) + a† (ν, Ω,
P = Ω a(ν, Ω, λ) a(ν, Ω,
λ) .
2c
ν Ωλ
− a† (ν, Ω, λ ) e (Ω)
λ) a(ν, Ω,
λ
∗ × e (Ω)
λ
. (A3.5)
The expression in square bracket can be related to the Stokes parameter operator
defined in Sect. 4.4. Let us consider, for instance, the representation (4.38)
V̂ (ν, Ω)
The cross products are
for the unit vectors eλ (Ω).
e+1 (Ω) ∗ = iΩ
× e (Ω)
+1
e−1 (Ω) ∗ = −i Ω
× e (Ω)
−1
∗ = e (Ω)
× e (Ω)
e+1 (Ω) ∗=0,
× e (Ω)
−1 −1 +1
APPENDIX 799
Although this result has been obtained using the specific unit vectors of Eqs. (4.38),
it can be shown directly that it holds also for the more general unit vectors defined
in Eqs. (1.41).
Equations (A3.6) and (A3.7) have a simple physical interpretation. They show
that each circularly polarized photon carries an intrinsic angular momentum of
magnitude h̄, parallel or antiparallel to the propagation direction Ω. It is also
seen that a positive (or right-handed) circular polarization corresponds to a nega-
tive projection of the angular momentum on the propagation direction (negative
helicity), and vice versa.
The inconsistency between the sign of circular polarization and the sign of helicity
is obviously due to the convention adopted in this book for the sign of circular
polarization. The reasons of our choice are mainly of practical nature and are
explained in Sect. 1.2. In any case, it should be remarked that even the definition
of positive helicity is purely conventional.
In Sect. 7.13.a we have seen that the hypothesis of isotropic collisions implies that
the transfer collisional rates CI (αJM M , α J M M ) must satisfy Eq. (7.84) for
any arbitrary rotation R of the reference system. Instead of transforming the
right-hand side of Eq. (7.84) via Eqs. (7.85), we can couple the rotation matrices
in a different way, writing
J ∗
DN M (R) DNJ
M (R) =
N −M J J K J J K
= (−1) (2K + 1) DPKQ (R)∗ ,
N −N P M −M Q
K
800 APPENDIX
J ∗
DN
J
M (R) D
N M (R) =
N −M J J K J J K
= (−1) (2K + 1) DPK Q (R)∗ ,
N −N P M −M Q
K
DPKQ (R)∗ DPK Q (R)∗ =
K K K K K K
= (2K + 1) DPK Q (R) . (A4.1)
P P P Q Q Q
K
Since the right-hand side must be independent of the rotation R, the index K can
only take the value K = 0. This implies K = K , P = −P , Q = −Q . We thus
obtain, using Eq. (2.26a)
CI (αJM M , α J M M ) = (−1)M −M (2J + 1)
J J K
J J K
× ΓI(K) (αJ, α J ) , (A4.2)
−M M Q −M M Q
K
where we have defined another set of multipole components of the collisional rates,
given by
2K + 1
ΓI(K) (αJ, α J ) =
2J + 1
N −N J J K J J K
× (−1)
−N N P −N N P
N N N N P
× CI (αJN N , α J N N ) .
The multipole components ΓI(K) are of course related to the multipole components
(K)
CI defined in Eq. (7.87). To find this relation we start from Eq. (7.87) and write
APPENDIX 801
with the inverse formula (that can be deduced with the help of Eq. (2.39))
2K + 1
ΓI(K) (αJ, α J ) =
(2J + 1)(2J + 1)
!
J J K
× (−1)J+J −K +K
(2K + 1) CI(K ) (αJ, α J ) .
J J K
K
J Ju K
J Ju K
× ΓS(K) (αJ, αu Ju )
−M Mu Q −M Mu Q
K
with
2K + 1
ΓS(K) (αJ, αu Ju ) =
2Ju + 1
Nu −Nu J Ju K J Ju K
× (−1)
−N Nu P −N Nu P
N N Nu Nu P
× CS (αJN N , αu Ju Nu Nu ) .
The multipole components CS(K) defined in Eq. (7.89) are related to these new
components by
J J K
J J K
× ΓE(K) (αJ)
−M M Q −M M Q
K
with
2K + 1
ΓE(K) (αJ) =
2J + 1
N −N J J K J J K
× (−1)
−N N P −N N P
NN N N P
and the relation with the multipole components CE(K) defined in Eq. (7.100) is
!
−K J J K
CE(K) (αJ) = (2J + 1) (−1)2J+K ΓE(K ) (αJ) . (A4.3)
J J K
K
When the interaction between the atomic system and the collider is described by
a single operator of rank K̃ we have
!
J J K
Ju Ju K̃
CS(K) (αJ, αu Ju ) = (−1)K ! CS(0) (αJ, αu Ju )
J J 0
Ju Ju K̃
and !
J J K
J J K̃
CE(K) (αJ) = (−1)K ! CE(0) (αJ) .
J J 0
J J K̃
−x Z
e ,
w
= (w1 , w2 , w3 ) , y = (y1 , y2 , y3 )
1 1
a = w
+ iy = w1 + i y1 , w2 + i y2 , w3 + i y3
2 2
∗ 1 1
b = a = w − i y = w1 − i y1 , w2 − i y2 , w3 − i y3 .
2 2
804 APPENDIX
= (A , A , A ) ,
A = (B , B , B )
B
1 2 3 1 2 3
defined by
⎛ ⎞
0 1 0 0
⎜ 1 0 0 0 ⎟
A1 = B1∗ = ⎝ ⎠ (A5.1a)
0 0 0 −i
0 0 i 0
⎛ ⎞
0 0 1 0
⎜ 0 0 0 i⎟
A2 = B2∗ = ⎝ ⎠ (A5.1b)
1 0 0 0
0 −i 0 0
⎛ ⎞
0 0 0 1
⎜ 0 0 −i 0⎟
A3 = B3∗ = ⎝ ⎠, (A5.1c)
0 i 0 0
1 0 0 0
+ b · B
Z = w0 1 + a · A , (A5.2)
where δjk is the Kronecker symbol and jkl is the antisymmetric tensor (defined
on p. 7). Since by a well-known theorem on matrices1
M1 + M2 M1 M2
e = e e if M1 , M2 = 0 , (A5.6)
−x Z −x w0 1 −x
a·A
−x b·B
e = e e e . (A5.7)
1
See e.g. Cohen-Tannoudji et al., 1977, p.170.
APPENDIX 805
Now we turn to the evaluation of the single factors appearing in this equation.
Recalling Eq. (8.21), and observing that 1 n = 1 , we have
∞
−x w0 1 (−1)n −x w0
e = (xw0 )n 1 = e 1 . (A5.8)
n=0
n!
where
1 2
a2 = a · a = w − y2 + 2 i w
· y (A5.9)
4
with
·w
w2 = w = w12 + w22 + w32
y 2 = y ·
y = y12 + y22 + y32 .
−x
a·A
x2 a2 x4 a4 x x3 a2 x5 a4 ,
e = 1+ + + ··· 1 − + + + · · · a · A
2! 4! 1! 3! 5!
−x
a·A
sinh(xa) ,
e = cosh(xa) 1 − a · A (A5.10)
a
where a is either of the two possible determinations of the square root of the
complex number a2 (note that Eq. (A5.10) is invariant under interchange of a
and −a ). In a similar way one gets
−x b·B
sinh(xb)
e = cosh(xb) 1 − b·B , (A5.11)
b
where
1 2
b2 = b · b = (a2 )∗ = w − y2 − 2 i w
· y (A5.12)
4
and where b is either of the two possible determinations of the square root of b2 .
1
The special case a2 = 0 will be considered later.
806 APPENDIX
Substitution of Eqs. (A5.8), (A5.10), and (A5.11) into Eq. (A5.7) gives
−x Z −x w0 sinh(xa)
e = e cosh(xa) 1 − a · A
a
sinh(xb)
× cosh(xb) 1 − b·B . (A5.13)
b
Now we choose the determination of a and b setting, in terms of the real numbers α
and β
1 1
a= α + iβ , b = a∗ = α − iβ , (A5.14)
2 2
with the supplementary condition
α ≥ 0. (A5.15)
The expressions for α and β can be found using Eqs. (A5.9) and (A5.14). One
gets the equation
w2 − y 2 + 2 i w
·
y = α2 − β 2 + 2 i αβ ,
α = Λ+ (w
, y ) β = σ Λ− (w
, y ) , (A5.17)
where1
Λ+ (w
, y ) = (w2 − y 2 )2 /4 + (w
· y )2 + (w2 − y 2 )/2
Λ− (w
, y ) = (w2 − y 2 )2 /4 + (w
· y )2 − (w2 − y 2 )/2 (A5.18)
· y
w
· y ) =
σ = sign (w . (A5.19)
|w
· y |
1
Note that when w · y = 0, σ is undetermined. In this case, however, being Λ− (
w , y ) = 0,
the final expression for exp(−xZ) is independent of σ (see Eqs. (A5.20)-(A5.21)).
APPENDIX 807
,
Finally, substituting Eq. (A5.17) into Eq. (A5.16) and evaluating the matrices a · A
b · B
, and their product, one gets the expression
−x Z −x w
e = e 0
1
× cosh x Λ+ (w
, y ) + cos x Λ− (w , y ) M1 (w , y )
2
− sin x Λ− (w , y ) M2 (w , y )
− sinh x Λ+ (w , y ) M3 (w , y )
!
1
+ cosh x Λ+ (w , y ) − cos x Λ− (w , y ) M4 (w , y ) , (A5.20)
2
M1 (w
, y ) = 1 (A5.21a)
M2 (w
, y ) = (A5.21b)
⎛ ⎞
0 Λ− w1 − σΛ+ y1 Λ− w2 − σΛ+ y2 Λ− w3 − σΛ+ y3
⎜ ⎟
1 ⎜ Λ− w1 − σΛ+ y1 0 σΛ+ w3 + Λ− y3 −σΛ+ w2 − Λ− y2 ⎟
= ⎜⎜
⎟
⎟
Θ ⎝ Λ− w2 − σΛ+ y2 −σΛ+ w3 − Λ− y3 0 σΛ+ w1 + Λ− y1 ⎠
Λ− w3 − σΛ+ y3 σΛ+ w2 + Λ− y2 −σΛ+ w1 − Λ− y1 0
M3 (w
, y ) = (A5.21c)
⎛ ⎞
0 Λ+ w1 + σΛ− y1 Λ+ w2 + σΛ− y2 Λ+ w3 + σΛ− y3
⎜ ⎟
1 ⎜ Λ+ w1 + σΛ− y1 0 −σΛ− w3 + Λ+ y3 σΛ− w2 − Λ+ y2 ⎟
= ⎜ ⎟
Θ⎜⎝ Λ+ w2 + σΛ− y2 σΛ− w3 − Λ+ y3 0
⎟
−σΛ− w1 + Λ+ y1 ⎠
Λ+ w3 + σΛ− y3 −σΛ− w2 + Λ+ y2 σΛ− w1 − Λ+ y1 0
M4 (w
, y ) =
⎛ ⎞
t2 w3 y2 − w2 y3 w1 y3 − w3 y1 w2 y1 − w1 y2
⎜ ⎟
2 ⎜ −w3 y2 + w2 y3 w12 + y12 − t2 w1 w2 + y1 y2 w3 w1 + y3 y1 ⎟
= ⎜ ⎟ (A5.21d)
Θ⎜⎝ −w1 y3 + w3 y1 w1 w2 + y1 y2 w22 + y22 − t2
⎟
w2 w3 + y2 y3 ⎠
−w2 y1 + w1 y2 w3 w1 + y3 y1 w2 w3 + y2 y3 w32 + y32 − t2
Equation (A5.20) can also be written in a different form, which involves only
exponential factors (although complex)
−x Z −x [ w0 + Λ+ (w,
y)] −x [ w0 − Λ+ (w,
y)]
e = e N1 (w
, y ) + e N2 (w
, y )
−x [ w0 + i Λ− (w,
y)] −x [ w0 − i Λ− (w,
y)]
+e N3 (w
, y ) + e N4 (w
, y ) , (A5.23)
where
1
N1 (w
, y ) = M1 (w
, y ) + 2 M3 (w
, y ) + M4 (w
, y )
4
1
N2 (w
, y ) = M1 (w
, y ) − 2 M3 (w
, y ) + M4 (w
, y )
4
1
N3 (w
, y ) = M1 (w
, y ) − 2 i M2 (w
, y ) − M4 (w
, y )
4
1
N4 (w
, y ) = M1 (w
, y ) + 2 i M2 (w
, y ) − M4 (w , y )∗ . (A5.24)
, y ) = N3 (w
4
The explicit expressions of the matrices Ni are rather cumbersome and will not
be given here.
It should be noticed that the matrices Mi (w , y ) defined in Eqs. (A5.21b,c,d)
– and hence the expression (A5.20) for exp(−xZ) – are ill-defined when Θ = 0,
that is
Λ+ (w , y ) = Λ− (w
, y ) = 0 ,
which implies (see Eq. (A5.22))
w2 = y 2 and · y ) = 0 .
(w
a2 = b 2 = 0 ,
Substituting into Eq. (A5.7) and applying the same procedure as before, one finds
that Eq. (A5.20) must be replaced, in this particular case, by the expression
−x Z −x w0 1 2
e = e 1 − x G(w
, y ) + x G(w
, y )2 ,
2
where ⎛ ⎞
0 w1 w2 w3
⎜w −y2 ⎟
⎜ 0 y3 ⎟
, y ) = ⎜ 1
G(w ⎟ . (A5.25)
⎝ w2 −y3 0 y1 ⎠
w3 y2 −y1 0
APPENDIX 809
This expression can also be obtained from Eqs. (A5.20)-(A5.21) performing the
, y ) → 0, Λ− (w
limit Λ+ (w , y ) → 0.
X −1 K X = K , (A6.1)
where K is the propagation matrix defined in Eq. (8.3) and K is the diagonal
matrix
Kij = λi δij (i, j = 1, 2, 3, 4) , (A6.2)
λi being the eigenvalues of K (given by Eqs. (8.33)), that we assume to be distinct
(see footnote on p. 358).
The matrices X and X −1 can be found as follows. First we determine the
eigenvectors u(i) of the matrix K , defined by
Using these eigenvectors we construct a matrix Z whose columns are the four
eigenvectors u(i) each multiplied by an arbitrary constant ci ,
(j)
Zij = cj ui . (A6.4)
From Eqs. (A6.2) and (A6.3) it follows that the matrix Z satisfies the equation
K Z = Z K . (A6.5)
Then we find the ‘left’ eigenvectors v (i) of the matrix K , defined by1
(i) † †
v K = λi v (i) , (A6.6)
and construct a matrix Y whose rows are the four eigenvectors (v (i) )† each mul-
tiplied by an arbitrary constant di ,
(i)
Yij = di vj . (A6.7)
Y K = K Y . (A6.8)
1
It can be easily proved that the quantities λi in Eqs. (A6.6) and (A6.3) are the same.
810 APPENDIX
If we now consider the product Y K Z , we have from Eqs. (A6.5) and (A6.8)
Y K Z = Y Z K = K Y Z ,
which proves that the matrix Y Z commutes with the diagonal matrix K . Since
the four eigenvalues of K have been assumed to be non-degenerate, it follows
that Y Z is also diagonal. On the other hand, the diagonal elements of Y Z are
given by
4
4
(i) (i)
(Y Z)ii = Yik Zki = ci di vk uk (i = 1, 2, 3, 4) .
k=1 k=1
Thus we get
Y Z =1
if we choose the coefficients ci , di in such a way that
1
ci di = . (A6.9)
%
4
(i) (i)
vk uk
k=1
X = Z, X −1 = Y ,
with Z and Y given by Eqs. (A6.4) and (A6.7), respectively, and with the coeffi-
cients ci and di satisfying Eq. (A6.9).
Let’s now apply this method to find an explicit expression for the matrices X
and X −1 . The first step is to determine the eigenvectors of the propagation ma-
trix K . From Eq. (A6.3) we have
⎧
⎪ (ηI − λ) u1 + ηQ u2 + ηU u3 + ηV u4 = 0
⎪
⎪
⎨ η u + (η − λ) u + ρ u − ρ u = 0
Q 1 I 2 V 3 U 4
(A6.10)
⎪
⎪ η u − ρ u + (η − λ) u + ρ Q u4 = 0
⎪
⎩
U 1 V 2 I 3
ηV u1 + ρU u2 − ρQ u3 + (ηI − λ) u4 = 0 ,
The left eigenvectors are found in a strictly similar way; their expression is obtained
by substituting ρQ → − ρQ , ρU → − ρU , ρV → − ρV in Eqs. (A6.12).
From the left eigenvectors we then construct the matrix X −1 without any mul-
tiplicative factor (the coefficients di in Eq. (A6.7) are set to 1). This is just the
matrix given in Eq. (8.42a). To construct the matrix X we must first evaluate the
coefficients ci . Using the compact notations of Eqs. (8.31) and (8.41) we have
4
(i) (i) 2
uk vk = (λi − ηI )2 (λi − ηI )2 + ρ2 + (λi − ηI )4 η 2
k=1
− (λi − ηI )2 ζ 2 + 2 (λi − ηI )2 (η · ρ
)2 + (η · ρ
)2 ρ2 ,
4
(i) (i)
uk vk = 2 (λi − ηI )4 η 2 + 2 (λi − ηI )2 (η · ρ
)2 + (η · ρ
)2 ρ2 .
k=1
The coefficients ci then follow from Eq. (A6.9), and this leads to the matrix X as
given in Eq. (8.42b).
It should be emphasized that the expressions for X and X −1 given in Eqs. (8.42)
are valid provided two conditions are satisfied, namely: i) the four eigenvalues of
1
Expressions (A6.12) are correct only if the determinant of the system (A6.11) is non-
zero. However, the system (A6.10) can always be reduced – provided the four eigenvalues λi are
distinct – to a system of three equations in three unknowns having a non-zero determinant. In
other words, the matrix of the coefficients of the system (A6.10) is of rank 3.
812 APPENDIX
the propagation matrix are distinct, and ii) the determinant of the system (A6.11)
is non-zero. As apparent from Eqs. (8.33), the first condition means that both
Λ+ (
η ,ρ
) and Λ− (
η,ρ
) must be non-zero, thus it is equivalent to (see Eqs. (A5.18))
η·ρ
= 0 . (A6.13)
ρ2 − Λ− ( )2 = 0 ,
η ,ρ
η 2 ρ2 = (
η·ρ
)2 . (A6.14)
Expressions (8.42) are correct only if conditions (A6.13) and (A6.14) are satisfied.
To shorten notations we write the coefficients E (k) (s, s ), introduced in Eq. (8.76),
in the form
E (k) (s, s ) = Xk + i Yk (k = 0, 1, 2, 3) , (A7.1)
with Xk and Yk real. Using Eqs. (8.62) and (8.63), the transfer equation (8.78)
becomes
⎛ ⎞ ⎛ ⎞⎛ ⎞
X0 ηI ηQ ηU ηV 0 −ρQ −ρU −ρV X0
⎜X ⎟ ⎜η −ρ −ρ −ηU ⎟ ⎜ ⎟
⎜ 1⎟ ⎜ Q η I ρ V U Q 0 ηV ⎟ ⎜ X1 ⎟
⎜ ⎟ ⎜ ⎟⎜ ⎟
⎜ X2 ⎟ ⎜ ηU −ρV ηI ρQ −ρU −ηV 0 ηQ ⎟ ⎜X ⎟
⎜ ⎟ ⎜ ⎟⎜ 2 ⎟
⎜ X ⎟ ⎜ −ρQ −ρV −ηQ 0 ⎟ ⎜ ⎟
d ⎜ 3⎟ ⎜ ηV ρU ηI ηU ⎟ ⎜ X3 ⎟
2 ⎜ ⎟= − ⎜ ⎟⎜ ⎟.
ds ⎜ Y0 ⎟ ⎜ 0 ρQ ρU ρV ηI ηQ ηU ηV ⎟⎜ Y0 ⎟
⎜ ⎟ ⎜ ⎟⎜ ⎟
⎜Y ⎟ ⎜ρ −ηV −ρU ⎟ ⎜ ⎟
⎜ 1⎟ ⎜ Q 0 ηU ηQ ηI ρV ⎟⎜ Y1 ⎟
⎜ ⎟ ⎜ ⎟⎜ ⎟
⎝ 2⎠
Y ⎝ ρU ηV 0 −ηQ ηU −ρV ηI ρQ ⎠⎝ Y2 ⎠
Y3 ρV −ηU ηQ 0 ηV ρU −ρQ ηI Y3
where K is the propagation matrix (Eq. (8.3)) and K is the matrix obtained
from K setting ηI = 0 and substituting
ηQ → ρQ ρQ → − ηQ
ηU → ρU ρU → − ηU
ηV → ρV ρV → − ηV .
APPENDIX 813
As far as Eq. (8.77) is concerned, the trace of the product of four Pauli spin
matrices is easily evaluated using Eqs. (8.56), (8.57), (8.62) and the relation
ijk lmk = δil δjm − δim δjl .
k
We get
Tr τi τj τk τl = 2 δij δkl − δik δjl + δil δjk (i, j, k, l = 1, 2, 3)
(if one of the indices is zero, the trace is given directly from Eq. (8.62)). Substitution
into Eq. (8.77) yields the expressions of the matrix elements Oij in terms of the
quantities Xk , Yk defined in Eq. (A7.1)
Let us consider two adjacent intervals (τk−1 , τk ) and (τk , τk+1 ), and their mid-
points
τk−1 + τk τk + τk+1
τ̃k−1 = , τ̃k = (k = 2, 3, . . . , N − 1) . (A8.1)
2 2
Using a standard finite-differences formula we can write
dG A−1 −1
k−1 + Ak Gk − Gk−1
A−1 =
dτr τr = τ̃k−1 2 τk − τk−1
dG A−1 −1
k + Ak+1 Gk+1 − Gk
A−1 = , (A8.2)
dτr τr = τ̃k 2 τk+1 − τk
Substitution of Eqs. (A8.1) and (A8.2) into Eq. (A8.3) leads to the linear system
where
A−1
k−1 + Ak
−1
x(k) =
(τk − τk−1 ) (τk+1 − τk−1 )
A−1
k + Ak+1
−1
z (k) =
(τk+1 − τk ) (τk+1 − τk−1 )
u(k) = bk . (A8.5)
Boundary conditions can also be written in the form of difference equations. For
the first interval we can write
dG A−1 + A−1 G2 − G1
A−1 = 1 2
.
dτr τr = τ̃1 2 τ2 − τ1
On the other hand, following a method suggested by Auer (1967) for the scalar
case, we have by a first-order expansion
!
dG dG d dG τ2 − τ1
A−1 = A−1 + A−1 .
dτr τ = τ̃ dτr τ = τ dτr dτr τ =τ 2
r 1 r 1 r 1
APPENDIX 815
Equating the expressions in the right-hand sides and applying the boundary con-
dition (9.185) at τr = τ1 we obtain, with the help of Eq. (9.183)
A−1 + A−1 G2 − G1 τ − τ1
1 2
= G1 + A1 G1 − b1 2 ,
2 τ2 − τ1 2
where
A−1 + A−1
z (1) = 1 2
2 (τ2 − τ1 )2
1 1
y (1) = z (1) + 1 + A1
τ2 − τ1 2
1
u(1) = b . (A8.7)
2 1
Performing analogous calculations at τr = τb = τN , one obtains the equation
where
A−1
N −1 + AN
−1
x(N ) =
2 (τN − τN −1 )2
1 1
y (N ) = x(N ) + 1 + AN
τN − τN −1 2
1 1
u(N ) = b + I (b) . (A8.9)
2 N τN − τN −1
G1 = g1 + Γ1 G2 , (A8.10)
where
−1
g1 = y (1) u(1)
−1
Γ1 = y (1) z (1) . (A8.11)
816 APPENDIX
Substitution of Eq. (A8.10) into the first of Eqs. (A8.4) and left-multiplication by
[ y (2) − x(2) Γ1 ]−1 gives G2 as a function of G3 . Iteration of this procedure leads
to the equations
Gk = gk + Γk Gk+1 , (A8.12)
with
−1
gk = y (k) − x(k) Γk−1 u(k) + x(k) gk−1
−1
Γk = y (k) − x(k) Γk−1 z (k) , (A8.13)
Gk = Γk Gk+1 (k = 1, 2, . . . , N − 1) . (A8.14)
This formula gives a precise physical meaning to the matrix Γk . For the particular
atmosphere that we are considering, the emission vector b(τr ) is zero and – because
of the boundary condition (9.184) – the vector Ĩ is also zero at any optical depth.
Therefore, at any optical depth we have from Eqs. (9.181) G = I / 2, so that
Eq. (A8.14) can be rewritten in the form
Ik = Γk Ik+1 .
Comparison with Eq. (9.165) – written for the case b(τr ) = 0 – shows that Γk gives
a numerical representation of the evolution operator associated with the interval
(τk , τk+1 ), a result that was pointed out by Rees et al. (1989),
Γk ≈ O(τk , τk+1 ) .
APPENDIX 817
Let’s now assume, in addition to the pure absorption hypothesis, that the propa-
gation matrix A is independent of optical depth. It follows from Eqs. (A8.7) and
(A8.11) that
−1
1 2
Γ1 = A−1 + τ2 − τ1 1 + τ2 − τ1 A A−1 ,
2
whence
1 2
Γ1−1 = 1 + τ2 − τ1 A + τ2 − τ1 A2 . (A8.15)
2
Expansion of Γ1 into power series of A
∞
Γ1 = c A
=0
and use of the relation Γ1 Γ1−1 = 1 , with Γ1−1 given by Eq. (A8.15), yields
1 2 1 4
Γ1 = 1 − τ2 − τ1 A + τ2 − τ1 A2 − τ2 − τ1 A4 + · · · (A8.16)
2 4
On the other hand, the analytical expression of the evolution operator associated
with the interval (τ1 , τ2 ) is (cf. Eq. (9.103))
−(τ2 − τ1 ) A
O(τ1 , τ2 ) = e ,
thus the expression for Γ1 provided by the Feautrier method is correct up to second-
order terms.
It can be shown by recursion that this statement is valid for all the matrices
Γk (k = 1, 2, . . . , N − 1). Considering for instance the matrix Γ2 , we have from
Eqs. (A8.5) and (A8.13), with easy algebra
τ3 − τ1 1 τ − τ2
Γ2−1 = 1 + τ3 − τ2 τ3 − τ1 A2 − 3 Γ .
τ2 − τ1 2 τ2 − τ1 1
Substitution of Eq. (A8.16) and expansion of Γ2 into power series of A lead to the
expression
1 2
Γ2 = 1 − τ3 − τ2 A + τ3 − τ2 A2
2
1 3
− τ3 − τ2 τ2 − τ1 + τ3 − τ2 )3 A4 + · · · ,
4
which should be compared with the analytical result
−(τ3 − τ2 ) A
O(τ2 , τ3 ) = e .
818 APPENDIX
so that
d
I(y) = I(y) − s(y) . (A9.2)
dy
Let us introduce a grid of N points (y1 , y2 , . . . , yN ) not necessarily evenly spaced.
Integration of Eq. (A9.2) in the interval (yk , yk+1 ) gives
yk+1
−(y− yk ) −(yk+1 − yk )
I(yk ) = e s(y) dy + e I(yk+1 ) . (A9.3)
yk
(yk+1 − y) sk + (y − yk ) sk+1
s(y) = , (A9.4)
yk+1 − yk
with sk = s(yk ). Substituting Eq. (A9.4) into Eq. (A9.3) and evaluating analyti-
cally the integral, one gets, with the help of Eq. (9.190)
Ik = pk + Πk Ik+1 , (A9.5)
with
∆k + k − 1
∆k = yk+1 − yk mk =
∆k
−∆k 1 − k (∆k + 1)
k = e nk = . (A9.7)
∆k
Comparison of Eqs. (A9.5) and (9.165) shows that Πk gives a numerical represen-
tation of the evolution operator associated with the interval (yk , yk+1 ),
Πk ≈ O(yk , yk+1 ) .
APPENDIX 819
Once the boundary condition IN = I (b) is specified (see footnote 1 on p. 441), the
emerging Stokes vector I1 can be determined by recursive application of Eq. (A9.5).
To test the accuracy of the DELO solution against the step length, we assume
the matrix A to be independent of y in the interval (yk , yk+1 ) and compare Πk
with the exact analytical expression of the evolution operator. From Eqs. (A9.6)
we have −1
Πk = 1 + mk A k 1 − nk A .
1 + mk A )−1 into power series of A yields
Expansion of the matrix (1
∞
−1
1 + mk A =1 + (−1)r mrk A r ,
r=1
whence ∞
nk
Πk = k 1 + (−1) mk +r
mr−1
k A r .
r=1
k
∆k ∆2 ∆3
mk = − k + k + O(∆4k )
2 6 24
∆k −∆k
nk e + e −2 ∆3k
mk + = = ∆k + + O(∆5k ) ,
k ∆k 12
1 2 2 ∆3
Πk = k 1 − ∆k A + ∆k A − k A + 2A 2 + 3A 3 + O(∆4k ) . (A9.8)
2 12
On the other hand, the analytical expression of the evolution operator is (see
Eqs. (9.187), (9.104) and (A9.1))
−(yk+1 − yk ) −(yk+1 − yk ) A −∆k A
O(yk , yk+1 ) = e e = k e
1 2 2 1
= k 1 − ∆k A + ∆k A − ∆3k A 3 + O(∆4k ) .
2 6
Comparison with Eq. (A9.8) shows that the expression for Πk resulting from the
DELO method is correct up to second-order terms in ∆k A . The ‘error’ is
−∆k ∆3k
Ek = O(yk , yk+1 ) − Πk = e A + 2A 2 + A 3 + O(∆4k ) .
12
Since the series expansion in the right-hand side is meaningful only when ∆k 1,
we can replace the exponential by 1. The ‘error’ Ek can be easily expressed in
820 APPENDIX
δk3
Ek ≈ a2 (A − aI 1 ) + 2aI (A − aI 1 )2 + (A − aI 1 )3 , (A9.9)
12 I
with
δk = τk+1 − τk .
Finally, it should be remarked that, for the particular case of the Milne-Eddington
atmosphere, the DELO method provides the exact solution to the transfer equa-
tion for any outward propagation direction. This fact, which may look surprising at
first sight, can be easily understood by the following argument. The basic assump-
tions of the Milne-Eddington model imply that the matrix A and the vector b
in Eq. (9.187) are independent of τr and linear in τr , respectively. On the other
hand, the solution I(τr ) given by Eqs. (9.108) is also linear in τr , thus the ‘modified
source vector’ s(τr ) defined in Eq. (9.190) is in turn linear in τr . Since the quantity
aI is also constant, it turns out that s is linear in y (cf. Eq. (A9.1)). Therefore,
Eq. (A9.4) – which is the only assumption of the DELO method – is exactly sat-
isfied under the Milne-Eddington approximations. It follows, incidentally, that it
doesn’t make sense to test the accuracy of the DELO method by comparing the
numerical result with the Milne-Eddington analytical solution.
Consider the transfer equation for the intensity of a non-magnetic spectral line
formed in a static atmosphere in LTE. From Eqs. (9.29), (9.27), (9.32) and (9.35)
=0
we have, under the limit B
dI
= (1 + k) (I − BP ) , (A10.1)
dτc
where
1
k = κL η = κL √ H(v, a) , (A10.2)
π
with v the reduced wavelength
λ − λ0
v= . (A10.3)
∆λD
APPENDIX 821
Following Sánchez Almeida et al. (1989), we assume that the absorption profile η
is shifted from the wavelength λ0 by a small amount δλ which depends on optical
depth, where small means
δλ(τc ) ∆λD (A10.4)
for any τc . We want to evaluate the effect of this wavelength shift on the equivalent
width of the line.
The variable v in Eq. (A10.2) becomes
λ − λ0 − δλ(τc )
v(τc ) = ,
∆λD
and owing to assumption (A10.4) we can expand k in power series of δλ(τc ) . Up
to the second order we have
1 2
k = κL η − η δλ(τc ) + η δλ(τc ) , (A10.5)
2
where η is the unperturbed profile defined in Eq. (A10.2) and where primes denote
derivatives with respect to λ.
Similarly to Sect. 9.6, we write I in the form
I = I0 + I1 + I2 , (A10.6)
n
where In is of order δλ . Substituting Eqs. (A10.5) and (A10.6) into Eq. (A10.1)
and equating the terms of the same order, we obtain the following set of equations
dI0
= (1 + κL η) (I0 − BP )
dτc
dI1
= (1 + κL η) I1 − κL η δλ (I0 − BP )
dτc
dI2 1
= (1 + κL η) I2 − κL η δλ I1 + κ η δλ2 (I0 − BP ) .
dτc 2 L
To solve these equations we make two further assumptions: i) the atmosphere is
described by the Milne-Eddington model (see Sect. 9.8), and we consider vertical
propagation (µ = 1, tc = τc ); ii) the wavelength shift δλ varies linearly with
optical depth,
δλ(τc ) = b τc . (A10.7)
A straightforward calculation shows that the contributions to the emerging inten-
sity are
β
I0 (0) = B0 1 +
1 + κL η
κL η
I1 (0) = B0 β b
(1 + κL η)3
κL η κ2L η 2
I2 (0) = − B0 β b2 − 3 .
(1 + κL η)4 (1 + κL η)5
822 APPENDIX
Using these expressions, we can calculate the equivalent width according to Eq.
(9.219). We have
∞
1
W = W0 − I1 (0) + I2 (0) dλ ,
B0 (1 + β)
−∞
β
W0 = ∆λD I (κ , a) ,
1+β 1 L
with I1 defined in Eqs. (9.222). Since the profile η is symmetrical about λ0 , the
contribution of I1 (0) to the equivalent width is zero. The contribution of I2 (0) can
be simplified by taking into account that
∞ ∞
κL η κ2L η 2
dλ = 4 dλ .
(1 + κL η)4 (1 + κL η)5
−∞ −∞
where
∞
κ2L η̇ 2
I6 (κL , a) = dv , (A10.9)
(1 + κL η)5
−∞
We consider the simplest case where the blend is formed by two independent,
Zeeman-triplet spectral lines. We assume the atmosphere is plane-parallel and
static and the magnetic field is in the vertical (outward or inward) direction, and
we restrict attention to the radiation flowing in the vertical outward direction.
Use of Eqs. (9.216) and (9.32) allows us to write the net circular polarization
parameter – defined in Eq. (9.235) – in the form (cf. Eq. (9.245))
Wb − Wr
v = σ ,
Wb + Wr
where σ is the sign factor defined in Eq. (9.246) and where Wb and Wr are the
equivalent widths of two ‘fictitious’ lines formed in the same atmosphere (without
APPENDIX 823
magnetic field) and having absorption coefficients given by the blend of the σb and
σr components of the two lines respectively, that is by
Denoting by g1 and g2 the Landé factors of the two lines and by λ1 and λ2 their
central wavelengths, the separation between the σb components is
| λ2 − λ1 − (g2 − g1 ) ∆λB | ,
| λ2 − λ1 + (g2 − g1 ) ∆λB | .
of the order or even larger than 0.2), there is indeed no reason why the blending
line having the larger Landé factor should preferably lie on the red rather than on
the blue side of the other line. A systematic effect could possibly exist for fine-
structure or hyperfine-structure multiplets, but such investigation is beyond the
aims of this book.
First of all we evaluate its expression for a particular realization of the jumping
points (τ1 , τ2 , . . . , τi , . . . ). If the number of such points in the interval (τ0 , τ ) is n,
we have from Eqs. (9.95) and (9.103)
−(τ − τn ) An+1
×e , (A12.1)
where A1 is the (constant) value of A(τr ) in the interval (τ0 , τ1 ), and so on.
Next we integrate over all possible realizations of the jumping points. The com-
bined probability dQn that the jumping points in (τ0 , τ ) are n and that they lie in
the intervals (τ1 , τ1 + dτ1 ), (τ2 , τ2 + dτ2 ), . . . , (τn , τn + dτn ), respectively, is given
by
τ1 − τ0 τ2 − τ1 τn − τn−1 τ − τn
− − − − dτ1 dτ2 dτ
dQn = e τe
e τe
··· e τe
e τe
··· n , (A12.2)
τe τe τe
with
∞
∞ τ τ
dτ1 dτ2
dQn = ···
n=0 n=0
τe τe
τ0 τ1
τ τn − τn−1
dτn − τ1 − τ0 τ −τ
− 2τ 1 − −
τ − τn
··· e τe
e e ··· e τe
e τe
= 1.
τe
τn−1
∞
τ − τ0 ∞ τ τ
− dτ1 dτ2
O(τ0 , τ ) dQn = e τe
···
n=0 n=0
τe τe
τ0 τ1
τ
dτn −(τ1 − τ0 ) A1 −(τ2 − τ1 ) A2 −(τ − τ )A −(τ − τn ) An+1
··· e e · · · e n n−1 n e .
τe
τn−1
To perform this integral we expand the exponentials into power series. Using the
formula1
τ τ τ
dτ1 dτ2 dτn
··· (τ1 − τ0 )i1 (τ2 − τ1 )i2 · · · (τn − τn−1 )in (τ − τn )in+1 =
τe τe τe
τ0 τ1 τn−1
n
τ − τ0 i1 ! i2 ! · · · in ! in+1 !
= (τ − τ0 )i1 + i2 + ··· + in + in+1 ,
τe (i1 + i2 + · · · + in + in+1 + n) !
we have
∞
τ − τ0 ∞ n ∞ ∞
− τ − τ0
O(τ0 , τ ) dQn = e τe
···
n=0 n=0
τe i =0 i =0
1 2
∞
[ − (τ − τ0 ) ]i1 + i2 + ··· + in+1 i i in+1
··· A11 A22 · · · An+1 .
in+1 =0
(i1 + i2 + · · · + in+1 + n) !
Finally, we average over the distribution of the physical parameters and obtain
the following expression for the statistical average of the evolution operator
τ − τ0 ∞ n ∞ ∞
− τ − τ0
O(τ0 , τ ) av
= e τe
···
n=0
τe i =0 i =0 1 2
∞
[ − (τ − τ0 ) ]i1 + i2 + ··· + in+1 i i
··· A 1 A 2 · · · Ain+1 .
in+1 =0
(i1 + i2 + · · · + in+1 + n) !
This equation can be rewritten in a more convenient form by grouping together all
the terms of the form
j j
A 1 A 2 · · · Ajm
1
See e.g. Gradshteyn and Ryzhik (1965), equation n. 4.634, p.620.
826 APPENDIX
∞
k
τ − τ0
=1 + (−1)k τek fkm Aj1 Aj2 · · · Ajm , (A12.3)
m=1
τe j1 j2 ··· jm
k=1
j1 + j2 + · · · + jm = k , (A12.4)
and where the function fkm (x), defined only for k ≥ 1 and 1 ≤ m ≤ k, is given by
∞
−x k xn n+1
fkm (x) = e x . (A12.5)
n=m−1
(n + k) ! m
The function fkm (x) can be expressed in a variety of different forms. A tedious
calculation, which is left as an exercise to the reader, allows one to transform the
infinite sum into two finite sums. The result is the following1
f11 (x) = x
m
xn k+m−2−n
fkm (x) = (−1)m (−1)n
n=0
n! k−2
k−2 !
−x xn k+m−2−n
−e (k ≥ 2) . (A12.6)
n=0
n! m
i) If the physical parameters are deterministic, the symbols · · · can be removed.
k−1
Since for a given k the number of different solutions of Eq. (A12.4) is m−1 , we
obtain
∞
k
k−1 τ − τ0
O(τ0 , τ ) av = 1 + (−1)k τek Ak fkm .
m=1
m−1 τe
k=1
1
To obtain Eqs. (A12.6), use has been made of Eq. 45 on p.166 and Eq. 4 on p.705 in
Prudnikov et al. (1986), and of Eq. 4 on p.8 and Eq. (ii) on p.1 in Riordan (1968). Equation 3
in the latter book is useful to prove Eq. (A12.7) using expression (A12.5).
APPENDIX 827
so that
∞
(τ − τ0 )k k −(τ − τ0 ) A
O(τ0 , τ ) av
=1 + (−1)k A = e ,
k!
k=1
which is the correct expression for the evolution operator in the deterministic case.
ii) Microturbulent limit (τe → 0). According to Eqs. (A12.6), the asymptotic
expression of the function fkm (x) for x → ∞ is
xm
fkm (x) ∼ .
m!
Substitution into Eq. (A12.3) shows that, for an assigned k, the only term in the
summation over m which makes a non-zero contribution for τe → 0 is the one
corresponding to m = k. This implies j1 = j2 = · · · = jm = 1, hence Eq. (A12.3)
reduces to
∞
(τ − τ0 )k k −(τ − τ0 ) A
O(τ0 , τ ) av
=1 + (−1)k A = e . (A12.8)
k!
k=1
iii) Macroturbulent limit (τe → ∞). According to Eq. (A12.5), the asymptotic
expression of fkm (x) for x → 0 is
xk+m−1
fkm (x) ∼ .
(k + m − 1) !
For an assigned k, the only term in the summation over m in Eq. (A12.3) which
makes a non-zero contribution for τe → ∞ is the one with m = 1. This implies
j1 = k, so that Eq. (A12.3) yields
∞
(τ − τ0 )k k
O(τ0 , τ ) av
=1 + (−1)k A
k!
k=1
8 ∞
9 1 −(τ − τ ) A 2
k (τ − τ0 )
k
= 1 + (−1) Ak
= e 0
.
k!
k=1
iv) Scalar case. For non-magnetic media, A has the form A = κ 1 (with κ given
by Eq. (9.278)), thus all the matrices Aji in Eq. (A12.3) commute. Equation
828 APPENDIX
!
× N (p1 , p2 , . . . , pm ) κ p1 κ p2 · · · κ pm 1 ,
p1 p2 ··· pm
where
p1 + p2 + · · · + pm = k , 1 ≤ p1 ≤ p2 ≤ · · · ≤ pm , (A12.9)
and where N (p1 , p2 , . . . , pm ) is an integer defined in the following way. For each
set (p1 , p2 , . . . , pm ) satisfying Eqs. (A12.9) we denote by the number of distinct
pi values contained in the set, and we divide the set into groups by collecting
the pi ’s with the same value into one group. If the number of elements in the i-th
group is ni , then
m!
N (p1 , p2 , . . . , pm ) = .
n1 ! n2 ! · · · n !
x3 −x 3
+ τe3 4 − 3x + x2 − − (4 + x) e A
6
x2 −x 2 2
− τe3 3 − 2x + − (3 + x) e A A + A A
2
−x
+ τe3 2 − x − (2 + x) e A3 ,
where x = (τ − τ0 ) / τe .
1 % 5
Note that N (p1 , p2 , p3 ) = 10 = 2
. In general,
% k−1
N (p1 , p2 , · · · , pm ) = m−1
.
APPENDIX 829
a) Definition
The generalized profile is defined by Eq. (10.40)
ΦKK
Q (J , Ju ; ν) = 3(2Ju + 1)(2K + 1)(2K + 1)
Ju J 1 Ju J 1
× (−1)1+Ju −Mu + q
−M u M −q −M u M −q
Mu Mu M qq
Ju Ju K 1 1 K
×
Mu −Mu −Q q −q −Q
1
× Φ(να J M , α J M − ν) + Φ(να J M , α J M − ν)∗ , (A13.1)
2 u u u u u u
b) Limitation on indices
Owing to the presence of the 3-j symbols, the indices are restricted to the range
K = 0, 1, . . . , 2Ju
K = 0, 1, 2
Q = 0, ±1, ±2 with |Q| ≤ K , |Q| ≤ K .
2
3 if Ju = 0
N= NQ =
Q=−2
18Ju + 1 if Ju = 0 .
c) Conjugation
Taking the complex conjugate of Eq. (A13.1) and interchanging the summation
indices Mu ←→ Mu and q ←
→ q , one can easily prove, using the symmetry properties
of the 3-j symbols (Eqs. (2.24), (2.25)), that
ΦKK
Q (J , Ju ; ν)∗ = ΦKK
−Q (J , Ju ; ν) . (A13.2)
1
Because of the conjugation property proved below (Eq. (A13.2)), N represents also the
number of independent real quantities needed to fully specify all the complex generalized profiles.
830 APPENDIX
where wJ(K)J is the symbol defined in Eq. (10.11). This result can easily be proved
u
with the help of Eqs. (6.59a-c), (2.42) and (2.23a).
να J Mu , α J M = ν0 + νL (gα J Mu − gα J M ) ,
u u u u
and that the profile Φ(ν0 − ν) satisfies the conjugation property (cf. Eq. (6.59a))
Φ(x) = Φ(−x)∗ ,
thus
After some index renaming, Eq. (A13.4) is easily obtained from Eq. (A13.1) with
the help of Eqs. (2.24) and (2.25).
1
lim Φ(να J M , α J M − ν) + Φ(να J M , α J M − ν)∗ = φ(ν0 − ν) ,
νL →0 2 u u u u u u
φ being the real part of the complex profile Φ (see Eq. (6.59a)). Substituting into
Eq. (A13.1) and performing the same calculations as in point d), one obtains
lim ΦKK
Q (J , Ju ; ν) = δKK wJ(K)J φ(ν0 − ν) . (A13.5)
νL →0 u
APPENDIX 831
∂ Φ(να J Mu , α J M − ν)
+ νL (gα J Mu − gα J M ) u u
u u ∂νL νL =0
∂ Φ(ν0 − ν)
= Φ(ν0 − ν) − νL (gα J Mu − gα J. M )
u u ∂ν
After substitution into Eq. (A13.1), we are left with a zero-order and a first-order
contribution. The former has already been calculated (Eq. (A13.5)); the latter
contains summations of the form
Ju J 1 Ju J 1
M (−1)1+Ju −Mu + q
−Mu M −q −Mu M −q
Mu Mu M qq
Ju Ju K 1 1 K
× ,
Mu −Mu −Q q −q −Q
where M is either Mu , or Mu , or M . These can be performed by writing M
in terms of a suitable 3-j symbol (as done, for instance, in Eqs. (3.42)). The
summation involving Mu or Mu can be evaluated by applying twice Eq. (2.42),
while the summation involving M can be evaluated via Eq. (2.52). The result is
the following
ΦKK
Q (J , Ju ; ν) = δKK wJ(K)J φ(ν0 − ν)
u
K K 1
−νL 3(2Ju + 1)(2K + 1)(2K + 1)
Q −Q 0
⎡
! !
K K 1 K 1 1
× ⎣ gα J Ju (Ju + 1)(2Ju + 1) (−1)1+J −Ju + Q
u u Ju Ju Ju J Ju Ju
∗
1 ∂ Φ(ν0 − ν) ∂ Φ(ν − ν)
× + (−1)1+K+K 0
2 ∂ν ∂ν
⎧ ⎫ ⎤
⎨ J Ju 1 ⎬ ∂φ(ν − ν)
+ gα J J (J + 1)(2J + 1) (−1)Q J Ju 1 0 ⎦. (A13.6)
⎩ ⎭ ∂ν
1 K K
h) Special case K = 0
For K = 0 one gets, with the help of Eq. (2.26a)
2
Ju J 1 1 1 K
Φ0K (J , J ; ν) = 3(2K + 1) (−1) 1+ q
0 u −Mu M −q q −q 0
Mu M q
× φ(να J Mu , α J M − ν) . (A13.7)
u u
832 APPENDIX
In particular, for K = 0
J J 1
2
Φ00
0 (J , Ju ; ν) = u φ(να − ν) .
−Mu M q J Mu , α J M
u u
Mu M q
Re T Q
K ρKu (α J ) Φ(ν
(i, Ω) − ν) =
Q u u α u u
J Mu , α J M
1 K Ku
= T (i, Ω) ρQ (αu Ju ) Φ(να J M , α J M − ν)
2 Q u u u
K
+ T−Q
K ρ u (α J ) Φ(ν
(i, Ω) −Q u u α J Mu , α J M − ν)∗ ,
u u
where Eqs. (3.102) and (5.158) have been used. Bearing in mind the properties
of the 3-j symbols (Eqs. (2.24), (2.25)) and the relation between the Einstein
coefficients (Eqs. (7.8)), one easily obtains after some index renaming
hν
=
εi (ν, Ω) N 2Ju + 1 A(αu Ju → α J )
4π
K ρK (α J ) ΦKK (J , J ; ν) .
× TQ (i, Ω) Q u u Q u (A13.8)
KK Q
= hν
ηiA (ν, Ω) N B(α J → αu Ju )
4π
2
Ju J 1 1 1 K
× 3(2K + 1) (−1)1+ q
−Mu M −q q −q 0
K Mu M q
φ(ν
× T 0K (i, Ω) − ν) ,
α u u
J Mu , α J M
hν
=
ηiA (ν, Ω) N B(α J → αu Ju ) Φ0K (J , J ; ν) ,
T 0K (i, Ω) (A13.9)
0 u
4π
K
k) Special case J = 0, Ju = 1
Evaluating the relevant 3-j symbols via Eqs. (2.26), and introducing the compact
notation (consistent with Eqs. (9.6) and (9.8))
Φq = φq + i ψq = Φ(να 1 −q, α 00 − ν) (q = −1, 0, +1) , (A13.10)
u
The expressions of ΦKK
Q
KK
and of i ΨQ are the same except for the opposite sign in
the last square bracket. Because of this close resemblance, the following properties
of the generalized dispersion profiles can be proved by strictly similar arguments.
b ) Limitation on indices
Same as b).
c ) Conjugation
KK
ΨQ (J , Ju ; ν)∗ = Ψ−Q
KK
(J , Ju ; ν) .
where the frequencies ν, ν are symmetrical about the central frequency of the line.
KK
ΨQ (J , Ju ; ν) = δKK wJ(K)J ψ(ν0 − ν)
u
K K 1
− νL 3(2Ju + 1)(2K + 1)(2K + 1)
Q −Q 0
⎡
! !
K K 1 K 1 1
× ⎣ gα J Ju (Ju + 1)(2Ju + 1) (−1) 1+J −Ju + Q
u u Ju Ju Ju J Ju Ju
∗
−i ∂ Φ(ν0 − ν) ∂ Φ(ν − ν)
× + (−1)K+K 0
2 ∂ν ∂ν
⎧ ⎫ ⎤
⎨ J Ju 1 ⎬
∂ψ(ν0 − ν) ⎦
+ gα J J (J + 1)(2J + 1) (−1)Q J Ju 1 .
⎩ ⎭ ∂ν
1 K K
APPENDIX 835
h ) Special case K = 0
2
Ju J 1 1 1 K
Ψ00K (J , Ju ; ν) = 3(2K + 1) (−1) 1+ q
−Mu M −q q −q 0
Mu M q
× ψ(να J Mu , α J M − ν) ,
u u
and for K = 0
J J 1
2
Ψ000 (J , Ju ; ν) = u ψ(να − ν) .
−Mu M q u u
J Mu , α J M
Mu M q
A14. Properties of the Symbol WKK Q (β L Sβu Lu ; B) fs
a) Definition
The symbol, simply denoted by WKK Q in this appendix, is defined by Eq. (10.153)
! !
3(2Lu + 1) 1 1 K 1 1 K
WKK Q =
2S + 1 Lu Lu L Lu Lu L
× (2K + 1)(2K + 1)(2Ju + 1)(2Ju + 1)(2Ju + 1)(2Ju + 1)
Ju Ju Ju Ju Mu Mu
! !
Lu Lu K Lu Lu K Ju Ju K Ju Ju K
×
Ju Ju S Ju Ju S −Mu Mu −Q −Mu Mu −Q
j j j j 1
× CJu (Mu ) CJu (Mu ) CJu (Mu ) CJu (Mu ) , (A14.1)
ju ju
u u u u 1 + 2πi ν(ju Mu , ju Mu ) A
836 APPENDIX
where we have used shorthand notations for the Bohr frequencies νβLS (jM, j M )
and the coefficients CJj (βLS, M ) defined in Eqs. (7.30) and (3.58), respectively, and
for the Einstein coefficient A(βu Lu S → β L S).
b) Limitation on indices
Owing to the presence of the 6-j and 3-j symbols, the indices are confined to the
range
0 if Lu = 0
K, K =
0, 1, 2 if Lu = 0
Q = 0, ±1, ±2 with |Q| ≤ K, |Q| ≤ K . (A14.2)
c) Symmetry
Exchange of the dummy indices Ju →
← Ju , Ju →
← Ju shows that
WKK Q = WK KQ . (A14.3)
d) Conjugation
By exchanging the dummy indices ju → → Mu , Ju ←
← ju , Mu ← → Ju , Ju →
← Ju , and
j
bearing in mind the reality of the CJ coefficients (Sect. 3.4), it can easily be proved,
with the help of Eqs. (2.24) and (2.25), that
∗
WKK Q = WKK −Q . (A14.4)
e) Special cases
Use of Eqs. (2.26a), (2.36a), (3.62a,b) and (2.23a) shows that
From Eqs. (A14.2)-(A14.5) it follows that there are only 8 independent symbols,
namely W000 (= 1), W110 , W210 , W220 , W111 , W211 , W221 , W222 . The first four
are real, the remaining are in general complex.
the last quantity being defined in Eq. (7.43). Performing the summation over Mu
and Mu via Eq. (2.23a), one finds
lim WKK Q = δKK WK (β L Sβu Lu ) fs , (A14.6)
B→0
where the symbol [WK (β L Sβu Lu )]fs is defined in Eq. (10.133).
APPENDIX 837
µ0 B
µ0 B ζ , A, (A14.7)
h
or, equivalently
γ1, γx1,
where γ and x are defined in Eqs. (10.154). It can be proved that under this limit
where the symbol WK (L , Lu ) is defined in Eq. (10.17). This property is at the
basis of the anti-level-crossing effect described in Sect. 10.18.
The proof of Eq. (A14.8) is rather complicated. First of all, we notice that
the first of inequalities (A14.7) implies that the upper term is in the complete
Paschen-Back effect regime. It follows (see Sect. 3.4) that the energy eigenvectors
j
of the upper term are of the form |βu Lu SML MS , hence the coefficients CJu (Mu )
u u
defined in Eq. (3.58) reduce to Clebsh-Gordan coefficients. The magnetic quantum
numbers are related by ML + MS = Mu , thus the index ju can be identified with
u
either ML or MS . Setting ju = MS we can write, with the use of Eq. (2.22)
u
j M
CJu (Mu ) ≡ CJ S (Mu ) = Lu SJu Mu |Lu SML MS =
u u u
Lu−S+Mu
Lu S Ju
= (−1) 2Ju + 1 .
ML MS −Mu
u
Substituting into Eq. (A14.1), and performing the summations over Ju and Ju via
Eq. (2.43), next the summations over Ju and Ju via Eq. (2.23b), one obtains
! !
3(2Lu + 1) 1 1 K 1 1 K
WKK Q = (2K + 1)(2K + 1)
2S + 1 Lu Lu L Lu Lu L
K Lu Lu K Lu Lu
× (−1)K−K −Q −ML ML −Q −ML ML
Mu Mu MS u u u u
1
× , (A14.9)
1 + 2πi ν(MS Mu , MS Mu ) A
µ0 B
ν(MS Mu , MS Mu ) = (Mu − Mu ) + (Mu , Mu , MS ) ,
h
838 APPENDIX
where is a correction of order ζ/h due to fine structure. Taking into account
inequalities (A14.7), it follows that
1
= δM M ,
1 + 2πi ν(MS Mu , MS Mu ) A u u
which implies ML = ML and Q = 0. The triple summation in Eq. (A14.9) reduces
u u
to a double summation over Mu and MS , or equivalently over ML and MS . The
u
former can be performed via Eq. (2.23a), the latter yields a factor (2S+1). Equation
(A14.8) is thus proved.
The Hopf function obeys the integral equation (cf. Eq. (12.24))
∞
1 1
q(τ ) = E3 (τ ) + E1 |t − τ | q(t) dt , (A15.1)
2 2
0
where
∞
I1 = E2 (τ ) q(τ ) dτ
0
∞
I2 = E1 (τ ) q(τ ) dτ
0
∞ ∞
I3 = dτ q(τ ) E1 |t − τ | q (t) dt .
0 0
0 0 0
where Eq. (A15.1) has been used. On the other hand, using again Eq. (A15.2) we
get
∞ ∞
1 1
q (t) E3 (t) dt = −q(0) E3 (0) + q(t) E2 (t) dt = − q(0) + ,
2 3
0 0
thus
∞
1 1
I3 = 2 q(t) q (t) dt + q(0) − . (A15.7)
2 3
0
840 APPENDIX
Substitution of Eqs. (A15.5), (A15.6), and (A15.7) into Eq. (A15.4) yields
∞ ∞
2 1
q(τ ) q (τ ) dτ = q(0) − + q(τ ) q (τ ) dτ ,
3
0 0
whence
2 1
q(0) = .
3
Excluding the negative root which is physically meaningless (leading to a negative
value for the surface source function, see Eqs. (12.20a) and (12.23)), one gets the
final result
1
q(0) = √ .
3
∞
1 1
q(τ ) = E3 (τ ) + E1 |t − τ | q(t) dt ,
2 2
0
b
En (τ ) dτ = En+1 (a) − En+1 (b)
a
b
En (τ ) τ dτ = a En+1 (a) − b En+1 (b) + En+2 (a) − En+2 (b) ,
a
−1
N
qi+1 − qi
E3 |τi − τj | − E3 |τi+1 − τj | = q0 E2 (τj ) − E3 (τj ) ,
i=0
τi+1 − τi
APPENDIX 841
For a given star, and for a given point P in its outer atmosphere, the radiation field
tensor, defined in the comoving frame by Eq. (12.40), is a function of the velocity
v of the frame itself. In this appendix we make a slight change of notations and
K
explicitly attach such a functional dependence to the symbol JQ , that is now
written in the form
K
JQ (ν ; v, θv , χv ) ,
where θv and χv are the angles defining the direction of v according to the geometry
of Fig. 12.10.
If the radiation coming from the stellar surface is cylindrically symmetrical – in
the star’s frame – about the vertical axis through the point P, the radiation field
tensor in the comoving frame satisfies a number of properties that are illustrated
below.
K iQχv K
JQ (ν ; v, θv , χv ) = e JQ (ν ; v, θv , 0) . (A17.1)
γ
K 1
JQ (ν ; v, θv , χv ) = dθ sin θ fKQ (θ)
4π
0
2π
iQχ
× g ν, v, θ, θv , cos(χ − χv ) e dχ , (A17.5)
0
where the angle γ is defined in Eq. (12.32). Performing in the last integral the
substitution (χ − χv ) = χ , one easily obtains the property in Eq. (A17.1).
K
b) All the components of the tensor JQ (ν ; v, θv , 0) are real, and satisfy the relation
K
J−Q (ν ; v, θv , 0) = (−1)Q JQ
K
(ν ; v, θv , 0) . (A17.6)
K
In fact, Eq. (A17.5) shows that the imaginary components of JQ (ν ; v, θv , 0) are
zero, because both fKQ and g are real, and g is an even function of χ in the interval
K
(0, 2π) whereas sin(Qχ) is odd. The real components of JQ (ν ; v, θv , 0) have the
form
γ 2π
K 1
JQ (ν ; v, θv , 0) = dθ sin θ fKQ (θ) g(ν, v, θ, θv , cos χ) cos(Qχ) dχ ,
4π
0 0
c) If the spectrum of the radiation coming from the stellar surface is ‘locally’
symmetrical about a particular frequency ν0 in the star’s frame (as in the case of
an absorption or emission line), namely if
= I(ν + ∆ν, Ω)
I(ν0 − ∆ν, Ω) (A17.7)
0
then
for any direction Ω,
K
JQ (ν0 − ∆ν ; v, π − θv , π + χv ) = JQ
K
(ν0 + ∆ν ; v, θv , χv ) , (A17.8)
and, in particular
K
JQ (ν0 ; v, π − θv , π + χv ) = JQ
K
(ν0 ; v, θv , χv ) . (A17.9)
θv → π − θv , χv → π + χv
APPENDIX 843
Neglecting the ‘second-order’ term ∆ν (v · Ω)/c and taking into account the as-
sumption in Eq. (A17.7), the integrand can be transformed via the substitution
- -
, v · Ω ,
I ν0 − ∆ν 1 − = I ν + ∆ν 1 + v · Ω , Ω
,Ω ,
0
c c
which shows, recalling again Eq. (12.40), the validity of Eq. (A17.8).
= v
v · Ω x
= cos Θ v + sin Θ v .
v · Ω x y
Substituting into Eq. (13.35), the integrals over vz and vx can be easily performed.
Introducing the Doppler width in frequency units, ∆νD = ν0 vT /c (see Eqs. (5.43)),
one gets
844 APPENDIX
ν −ν 2
− 0
; ν, Ω
) = P (Ω, )
Ω 1 ∆νD
Rij (ν, Ω ij e
π vT ∆νD
∞ v 2 , -
− y ν
× dvy e vT δ ν0 − ν + (ν − ν0 ) cos Θ + 0 sin Θ vy ,
c
−∞
; ν, Ω
Rij (ν, Ω ) =
ν −ν 2 ν−ν0 ν −ν0 2
− 0 − −
Ω ) 1 ∆νD ∆νD sin Θ ∆νD tan Θ
= Pij (Ω, 2 sin Θ e e , (A18.1)
π ∆νD
and for Θ = 0
ν −ν 2
− 0
; ν, Ω
Rij (ν, Ω ) = P (Ω, ) √1
Ω 1
e ∆νD
δ(ν − ν) . (A18.2)
ij
π ∆νD
∗
KQQ
K
(RB ) = DQ
K
Q (RB ) Q DQ
K
Q (RB ) , (A19.1)
Q
As proved in Sect. 10.8 (see Eq. (10.74)), Eq. (A19.1) can be rewritten in a
simpler form, involving a single rotation matrix
K K 1
KQQ
K
(RB ) = K(K + 1)(2K + 1) (−1)K−Q D0q
1
(RB ) . (A19.2)
Q −Q q
It can easily be seen, with the help of Eqs. (2.24), (2.25), and (2.73), that
KQQ
K
(RB ) satisfies the symmetry properties
∗
KQ
K
Q (RB ) = KQQ (RB )
K
K−Q−Q
K
(RB ) = (−1)
2K+1+Q+Q
KQ
K
Q (RB ) .
K−2−2
2
= −K22
2
= −2 cos θB
iχB
K−2−1
2 2
= (K−1−2 )∗ = (K21
2 ∗
) = K12
2
= sin θB e
K−1−1
2
= −K11
2
= − cos θB
.
3 iχ
K−10
2
= 2
(K0−1 )∗ = 2 ∗
(K10 ) = K01
2
= sin θB e B .
2
3
× wJ(K)J wJ(KJ) (−1)Q T−Q
K T K (i, Ω)
(i, Ω) , (A20.1)
u u
Q
i=0
where the symbol wJ(K)J is given by Eq. (10.11), the tensor T Q K – given by
(i, Ω)
u
Eq. (5.134) – is defined in the system Σ of Fig. 14.4, the unit vector Ω is given by
(x − x )/|x − x |, p(ν0 − ν) is the Gaussian profile of Eq. (14.6), and τν (x, x ) is
the optical thickness at frequency ν between points x and x (see Eq. (14.17)).
Since T QK is defined for K = 0, 1, 2 and, for a fixed K, for Q = −K, . . . , K,
(i, Ω)
it follows that there are in principle 81 different multipole coupling coefficients
satisfying the limitations
K = 0, 1, 2 K = 0, 1, 2
Q = −K, . . . , K Q = −K , . . . , K .
846 APPENDIX
∞
2 −τ (x,x )
GKQ,K Q (x, x ) = dν p(ν0 − ν) e ν
−∞
,
× wJ(K)J wJ(KJ) ΓKQ,K Q (Ω) (A20.2)
u u
where
3
=
ΓKQ,K Q (Ω) (−1)Q T−Q
K T K (i, Ω)
(i, Ω) .
Q
i=0
=Γ
ΓK Q ,KQ (Ω) ∗
KQ,K Q (Ω) (A20.3)
and
= (−1)Q+Q Γ
ΓK−Q,K −Q (Ω) ∗
KQ,K Q (Ω) . (A20.4)
Obviously, similar relations hold for the multipole coupling coefficients,
and
GK−Q,K −Q (x, x ) = (−1)Q+Q GKQ,K Q (x, x )∗ . (A20.6)
can be expressed in terms of rotation matrices using
The quantity ΓKQ,K Q (Ω)
Eq. (5.159)
3
= (−1)Q
P (i) DP −Q (R) tP (i) DP Q (R) ,
tK K K K
ΓKQ,K Q (Ω) (A20.7)
i=0 P P
e (Ω),
where R ≡ (−γ, −θ, −χ) is the rotation carrying the system (ea (Ω), Ω) into
b
the system Σ where the multipole coupling coefficients are defined (see Fig. 14.4).
On the other hand, Eqs. (5.162) and (5.161) yield
3
3
3
tK K
P (i) tP (i) = (−1)
K+K
ξi2 tK K
P (i) tP (i) = (−1)
K+K
tK K
P (i) tP (i) ,
i=0 i=0 i=0
APPENDIX 847
which implies
= (−1)K+K Γ
ΓKQ,K Q (Ω)
KQ,K Q (Ω)
and
GKQ,K Q (x, x ) = (−1)K+K GKQ,K Q (x, x ) . (A20.8)
and G
As a consequence, the quantities ΓKQ,K Q (Ω) x, x ) vanish unless K
KQ,K Q (
and K are both even or both odd. This reduces the number of non-zero components
from 81 to 45. Taking into account the symmetry properties (A20.3) and (A20.4),
it can easily be seen that the number of non-zero, independent components is 17.
Equation (A20.7) can be simplified via the following transformations. First we
express the coefficients tK P (i) in terms of the Pauli spin matrices σ̂i defined in
Eq. (5.128). Using Eq. (5.160) we have
3
3
tK K
P (i) tP (i) = (2K + 1)(2K + 1)
i=0
4
3
1 1 K 1 1 K
× (σ̂i )αβ (σ̂i )γδ ,
β −α −P δ −γ −P
i=0 αβγδ=±1
where the rows and columns of σ̂i are labelled as in Eqs. (5.129). Using Eq. (8.64)
we get1
3
3
tK K
P (i) tP (i) = (2K + 1)(2K + 1)
i=0
2
1 1 K
1 1 K
× . (A20.9)
β −α −P α −β −P
αβ=±1
This equation shows that P = −P and that the only allowed values for P and P
are 0 and ±2. Substitution of Eq. (A20.9) into Eq. (A20.7) and use of Eq. (2.75)
yields
= (−1)Q 3
ΓKQ,K Q (Ω) (2K + 1)(2K + 1)
2
1 1 K 1 1 K
× (2K + 1)
β −α −P α −β −P
αβ=±1 P P K
K K K K K K
∗
× DSS
K
(R) .
P P S −Q Q S
The sums over α and β cannot be performed by standard sum rules of Racah
algebra because the ‘components’ α and β run on the values ±1 but not on the
1
We recall that Eq. (8.64) is valid for any representation of the Pauli spin matrices.
848 APPENDIX
value 0. To overcome this difficulty, we multiply the right-hand side by the quantity
α2 β 2 which, using Eq. (3.52), can be written as
! !
1 1 f 1 1 f
α2 β 2 = (−1)α−β 36 (2f + 1)(2f + 1)
1 1 1 1 1 1
ff
1 1 f 1 1 f 1 1 f 1 1 f
× .
0 0 0 0 0 0 α −α 0 β −β 0
Now the sum can be extended to the values α = 0 and β = 0, and the summation
over the indices α, β, P , P can be carried out with the help of Eq. (2.52). Changing
the index K into f , and using Eq. (2.73), we obtain the expression
= (−1)K +Q 54
ΓKQ,K Q (Ω) (2K + 1)(2K + 1)
1 1 f 1 1 f
× (2f + 1)(2f + 1)(2f + 1)
0 0 0 0 0 0
ff f
!
f f f K K f 1 1 f
×
0 0 0 −Q Q Q − Q 1 1 1
⎧ ⎫
!⎨f f f ⎬
1 1 f f
× 1 1 K D0,Q −Q (R) . (A20.10)
1 1 1 ⎩ ⎭
1 1 K
It should be noticed that, owing to the symmetry properties of the 3-j symbols,
the sum over f , f , f is in fact restricted to even values, thus it contains 6 terms
at most, corresponding respectively to the following triplets (f, f , f ): (0, 0, 0),
(0, 2, 2), (2, 0, 2), (2, 2, 0), (2, 2, 2), and (2, 2, 4). According to the values of K and
K , some of these terms can however be zero. If, for instance, K and K are both
zero, it follows that f = 0 and the sum contains only two terms.
The explicit expressions of the non-zero Γ ’s can be found from Eq. (A20.10) by
evaluating the various 3-j, 6-j, and 9-j symbols and by using the analytical formulae
for the rotation matrices (Eqs. (2.68) and Table 2.1). The results – that depend
on the angles θ and χ specifying the direction Ω but are obviously independent of
the angle γ – are the following
=1
Γ00,00 (Ω) =
Γ20,20 (Ω) 1
(5 − 12 cos2 θ + 9 cos4 θ)
4
= = iχ
Γ00,20 (Ω) 1
√
2 2
(3 cos2 θ − 1) Γ20,21 (Ω) 1
2
3
2 sin θ cos θ (2 − 3 cos2 θ) e
√
=− iχ = 2iχ
Γ00,21 (Ω) 2
3
sin θ cos θ e Γ20,22 (Ω) 1
4
3
2 sin2 θ (1 + 3 cos2 θ) e
√
= 2iχ = − 3 sin3 θ cos θ e −3iχ
Γ00,22 (Ω) 4
3
sin2 θ e Γ21,2−2 (Ω) 4
= −2iχ
Γ21,2−1 (Ω) 3
4 sin2 θ (1 − 2 cos2 θ) e
APPENDIX 849
=
Γ10,10 (Ω) 3
cos2 θ =
Γ21,21 (Ω) 3
sin2 θ (1 + 2 cos2 θ)
2 4
=− √
3 iχ = 3 iχ
Γ10,11 (Ω) 2 2
sin θ cos θ e Γ21,22 (Ω) 4 sin θ cos θ (1 + cos2 θ) e
= − 3 sin2 θ e−2iχ
Γ11,1−1 (Ω) =
Γ22,2−2 (Ω) 3
sin4 θ e
−4iχ
4 8
=
Γ11,11 (Ω) 3
sin2 θ =
Γ22,22 (Ω) 3
(1 + cos2 θ)2 . (A20.11)
4 8
The remaining Γ ’s can easily be deduced from the symmetry properties (A20.3)
and (A20.4). The multipole coupling coefficients for a given transition (J , Ju ) are
then obtained from Eq. (A20.2).1
is their invariance under
A remarkable property of the quantities ΓKQ,K Q (Ω)
inversion of the direction Ω,
=Γ
ΓKQ,K Q (−Ω)
KQ,K Q (Ω) . (A20.12)
To prove this relation, we recall that the rotation appearing in Eq. (A20.10) carries
e (Ω),
the reference system (ea (Ω), Ω)
into the system Σ of Fig. 14.4,
b
e (−Ω),
The rotation carrying (ea (−Ω), −Ω)
into Σ is
b
where the angle γ specifies the direction of the unit vector ea (−Ω)
in the plane
perpendicular to Ω. Thus the rotation matrix appearing in the expression of
is
ΓKQ,K Q (−Ω)
f
i(π+χ)(Q −Q)
D0,Q −Q (R ) = e df0,Q −Q (θ − π) = (−1)f D0,Q
f
−Q (R) ,
where Eqs. (2.68) and (2.70) have been used. But the summation in Eq. (A20.10)
is limited to even values of the index f , so Eq. (A20.12) is proved. In fact, it
can easily be checked that the analytical expressions of Eqs. (A20.11) are invariant
under the transformation θ → π − θ, χ → π + χ.
Since the optical thickness τν (x, x ) is invariant under exchange of x and x ,
it follows from Eqs. (A20.2) and (A20.12) that the multipole coupling coefficients
satisfy the important relation
Finally, from Eq. (A20.10) one can easily evaluate the solid-angle average of the
defined by
quantity ΓKQ,K Q (Ω),
5
1
ΓKQ,K Q (Ω) = .
dΩ ΓKQ,K Q (Ω)
4π
Since D00
0
(αβγ) = 1, we obtain from the Weyl theorem (Eq. (2.76))
2π π
dγ dβ sin β D0N
J
(αβγ) = 4π δJ0 δN 0 ,
0 0
2 !2 !
1 1 f 1 1 f 1 1 K
× (2f + 1) ,
0 0 0 1 1 1 1 1 f
f =0,2
and evaluating the relevant 3-j and 6-j symbols (Eqs. (2.26a,f), (2.36a,f))
!
2 1 1 K
ΓKQ,K Q (Ω) = δKK δQQ + (−1) K
,
3 1 1 2
=1
Γ1Q,1Q (Ω) (Q = 0, ±1)
2
= 7
Γ2Q,2Q (Ω) (Q = 0, ±1, ±2) .
10
Ω
θ
χ
α
Γ
Fig.A21.1. In the reference system Σ we consider a fixed point P located at height z corresponding
to line optical depth tL . The point P lies on the plane Γ parallel to the x-y plane; its height is
z corresponding to line optical depth tL . The cylindrical coordinates (r, α) define the position of
P in the plane Γ . The angles θ and χ specify the direction Ω.
dx dy = r dr dα .
dx dy
= tan θ dθ dα ,
(x − x )2
and since χ = α + π, the double integral over x and y can be transformed into an
integral over the angles θ and χ specifying the direction Ω,
2π π/2
∆νD .
I= dχ dθ tan θ GKQ,K Q (Ω) (A21.2)
4π
0 0
852 APPENDIX
1
p(ν0 − ν) = ϕ(v) , (A21.4)
∆νD
where
ν0 − ν 1 −v 2
v= , ϕ(v) = √ e , (A21.5)
∆νD π
with
∞
ϕ(v) dv = 1 .
−∞
Substitution of Eqs. (A21.3), (A21.4) into Eq. (A21.2) and use of Eq. (A20.2) leads
to the expression
f iχ(Q −Q)
D0,Q −Q (−γ −θ −χ) = e df0,Q −Q (−θ) ,
hence the integration over χ yields 2π for Q = Q and zero for Q = Q . On the other
are linear combinations
side, Eqs. (A20.11) show that the quantities ΓKQ,K Q (Ω)
2n
of factors of the form [cos θ] with n = 0, 1, 2,
2
=
ΓKQ,K Q (Ω) cn (K, K , Q) [cos θ]2n . (A21.7)
n=0
∞
1 (K) (K ) 2
2
I = δQQ wJ J wJ J cn (K, K , Q) dv ϕ(v) In (tL − tL ) ϕ(v) ,
2 u u n=0
−∞
APPENDIX 853
where
π/2 (t −t ) ϕ(v)
− L L
In (tL − tL ) ϕ(v) = e cos θ [cos θ]2n tan θ dθ .
0
Introducing the variable y = 1/ cos θ, it can readily be seen that the last integral
is nothing but the exponential-integral function of order (2n + 1) – cf. Eq. (12.6)
In the above calculations we assumed tL > tL . The opposite case can be treated
in a strictly similar way. The final result is that, irrespective of the relative height
of the points x and x , the integral in Eq. (A21.1) can be written in the form
1 (K) (K )
I = δQQ w w
2 Ju J Ju J
∞
2
2
× cn (K, K , Q) dv ϕ(v) E2n+1 |tL − tL | ϕ(v) , (A21.8)
n=0 −∞
√
A22. The Generalization of the - Law
One of the few analytical results that can be established in the scalar theory of
radiative transfer concerns the integral equation
∞
(1 + ) S(τ ) = BP + K |τ − τ | S(τ ) dτ , (A22.1)
0
with
= . (A22.2)
1+
This equation is encountered in the non-LTE approach to the problem of line for-
mation for a two-level atom in plane-parallel atmospheres, and relates the source
function at optical depth τ to the same function at different optical depths. BP
854 APPENDIX
is the Planck function at the kinetic temperature of the colliding particles (char-
acterized by a Maxwellian velocity distribution), and is the ratio of collisional to
radiative de-excitation rates for the upper level – a quantity specifying the degree
of coupling between atoms and radiation.
Provided and BP are independent of τ , and the kernel K obeys the integral
property
∞
1
K(t) dt = , (A22.3)
2
0
∞
N
ai Si (τ ) = b δi0 + Kij |τ − τ | Sj (τ ) dτ (i = 0, . . . , N ) , (A22.4)
j=0 0
where:
- ai and b are non-vanishing quantities independent of τ , with
a0 > 1 , b>0;
- the kernels Kij are integrable functions of their argument and satisfy the sym-
metry property
Kij (t) = Kji (t) (i, j = 0, . . . , N ) , (A22.5)
and the integral property
∞
1
K0j (t) dt = δ (j = 0, . . . , N ) . (A22.6)
2 j0
0
N
2 b2
ai Si (0) = . (A22.7)
i=0
a0 − 1
APPENDIX 855
N
∞ τ !
ai Si (∞) = b δi0 + lim Kij (τ − τ ) dτ + Kij (τ − τ ) dτ Sj (∞) ,
τ →∞
j=0 τ 0
N
ai Si (∞) = b δi0 + 2 Aij Sj (∞) (i = 0, . . . , N ) , (A22.8)
j=0
where
∞
Aij = Kij (t) dt . (A22.9)
0
a0 S0 (∞) = b + S0 (∞)
N
ai Si (∞) = 2 Aij Sj (∞) (i = 1, . . . , N ) ,
j=1
whose solution is
b
Si (∞) = δ . (A22.10)
a0 − 1 i0
Next we consider the τ -derivative of Eq. (A22.4),
∞
dS (τ ) d
N
ai i = Kij |τ − τ | Sj (τ ) dτ .
dτ j=0
dτ
0
The derivative in the right-hand side can be evaluated by splitting the integral in
two parts (cf. the derivation of Eq. (A15.3)). One obtains
∞
dSi (τ ) dSj (τ )
N N
ai = Kij (τ ) Sj (0) + Kij |τ − τ |
dτ .
dτ j=0 j=0
dτ
0
√
Following a method developed by Frisch and Frisch (1975) to derive the - law
in the scalar case, we multiply both members by Si (τ ), integrate in dτ , and sum
over i,
856 APPENDIX
N ∞ N N ∞
dS (τ )
ai Si (τ ) i dτ = Sj (0) Kij (τ ) Si (τ ) dτ
i=0
dτ i=0 j=0
0 0
N ∞ ∞
N
dSj (τ )
+ dτ Si (τ ) dτ Kij |τ − τ | . (A22.11)
i=0 j=0
dτ
0 0
The term in the left-hand side is immediately evaluated via integration by parts,
and gives
1
N & 2 2 '
ai Si (∞) − Si (0) .
2 i=0
Using Eqs. (A22.5) and (A22.4), the first term in the right-hand side can be cast
into the form
N
2
ai Si (0) − b S0 (0) .
i=0
For the second term in the right-hand side we obtain, by interchanging the inte-
gration order and by using again Eqs. (A22.5) and (A22.4)
N ∞ ∞
dS (τ ) dS0 (τ )
ai Si (τ ) i dτ − b dτ ,
i=0
dτ dτ
0 0
or
1
N & 2 2 ' & '
ai Si (∞) − Si (0) − b S0 (∞) − S0 (0) .
2 i=0
N
2
ai Si (0) = b S0 (∞) ,
i=0
√
which proves – taking into account Eq. (A22.10) – the generalized - law of
Eq. (A22.7). √
In the following we show that the ‘classical’ - law can be recovered as a special
√
case of Eq. (A22.7), and we deduce the explicit expression of the generalized -
law for some physical situations that are treated in the text.
a) Scalar case
Equation (A22.1) is a special case of Eq. (A22.4) corresponding to
N =0, S0 = S , a0 = 1 + , b = BP .
APPENDIX 857
Since the quantities and BP in Eq. (A22.1) are τ -independent and positive, and
the kernel K obeys Eq. (A22.3), it is easily seen that all the conditions for the
validity of Eq. (A22.7) are met. Performing the above substitutions, we obtain
from Eq. (A22.7)
2
(1 + ) S(0) = BP2 ,
or .
√
S(0) = BP = BP , (A22.12)
1+
√
where Eq. (A22.2) has been used. Equation (A22.12) is just the ‘classical’ - law.
b) Non-magnetic atmosphere
Consider the system of integral equations (14.27) under the assumption that ,
BP , and δu(K) are independent of tL and that Hu = 0: we obtain the system of
Eqs. (14.35) with the only difference that the factor in front of S02 (tL ) in the left-
hand side of the second equation is (1 + + δu(2) ) instead of (1 + ). This system
has the form of Eq. (A22.4) with
and because the multipolar kernels GKQ,K Q (x) defined in Eq. (14.28) are integrable
functions of x satisfying the symmetry properties of Eq. (14.29) and the integral
properties of Eqs. (14.32), Eq. (A22.7) can be applied. We get
2 2
(1 + ) S00 (0) + (1 + + δu(2) ) S02 (0) = BP2 ,
or
0 2 2 √
S0 (0) + [ 1 + (1 − ) δu(2) ] S02 (0) = BP . (A22.13)
√
This is the explicit form of the generalized - law for the physical situation con-
sidered.
Let us assume that , BP , δu(2) , and the magnetic field vector are independent of tL ,
and that the x-axis of the ‘magnetic frame’ lies in the plane containing the magnetic
field and the perpendicular to the atmosphere (RB = RB ≡ (0, −θB , −χB ) in
Eq. (14.40)). Under these assumptions the kernels ĜKQ,K Q (x) obey the properties
858 APPENDIX
of Eqs. (14.41) and (14.42), and the conditions for the validity of Eq. (A22.7) are
satisfied. We get
A
B
B 2
√
C S 0 (0) 2 + [ 1 + (1 − ) (δ (2) + iH Q) ] S 2 (0) 2 =
u BP . (A22.14)
0 M u Q M
Q=−2
The kernels G̃K0,K 0 (x) obey the properties in Eqs. (14.65) and (14.66). Provided ,
BP , and δu(K) are independent of tL , all the conditions for the validity of Eq. (A22.7)
are satisfied, so that
A
B
B 0 2 2 √
C S0 (0) + [ 1 + (1 − ) δu(K) ] S0K (0) M = BP . (A22.15)
M
K=2,4,...,Kmax
√
Finally, it should be remarked that a further generalization of the - law has
been proved by Frisch (1998). This generalization concerns the coupled system of
(N + 1) integral equations of the form
∞
N
ai Si (τ ) = bi + Kij |τ − τ | Sj (τ ) dτ (i = 0, . . . , N ) ,
j=0 0
where:
- the coefficients ai , bi are independent of τ , with
a0 > 1 , b0 > 0 ;
- the kernels Kij have the same properties as the kernels appearing in Eq. (A22.4).
Following a procedure similar to the former, it can be proved that
N
2 b20 N N
ai Si (0) = + bi (X −1 )ij bj ,
i=0
a0 − 1 i=1 j=1
∞
K
G̃K0,K 0 (x, x ) = dν Φ̃KK
0 (J , Ju ; ν) Φ̃K
0 (J , Ju ; ν)
K K −∞
3
3
× O (x, x ; ν) T K (j, Ω)
T 0K (i, Ω) , (A23.1)
ij 0
i=0 j=0
where the profiles Φ̃KK0 (J , Ju ; ν) are given by Eq. (14.48), Oij (x, x ; ν) is the
evolution operator defined in Eq. (14.57), Ω is the direction of the vector joining
point x with point x, and the tensors T 0 are defined in the reference system of the
K
magnetic field. They are real quantities (because the single factors appearing in the
right-hand side are real – see Eqs. (5.158) and (A13.2)),1 and obey the following
properties:
a) Limitation on indices
As apparent from Eq. (14.48), the indices K and K are restricted to the range
K, K = 0, 1, . . . , 2Ju .
where the profile q(ν0 − ν) is given by Eq. (14.53). As a consequence, Eqs. (14.51)
and (14.52) yield
= k A p(ν − ν) δ
lim ηiA (ν, Ω) (i = 0, 1, 2, 3)
νL →0 L 0 i0
=0
lim ρAi (ν, Ω) (i = 1, 2, 3) ,
νL →0
1
Equation (A13.2) refers in fact to the generalized profiles ΦKKQ , but it can easily be seen
that the same relation holds for Φ̃KK
Q . This remark also applies to other properties that will be
needed in the following.
860 APPENDIX
where τν (x, x ) is given by Eq. (14.17). Substitution of Eqs. (A23.2) and (A23.4)
into Eq. (A23.1) yields
∞
2 −τ (x,x ) (K) (K )
lim G̃K0,K 0 (x, x ) = dν p(ν0 − ν) e ν wJ J wJ J
νL →0 u u
−∞
3
× T K (i, Ω)
T 0K (i, Ω) .
0
i=0
This expression formally coincides with the definition of GK0,K 0 (x, x ) given in
Eq. (14.24), but one must bear in mind the difference between the reference systems
where the two quantities are defined.
∞
K
G̃K0,K 0 (x, x ) = dν Φ̃KK
0 (J , Ju ; ν) Φ̃K
0 (J , Ju ; ν)
K K P P −∞
3
3
× tK x, x ; ν) tK
P (i) Oij ( P (j) DP 0 (R) DP 0 (R) ,
K K
(A23.5)
i=0 j=0
e (Ω),
where R is the rotation carrying the reference system (ea (Ω), Ω) into the
b
system of the magnetic field.
d) Vanishing values
G̃K0,K 0 (x, x ) = (−1)K+K G̃K0,K 0 (x, x ) , (A23.6)
a relation showing that all the generalized multipole coupling coefficients connecting
irreducible components of the source function of different parity are identically zero.
To prove this property, we consider two frequencies ν, ν symmetrical about line
center (ν − ν0 = ν0 − ν). From Eqs. (A13.4) and (A13.12) we have
Φ̃0K K 0K
0 (J , Ju ; ν ) = (−1) Φ̃0 (J , Ju ; ν)
Using these relations and the expressions of the tensors T Q K – Table 5.6 – it is
(i, Ω)
easily seen from Eqs. (14.51) and (14.52) that the radiative transfer coefficients η0A ,
η1A , η2A , ρA3 are symmetrical about line center, while η3A , ρA1 , ρA2 are antisymmetrical.
Thus the matrix H of Eq. (14.56) satisfies the symmetry property1
where ξi is the formal vector defined in Eqs. (5.161). Since this property remains
valid for any arbitrary function of the matrix H, it holds also for the evolution
operator,
Oij (x, x ; ν ) = ξi ξj Oij (x, x ; ν) .
Using the first of Eqs. (5.162), it can be shown that the integrand in Eq. (A23.5)
has the parity of (K + K ), so that Eq. (A23.6) follows after integration.
To prove this property, one should bear in mind that the generalized multipole
coupling coefficients do not depend, as obvious on general physical grounds, on the
chosen to define the Stokes parameters of the radiation
reference direction ea (Ω)
flowing in the direction Ω. On the other hand, it can easily be shown that if the
and
reference direction is rotated through an angle α, the components T 02 (1, Ω)
transform according to the equations
T 0 (2, Ω)
2
2
T 0 (1, Ω)
= cos 2α T 02 (1, Ω)
+ sin 2α T 02 (2, Ω)
new old old
2
T 0 (2, Ω)
= − sin 2α T 02 (1, Ω)
+ cos 2α T 02 (2, Ω) .
new old old
van-
This implies that it is possible to find a reference direction such that T 02 (2, Ω)
A
ishes. With this choice, one obtains from Eqs. (14.51) and (14.52) that η2 (ν, Ω) =
A = 0, so that the matrix K defined in Eq. (14.50) takes the form
ρ2 (ν, Ω) A
⎛ ⎞
η0A (ν, Ω)
η1A (ν, Ω) 0
η3A (ν, Ω)
⎜ A ⎟
⎜ η1 (ν, Ω)
η A (ν, Ω)
ρA3 (ν, Ω) 0 ⎟
KA = ⎜
⎜
0 ⎟ .
⎝ 0
−ρA3 (ν, Ω)
η A (ν, Ω) ⎟
ρA1 (ν, Ω) ⎠
0
A
η (ν, Ω) 0
−ρA1 (ν, Ω) A
η (ν, Ω)
3 0
1
This is the well-known property of the absorption matrix deduced in Chap. 9, Eq. (9.51).
We recall that in the non-LTE problem considered in Sect. 14.3 the lower level is assumed to be
unpolarized.
862 APPENDIX
It follows that the matrix H of Eq. (14.56) obeys the symmetry property
where τi is the formal vector defined in Eqs. (5.161). Since this property holds for
any function of the matrix H, it is also valid for the evolution operator,
= T K (i, Ω)
τi T 0K (i, Ω) ,
0
so that
O (x, x ; ν) T K (j, Ω)
T 0K (i, Ω) O (x, x ; ν) T K (j, Ω)
= T K (i, Ω) .
ji 0 0 ij 0
This means that the indices i and j of the evolution operator in Eq. (A23.1) can
be interchanged. This is sufficient to prove Eq. (A23.7) by exchange of the dummy
summation indices K and K in Eq. (A23.1).
To prove this property, we observe that the interchange of x and x implies the
interchange of the direction Ω with the opposite direction −Ω. Since, as already
noticed at point e), the generalized multipole coupling coefficients do not depend
on the reference direction, we choose for the direction −Ω a reference direction
unit vector ea (−Ω) = ea (Ω), as shown in Fig. 5.15. With this choice, one has from
Eq. (5.163)
= ζ T K (i, Ω)
T 0K (i, −Ω) (i = 0, . . . , 3) , (A23.9)
i 0
where ζi is the formal vector defined in Eqs. (5.161). Substitution of Eq. (A23.9)
into Eqs. (14.51) and (14.52) shows that the matrix H of Eq. (14.56) satisfies, at
each point of the medium, the property
= ζ ζ H (ν, Ω)
Hij (ν, −Ω) ,
i j ij
and because the optical depth τ (x, x ) is obviously invariant under interchange of
its arguments, it follows that
Substituting Eqs. (A23.9) and (A23.10) into Eq. (A23.1), property (A23.8) is easily
proved.
APPENDIX 863
where, with obvious notations, (βL0 S0 ) is the set of quantum numbers describing
the parent term of the photoionized level and, at the same time, the term encom-
passing the level where the atom is left after photoionization (this final level being
further described by the quantum number J0 ).
Notwithstanding the simplifications introduced, the evaluation of the matrix el-
ement is rather complicated because of the different coupling schemes of the ‘bra’
and ‘ket’ vectors. A recoupling of the various angular momenta is necessary. With
standard techniques one has
|β L0 S0 , nb ; LSJ = (2L + 1)(2S + 1) (2J0 + 1)(2jb + 1)
J0 jb
⎧ ⎫
⎨ L0 S0 J0 ⎬
× b 1
jb |β (L0 S0 )J0 , nb jb ; J . (A24.2)
⎩ 2 ⎭
L S J
On the other hand, since the operator d only acts on the variables of the optical
electron, using Eq. (2.109) one has
and taking into account that d only acts on the orbital variables of the optical
electron, from Eq. (2.108) one obtains
1
= (−1)J0 +J−b − 2 (2J + 1)(2jb + 1)(2j + 1)(2b + 1)
! !
jb j 1 b 1
× nb d . (A24.3)
J J J0 j jb 21
864 APPENDIX
The reduced matrix element nb d can be evaluated through standard meth-
ods. One has
√ b 1
nb d = e0 (−1) +1
2 + 1 I(nb , ) , (A24.4)
0 0 0
∞
I(nb , ) = Pn (r) r χ (r) dr , (A24.5)
b
0
Pn (r) being the reduced radial wavefunction of the bound electron and χ (r) be-
b
ing the same quantity for the free electron with kinetic energy at infinite distance
from the atomic nucleus.
Substituting Eq. (A24.4) into Eq. (A24.3), and taking into account Eq. (A24.2),
one obtains from Eq. (A24.1)
Equation (A24.6) can be substituted into Eq. (14.83) to find the dichroism coef-
ficient due to photoionization. The summation over , j, J in the denominator of
Eq. (14.83) can be evaluated by summing first over J and then over j. Applying
twice Eq. (2.39) one gets
⎧ ⎫
⎨ L0 S0 J0 ⎬2
M = e20 (2L + 1)(2S + 1)(2J0 + 1) (2jb + 1) b 1
jb
⎩ 2 ⎭
jJ jb L S J
2
b 1 2
× (2 + 1) I(nb , ) . (A24.7)
0 0 0
(K)
For the numerator, recalling the definition of wJJ (see Eq. (10.11)) and applying
M = e0 (2L + 1)(2S + 1)(2J0 + 1)(2b + 1)
(K) 2
wJJ 3(2J + 1)
jJ
⎧ ⎫⎧ ⎫
⎨ L0 S0 J0 ⎬ ⎨ L0 S0 J0 ⎬
× (2jb + 1)(2jb + 1) b 12 jb 1
jb
⎩ ⎭⎩ b 2 ⎭
jb jb L S J L S J
! !
jb jb K jb jb K
×
b b 12 J J J0
! 2
1 1 1 K b 1
× (−1) 2 +J+J0 + (2 + 1)
b b 0 0 0
2
× I(nb , ) . (A24.8)
Equations (A24.7) and (A24.8), when substituted into Eq. (14.83), allow one to
express the dichroism coefficients due to photoionization as functions of the radial
integrals defined in Eq. (A24.5). Obviously, such integrals can be evaluated only
when the wavefunctions of the bound and free electronic states are known. To this
aim, one needs to solve the radial Schrödinger equation, which generally requires
the use of self-consistent Hartree-Fock methods. An approximate method that
can conveniently be used in many cases to express the radial integrals is the so-
called Quantum Defect Method developed by Seaton (1958) and Burgess and Seaton
(1960).
LIST OF TABLES
10.2 Analytical expressions of the quantities WK (J , Ju ), (pQ )max , |pV |max 519
10.3 Atomic polarization induced by an unpolarized radiation beam . . . . . . . 543
10.4 Values of the quantities yJ(K) (K)
J , xJ J , XK (J, J ) for different transitions 575
10.5 Analytical expressions of the quantities XK (J, J ) . . . . . . . . . . . . . . . . . . . . 576
∞
10.6 Values of the factors [DK (LS)]fs for different terms . . . . . . . . . . . . . . . . . . 588
10.7 Values of the quantities WK (L Lu S, J Ju ) for different multiplets . . . . . . 593
∞
10.8 Values of the factors [DK (JI)]hfs for half-integral J values . . . . . . . . . . . 619
REFERENCES
Cannon, C.J.: 1985, The Transfer of Spectral Line Radiation, Cambridge Univer-
sity Press, Cambridge [etc.].
Casini, R., and Judge, P.G.: 1999, Astrophys. J. 522, 524.
Chandrasekhar, S.: 1947, Astrophys. J. 106, 145.
Chandrasekhar, S.: 1950, Radiative Transfer , Clarendon Press, Oxford.
Chandrasekhar, S., and Breen, F.H.: 1947, Astrophys. J. 105, 435.
Charvin, P.: 1965, Ann. Astrophysique 28, 877.
Clarke, D.: 1974, Appl. Opt. 13, 222.
Clarke, D., and Grainger, J.F.: 1971, Polarized Light and Optical Measurement ,
Pergamon Press, Oxford [etc.].
Cohen-Tannoudji, C., Diu, B., and Laloë, F.: 1977, Mécanique Quantique, Her-
mann, Paris.
Cohen-Tannoudji, C., Dupont-Roc, J., and Grynberg, G.: 1987, Photons et Ato-
mes, InterEditions, Editions du CNRS, Paris.
Cohen-Tannoudji, C., Dupont-Roc, J., and Grynberg, G.: 1988, Processus d’Inter-
action entre Photons et Atomes, InterEditions, Editions du CNRS, Paris.
Colegrove, F.D., Franken, P.A., Lewis, R.R., and Sands, H.R.: 1959, Phys. Rev.
Letters 3, 420.
Collatz, L.: 1966, The Numerical Treatment of Differential Equations, Springer-
Verlag, Berlin [etc.].
Condon, E.U., and Odabaşi, H.: 1980, Atomic Structure, Cambridge University
Press, Cambridge [etc.].
Condon, E.U., and Shortley, G.H.: 1935, The Theory of Atomic Spectra, Cambridge
University Press, Cambridge.
Corliss, C., and Sugar, J.: 1982, J. Phys. Chem. Ref. Data 11, No.1, 135.
Cuperman, S., Ofman, L., and Semel, M.: 1990, Astron. Astrophys. 227, 227.
Débarbat, S., Dumont, S., and Pecker, J.C.: 1970, Astron. Astrophys. 8, 231.
Delbouille, L., Neven, L., and Roland, G.: 1973, Photometric Atlas of the Solar
Spectrum from 3000 to 10000 Å, Institut d’Astrophysique de l’Université de
Liège.
Derouich, M., Sahal-Bréchot, S., Barklem, P.S., and O’Mara, B.J.: 2003, Astron.
Astrophys. 404, 763.
Dirac, P.A.M.: 1935, The Principles of Quantum Mechanics, Clarendon Press,
Oxford.
Dolginov, A.Z., and Pavlov, G.G.: 1972, Soviet Astron. 16, 450.
Domke, H.: 1969, Astrofizika 5, 525.
Domke, H.: 1971, Astrofizika 7, 39.
Edmonds, A.R.: 1957, Angular Momentum in Quantum Mechanics, Princeton
University Press, Princeton, New Jersey.
Elliott, J.P., and Flowers, B.H.: 1955, Proc. Roy. Soc. A 229, 536.
Faddeeva, V.N., and Terent’ev, N.M.: 1961, Tables of Values of the Function w(z)
for Complex Argument, Pergamon Press, New York.
Falkoff, D.L., Colladay, G.S., and Sells, R.E.: 1952, Can. J. Phys. 30, 253.
Fano, U.: 1949, J. Opt. Soc. Am. 39, 859.
Faulstich, R.: 1980, J. Quant. Spectrosc. Rad. Transfer 24, 229.
872 REFERENCES
Heading, J.: 1958, Matrix Theory for Physicists, Longmans, Green and Co., Lon-
don [etc.].
Heitler, W.: 1954, The Quantum Theory of Radiation, 3rd ed., Clarendon Press,
Oxford.
Hopf, E.: 1934, Mathematical Problems of Radiative Equilibrium, Cambridge
Tracts, No.31.
House, L.L.: 1970a, J. Quant. Spectrosc. Rad. Transfer 10, 909.
House, L.L.: 1970b, J. Quant. Spectrosc. Rad. Transfer 10, 1171.
House, L.L.: 1971, J. Quant. Spectrosc. Rad. Transfer 11, 367.
Howard, R., and Stenflo, J.O.: 1972, Solar Phys. 22, 402.
Hui, A.K., Armstrong, B.H., and Wray, A.A.: 1978, J. Quant. Spectrosc. Rad.
Transfer 19, 509.
Humlicek, J.: 1982, J. Quant. Spectrosc. Rad. Transfer 27, 437.
Hummer, D.G.: 1962, Monthly Not. R. A. S. 125, 21.
Illing, R.M.E., Landman, D.A., and Mickey, D.L.: 1975, Astron. Astrophys. 41,
183.
Ivanov, V.V.: 1973, Transfer of Radiation in Spectral Lines, National Bureau of
Standards Spec. Publ. No.385, Govt. Printing Office, Whashington D.C.
Ivanov, V.V.: 1990, Soviet Astron. 34, 621.
Ivanov, V.V., Grachev, S.I., and Loskutov, V.M.: 1997, Astron. Astrophys. 321,
968.
Jackson, J.D.: 1962, Classical Electrodynamics, John Wiley & Sons, New York
[etc.].
Jahn, H.A., and Hope, J.: 1954, Phys. Rev. 93, 318.
Jefferies, J.T.: 1968, Spectral Line Formation, Blaisdell, Waltham, Mass. [etc.].
Jefferies, J.T., Lites, B.W., and Skumanich, A.: 1989, Astrophys. J. 343, 920.
Jones, R.C.: 1941, J. Opt. Soc. Am. 31, 488.
Katz, J.M.: 1971, Solar Phys. 20, 362.
Kawakami, H.: 1983, Publ. Astron. Soc. Japan 35, 459.
Kemp, J.C.: 1969, J. Opt. Soc. Am. 59, 950.
Kemp, J.C.: 1970, Astrophys. J. 162, 169.
Kemp, J.C., Henson, G.D., Barbour, M.S., Kraus, D.J., and Collins II, G.W.: 1983,
Astrophys. J. 273, L85.
Kemp, J.C., Macek, J.H., and Nehring, F.W.: 1984, Astrophys. J. 278, 863.
Kerkeni, B.: 2002, Astron. Astrophys. 390, 783.
Kerkeni, B., Spielfiedel, A., and Feautrier, N.: 2003, Astron. Astrophys. 402, 5.
de Kertanguy, A.: 1998, Astron. Astrophys. 333, 1130.
Kjeldseth-Moe, O.: 1968, Solar Phys. 4, 267.
Knölker, M., Schüssler, M., and Weisshaar, E.: 1988, Astron. Astrophys. 194, 257.
Kopfermann, H.: 1958, Nuclear Moments, Academic Press, New York.
Kourganoff, V., and Busbridge, I.W.: 1952, Basic Methods in Transfer Problems,
Clarendon Press, Oxford.
Kponou, A., Hughes, V.W., Johnson, C.E., Lewis, S.A., and Pichanik, F.M.J.:
1971, Phys. Rev. Letters 26, 1613.
Kramers, H.A.: 1924, Nature 113, 783.
874 REFERENCES
Landi Degl’Innocenti, M., and Landi Degl’Innocenti, E.: 1988, Astron. Astrophys.
192, 374.
Landi Degl’Innocenti, E., and Landolfi, M.: 1983, Solar Phys. 87, 221.
Landi Degl’Innocenti, M., Landolfi, M., and Landi Degl’Innocenti, E.: 1976, Il
Nuovo Cimento 35 B, 117.
Landi Degl’Innocenti, E., Bommier, V., and Sahal-Bréchot, S.: 1987, Astron. As-
trophys. 186, 335.
Landi Degl’Innocenti, E., Landi Degl’Innocenti, M., and Landolfi, M.: 1988, As-
tron. Astrophys. 204, 133.
Landi Degl’Innocenti, E., Bommier, V., and Sahal-Bréchot, S.: 1990, Astron. As-
trophys. 235, 459.
Landi Degl’Innocenti, E., Bommier, V., and Sahal-Bréchot, S.: 1991, Astron. As-
trophys. 244, 391.
Landi Degl’Innocenti, E., Landi Degl’Innocenti, M., and Landolfi, M.: 1997, in N.
Mein and S. Sahal-Bréchot (eds.), Forum THEMIS , Publications de l’Obser-
vatoire de Paris, p. 59.
Landolfi, M.: 1987, Solar Phys. 109, 287.
Landolfi, M., and Landi Degl’Innocenti, E.: 1982, Solar Phys. 78, 355.
Landolfi, M., and Landi Degl’Innocenti, E.: 1985, Solar Phys. 98, 53.
Landolfi, M., and Landi Degl’Innocenti, E.: 1986, Astron. Astrophys. 167, 200.
Landolfi, M., and Landi Degl’Innocenti, E.: 1996, Solar Phys. 164, 191.
Landolfi, M., Landi Degl’Innocenti, E., Arena, P.: 1984, Solar Phys. 93, 269.
Lehmann, J.C.: 1964, J. Physique 25, 809.
Lenz, W.: 1933, Z. Phys. 80, 423.
Leroy, J.L.: 1962, Ann. Astrophysique 25, 127.
Leroy, J.L.: 1972, Astron. Astrophys. 19, 287.
Leroy, J.L., Ratier, G., and Bommier, V.: 1977, Astron. Astrophys. 54, 811.
Lin, H., Penn, M.J., and Kuhn, J.R.: 1998, Astrophys. J. 493, 978.
Lin, H., Penn, M.J., and Tomczyk, S.: 2000, Astrophys. J. 541, L83.
Lites, B.W., and Skumanich, A.: 1990, Astrophys. J. 348, 747.
Lites, B.W., Low, B.C., Martı́nez Pillet, V., Seagraves, P., Skumanich, A., Frank,
Z.A., Shine, R.A., and Tsuneta, S.: 1995, Astrophys. J. 446, 887.
Lites, B.W., Leka, K.D., Skumanich, A., Martı́nez Pillet, V., and Shimizu, T.:
1996, Astrophys. J. 460, 1019.
Lorentz, H.A.: 1915, The Theory of Electrons, 2nd ed., Dover Publications, New
York.
Loudon, R.: 1983, The Quantum Theory of Light, 2nd ed., Clarendon Press, Ox-
ford.
Louisell, W.H.: 1973, Quantum Statistical Properties of Radiation, John Wiley
& Sons, New York [etc.].
Lyot, B.: 1948, Comptes Rendues Acad. Sci. Paris 226, 25.
Magain, P.: 1986, Astron. Astrophys. 163, 135.
Makita, M.: 1986, Solar Phys. 106, 269.
Manso Sainz, R.: 2002, Ph.D. Thesis, Universidad de La Laguna.
876 REFERENCES
Manso Sainz, R., and Trujillo Bueno, J.: 2001, in M. Sigwarth (ed.), Advanced
Solar Polarimetry. Theory, Observations and Interpretation, ASP Conf. Ser.
236, p. 213.
Marquardt, D.W.: 1963, J. Soc. Ind. Appl. Math. 11, No.2, p. 431.
Martin, B.R.: 1971, Statistics for Physicists, Academic Press, London [etc.].
Mathys, G., and Stenflo, J.O.: 1987a, Astron. Astrophys. 171, 368.
Mathys, G., and Stenflo, J.O.: 1987b, Astron. Astrophys. Suppl. 67, 557.
Maxwell, J.C.: 1873, A Treatise on Electricity and Magnetism, Oxford University
Press, Oxford.
Mein, P.: 1971, Solar Phys. 20, 3.
Messiah, A.: 1961, Quantum Mechanics, North-Holland, Amsterdam [etc.].
Mickey, D.L.: 1973, Astrophys. J. 181, L19.
Mickey, D.L., and Orrall, F.Q.: 1974, Astron. Astrophys. 31, 179.
Mihalas, D.: 1978, Stellar Atmospheres, 2nd ed., W.H. Freeman, San Francisco.
Mitchell, A.C.J., and Zemansky, M.W.: 1934, Resonance Radiation and Excited
Atoms, Cambridge University Press, Cambridge.
Moore, C.E.: 1949, Atomic Energy Levels, Vol. I, National Bureau of Standards.
Moore, C.E.: 1952, Atomic Energy Levels, Vol. II, National Bureau of Standards.
Moore, C.E.: 1958, Atomic Energy Levels, Vol. III, National Bureau of Standards.
Mueller, H.: 1948, J. Opt. Soc. Am. 38, 661.
Nagendra, K.N.: 1988, Astrophys. J. 335, 269.
Nagendra, K.N.: 1994, Astrophys. J. 432, 274.
Nagendra, K.N., Frisch, H., and Faurobert-Scholl, M.: 1998, Astron. Astrophys.
332, 610.
Öhman, Y.: 1929, Monthly Not. R. A. S. 89, 479.
Omont, A.: 1965, J. Physique 26, 26.
Omont, A.: 1977, Progr. Quant. Electron. 5, 69.
Omont, A., Smith, E.W., Cooper, J.: 1972, Astrophys. J. 175, 185.
Ord-Smith, R.J.: 1954, Phys. Rev. 94, 1227.
Oxenius, J.: 1986, Kinetic Theory of Particles and Photons, Springer-Verlag,
Berlin [etc].
Pauli, W.: 1927, Z. Phys. 43, 601.
Pavlov, G.G., and Shibanov, Yu.A.: 1978, Soviet Astron. 22, 43.
Pecker, J.C.: 1999, Solar Phys. 184, 1.
Perrin, F.: 1942, J. Chem. Phys. 10, 415.
Pichanik, F.M.J., Swift, R.D., Johnson, C.E., and Hughes, V.W.: 1968, Phys. Rev.
169, 55.
Poincaré, H.: 1892, Théorie Mathématique de la Lumière, Gauthiers-Villars, Paris.
Prudnikov, A.P., Brychkov, Yu.A., and Marichev, O.I.: 1986, Integrals and Series,
Vol. I, Gordon and Breach, New York.
Querfeld, C.W., and Smartt, R.N.: 1984, Solar Phys. 91, 299.
Querfeld, C.W., Smartt, R.N., Bommier, V., Landi Degl’Innocenti, E., and House,
L.L.: 1985, Solar Phys. 96, 277.
Racah, G.: 1942, Phys. Rev. 62, 438.
Rachkovsky, D.N.: 1962a, Izv. Krymsk. Astrofiz. Obs. 27, 148.
REFERENCES 877
Stenflo, J.O., Twerenbold, D., and Harvey, J.W.: 1983a, Astron. Astrophys. Suppl.
52, 161.
Stenflo, J.O., Twerenbold, D., Harvey, J.W., and Brault, J.W.: 1983b, Astron.
Astrophys. Suppl. 54, 505.
Stenflo, J.O., Harvey, J.W., Brault, J.W., and Solanki, S.K.: 1984, Astron. Astro-
phys. 131, 33.
Stepanov, V.E.: 1958, Izv. Krymsk. Astrofiz. Obs. 19, 20.
Stokes, G.G.: 1852, Trans. Cambridge Phil. Soc. 9, 399.
Tam, A.C.: 1975, Phys. Rev. A 12, 539.
Tandberg-Hanssen, E.: 1995, The Nature of Solar Prominences, Kluwer, Dordrecht
[etc.].
Ten Bruggencate, P., and von Klüber, H.: 1939, Z. Astrophys. 18, 284.
del Toro Iniesta, J.C., Semel, M., Collados, M., and Sánchez Almeida, J.: 1990,
Astron. Astrophys. 227, 591.
Trujillo Bueno, J.: 1990, in F. Sanchez and M. Vazquez (eds.), New Windows to the
Universe, XI th European Meeting of the IAU, Cambridge University Press,
Cambridge, p. 119.
Trujillo Bueno, J.: 1999, in K.N. Nagendra and J.O. Stenflo (eds.), Solar Polariza-
tion, Kluwer, Dordrecht, p. 73.
Trujillo Bueno, J., and Fabiani Bendicho, P.: 1995, Astrophys. J. 455, 646.
Trujillo Bueno, J., and Landi Degl’Innocenti, E.: 1996, Solar Phys. 164, 135.
Trujillo Bueno, J., and Landi Degl’Innocenti, E.: 1997, Astrophys. J. 482, L183.
Trujillo Bueno, J., and Manso Sainz, R.: 1999, Astrophys. J. 516, 436.
Unno, W.: 1956, Publ. Astron. Soc. Japan 8, 108.
Unsöld, A.: 1955, Physik der Sternatmosphären, 2nd ed., Springer, Berlin [etc.].
Van Ballegooijen, A.A.: 1985, in M.J. Hagyard (ed.), Measurements of Solar Vector
Magnetic Fields, NASA Conf. Publ. 2374, p. 322.
Van Ballegooijen, A.A.: 1987, in W. Kalkofen (ed.), Numerical Radiative Transfer ,
Cambridge University Press, Cambridge, p. 279.
Varshalovich, D.A., Moskalev, A.N., and Khersonskii, V.K.: 1988, Quantum The-
ory of Angular Momentum, World Scientific, Singapore [etc.].
Vernazza, J.E., Avrett, E.H., and Loeser, R.: 1981, Astrophys. J. Suppl. 45, 635.
Walker, M.J.: 1954, American J. Phys. 22, 170.
Warner, B.: 1969, Observatory 89, 11.
Weisskopf, V.F.: 1932, Z. Phys. 75, 296.
Westendorp Plaza, C., del Toro Iniesta, J.C., Ruiz Cobo, B., Martı́nez Pillet, V.,
Lites, B.W., and Skumanich, A.: 1997, Nature 389, 47.
Wieder, I., and Lamb, W.E. Jr.: 1957, Phys. Rev. 107, 125.
Wiehr, E.: 1975, Astron. Astrophys. 38, 303.
Wiehr, E.: 1978, Astron. Astrophys. 67, 257.
Wiese, W.L., Smith, M.W., and Glennon, B.M.: 1966, Atomic Transition Proba-
bilities, Vol. I (NSRD-NBS4), U.S. Department of Commerce.
Wittmann, A.: 1971, Solar Phys. 20, 365.
Wittmann, A.: 1974, Solar Phys. 35, 11.
Zirin, H., and Severny, A.B.: 1961, Observatory 81, 155.
AUTHOR INDEX
Italic numbers refer to ‘et al.’ quotations. Multiple occurrences of an author’s name on the same
page are not indicated.
Lites, B.W., 152, 162, 397, 474, 487, 625, Ord-Smith, R.J., 49
635, 637, 653, 655, 660, 661, 662 Orrall, F.Q., 782
Livingston, W., 635, 646, 658 Oxenius, J., 693
Loeser, R., 437
Lorentz, H.A., 221 Pahlke, K.D., 482
Loskutov, V.M., 754, 756 Pauli, W., 7
Loudon, R., 384 Pavlov, G.G., 482, 505
Louisell, W.H., 245 Pecker, J.C., 775, 782
Low, B.C., 662 Penn, M.J., 712, 720
Lyot, B., 775 Perrin, F., 16
Petford, A.D., 651
Macek, J.H., 607, 608 Pichanik, F.M.J., 713
Magain, P., 456, 457 Poincaré, H., 26
Makita, M., 487 Pouquet, A., 496, 500
Manso Sainz, R., 740, 752, 754, 755, 774 Prudnikov, A.P., 826
Marichev, O.I., 826
Marmolino, C., 456 Querfeld, C.W., 656, 713, 719
Marquardt, D.W., 636
Martin, B.R., 500 Racah, G., 35, 36, 42, 67
Martı́nez Pillet, V., 625, 637, 655, 660, 661, Rachkovsky, D.N., 162, 414
662 Ramsauer, J., 658
Mathys, G., 95 Ratier, G., 703
Maxwell, J.C., 1 Rayrole, J., 639, 641, 643
McClimont, A.N., 662 Reader, J., 78
Mein, P., 450 Rees, D.E., 162, 444, 445, 446, 457, 754, 768,
Messiah, A., 7, 29, 35, 74, 238 789, 816
Metcalf, R.F., 662 Reichel, A., 171
Metropolis, N., 39, 44 Riordan, J., 826
Mickey, D.L., 483, 662, 719, 782 Robinson, R.D. Jr., 90
Mihalas, D., 237, 264, 270, 340, 350, 376, Rogers, S.R., 656
384, 439, 445, 446, 462, 568, 722, 737, 776, Roland, G., 628
854 Rose, M.E., 44
Milkey, R.W., 450 Rotenberg, M., 39, 44
Mitchell, A.C.J., 192, 534 Rüedi, I., 658
Montavon, C.A.P., 487 Ruiz Cobo, B., 452, 455, 655, 656
Moore, C.E., 78, 104, 596, 597, 616 Rust, D.M., 714
Morita, M., 39 Rybicki, G.B., 854
Moskalev, A.N., 29, 35, 39, 44, 47 Ryzhik, I.M., 825
Mueller, H., 364
Murphy, G.A., 444, 445, 446, 457, 816 Sahal-Bréchot, S., 288, 343, 524, 680, 710,
Mürset, U., 658 711, 713, 716, 719, 720, 742, 752, 754,
757, 761
Nagendra, K.N., 789 Saito, R., 39
Nehring, F.W., 607, 608 Saliba, G., 789
Neven, L., 628 Salpeter, E.E., 258
Newkirk, G.N. Jr., 719 Sánchez Almeida, J., 373, 409, 444, 482, 487,
Nieuwenhuijzen, H., 456 625, 654, 660, 661, 821
Noyes, R.W., 437 Sands, H.R., 607
Satchler, G.R., 29, 35, 39, 42, 67
Odabaşi, H., 73 Schiff, L.I., 7, 73
Ofman, L., 662 Schmidt, G.D., 74
Öhman, Y., 710 Schüssler, M., 657
O’Mara, B.J., 343 Seagraves, P., 637, 662
Omont, A., 123, 229, 789 Seares, F.H., 433
884 AUTHOR INDEX
Volume 299: Open Issues in Local Star Fomation, edited by Jacques Lépine, Jane
Gregorio-Hetem
Hardbound, ISBN 1-4020-1755-3, December 2003
Volume 293: Physics of the Solar System, by Bruno Bertotti, Paolo Farinella, David
Vokrouhlický
Hardbound, ISBN 1-4020-1428-7, August 2003
Paperback, ISBN 1-4020-1509-7, August 2003
Volume 287/8/9: The Future of Small Telescopes in the New Millennium, edited by
Terry D. Oswalt
Hardbound Set only of 3 volumes, ISBN 1-4020-0951-8, July 2003
Volume 286: Searching the Heavens and the Earth: The History of Jesuit
Observatories, by Agustín Udías
Hardbound, ISBN 1-4020-1189-X, October 2003
Volume 284: Light Pollution: The Global View, edited by Hugo E. Schwarz
Hardbound, ISBN 1-4020-1174-1, April 2003
Volume 283: Mass-Losing Pulsating Stars and Their Circumstellar Matter, edited by
Y. Nakada, M. Honma, M. Seki
Hardbound, ISBN 1-4020-1162-8, March 2003
Volume 280: Organizations and Strategies in Astronomy III, edited by André Heck
Hardbound, ISBN 1-4020-0812-0, September 2002
Volume 274: New Quests in Stellar Astrophysics: The Link Between Stars and
Cosmology, edited by Miguel Chávez, Alessandro Bressan, Alberto Buzzoni,Divakara
Mayya
Hardbound, ISBN 1-4020-0644-6, June 2002
Volume 270: Dayside and Polar Cap Aurora, by Per Even Sandholt, Herbert C. Carlson,
Alv Egeland
Hardbound, ISBN 1-4020-0447-8, July 2002
Volume 268: Multielement System Design in Astronomy and Radio Science, by Lazarus
E. Kopilovich, Leonid G. Sodin
Hardbound, ISBN 1-4020-0069-3, November 2001
Volume 266: Organizations and Strategies in Astronomy II, edited by André Heck
Hardbound, ISBN 0-7923-7172-0, October 2001
Volume 261: Collisional Processes in the Solar System, edited by Mikhail Ya. Marov,
Hans Rickman
Hardbound, ISBN 0-7923-6946-7, May 2001
Volume 259: The Dynamic Sun, edited by Arnold Hanslmeier, Mauro Messerotti, Astrid
Veronig
Hardbound, ISBN 0-7923-6915-7, May 2001
Volume 257: Stellar Pulsation - Nonlinear Studies, edited by Mine Takeuti, Dimitar D.
Sasselov
Hardbound, ISBN 0-7923-6818-5, March 2001
Volume 255: The Evolution of the Milky Way- Stars versus Clusters, edited by
Francesca Matteucci, Franco Giovannelli
Hardbound, ISBN 0-7923-6679-4, January 2001
Volume 254: Stellar Astrophysics, edited by K.S. Cheng, Hoi Fung Chau, Kwing Lam
Chan, Kam Ching Leung
Hardbound, ISBN 0-7923-6659-X, November 2000
Volume 247: Large Scale Structure Formation, edited by Reza Mansouri, Robert
Brandenberger
Hardbound, ISBN 0-7923-6411-2, August 2000
Volume 246: The Legacy of J.C. Kapteyn, edited by Piet C. van der Kruit, Klaas van
Berkel
Paperback, ISBN 1-4020-0374-9, November 2001
Hardbound, ISBN 0-7923-6393-0, August 2000
Volume 243: Solar Polarization, edited by K.N. Nagendra, Jan Olof Stenflo
Hardbound, ISBN 0-7923-5814-7, July 1999
Volume 237: Post-Hipparcos Cosmic Candles, edited by André Heck, Filippina Caputo
Hardbound, ISBN 0-7923-5348-X, December 1998