Lectures in Classical Mechanics - Richard Fitzpatrick
Lectures in Classical Mechanics - Richard Fitzpatrick
Richard Fitzpatrick
Professor of Physics
The University of Texas at Austin
For Faith
CONTENTS 3
Contents
1 Introduction 9
1.1 Prerequisites . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
1.2 Scope of Book . . . . . . . . . . . . . . . . . . . . . . . . . . 9
1.3 Major Sources . . . . . . . . . . . . . . . . . . . . . . . . . . 11
3 Fundamental Concepts 39
3.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 39
3.2 Fundamental Assumptions . . . . . . . . . . . . . . . . . . . 39
3.3 Newton’s Laws of Motion . . . . . . . . . . . . . . . . . . . . 41
3.4 Newton’s First Law of Motion . . . . . . . . . . . . . . . . . 41
3.5 Newton’s Second Law of Motion . . . . . . . . . . . . . . . . 44
3.6 Newton’s Third Law of Motion . . . . . . . . . . . . . . . . . 46
3.7 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49
4 One-Dimensional Motion 51
4.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
4.2 Motion in a General One-Dimensional Potential . . . . . . . 51
4.3 Velocity Dependent Forces . . . . . . . . . . . . . . . . . . . 54
4.4 Simple Harmonic Motion . . . . . . . . . . . . . . . . . . . . 57
4.5 Damped Oscillatory Motion . . . . . . . . . . . . . . . . . . 59
4.6 Quality Factor . . . . . . . . . . . . . . . . . . . . . . . . . . 62
4.7 Resonance . . . . . . . . . . . . . . . . . . . . . . . . . . . . 64
4.8 Periodic Driving Forces . . . . . . . . . . . . . . . . . . . . . 66
4 NEWTONIAN DYNAMICS
4.9 Transients . . . . . . . . . . . . . . . . . . . . . . . . . . . . 70
4.10 Simple Pendulum . . . . . . . . . . . . . . . . . . . . . . . . 71
4.11 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 74
5 Multi-Dimensional Motion 77
5.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 77
5.2 Motion in a Two-Dimensional Harmonic Potential . . . . . . 77
5.3 Projectile Motion with Air Resistance . . . . . . . . . . . . . 80
5.4 Charged Particle Motion in Electric and Magnetic Fields . . . 84
5.5 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86
6 Planetary Motion 89
6.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 89
6.2 Kepler’s Laws . . . . . . . . . . . . . . . . . . . . . . . . . . 89
6.3 Newtonian Gravity . . . . . . . . . . . . . . . . . . . . . . . 89
6.4 Conservation Laws . . . . . . . . . . . . . . . . . . . . . . . 90
6.5 Plane Polar Coordinates . . . . . . . . . . . . . . . . . . . . 91
6.6 Conic Sections . . . . . . . . . . . . . . . . . . . . . . . . . . 94
6.7 Kepler’s Second Law . . . . . . . . . . . . . . . . . . . . . . 97
6.8 Kepler’s First Law . . . . . . . . . . . . . . . . . . . . . . . . 98
6.9 Kepler’s Third Law . . . . . . . . . . . . . . . . . . . . . . . 99
6.10 Orbital Energies . . . . . . . . . . . . . . . . . . . . . . . . . 100
6.11 Kepler Problem . . . . . . . . . . . . . . . . . . . . . . . . . 102
6.12 Motion in a General Central Force-Field . . . . . . . . . . . . 105
6.13 Motion in a Nearly Circular Orbit . . . . . . . . . . . . . . . 106
6.14 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 109
1 Introduction
1.1 Prerequisites
The scope of this book is indicated by its title, “Newtonian Dynamics”. Tak-
ing the elements of this title in reverse order: “Dynamics” is the study of the
motions of the various objects in the world around us; and by “Newtonian”,
we understand that the theory which we are actually going to employ in
our investigation of Dynamics is that which was first formulated by Sir Isaac
Newton in 1687. We now know that this theory is only approximately true.
The theory breaks down, due to relativistic effects, when the velocities of
the objects under investigation approach the speed of light in vacuum, and
in regions of space in which the underlying geometry is not approximately
Euclidean. The theory also breaks down on atomic and subatomic length-
scales, and must be replaced by Quantum Mechanics. In this book, we shall
entirely neglect relativistic and quantum mechanical effects. It follows that
we must restrict our investigations to the dynamics of large (compared to an
atom), slowly moving (compared to the velocity of light), objects in Euclidean
space. Fortunately, the vast majority of the motions which we commonly ob-
serve in the world around us fall into this category.
For the sake of simplicity, and brevity, we shall restrict our investigations
10 NEWTONIAN DYNAMICS
2.1 Introduction
This chapter briefly outlines those aspects of Vector Algebra and Vector Cal-
culus which are helpful in the study of Newtonian Dynamics. The essential
purpose of Vector Algebra is to convert the propositions of Euclidean Ge-
ometry in three-dimensional space into a convenient algebraic form. Vector
Calculus allows us to define the instantaneous velocity and acceleration of a
moving point in three-dimensional space, as well as the distance moved by
such a point, in a given time interval, along a general curved trajectory. Vec-
tor Calculus also introduces the concept of a scalar field: e.g., the potential
energy associated with a conservative force.
b
a
c Q
S
a
b
→ → → → → → →
PR = PQ + QR—see Figure 2.2. Suppose that a ≡ PQ ≡ SR, b ≡ QR ≡ PS,
→
and c ≡ PR. It is clear, from Figure 2.2, that vector addition is commutative:
i.e., c = a + b = b + a. It can also be shown that the associative law holds:
i.e., a + (b + c) = (a + b) + c.
There are two main approaches to vector algebra. The geometric ap-
proach is based on line elements in space. The coordinate approach assumes
that space is defined in terms of Cartesian coordinates, and uses these to
characterize vectors. In Newtonian Dynamics, we generally adopt the sec-
ond approach, because it is far more convenient than the first.
In the coordinate approach, a vector is denoted as the row matrix of its
components (i.e., perpendicular projections) along each of three mutually
perpendicular Cartesian axes (the x-, y-, and z-axes, say): e.g.,
If a ≡ (ax, ay, az) and b ≡ (bx, by, bz) then vector addition is defined
We do not need to change our notation for the displacement in the new
basis. It is still denoted r. The reason for this is that the magnitude and
direction of r are independent of the choice of basis vectors. The coordinates
of r do depend on the choice of basis vectors. However, they must depend
in a very specific manner [i.e., Equations (2.8)–(2.10)] which preserves the
magnitude and direction of r.
16 NEWTONIAN DYNAMICS
y
y′ ⊙z
x′
θ x
with analogous transformation rules for rotation about the y- and z-axes. In
the coordinate approach, Equations (2.11)–(2.13) constitute the definition
of a vector. The three quantities (ax, ay, az) are the components of a vector
provided that they transform under rotation like Equations (2.11)–(2.13).
Conversely, (ax, ay, az) cannot be the components of a vector if they do not
transform like Equations (2.11)–(2.13). Scalar quantities are invariant un-
der transformation. Thus, the individual components of a vector (ax, say)
are real numbers, but they are not scalars. Displacement vectors, and all
vectors derived from displacements, automatically satisfy Equations (2.11)–
(2.13). There are, however, other physical quantities which have both mag-
nitude and direction, but which are not obviously related to displacements.
We need to check carefully to see whether these quantities are vectors (see
Section 2.5).
Let us rotate the basis though θ degrees about the z-axis. According to
Equations (2.11)–(2.13), in the new basis a · b takes the form
Thus, a·b is invariant under rotation about the z-axis. It can easily be shown
that it is also invariant under rotation about the x- and y-axes. Clearly, a · b
is a true scalar, so the above definition is a good one. Incidentally, a · b is the
only simple combination of the components of two vectors which transforms
like a scalar. It is easily shown that the dot product is commutative and
distributive: i.e.,
a · b = b · a,
a · (b + c) = a · b + a · c. (2.17)
b
b−a .
θ
O a A
Does this rather unlikely combination transform like a vector? Let us try ro-
tating the basis through θ degrees about the z-axis using Equations (2.11)–
(2.13). In the new basis,
a × b = −b × a, (2.28)
distributive,
a × (b + c) = a × b + a × c, (2.29)
20 NEWTONIAN DYNAMICS
thumb
a×b
middle finger
b
θ
a index finger
Figure 2.5: The right-hand rule for cross products. Here, θ is less that 180◦ .
a × (b × c) 6= (a × b) × c. (2.30)
The cross product transforms like a vector, which means that it must
have a well-defined direction and magnitude. We can show that a × b is
perpendicular to both a and b. Consider a · a × b. If this is zero then the
cross product must be perpendicular to a. Now,
Thus,
|a × b| = |a| |b| sin θ, (2.34)
with the sense obtained from the right-hand grip rule by rotating a on to b.
Suppose that a force F is applied at position r—see Figure 2.7. The
torque about the origin O is the product of the magnitude of the force
and the length of the lever arm OQ. Thus, the magnitude of the torque
is |F| |r| sin θ. The direction of the torque is conventionally the direction of
the axis through O about which the force tries to rotate objects, in the sense
determined by the right-hand grip rule. It follows that the vector torque is
given by
τ = r × F. (2.36)
b b
θ
a
θ
P
O Q
r sin θ
2.5 Rotation
a ′ ≃ a + δθz ez × a, (2.38)
where use has been made of the small angle approximations sin θ ≃ θ and
cos θ ≃ 1. The above equation can easily be generalized to allow small
rotations about the x- and y-axes by δθx and δθy, respectively. We find that
a ′ ≃ a + δθ × a, (2.39)
Vector Algebra and Vector Calculus 23
x
y
z -axis x-axis
x-axis z -axis
where
δθ = δθx ex + δθy ey + δθz ez. (2.40)
Clearly, we can define a rotation vector δθ, but it only works for small angle
rotations (i.e., sufficiently small that the small angle approximations of sine
and cosine are good). According to the above equation, a small z-rotation
plus a small x-rotation is (approximately) equal to the two rotations applied
in the opposite order. The fact that infinitesimal rotation is a vector implies
that angular velocity,
δθ
ω = lim , (2.41)
δt→0 δt
da
= ω × a. (2.42)
dt
24 NEWTONIAN DYNAMICS
c
b
a · b × c = −b · a × c. (2.46)
The scalar triple product is zero if any two of a, b, and c are parallel, or if a,
b, and c are coplanar.
If a, b, and c are non-coplanar, then any vector r can be written in terms
of them:
r = α a + β b + γ c. (2.47)
Vector Algebra and Vector Calculus 25
r · b × c = α a · b × c, (2.48)
so
r·b×c
α= . (2.49)
a·b×c
Analogous expressions can be written for β and γ. The parameters α, β,
and γ are uniquely determined provided a · b × c 6= 0: i.e., provided that the
three basis vectors are non-coplanar.
For three vectors a, b, and c, the vector triple product is defined a × (b × c).
The brackets are important because a × (b × c) 6= (a × b) × c. In fact, it can
be demonstrated that
a × (b × c) ≡ (a · c) b − (a · b) c (2.50)
and
(a × b) × c ≡ (a · c) b − (b · c) a. (2.51)
Let us try to prove the first of the above theorems. The left-hand side and
the right-hand side are both proper vectors, so if we can prove this result
in one particular coordinate system then it must be true in general. Let us
take convenient axes such that the x-axis lies along b, and c lies in the x-y
plane. It follows that b = (bx, 0, 0), c = (cx, cy, 0), and a = (ax, ay, az).
The vector b × c is directed along the z-axis: b × c = (0, 0, bx cy). It follows
that a × (b × c) lies in the x-y plane: a × (b × c) = (ay bx cy, −ax bx cy, 0).
This is the left-hand side of Equation (2.50) in our convenient coordinate
system. To evaluate the right-hand side, we need a · c = ax cx + ay cy and
a · b = ax bx. It follows that the right-hand side is
Suppose that vector a varies with time, so that a = a(t). The time derivative
of the vector is defined
da a(t + δt) − a(t)
= lim . (2.53)
dt δt→0 δt
When written out in component form this becomes
da dax day daz
= , , . (2.54)
dt dt dt dt
Suppose that a is, in fact, the product of a scalar φ(t) and another vector
b(t). What now is the time derivative of a? We have
dax d dφ dbx
= (φ bx) = bx + φ , (2.55)
dt dt dt dt
which implies that
da dφ db
= b+φ . (2.56)
dt dt dt
Moreover, it is easily demonstrated that
d da db
(a · b) = ·b+a· , (2.57)
dt dt dt
and
d da db
(a × b) = ×b+a× . (2.58)
dt dt dt
Hence, it can be seen that the laws of vector differentiation are analogous
to those in conventional calculus.
Q
y f
x P Q l
.
Note that the integral depends on the route taken between the initial and
final points.
The most common type of line integral is that in which the contributions
from dx and dy are evaluated separately, rather that through the path length
dl: Z Q
[f(x, y) dx + g(x, y) dy] . (2.62)
P
y
Q = (1, 1)
1 2
2
P = (0, 0) x
along the two routes indicated in Figure 2.12. Along route 1 we have x =
y + 1 and dx = dy, so
ZQh i Z1 h
3
i 17
y dx + x dy = y dy + (y + 1)3 dy = . (2.64)
P 0 4
Along route 2,
ZQh Z1 Z2
3
i
3
7
y dx + x dy = x dy + y dx = . (2.65)
P 0
x=1 1
y=1
4
y
2 Q = (2, 1)
2 1
P = (1, 0) x
for some function F. Given F(P) for one point P in the x-y plane, then
ZQ
F(Q) = F(P) + (f dx + g dy) (2.67)
P
defines F(Q) for all other points in the plane. We can then draw a contour
map of F(x, y). The line integral between points P and Q is simply the
change in height in the contour map between these two points:
ZQ ZQ
(f dx + g dy) = dF(x, y) = F(Q) − F(P). (2.68)
P P
Thus,
dF(x, y) = f(x, y) dx + g(x, y) dy. (2.69)
For instance, if F = x3 y then dF = 3 x2 y dx + x3 dy and
ZQ h iQ
3 x2 y dx + x3 dy = x3 y (2.70)
P P
A vector field is defined as a set of vectors associated with each point in space.
For instance, the velocity v(r) in a moving liquid (e.g., a whirlpool) consti-
tutes a vector field. By analogy, a scalar field is a set of scalars associated
with each point in space. An example of a scalar field is the temperature
distribution T (r) in a furnace.
Consider a general vector field A(r). Let dr = (dx, dy, dz) be the vector
element of line length. Vector line integrals often arise as
ZQ ZQ
A · dr = (Ax dx + Ay dy + Az dz). (2.71)
P P
For instance, if A is a force-field then the line integral is the work done in
going from P to Q.
30 NEWTONIAN DYNAMICS
Q P
x
a 1 ∞
The second route is, firstly, around a large circle (r = constant) to the point
(a, ∞, 0), and then parallel to the y-axis—see Figure 2.13. In the first part,
no work is done, since F is perpendicular to dr. In the second part,
Z0 " #0
−y dy 1 1
W= 3/2
= = . (2.73)
∞
2 2
(a + y ) (y + a2)1/2
2
∞
a
In this case, the integral is independent of the path. However, not all vector
line integrals are path independent.
The bottom integral is simply the volume of the hemisphere, which is 2π a3/3.
The top integral is most easily evaluated in spherical polar coordinates (r,
θ, φ), for which x = r sin θ cos φ, y = r sin θ sin φ, z = r cos θ, and
dV = r2 sin θ dr dθ dφ. Thus,
ZZZ Za Z π/2 Z 2π
z dV = dr dθ dφ r cos θ r2 sin θ
0 0 0
Za Z π/2 Z 2π
π a4
= r3 dr sin θ cos θ dθ dφ = , (2.76)
0 0 0 4
giving
π a4 3 3a
z= 3
= . (2.77)
4 2π a 8
2.12 Gradient
y contours of h(x, y)
high
θ
direction of steepest ascent
P
low
x
∂h ∂h
grad h = , . (2.79)
∂x ∂y
∂h ∂h
α= , β= , (2.81)
∂x ∂y
∂h ∂h
dh = dx + dy. (2.82)
∂x ∂y
∂T ∂T ∂T
grad T = , , . (2.84)
∂x ∂y ∂z
∂T ∂T ∂T
dT = dx + dy + dz. (2.85)
∂x ∂y ∂z
dT = grad T · dr = 0. (2.87)
This
RQ integral is clearly independent of the path taken between P and Q, so
P grad T · dr must be path independent.
RQ
Consider a vector field A(r). In general, P A · dr depends on path, but
for some special vector fields the integral is path-independent. Such fields
are called conservative fields. It can be shown that if A is a conservative field
34 NEWTONIAN DYNAMICS
T = constant grad T
dr
isotherms
then A = grad V for some scalar field V. The proof of this is straightforward.
Keeping P fixed, we have
ZQ
A · dr = V(Q), (2.89)
P
Hence,
∂V
= Ax, (2.91)
∂x
with analogous relations for the other components of A. It follows that
A = grad V. (2.92)
the conservation of energy (this is why conservative fields are called “con-
servative”). A good example of a non-conservative field is the force due to
friction. HClearly, a frictional system loses energy in going around a closed
cycle, so A · dr 6= 0.
It is useful to define the vector operator
∂ ∂ ∂
∇≡ , , , (2.94)
∂x ∂y ∂z
which is usually called the grad or del operator. This operator acts on every-
thing to its right in a expression, until the end of the expression or a closing
bracket is reached. For instance,
∂f ∂f ∂f
grad f = ∇f = , , . (2.95)
∂x ∂y ∂z
For two scalar fields φ and ψ,
grad (φ ψ) = φ grad ψ + ψ grad φ (2.96)
can be written more succinctly as
∇(φ ψ) = φ ∇ψ + ψ ∇φ. (2.97)
Suppose that we rotate the basis about the z-axis by θ degrees. By anal-
ogy with Equations (2.8)–(2.10), the old coordinates (x, y, z) are related
to the new ones (x ′ , y ′ , z ′ ) via
x = x ′ cos θ − y ′ sin θ, (2.98)
′ ′
y = x sin θ + y cos θ, (2.99)
z = z ′. (2.100)
Now,
∂ ∂x ∂ ∂y ∂ ∂z ∂
= + + , (2.101)
∂x ′ ∂x ′ y′ ,z′ ∂x ∂x ′ y′ ,z′ ∂y ∂x ′ y′ ,z′ ∂z
giving
∂ ∂ ∂
′
= cos θ + sin θ , (2.102)
∂x ∂x ∂y
and
∇x′ = cos θ ∇x + sin θ ∇y. (2.103)
It can be seen, from Equations (2.11)–(2.13), that the differential operator
∇ transforms in an analogous manner to a vector. This is another proof that
∇f is a good vector.
36 NEWTONIAN DYNAMICS
2.13 Exercises
1.2. Demonstrate using vectors that the diagonals of a parallelogram bisect one
another. In addition, show that if the diagonals of a quadrilateral bisect one
another then it is a parallelogram.
|a + b| ≤ |a| + |b|.
(a) |r| = a,
(b) r · n = b,
(c) r · n = c |r|,
(d) |r − (r · n) n| = d.
α a + β b + γ c = 0,
where α, β, and γ are not all zero. Show that the condition for the points
with position vectors u a, v b, and w c to be colinear is
α β γ
+ + = 0.
u v w
1.7. The vectors a, b, and c are non-coplanar, and form a non-orthogonal vector
base. The vectors A, B, and C, defined by
b×c
A= ,
a·b×c
plus cyclic permutations, are said to be reciprocal vectors. Show that
a = (B × C)/(A · B × C),
h = α A + β B + γ C.
In addition, show that the perpendicular distance of the plane from the origin
is |h|−1 .
1.9. Find the gradients of the following scalar functions of the position vector
r = (x, y, z):
(a) k · r,
(b) |r|n ,
(c) |r − k|−n ,
(d) cos(k · r).
Here, k is a fixed vector.
38 NEWTONIAN DYNAMICS
Fundamental Concepts 39
3 Fundamental Concepts
3.1 Introduction
Newton’s first law of motion essentially states that a point object subject
to zero net external force moves in a straight-line with a constant speed
(i.e., it does not accelerate). However, this is only true in special frames of
reference called inertial frames. Indeed, we can think of Newton’s first law
as the definition of an inertial frame: i.e., an inertial frame of reference is
one in which a point object subject to zero net external force moves in a
straight-line with constant speed.
Suppose that we have found an inertial frame of reference. Let us set up
a Cartesian coordinate system in this frame. The motion of a point object
can now be specified by giving its position vector, r = (x, y, z), with respect
to the origin of the coordinate system, as a function of time, t. Consider
a second frame of reference moving with some constant velocity u with re-
spect to the first frame. Without loss of generality, we can suppose that the
Cartesian axes in the second frame are parallel to the corresponding axes in
the first frame, and that u = (u, 0, 0), and, finally, that the origins of the two
frames instantaneously coincide at t = 0—see Figure 3.1. Suppose that the
position vector of our point object is r ′ = (x ′ , y ′ , z ′ ) in the second frame
of reference. It is evident, from Figure 3.1, that at any given time, t, the
coordinates of the object in the two reference frames satisfy
x ′ = x − u t, (3.1)
y ′ = y, (3.2)
z ′ = z. (3.3)
42 NEWTONIAN DYNAMICS
y y′
ut
x x′
origin
vx′ = vx − u, (3.4)
vy′ = vy, (3.5)
vz′ = vz. (3.6)
v ′ = v − u. (3.7)
a ′ = a. (3.11)
is the center of the Sun and whose coordinate axes are fixed with respect to
distant stars is approximately inertial. In this case, any deviations of the or-
bits from the predictions of Newtonian Dynamics due to the orbital motion
of the Sun about the Galactic Center are far too small to be measurable. It
should be noted that it is impossible to identify an absolute inertial frame—
the best approximation to such a frame would be one in which the Cosmic
Microwave Background appears to be isotropic. However, for a given dy-
namical problem, it is always possible to identify an approximate inertial
frame. Furthermore, any deviations of such a frame from a true inertial
frame can be incorporated into the framework of Newtonian Dynamics via
the introduction of so-called fictitious forces—see Chapter 8.
dp
= f, (3.13)
dt
where the momentum, p, is the product of the object’s inertial mass, m,
and its velocity, v. If m is not a function of time then the above expression
reduces to the familiar equation
dv
m = f. (3.14)
dt
Note that this equation is only valid in a inertial frame. Clearly, the inertial
mass of an object measures its reluctance to deviate from its preferred state
of uniform motion in a straight-line (in an inertial frame). Of course, the
above equation of motion can only be solved if we have an independent
expression for the force, f (i.e., a law of force). Let us suppose that this is
the case.
An important corollary of Newton’s second law is that force is a vector
quantity. This must be the case, since the law equates force to the product
of a scalar (mass) and a vector (acceleration). Note that acceleration is
obviously a vector because it is directly related to displacement, which is
the prototype of all vectors—see Chapter 2. One consequence of force being
a vector is that two forces, f1 and f2, both acting at a given point, have the
same effect as a single force, f = f1 + f2, acting at the same point, where
the summation is performed according to the laws of vector addition—see
Fundamental Concepts 45
Chapter 2. Likewise, a single force, f, acting at a given point, has the same
effect as two forces, f1 and f2, acting at the same point, provided that f1 +
f2 = f. This method of combining and splitting forces is known as the
resolution of forces, and lies at the heart of many calculations in Newtonian
Dynamics.
Taking the scalar product of Equation (3.14) with the velocity, v, we
obtain
dv m d(v · v) m dv2
m v· = = = f · v. (3.15)
dt 2 dt 2 dt
This can be written
dK
= f · v. (3.16)
dt
where
1
K = m v2. (3.17)
2
The right-hand side of Equation (3.16) represents the rate at which the force
does work on the object: i.e., the rate at which the force transfers energy
to the object. The quantity K represents the energy the object possesses
by virtue of its motion. This type of energy is generally known as kinetic
energy. Thus, Equation (3.16) states that any work done on point object by
an external force goes to increase the object’s kinetic energy.
Suppose that, under the action of the force, f, our object moves from
point P at time t1 to point Q at time t2. The net change in the object’s
kinetic energy is obtained by integrating Equation (3.16):
Z t2 ZQ
∆K = f · v dt = f · dr, (3.18)
t1 P
f = −∇U, (3.19)
46 NEWTONIAN DYNAMICS
E = K + U = constant. (3.22)
Consider a system of N mutually interacting point objects. Let the ith object,
whose mass is mi, be located at vector displacement ri. Suppose that this
object exerts a force fji on the jth object. Likewise, suppose that the jth
object exerts a force fij on the ith object. Newton’s third law of motion
essentially states that these two forces are equal and opposite, irrespective
of their nature. In other words,
One corollary of Newton’s third law is that an object cannot exert a force on
itself.
It should be noted that Newton’s third law implies action at a distance.
In other words, if the force that object i exerts on object j suddenly changes
then Newton’s third law demands that there must be an immediate change
Fundamental Concepts 47
in the force that object j exerts on object i. Moreover, this must be true
irrespective of the distance between the two objects. However, we now
known that Einstein’s Theory of Relativity forbids information from travel-
ing through the Universe faster than the velocity of light in vacuum. Hence,
action at a distance is also forbidden. In other words, if the force that ob-
ject i exerts on object j suddenly changes then there must be a time delay,
which is at least as long as it takes a light ray to propagate between the
two objects, before the force that object j exerts on object i can respond. Of
course, this means that Newton’s third law is not, strictly speaking, correct.
However, as long as we restrict our investigations to the motions of dynami-
cal systems on time-scales which are long compared to the time required for
light-rays to traverse these systems, Newton’s third law can be regarded as
being approximately correct.
In an inertial frame, Newton’s second law of motion applied to the ith
object yields
j6=i
d2ri X
mi 2 = fij. (3.24)
dt
j=1,N
Note that the summation on the right-hand side of the above equation ex-
cludes the case j = i, since the ith object cannot exert a force on itself. Let
us now take the above equation and sum it over all objects. We obtain
j6=i
X d2ri X
mi 2 = fij. (3.25)
dt
i=1,N i,j=1,N
Consider the sum over forces on the right-hand side of the above equation.
Each element of this sum—fij, say—can be paired with another element—
fji, in this case—which is equal and opposite, according to Newton’s third
law. In other words, the elements of the sum all cancel out in pairs. Thus,
the net value of the sum is zero. It follows that the above equation can be
written
d2rcm
M = 0, (3.26)
dt2
P
where M = N i=1 mi is the total mass. The quantity rcm is the vector dis-
placement of the center of mass of the system, which is an imaginary point
whose coordinates are the mass weighted averages of the coordinates of the
objects which constitute the system. Thus,
PN
mi ri
rcm = Pi=1 N
. (3.27)
i=1 mi
48 NEWTONIAN DYNAMICS
is a constant of the motion. However, the momentum of the ith object takes
the form pi = mi dri/dt. Hence, the total momentum of the system is
written
XN
dri
P= mi . (3.29)
dt
i=1
where
dri
li = mi ri × (3.32)
dt
is the angular momentum of the ith particle about the origin of our coordi-
nate system. The total angular momentum of the system (about the origin)
takes the form X
L= li (3.33)
i=1,N
Fundamental Concepts 49
i6=j
X
dL
= ri × fij. (3.34)
dt
i,j=1,N
Consider the sum on the right-hand side of the above equation. A gen-
eral term, ri × fij, in this sum can always be paired with a matching term,
rj × fji, in which the indices have been swapped. Making use of Equa-
tion (3.23), the sum of a general matched pair can be written
Let us assume that the forces acting between the various components of the
system are central in nature, so that fij is parallel to ri − rj. In other words,
the force exerted on object j by object i either points directly toward, or
directly away from, object i, and vice versa. This is a reasonable assumption,
since most of the forces which we encounter in the world around us are of
this type (e.g., gravity). It follows that if the forces are central in nature then
the vector product on the right-hand side of the above expression is zero.
We conclude that
ri × fij + rj × fji = 0, (3.36)
for all values of i and j. Thus, the sum on the right-hand side of Equa-
tion (3.34) is zero for any kind of central force. We are left with
dL
= 0. (3.37)
dt
3.7 Exercises
1. Consider a system of N mutually interacting point objects. Let the ith object
have mass mi and position vector ri . Suppose that the jth object exerts a
central force fij on the ith. In addition, let the ith object be subject to an
external force Fi . Here, i and j take all possible values. Find expressions
for the rate of change of the total linear momentum P and the total angular
momentum L of the system.
50 NEWTONIAN DYNAMICS
2. Consider an isolated system of N point objects interacting via gravity. Let the
mass and position vector of the ith object be mi and ri , respectively. What
is the vector equation of motion of the ith object? Write expressions for the
total kinetic energy, K, and potential energy, U, of the system. Demonstrate
from the equations of motion that K + U is a conserved quantity.
3. Consider a function of many variables f(x1 , x2 , · · · , xn ). Such a function
which satisfies
f(t x1 , t x2 , · · · , t xn ) = ta f(x1 , x2 , · · · , xn )
for all t > 0, and all values of the xi , is termed a homogenous function of
degree a. Prove the following theorem regarding homogeneous functions:
X ∂f
xi = af
∂xi
i=1,n
1 d2 I
= 2 K + (n − 1) U,
2 dt2
P
where I = i=1,N mi ri2 , and K is the kinetic energy. This result is known
as the Virial theorem. Demonstrate that there are no bound steady-states for
the system when n ≥ 3.
5. Consider a system of N point particles. Let the ith particle have mass mi ,
electric charge qi , and position vector ri . Suppose that the charge to mass
ratio, qi /mi , is the same for all particles. The system is placed in a uniform
magnetic field B. Write the equation of motion of the ith particle. You may
neglect any magnetic fields generated by motion of the particles. Demon-
strate that the total momentum P of the system precesses about B at the
frequency Ω = qi B/mi . Demonstrate that Lk + Ω Ik /2 is a constant of the
motion. Here, Lk is the total angular momentum of the system parallel to the
magnetic field, and Ik is the moment of inertia of the system about an axis
parallel to B which passes through the origin.
One-Dimensional Motion 51
4 One-Dimensional Motion
4.1 Introduction
dU(x)
f(x) = − , (4.1)
dx
where U(x) is the potential energy of the particle at position x.
Let the curve U(x) take the form shown in Figure 4.1. For instance, this
curve might represent the gravitational potential energy of a cyclist free-
wheeling in a hilly region. Observe that we have set the potential energy
at infinity to zero (which we are generally free to do, since potential en-
ergy is undefined to an arbitrary additive constant). This is a fairly common
convention. What can we deduce about the motion of the particle in this
potential?
Well, we know that the total energy, E—which is the sum of the kinetic
energy, K, and the potential energy, U—is a constant of the motion—see
Equation (3.22). Hence, we can write
However, we also know that a kinetic energy can never be negative [since
K = (1/2) m v2, and neither m nor v2 can be negative]. Hence, the above
expression tells us that the particle’s motion is restricted to the region (or
regions) in which the potential energy curve U(x) falls below the value E.
This idea is illustrated in Figure 4.1. Suppose that the total energy of the
system is E0. It is clear, from the figure, that the particle is trapped inside one
or other of the two dips in the potential—these dips are generally referred
to as potential wells. Suppose that we now raise the energy to E1. In this
52 NEWTONIAN DYNAMICS
U→
E2
x→
0
E1
E0
x0 x1 x2
case, the particle is free to enter or leave each of the potential wells, but its
motion is still bounded to some extent, since it clearly cannot move off to
infinity. Finally, let us raise the energy to E2. Now the particle is unbounded:
i.e., it can move off to infinity. In conservative systems in which it makes
sense to adopt the convention that the potential energy at infinity is zero,
bounded systems are characterized by E < 0, whereas unbounded systems
are characterized by E > 0.
The above discussion suggests that the motion of an particle moving in
a potential generally becomes less bounded as the total energy E of the sys-
tem increases. Conversely, we would expect the motion to become more
bounded as E decreases. In fact, if the energy becomes sufficiently small
then it appears likely that the system will settle down in some equilibrium
state in which the particle is stationary. Let us try to identify any prospec-
tive equilibrium states in Figure 4.1. If the particle remains stationary then
it must be subject to zero force (otherwise it would accelerate). Hence,
according to Equation (4.1), an equilibrium state is characterized by
dU
= 0. (4.3)
dx
In other words, a equilibrium state corresponds to either a maximum or a
minimum of the potential energy curve U(x). It can be seen that the U(x)
curve shown in Figure 4.1 has three associated equilibrium states located at
x = x0, x = x1, and x = x2.
Let us now make a distinction between stable equilibrium points and un-
stable equilibrium points. When the particle is slightly displaced from a sta-
One-Dimensional Motion 53
ble equilibrium point then the resultant force f acting on it must always be
such as to return it to this point. In other words, if x = x0 is an equilibrium
point then we require
df
<0 (4.4)
dx x0
for stability: i.e., if the particle is displaced to the right, so that x − x0 > 0,
then the force must act to the left, so that f < 0, and vice versa. Likewise, if
df
>0 (4.5)
dx x0
d2U
> 0. (4.6)
dx2
In other words, a stable equilibrium point corresponds to a minimum of
the potential energy curve U(x). Likewise, an unstable equilibrium point
corresponds to a maximum of the U(x) curve. Hence, we conclude that, in
Figure 4.1, x = x0 and x = x2 are stable equilibrium points, whereas x = x1
is an unstable equilibrium point. Of course, this makes perfect sense if we
think of U(x) as a gravitational potential energy curve, so that U is directly
proportional to height. In this case, all we are saying is that it is easy to
confine a low energy mass at the bottom of a valley, but very difficult to
balance the same mass on the top of a hill (since any slight displacement of
the mass will cause it to fall down the hill). Note, finally, that if
dU d2U
= =0 (4.7)
dx dx2
at any point (or in any region) then we have what is known as a neutral
equilibrium point. We can move the particle slightly away from such a point
and it will still remain in equilibrium (i.e., it will neither attempt to return
to its initial state, nor will it continue to move). A neutral equilibrium point
corresponds to a flat spot in a U(x) curve. See Figure 4.2.
The equation of motion of an particle moving in one dimension under
the action of a conservative force is, in principle, integrable. Since K =
(1/2) m v2, the energy conservation equation (4.2) can be rearranged to
give
2 [E − U(x)] 1/2
v=± , (4.8)
m
54 NEWTONIAN DYNAMICS
U (x) →
x→
where the ± signs correspond to motion to the left and to the right, re-
spectively. However, since v = dx/dt, this expression can be integrated to
give
Z
m 1/2 x dx ′
t=± p , (4.9)
2E x0 1 − U(x ′ )/E
where x(t = 0) = x0. For sufficiently simple potential functions, U(x), the
above equation can be solved to give x as a function of t. For instance, if
U = (1/2) k x2, x0 = 0, and the plus sign is chosen, then
1/2 Z (k/2E)1/2 x 1/2 1/2 !
m dy m k
−1
t= p = sin x , (4.10)
k 0 1 − y2 k 2E
dv
m = f(v). (4.12)
dt
Integrating this equation, we obtain
Zv
dv ′ t
′
= , (4.13)
v0 f(v ) m
where v(t = 0) = v0. In principle, the above equation can be solved to give
v(t). The equation of motion is also written
dv
mv = f(v), (4.14)
dx
since v = dx/dt. Integrating this equation, we obtain
Zv ′ ′
v dv x − x0
′
= , (4.15)
v0 f(v ) m
where x(t = 0) = x0. In principle, the above equation can be solved to give
v(x).
Let us now consider a specific example. Suppose that an object of mass
m falls vertically under gravity. Let x be the height through which the ob-
ject has fallen since t = 0, at which time the object is assumed to be at
rest. It follows that x0 = v0 = 0. Suppose that, in addition to the force of
gravity, m g, where g is the gravitational acceleration, our object is subject
to a retarding air resistance force which is proportional to the square of its
instantaneous velocity. The object’s equation of motion is thus
dv
m = m g − c v2, (4.16)
dt
where c > 0. This equation can be integrated to give
Zv
dv ′
′ 2
= g t, (4.17)
0 1 − (v /vt)
The left-hand side of the above equation is now a standard integral, which
can be solved to give
v gt
tanh−1 = , (4.19)
vt vt
or
gt
v = vt tanh . (4.20)
vt
Thus, when t ≪ vt/g, we obtain the standard result v ≃ g t, since tanh x ≃ x
for x ≪ 1. However, when t ≫ vt/g, we get v ≃ vt, since tanh x ≃ 1 for
x ≫ 1. It follows that air resistance prevents the downward velocity of
our object from increasing indefinitely as it falls. Instead, at large times,
the velocity asymptotically approaches the so-called terminal velocity, vt (at
which the gravitational and air resistance forces balance).
The equation of motion of our falling object is also written
dv
mv = m g − c v2. (4.21)
dx
This equation can be integrated to give
Zv
v ′ dv ′
′ 2
= g x. (4.22)
0 1 − (v /vt)
The above expression tells us that large, dense, streamlined projectiles (e.g.,
medicine balls) tend to have large terminal velocities, and small, rarefied,
non-streamlined projectiles (e.g., feathers) tend to have small terminal ve-
locities. Hence, the former type of projectile is relatively less affected by air
resistance than the latter.
f(0) = 0, (4.28)
and
df(0)
< 0. (4.29)
dx
58 NEWTONIAN DYNAMICS
d2x
m = f(x). (4.30)
dt2
Let us assume that it always stays fairly close to its equilibrium point. In this
case, to a good approximation, we can represent f(x) via a truncated Taylor
expansion about this point. In other words,
df(0)
f(x) ≃ f(0) + x + O(x2). (4.31)
dx
However, according to (4.28) and (4.29), the above expression can be writ-
ten
f(x) ≃ −m ω02 x, (4.32)
where df(0)/dx = −m ω02. Hence, we conclude that our particle satisfies
the following approximate equation of motion,
d2x
+ ω02 x ≃ 0, (4.33)
dt2
provided that it does not stray too far from its equilibrium point: i.e., pro-
vided |x| does not become too large.
Equation (4.33) is called the simple harmonic equation, and governs the
motion of all one-dimensional conservative systems which are slightly per-
turbed from some stable equilibrium state. The solution of Equation (4.33)
is well-known:
x(t) = a sin(ω0 t − φ0). (4.34)
The pattern of motion described by above expression, which is called simple
harmonic motion, is periodic in time, with repetition period T0 = 2π/ω0, and
oscillates between x = ±a. Here, a is called the amplitude of the motion.
The parameter φ0, known as the phase angle, simply shifts the pattern of
motion backward and forward in time. Figure 4.3 shows some examples of
simple harmonic motion. Here, φ0 = 0, +π/4, and −π/4 correspond to the
solid, short-dashed, and long-dashed curves, respectively.
Note that the frequency, ω0—and, hence, the period, T0—of simple har-
monic motion is determined by the parameters appearing in the simple har-
monic equation, (4.33). However, the amplitude, a, and the phase angle,
φ0, are the two integration constants of this second-order ordinary differen-
tial equation, and are, thus, determined by the initial conditions: i.e., by the
particle’s initial displacement and velocity.
One-Dimensional Motion 59
Now, from Equations (4.1) and (4.32), the potential energy of our parti-
cle at position x is approximately
1
U(x) ≃ m ω02 x2. (4.35)
2
Hence, the total energy is written
2
1 dx 1
E=K+U= m + m ω02 x2, (4.36)
2 dt 2
giving
1 1 1
E= m ω02 a2 cos2(ω0 t − φ0) + m ω02 a2 sin2(ω0 t − φ0) = m ω02 a2,
2 2 2
(4.37)
where use has been made of Equation (4.34), and the trigonometric identity
cos2 θ + sin2 θ ≡ 1. Note that the total energy is constant in time, as is to
be expected for a conservative system, and is proportional to the amplitude
squared of the motion.
oscillates about this point with a fixed frequency and a constant amplitude.
In other words, the oscillations never die away. This is not very realistic,
since we know that, in practice, if we slightly perturb a dynamical system
(such as a pendulum) from a stable equilibrium point then it will indeed
oscillate about this point, but these oscillations will eventually die away
due to frictional effects, which are present in virtually all real dynamical
systems. In order to model this process, we need to include some sort of
frictional drag force in our perturbed equation of motion, (4.33).
The most common model for a frictional drag force is one which is al-
ways directed in the opposite direction to the instantaneous velocity of the
object upon which it acts, and is directly proportional to the magnitude of
this velocity. Let us adopt this model. So, our drag force can be written
dx
fdrag = −2 m ν , (4.38)
dt
where ν is a positive constant with the dimensions of frequency. Including
such a force in our perturbed equation of motion, (4.33), we obtain
d2x dx
+ 2ν + ω02 x = 0. (4.39)
dt2 dt
Thus, the positive constant ν parameterizes the strength of the frictional
damping in our dynamical system.
Equation (4.39) is a linear second-order ordinary differential equation,
which we suspect possesses oscillatory solutions. There is a standard trick
for solving such an equation. We search for complex solutions of the form
where the constants ω and a are both, in general, complex. Of course, the
physical solution is the real part of the above expression: i.e.,
q
where ν± = ν ± ν2 − ω02. It can be seen that the solution again decays
without oscillating, except there are now two independent decay rates. The
largest, ν+, is always greater than the critically damped decay rate, ω0,
whereas the smaller, ν−, is always less than this decay rate. This means
that, in general, the critically damped solution is more rapidly damped than
either the underdamped or overdamped solutions.
Figure 4.4 shows typical examples of underdamped (i.e., ν = ω0/4),
critically damped (i.e., ν = ω0), and overdamped (i.e., ν = 4 ω0) solutions,
calculated with the initial conditions x0 = 1 and v0 = 0. Here, T0 = 2π/ω0.
The three solutions correspond to the solid, short-dashed, and long-dashed
curves, respectively.
The total energy of a damped oscillator is the sum of its kinetic and potential
energies: i.e.,
2
1 dx 1
E= m + m ω02 x2. (4.50)
2 dt 2
Differentiating the above expression with respect to time, we obtain
!
dE dx d2x dx dx d2x
=m 2
+ m ω02 x =m + ω02 x . (4.51)
dt dt dt dt dt dt2
One-Dimensional Motion 63
dx
= −x0 ν e−νt cos(ωr t − φ0) − x0 ωr e−νt sin(ωr t − φ0). (4.54)
dt
Thus, making use of Equation (4.52), the energy lost during a single oscil-
lation period is
Z Tr
dE
∆E = − dt (4.55)
0 dt
Z Tr
= 2 m ν x0 2
e−2νt [ν cos(ωr t − φ0) + ωr sin(ωr t − φ0)]2 dt,
0
Thus,
2π m ν x02 2 ν
∆E ≃ (ν + ωr2) = 2π m ω02 x02 , (4.57)
ωr ωr
64 NEWTONIAN DYNAMICS
since cos2 θ and sin2 θ both have the average values 1/2 in the interval 0
to 2π, whereas cos θ sin θ has the average value 0. According to Equa-
tion (4.37), the energy stored in the oscillator (at t = 0) is
1
E= m ω02 x02. (4.58)
2
It follows that
E ωr ω0
Q = 2π = ≃ . (4.59)
∆E 2ν 2ν
4.7 Resonance
d2x dx
2
+ 2ν + ω02 x = ω02 X1 cos(ω t). (4.61)
dt dt
Let us search for a solution of the form (4.40), and represent the right-hand
side of the above equation as ω02 X1 exp(−i ω t). It is again understood that
the physical solutions are the real parts of these expressions. Note that ω is
now a real parameter. We obtain
h i
a −ω2 − i 2 ν ω + ω02 e−i ωt = ω02 X1 e−i ωt. (4.62)
Hence,
ω02 X1
a= . (4.63)
ω02 − ω2 − i 2 ν ω
One-Dimensional Motion 65
a = x1 e i φ1 , (4.64)
where x1 and φ1 are both real. It follows from Equations (4.40), (4.63),
and (4.64) that the physical solution takes the form
where
ω02 X1
x1 = 1/2 , (4.66)
(ω02 − ω2)2 + 4 ν2 ω2
and !
−1 2νω
φ1 = tan . (4.67)
ω02 − ω2
We conclude that, in response to the applied sinusoidal force, (4.60), the
system executes a sinusoidal pattern of motion at the same frequency, with
fixed amplitude x1, and phase-lag φ1 (with respect to the external force).
Let us investigate the variation of x1 and φ1 with the forcing frequency,
ω. This is most easily done graphically. Figure 4.5 shows x1 and φ1 as func-
tions of ω for various values of ν/ω0. Here, ν/ω0 = 1, 1/2, 1/4, 1/8, and
1/16 correspond to the solid, dotted, short-dashed, long-dashed, and dot-
dashed curves, respectively. It can be seen that as the amount of damping
in the system is decreased, the amplitude of the response becomes progres-
sively more peaked at the natural frequency of oscillation of the system, ω0.
66 NEWTONIAN DYNAMICS
where
X(t + T ) = X(t) (4.72)
for all t.
It is convenient to represent X(t) as a Fourier series in time, so that
∞
X
X(t) = Xn cos(n ω t), (4.73)
n=0
We can incorporate the periodic external force (4.71) into our perturbed
equation of motion by writing
∞
d2x dx X
+ 2ν 2 2
+ ω0 x = ω0 Xn e−i nωt, (4.78)
dt2 dt
n=0
where we are again using the convention that the physical solution corre-
sponds to the real part of the complex solution. Note that the above dif-
ferential equation is linear. This means that if xa(t) and xb(t) represent
two independent solutions to this equation then any linear combination of
xa(t) and xb(t) is also a solution. We can exploit the linearity of the above
equation to write the solution in the form
∞
X
x(t) = Xn an e−i nωt, (4.79)
n=0
d2x dx
2
+ 2ν + ω02 x = ω02 e−i nωt. (4.80)
dt dt
In other words, an is obtained by substituting x = an exp(−i n ω t) into the
above equation. Hence, it follows that
ω02
an = . (4.81)
ω02 − n2 ω 2 − i 2 ν n ω
where
an = xn e i φn , (4.83)
and xn and φn are real parameters. It follows from Equation (4.81) that
ω02
xn = 1/2 , (4.84)
(ω02 − n2 ω2)2 + 4 ν2 n2 ω2
and !
−1 2νnω
φn = tan 2
. (4.85)
ω 0 − n2 ω 2
One-Dimensional Motion 69
X0 = 0.2 A, (4.86)
and
2 sin(n π/5) A
Xn = , (4.87)
nπ
for n > 0. Obviously, to obtain an exact solution, we would have to include
every Fourier harmonic in Equation (4.82), which would be impractical.
However, we can obtain a fairly accurate approximate solution by truncating
the Fourier series (i.e., by neglecting all the terms with n > N, where N ≫
1).
Figure 4.6 shows an example calculation in which the Fourier series is
truncated after 100 terms. The parameters used in this calculation are ω =
1.2 ω0 and ν = 0.8 ω0. The left panel shows the Fourier reconstruction
of the driving force, X(t). The glitches at the rising and falling edges of
the pulses are called Gibbs phenomena, and are an inevitable consequence
of attempting to represent a discontinuous periodic function as a Fourier
series. The right panel shows the Fourier reconstruction of the response,
x(t), of the dynamical system to the applied force.
70 NEWTONIAN DYNAMICS
4.9 Transients
d2x dx
+ 2ν + ω02 x = ω02 X1 cos(ω t). (4.88)
dt2 dt
We also found that the solution to this equation which oscillates in sympathy
with the applied force takes the form
where A and B are two arbitrary constants [they are in fact the integration
constants
q of the second-order ordinary differential equation (4.88)], and
ωr = ω02 − ν2. Thus, the most general solution to Equation (4.88) is
written
The first two terms on the right-hand side of the above equation are called
transients, since they decay in time. The transients are determined by the
initial conditions. However, if we wait long enough after setting the system
into motion then the transients will always decay away, leaving the time-
asymptotic solution (4.89), which is independent of the initial conditions.
As an example, suppose that we set the system into motion at time t =
0 with the initial conditions x(0) = dx(0)/dt = 0. Setting x(0) = 0 in
Equation (4.91), we obtain
Thus, we have now determined the constants A and B, and, hence, fully
specified the solution for t > 0. Figure 4.7 shows this solution (solid curve)
calculated for ω = 2 ω0 and ν = 0.2 ω0. Here, T0 = 2π/ω0. The associated
time-asymptotic solution (4.89) is also shown for the sake of comparison
(dashed curve). It can be seen that the full solution quickly converges to the
time-asymptotic solution.
d2θ
I = τ, (4.94)
dt2
72 NEWTONIAN DYNAMICS
fixed support
pivot point
l
θ
T
mg
where I is the moment of inertia of the mass, and τ is the torque acting
about the pivot point. For the case in hand, given that the mass is essentially
a point particle, and is situated a distance l from the axis of rotation (i.e.,
the pivot point), it is easily seen that I = m l2.
The two forces acting on the mass are the downward gravitational force,
m g, where g is the acceleration due to gravity, and the tension, T , in the
string. Note, however, that the tension makes no contribution to the torque,
since its line of action clearly passes through the pivot point. From sim-
ple trigonometry, the line of action of the gravitational force passes a dis-
tance l sin θ from the pivot point. Hence, the magnitude of the gravitational
torque is m g l sin θ. Moreover, the gravitational torque is a restoring torque:
i.e., if the mass is displaced slightly from its equilibrium state (i.e., θ = 0)
then the gravitational torque clearly acts to push the mass back toward that
state. Thus, we can write
τ = −m g l sin θ. (4.95)
Combining the previous two equations, we obtain the following angular
equation of motion of the pendulum:
d2θ
l + g sin θ = 0. (4.96)
dt2
Note that, unlike all of the other equations of motion which we have ex-
amined in this chapter, the above equation is nonlinear [since sin(a θ) 6=
a sin θ, except when θ ≪ 1].
One-Dimensional Motion 73
Let us assume, as usual, that the system does not stray very far from its
equilibrium state (θ = 0). If this is the case then we can make the small
angle approximation sin θ ≃ θ, and the above equation of motion simplifies
to
d2θ
+ ω02 θ ≃ 0, (4.97)
dt2
p
where ω0 = g/l. Of course, this is just the simple harmonic equation.
Hence, we can immediately write the solution as
We conclude that the pendulum swings back and forth at a fixed frequency,
ω0, which depends on l and g, but is independent of the amplitude, θ0, of
the motion.
Suppose, now, that we desire a more accurate solution of Equation (4.96).
One way in which we could achieve this would be to include more terms in
the small angle expansion of sin θ, which is
θ3 θ5
sin θ = θ − + + ···. (4.99)
3! 5!
For instance, keeping the first two terms in this expansion, Equation (4.96)
becomes
d2θ
+ ω02 (θ − θ 3/6) ≃ 0. (4.100)
dt2
By analogy with (4.98), let us try a trial solution of the form
Substituting this into Equation (4.100), and making use of the trigonometric
identity
cos3 u ≡ (3/4) cos u + (1/4) cos(3 u), (4.102)
we obtain
h i
ϑ0 ω02 − ω2 − (1/8) ω02 ϑ02 cos(ω t) − (1/24) ω02 ϑ03 cos(3 ω t) ≃ 0.
(4.103)
It is evident that the above equation cannot be satisfied for all values of t,
except in the trivial case ϑ0 = 0. However, the form of this expression does
suggest a better trial solution, namely
We can only satisfy the above equation at all values of t, and for non-zero
ϑ0, by setting the two expressions in square brackets to zero. This yields
q
ω ≃ ω0 1 − (1/8) ϑ02, (4.106)
and
ω02
α≃− . (4.107)
192
Now, the amplitude of the motion is given by
ω02 3
θ0 = ϑ0 + α ϑ03 = ϑ0 − ϑ . (4.108)
192 0
Hence, Equation (4.106) simplifies to
" #
θ2
ω = ω0 1 − 0 + O(θ04) . (4.109)
16
4.11 Exercises
F = −c vn ,
7. Show that the ratio of two successive maxima in the displacement of a damped
harmonic oscillator is constant.
8. If the amplitude of a damped harmonic oscillator decreases to 1/e of its initial
value after n periods show that the ratio of the period of oscillation to the
period of the same oscillator with no damping is
1/2
1 1
1+ ≃1+ .
4 π2 n2 8 π2 n2
76 NEWTONIAN DYNAMICS
d2 x dx
+ 2ν + ω02 x = F(t).
dt2 dt
Let x = 0 and dx/dt = v0 at t = 0.
(a) Find the solution for t > 0 when F = sin(ω t).
(b) Find the solution for t > 0 when F = sin2 (ω t).
10. Obtain the time asymptotic response of a damped linear oscillator of natural
frequency ω0 and damping coefficient ν to a square-wave periodic forcing
function of amplitude F0 = m ω02 X0 and frequency ω. Thus, F(t) = F0 for
−π/2 < ω t < π/2, 3π/2 < ω t < 5π/2, etc., and F(t) = −F0 for π/2 < ω t <
3π/2, 5π/2 < ω t < 7π/2, etc.
Multi-Dimensional Motion 77
5 Multi-Dimensional Motion
5.1 Introduction
which always points towards the origin, and whose magnitude increases
linearly with increasing distance from the origin. According to Newton’s
second law, the equation of motion of the particle is
d2r
m = f = −k r. (5.3)
dt2
d2x
= −ω02 x, (5.4)
dt2
d2y
= −ω02 y, (5.5)
dt2
p
where ω0 = k/m.
Since Equations (5.4) and (5.5) are both simple harmonic equations, we
can immediately write their general solutions:
Hence, we obtain
x = A cos(ω0 t ′ ), (5.9)
y = B cos(ω0 t ′ − ∆), (5.10)
where ∆ = φ2 − φ1. Note that the motion is clearly periodic in time, with
period T = 2 π/ω0. Thus, the particle must trace out some closed trajectory
in the x-y plane. The question, now, is what does this trajectory look like as
a function of the relative phase-shift, ∆, between the oscillations in the x-
and y-directions?
Using standard trigonometry, we can write Equation (5.10) in the form
x2 xy y2
− 2 cos ∆ + = sin2 ∆. (5.13)
A2 AB B2
Let us see whether Equation (5.13) takes a simpler form when expressed in
terms of our new coordinates. Equations (5.13)–(5.15) yield
"
cos2 θ 2 cos θ sin θ cos ∆ sin2 θ
#
′2
x − +
A2 AB B2
"
sin2 θ 2 cos θ sin θ cos ∆ cos2 θ
#
+y ′ 2 + + (5.16)
A2 AB B2
"
2 sin θ cos θ 2 (sin2 θ − cos2 θ) cos ∆ 2 cos θ sin θ
#
+x ′ y ′ − + + = sin2 ∆.
A2 AB B2
x′ 2 y′ 2
+ 2 = 1, (5.19)
a2 b
where
"
cos2 θ 2 cos θ sin θ cos ∆ sin2 θ
#
1 1
= − + , (5.20)
a2 sin2 ∆ A2 AB B2
"
sin2 θ 2 cos θ sin θ cos ∆ cos2 θ
#
1 1
= + + . (5.21)
b2 sin2 ∆ A2 AB B2
(a) (b)
(c) (d)
proportion to the square of the speed—see Section 4.3), but it does lead to
tractable equations of motion. Hence, by using this model we can, at least,
get some idea of how air resistance modifies projectile trajectories.
Let us adopt a Cartesian coordinate system whose origin coincides with
the launch point, and whose z-axis points vertically upward. Let the initial
velocity of the projectile lie in the x-z plane. Note that, since neither gravity
nor the drag force causes the projectile to move out of the x-z plane, we can
effectively ignore the y coordinate in this problem.
The equation of motion of our projectile is written
dv
m = m g − c v, (5.22)
dt
where v = (vx, vz) is the projectile velocity, g = (0, −g) the acceleration due
to gravity, and c a positive constant. In component form, the above equation
becomes
dvx vx
= −g , (5.23)
dt vt
dvz vz
= −g 1 + . (5.24)
dt vt
Here, vt = m g/c is the terminal velocity: i.e., the velocity at which the
drag force balances the gravitational force (for a projectile falling vertically
downward).
Integrating Equation (5.23), we obtain
Zv
dvx g
= − t, (5.25)
vx 0 vx vt
x = v0 cos θ t, (5.32)
which is the standard result in the absence of air drag. In the opposite limit,
t ≫ vt/g, we get
v0 vt cos θ
x= . (5.33)
g
The above expression clearly sets an effective upper limit on how far the
projectile can travel in the horizontal direction.
Integration of (5.30) gives
vt
z= (v0 sin θ + vt) 1 − e−gt/vt − vt t. (5.34)
g
It is clear, from the previous two equations, that the time of flight of the
projectile (i.e., the time at which z = 0, excluding the trivial result t = 0) is
2 v0 sin θ
tf = (5.37)
g
when t ≪ vt/g, which implies that v0 sin θ ≪ vt, and
v0 sin θ
tf = (5.38)
g
when t ≫ vt/g, which implies that v0 sin θ ≫ vt. It thus follows, from
Equations (5.32) and (5.33), that the horizontal range [i.e., x(tf)] of the
projectile is
v 2 sin(2 θ)
R= 0 (5.39)
g
when v0 sin θ ≪ vt, and
v0 vt cos θ
R= (5.40)
g
when v0 sin θ ≫ vt. Equation (5.39) is, of course, the standard result with-
out air resistance. This result implies that, in the absence of air resistance,
the maximum horizontal range, v02/g, is achieved when the launch angle
θ takes the value 45◦ . On the other hand, Equation (5.40) implies that, in
the presence of air resistance, the maximum horizontal range, v0 vt/g, is
achieved when θ is made as small as possible. However, θ cannot be made
too small, since expression (5.40) is only valid when v0 sin θ ≫ vt. In fact,
assuming that v0 ≫ vt, the maximum horizontal range, v0 vt/g, is achieved
when θ ∼ vt/v0 ≪ 1. We thus conclude that if air resistance is significant
then it causes the horizontal range of the projectile to scale linearly, rather
than quadratically, with the launch velocity, v0. Moreover, the maximum
horizontal range is achieved with a launch angle which is much shallower
than the standard result, 45◦ .
Figure 5.2 shows some example trajectories calculated, from the above
model, with the same launch angle, 45◦ , but with different values of the
ratio v0/vt. Here, X = x/(v02/g) and Z = z/(v02/g). The solid, short-dashed,
long-dashed, and dot-dashed curves correspond to v0/vt = 0, 1, 2, and 4,
respectively. It can be seen that as the air resistance strength increases (i.e.,
as v0/vt increases), the range of the projectile decreases. Furthermore, there
is always an initial time interval during which the trajectory is identical to
that calculated in the absence of air resistance (i.e., v0/vt = 0). Finally, in
the presence of air resistance, the projectile tends to fall more steeply than
84 NEWTONIAN DYNAMICS
it rises. Indeed, in the presence of strong air resistance (i.e., v0/vt = 4), the
projectile falls almost vertically.
f = q (E + v × B) , (5.41)
dv ′
m = q v ′ × B, (5.44)
dt
where we have made use of a standard vector identity (see Section 2.7),
as well as the fact that E · B = 0. Hence, we conclude that the addition of
an electric field perpendicular to a given magnetic field simply causes the
particle to drift perpendicular to both the electric and magnetic field with
the fixed velocity
E×B
vEB = , (5.45)
B2
irrespective of its charge or mass, It follows that the electric field has no
effect on the particle’s motion in a frame of reference which is co-moving
with the so-called E-cross-B velocity given above.
Let us suppose that the magnetic field is directed along the z-axis. As we
have just seen, in the E × B frame, the particle’s equation of motion reduces
to Equation (5.44), which can be written:
dvx′
= Ω vy′ , (5.46)
dt
dvy′
= −Ω vx′ , (5.47)
dt
vz′ = 0. (5.48)
Here,
qB
Ω= (5.49)
m
is the so-called cyclotron frequency. Equations (5.46)–(5.48) can be inte-
grated to give
field with some fixed speed vk . The fact that the cyclotron frequency is pos-
itive for positively charged particles, and negative for negatively charged
particles, just means that oppositely charged particles gyrate in opposite di-
rections in the plane perpendicular to the magnetic field.
Equations (5.50)–(5.52) can be integrated to give
B
ρ
5.5 Exercises
x = a sin(ω t) + b t,
y = c [1 − cos(ω t)],
z = 0.
Find the values of a, b, c, and ω, and sketch the electron’s trajectory in the
x-y plane when v0 < E/B, E/B < v0 < 2 E/B, and v0 > 2 E/B.
2. A particle of mass m and charge q moves in the x-y plane under the in-
fluence of a constant amplitude rotating electric field which is such that
Ex = E0 cos(ω t) and Ey = E0 sin(ω t). The particle starts at rest from
the origin. Determine its subsequent motion. What shape is the particle’s
trajectory?
3. A particle of mass m slides on a frictionless surface whose height is a function
of x only: i.e., z = z(x). The function z(x) is specified by the parametric
equations
x = A [2 φ + sin(2 φ)],
z = A [1 − cos(2 φ)],
where φ is the parameter. Show that the total energy of the particle can be
written 2
m ds 1 mg 2
E= + s ,
2 dt 2 4A
where s = 4 A sin φ. Deduce that the particle undergoes periodic motion
whose frequency is amplitude independent (even when the amplitude is
large). Demonstrate that the frequency of the motion is given by 4π (A/g)1/2 .
88 NEWTONIAN DYNAMICS
Planetary Motion 89
6 Planetary Motion
6.1 Introduction
2. The radius vectors connecting each planet to the Sun sweep out equal
areas in equal time intervals.
The force which maintains the Planets in orbit around the Sun is called grav-
ity, and was first correctly described by Isaac Newton (in 1687). According
to Newton, any two point mass objects (or spherically symmetric objects of
finite extent) exert a force of attraction on one another. This force points
along the line of centers joining the objects, is directly proportional to the
product of the objects’ masses, and inversely proportional to the square of
the distance between them. Suppose that the first object is the Sun, which
is of mass M, and is located at the origin of our coordinate system. Let the
second object be some planet, of mass m, which is located at position vector
r. The gravitational force exerted on the planet by the Sun is thus written
GMm
f=− r. (6.1)
r3
90 NEWTONIAN DYNAMICS
d2r GM
= − 3 r. (6.3)
dt2 r
Note that the planetary mass, m, has canceled out on both sides of the above
equation.
Now gravity is a conservative force. Hence, the gravitational force (6.1) can
be written (see Section 3.5)
f = −∇U, (6.4)
Planetary Motion 91
where the potential energy, U(r), of our planet in the Sun’s gravitational
field takes the form
GMm
U(r) = − . (6.5)
r
It follows that the total energy of our planet is a conserved quantity—see
Section 3.5. In other words,
v2 G M
E= − (6.6)
2 r
is constant in time. Here, E is actually the planet’s total energy per unit
mass, and v = dr/dt.
Gravity is also a central force. Hence, the angular momentum of our
planet is a conserved quantity—see Section 3.6. In other words,
h = r × v, (6.7)
which is actually the planet’s angular momentum per unit mass, is constant
in time. Taking the scalar product of the above equation with r, we obtain
h · r = 0. (6.8)
This is the equation of a plane which passes through the origin, and whose
normal is parallel to h. Since h is a constant vector, it always points in the
same direction. We, therefore, conclude that the motion of our planet is two-
dimensional in nature: i.e., it is confined to some fixed plane which passes
through the origin. Without loss of generality, we can let this plane coincide
with the x-y plane.
We can determine the instantaneous position of our planet in the x-y plane
in terms of standard Cartesian coordinates, (x, y), or plane polar coordi-
nates, (r, θ), as illustrated in Figure 6.1. It is helpful to define two unit
vectors, er ≡ r/r and eθ ≡ ez × er, at the instantaneous position of the
planet. The first always points radially away from the origin, whereas the
second is normal to the first, in the direction of increasing θ. As is easily
demonstrated, the Cartesian components of er and eθ are
eθ er
y
θ
x
b
x
a
respectively.
We can write the position vector of our planet as
r = r er. (6.11)
Thus,
v = ṙ er + r θ̇ eθ. (6.14)
Now, the planet’s acceleration is written
dv d2r
a= = 2 = r̈ er + ṙ ėr + (ṙ θ̇ + r θ̈) eθ + r θ̇ ėθ. (6.15)
dt dt
Again, eθ has a non-zero time-derivative because its direction changes as
the planet moves around. Differentiation of Equation (6.10) with respect to
time yields
ėθ = θ̇ (− cos θ, − sin θ) = −θ̇ er. (6.16)
Hence,
a = (r̈ − r θ̇ 2) er + (r θ̈ + 2 ṙ θ̇) eθ. (6.17)
It follows that the equation of motion of our planet, (6.3), can be written
GM
a = (r̈ − r θ̇ 2) er + (r θ̈ + 2 ṙ θ̇) eθ = − er. (6.18)
r2
Since er and eθ are mutually orthogonal, we can separately equate the co-
efficients of both, in the above equation, to give a radial equation of motion,
GM
r̈ − r θ̇ 2 = − , (6.19)
r2
and a tangential equation of motion,
r θ̈ + 2 ṙ θ̇ = 0. (6.20)
94 NEWTONIAN DYNAMICS
The ellipse, the parabola, and the hyperbola are collectively known as conic
sections, since these three types of curve can be obtained by taking various
different plane sections of a right cone. It turns out that the possible solu-
tions of Equations (6.19) and (6.20) are all conic sections. It is, therefore,
appropriate for us to briefly review these curves.
An ellipse, centered on the origin, of major radius a and minor radius
b, which are aligned along the x- and y-axes, respectively (see Figure 6.2),
satisfies the following well-known equation:
x 2 y2
+ = 1. (6.21)
a2 b2
Likewise, a parabola which is aligned along the +x-axis, and passes
through the origin (see Figure 6.3), satisfies:
y2 − b x = 0, (6.22)
where b > 0.
Finally, a hyperbola which is aligned along the +x-axis, and whose asymp-
totes intersect at the origin (see Figure 6.4), satisfies:
x 2 y2
− = 1. (6.23)
a2 b2
Here, a is the distance of closest approach to the origin. The asymptotes
subtend an angle φ = tan−1(b/a) with the x-axis.
Planetary Motion 95
a
φ
x
asymptote
It is not clear, at this stage, what the ellipse, the parabola, and the hyper-
bola have in common (other than being conic sections). Well, it turns out
that what these three curves have in common is that they can all be repre-
sented as the locus of a movable point whose distance from a fixed point is in
a constant ratio to its perpendicular distance to some fixed straight-line. Let
the fixed point (which is termed the focus of the ellipse/parabola/hyperbola)
lie at the origin, and let the fixed line correspond to x = −d (with d > 0).
Thus, the distance of a general point p (x, y) (which lies to the right of the
line x = −d) from the origin is r1 = x2 + y2, whereas the perpendicular
distance of the point from the line x = −d is r2 = x + d—see Figure 6.5. In
plane polar coordinates, r1 = r and r2 = r cos θ + d. Hence, the locus of a
point for which r1 and r2 are in a fixed ratio satisfies the following equation:
p
r1 x 2 + y2 r
= = = e, (6.24)
r2 x+d r cos θ + d
where e ≥ 0 is a constant. When expressed in terms of plane polar coordi-
nates, the above equation can be rearranged to give
rc
r= , (6.25)
1 − e cos θ
where rc = e d.
When written in terms of Cartesian coordinates, (6.24) can be rear-
ranged to give
(x − xc)2 y2
+ 2 = 1, (6.26)
a2 b
96 NEWTONIAN DYNAMICS
y
r2
r1 = r
θ x
d
rc
a = , (6.27)
1 − e2
r
√ c
p
b = = 1 − e2 a, (6.28)
1 − e2
e rc
xc = = e a. (6.29)
1 − e2
y2 − 2 rc (x − xc) = 0, (6.30)
(x − xc)2 y2
− 2 = 1, (6.31)
a2 b
Planetary Motion 97
d(r2 θ̇)
= 0, (6.37)
dt
which implies that
h = r2 θ̇ (6.38)
is constant in time. It is easily demonstrated that h is the magnitude of the
vector h defined in Equation (6.7). Thus, the fact that h is constant in time
is equivalent to the statement that the angular momentum of our planet is
a constant of its motion. As we have already mentioned, this is the case
because gravity is a central force.
Suppose that the radius vector connecting our planet to the origin (i.e.,
the Sun) sweeps out an angle δθ between times t and t+δt—see Figure 6.6.
98 NEWTONIAN DYNAMICS
r
δθ r δθ
The approximately triangular region swept out by the radius vector has the
area
1
δA ≃ r2 δθ, (6.39)
2
since the area of a triangle is half its base (r δθ) times its height (r). Hence,
the rate at which the radius vector sweeps out area is
dA r2 δθ r2 dθ h
= lim = = . (6.40)
dt δt→0 2 δt 2 dt 2
Thus, the radius vector sweeps out area at a constant rate (since h is con-
stant in time)—this is Kepler’s second law. We conclude that Kepler’s second
law of planetary motion is a direct consequence of angular momentum con-
servation.
Our planet’s radial equation of motion, (6.19), can be combined with Equa-
tion (6.38) to give
h2 GM
r̈ − 3 = − 2 . (6.41)
r r
Suppose that r = u−1. It follows that
u̇ du dθ du
ṙ = − 2
= −r2 = −h . (6.42)
u dθ dt dθ
Likewise,
d2u 2
2 2d u
r̈ = −h θ̇ = −u h . (6.43)
dθ2 dθ2
Planetary Motion 99
d2u GM
+u= 2 . (6.44)
dθ2 h
The general solution to the above equation is
GM
u(θ) = [1 − e cos(θ − θ0)] , (6.45)
h2
where e and θ0 are arbitrary constants. Without loss of generality, we can
set θ0 = 0 by rotating our coordinate system about the z-axis. Thus, we
obtain
rc
r(θ) = , (6.46)
1 − e cos θ
where
h2
rc = . (6.47)
GM
We immediately recognize Equation (6.46) as the equation of a conic section
which is confocal with the origin (i.e., with the Sun). Specifically, for e < 1,
Equation (6.46) is the equation of an ellipse which is confocal with the Sun.
Thus, the orbit of our planet around the Sun in a confocal ellipse—this
is Kepler’s first law of planetary motion. Of course, a planet cannot have
a parabolic or a hyperbolic orbit, since such orbits are only appropriate to
objects which are ultimately able to escape from the Sun’s gravitational field.
We have seen that the radius vector connecting our planet to the origin
sweeps out area at the constant rate dA/dt = h/2 [see Equation (6.40)].
We have also seen that the planetary orbit is an ellipse. Suppose that the
major and minor radii of the ellipse are a and b, respectively. It follows that
the area of the ellipse is A = π a b. Now, we expect the radius vector to
sweep out the whole area of the ellipse in a single orbital period, T . Hence,
A 2πab
T= = . (6.48)
(dA/dt) h
4 π2 a3
T2 = . (6.49)
GM
100 NEWTONIAN DYNAMICS
In other words, the square of the orbital period of our planet is proportional
to the cube of its orbital major radius—this is Kepler’s third law.
Note that for an elliptical orbit the closest distance to the Sun—the so-
called perihelion distance—is [see Equations (6.27) and (6.46)]
rc
rp = = a (1 − e). (6.50)
1+e
Likewise, the furthest distance from the Sun—the so-called aphelion distance—
is
rc
ra = = a (1 + e). (6.51)
1−e
It follows that the major radius, a, is simply the mean of the perihelion and
aphelion distances,
rp + ra
a= . (6.52)
2
The parameter
ra − rp
e= (6.53)
ra + rp
is called the eccentricity, and measures the deviation of the orbit from cir-
cularity. Thus, e = 0 corresponds to a circular orbit, whereas e → 1 corre-
sponds to an infinitely elongated elliptical orbit.
According to Equations (6.6) and (6.14), the total energy per unit mass of
an object in orbit around the Sun is given by
ṙ 2 + r2 θ̇ 2 G M
E= − . (6.54)
2 r
It follows from Equations (6.38), (6.42), and (6.47) that
" 2 #
h2 du 2
E= + u − 2 u uc , (6.55)
2 dθ
We conclude that elliptical orbits (e < 1) have negative total energies, whereas
parabolic orbits (e = 1) have zero total energies, and hyperbolic orbits (e >
1) have positive total energies. This makes sense, since in a conservative
system in which the potential energy at infinity is set to zero [see Equa-
tion (6.5)] we expect bounded orbits to have negative total energies, and
unbounded orbits to have positive total energies—see Section 4.2. Thus, el-
liptical orbits, which are clearly bounded, should indeed have negative total
energies, whereas hyperbolic orbits, which are clearly unbounded, should
indeed have positive total energies. Parabolic orbits are marginally bounded
(i.e., an object executing a parabolic orbit only just escapes from the Sun’s
gravitational field), and thus have zero total energy.
Consider an artificial satellite in an elliptical orbit around the Sun (the
same considerations also apply to satellites in orbit around the Earth). At
perihelion, ṙ = 0, and Equations (6.54) and (6.57) can be combined to give
vt √
= 1 + e. (6.58)
vc
q
Here, vt = r θ̇ is the satellite’s tangential velocity, and vc = G M/rp is the
tangential velocity that it would need in order to maintain a circular orbit
at the perihelion distance. Likewise, at aphelion,
vt √
= 1 − e, (6.59)
vc
p
where vc = G M/ra is now the tangential velocity that the satellite would
need in order to maintain a circular orbit at the aphelion distance.
Suppose that our satellite is initially in a circular orbit of radius r1, and
that we wish to transfer it into a circular orbit of radius r2, where r2 >
r1. We can achieve this by temporarily placing the satellite in an elliptical
orbit whose perihelion distance is r1, and whose aphelion distance is r2. It
follows, from Equation (6.53), that the required eccentricity of the elliptical
orbit is
r2 − r1
e= . (6.60)
r2 + r1
According to Equation (6.58), we can transfer our satellite from its initial
circular orbit into the temporary elliptical orbit by increasing its tangential
velocity (by briefly switching on the satellite’s rocket motor) by a factor
√
α1 = 1 + e. (6.61)
We must next allow the satellite to execute half an orbit, so that it attains
its aphelion distance, and then boost the tangential velocity by a factor [see
102 NEWTONIAN DYNAMICS
final orbit
r1
Equation (6.59)]
1
α2 = √ . (6.62)
1−e
The satellite will now be in a circular orbit at the aphelion distance, r2.
This process is illustrated in Figure 6.7. Obviously, we can transfer our
satellite from a larger to a smaller circular orbit by performing the above
process in reverse. Note, finally, from Equation (6.58), that if we increase
the tangential velocity√of a satellite in a circular orbit about the Sun by
a factor greater than 2 then we will transfer it into a hyperbolic orbit
(e > 1), and it will eventually escape from the Sun’s gravitational field.
r = rp (1 + P2), (6.81)
tan(θ/2) = P. (6.82)
Here, P is termed the parabolic anomaly, and varies between −∞ and +∞,
with the perihelion point corresponding to P = 0. Note that Equation (6.80)
is a cubic equation possessing a single real root which can, in principle, be
solved analytically. However, a numerical solution is generally more conve-
nient.
Finally, for the case of a hyperbolic orbit, characterized by e > 1, we
obtain:
1/2
GM
e sinh H − H = t, (6.83)
a3
r = a (e cosh H − 1), (6.84)
1/2
e+1
tan(θ/2) = tanh(H/2). (6.85)
e−1
Here, H is termed the hyperbolic anomaly, and varies between −∞ and +∞,
with the perihelion point corresponding to H = 0. Moreover, a = rp/(e − 1).
As in the elliptical case, Equation (6.83) is a transcendental equation which
can only be solved numerically.
h = r2 θ̇ (6.86)
r = r0 θ 2. (6.88)
106 NEWTONIAN DYNAMICS
where use has been made of Equation (6.93). Thus, the apsidal angle, ψ, is
given by
T f ′ (rc) −1/2
ψ = θ̇ = π 3 + rc (6.104)
2 f(rc)
For the case of attractive power-law central forces of the form f(r) =
−c rn, where c > 0, the apsidal angle becomes
π
ψ= . (6.105)
(3 + n)1/2
Now, it should be clear that if an orbit is going to close on itself then the
apsidal angle needs to be a rational fraction of 2 π. There are, in fact, only
two small integer values of the power-law index, n, for which this is the
case. As we have seen, for an inverse-square force law (i.e., n = −2), the
apsidal angle is π. For a linear force law (i.e., n = 1), the apsidal angle
is π/2—see Section 5.2. However, for quadratic (i.e., n = 2) or cubic (i.e.,
n = 3) force laws, the apsidal angle is an irrational fraction of 2 π, which
means that non-circular orbits in such force-fields never close on themselves.
Let us, finally, calculate the apsidal angle for a nearly circular orbit of
radius rc in a slightly modified (attractive) inverse-square force law of the
form
k ǫ
f(r) = − 2 − 4 , (6.106)
r r
where ǫ/(k rc2) is small. Substitution into Equation (6.104) yields
" #−1/2 " #−1/2
2 k r−3 −5
c + 4 ǫ rc 1 + 2 ǫ/(k rc2)
ψ = π 3 + rc =π 3−2 .
−k r−2 −4
c − ǫ rc 1 + ǫ/(k rc2)
(6.107)
Expanding to first-order in ǫ/(k rc2), we obtain
−1/2 h i−1/2
ψ ≃ π 3 − 2 [1 + ǫ/(k rc2)] = π 1 − 2 ǫ/(k rc2) ≃ π [1 + ǫ/(k rc2)].
(6.108)
Planetary Motion 109
We conclude that if ǫ > 0 then the perihelion (or aphelion) of the orbit
advances by an angle 2 ǫ/(k rc2) every rotation period. It turns out that the
general relativistic corrections to Newtonian gravity give rise to a small 1/r4
modification (with ǫ > 0) to the Sun’s gravitational field. Hence, these cor-
rections generate a small precession in the perihelion of each planet orbiting
the Sun. This effect is particularly large for Mercury—see Section 13.13.
6.14 Exercises
4. How long (in days) does it take the Sun-Earth radius vector to rotate through
90◦ , starting at the perihelion point? How long does it take starting at the
aphelion point? The period and eccentricity of the Earth’s orbit are T =
365.24 days, and e = 0.01673, respectively.
5. Solve the Kepler problem for a parabolic orbit to obtain Equations (6.80)–
(6.82).
6. Solve the Kepler problem for a hyperbolic orbit to obtain Equations (6.83)–
(6.85).
7. A comet is in a parabolic orbit lying in the plane of the Earth’s orbit. Regard-
ing the Earth’s orbit as a circle of radius a, show that the points where the
comet intersects the Earth’s orbit are given by
2p
cos θ = −1 + ,
a
110 NEWTONIAN DYNAMICS
of a year, and that the maximum value of this time interval is 2/3π year, or
about 11 weeks.
8. The orbit of a particle moving in a central field is a circle passing through the
origin, namely r = r0 cos θ. Show that the force law is inverse-fifth power.
9. A particle moving in a central field describes a spiral orbit r = r0 exp(k θ).
Show that the force law is inverse-cube, and that θ varies logarithmically
with t. Show that there are two other possible types of orbit in this force-
field, and give their equations.
10. A particle moves in a spiral orbit given by r = a θ. Suppose that θ increases
linearly with t. Is the force acting on the particle central in nature? If not,
determine how θ would have to vary with t in order to make the force central.
11. A particle moves in a circular orbit of radius r0 in an attractive central force-
field of the form f(r) = −c exp(−r/a)/r2 , where c > 0 and a > 0. Demon-
strate that the orbit is only stable provided that r0 < a.
12. A particle moves in a circular orbit in an attractive central force-field of the
form f(r) = −c r−3 , where c > 0. Demonstrate that the orbit is unstable to
small perturbations.
13. If the Solar System were embedded in a uniform dust cloud, what would the
apsidal angle of a planet be for motion in a nearly circular orbit? Express
your answer in terms of the ratio of the mass of dust contained in a sphere,
centered on the Sun, whose radius is that of the orbit, to the mass of the Sun.
This model was once suggested as a possible explanation for the advance of
the perihelion of Mercury.
14. The potential energy per unit mass of a particle in the gravitational field of
an oblate spheroid, like the Earth, is
GM ǫ
V(r) = − 1+ 2 ,
r r
where r refers to distances in the equatorial plane, M is the Earth’s mass,
and ǫ = (2/5) R ∆R. Here, R = 4000 mi is the Earth’s equatorial radius, and
∆R = 13 mi the difference between the equatorial and polar radii. Find the
apsidal angle for a satellite moving in a nearly circular orbit in the equatorial
plane of the Earth.
Two-Body Dynamics 111
7 Two-Body Dynamics
7.1 Introduction
Suppose that our first object is of mass m1, and is located at position vector
r1. Likewise, our second object is of mass m2, and is located at position
vector r2. Let the first object exert a force f21 on the second. By Newton’s
third law, the second object exerts an equal and opposite force, f12 = −f21,
on the first. Suppose that there are no other forces in the problem. The
equations of motion of our two objects are thus
d2r1
m1 = −f, (7.1)
dt2
d2r2
m2 = f, (7.2)
dt2
where f = f21.
Now, the center of mass of our system is located at
m1 r1 + m2 r2
rcm = . (7.3)
m1 + m2
Hence, we can write
m2
r1 = rcm − r, (7.4)
m1 + m2
m1
r2 = rcm + r, (7.5)
m1 + m2
where r = r2 −r1. Substituting the above two equations into Equations (7.1)
and (7.2), and making use of the fact that the center of mass of an isolated
system does not accelerate (see Section 3.6), we find that both equations
yield
d2r
µ 2 = f, (7.6)
dt
112 NEWTONIAN DYNAMICS
where
m1 m2
µ= (7.7)
m1 + m2
is called the reduced mass. Hence, we have effectively converted our original
two-body problem into an equivalent one-body problem. In the equivalent
problem, the force f is the same as that acting on both objects in the original
problem (modulo a minus sign). However, the mass, µ, is different, and is
less than either of m1 or m2 (which is why it is called the “reduced” mass).
Approximately half of the stars in our galaxy are members of so-called binary
star systems. Such systems consist of two stars orbiting about their common
center of mass. The distance separating the stars is always much less than
the distance to the nearest neighbour star. Hence, a binary star system can
be treated as a two-body dynamical system to a very good approximation.
In a binary star system, the gravitational force which the first star exerts
on the second is
G m1 m2
f=− r, (7.8)
r3
where r = r2 − r1. As we have seen, a two-body system can be reduced to an
equivalent one-body system whose equation of motion is of the form (7.6),
where µ = m1 m2/(m1 + m2). Hence, in this particular case, we can write
m1 m2 d2r G m1 m2
2
=− r, (7.9)
m1 + m2 dt r3
which gives
d2r GM
2
= − 3 r, (7.10)
dt r
where
M = m1 + m2. (7.11)
Equation (7.10) is identical to Equation (6.3), which we have already
solved. Hence, we can immediately write down the solution:
where
a (1 − e2)
r= , (7.13)
1 − e cos θ
Two-Body Dynamics 113
and
dθ h
= 2, (7.14)
dt r
with
h2
a= . (7.15)
(1 − e2) G M
Here, h is a constant, and we have aligned our Cartesian axes so that the
plane of the orbit coincides with the x-y plane. According to the above
solution, the second star executes a Keplerian elliptical orbit, with major
radius a and eccentricity e, relative to the first star, and vice versa. From
Equation (6.49), the period of revolution, T , is given by
s
4 π2 a3
T= . (7.16)
GM
In the inertial frame of reference whose origin always coincides with the
center of mass—the so-called center of mass frame—the position vectors of
the two stars are
m2
r1 = − r, (7.17)
m1 + m2
m1
r2 = r, (7.18)
m1 + m2
where r is specified above. Figure 7.1 shows an example binary star orbit, in
the center of mass frame, calculated with m1/m2 = 0.5 and e = 0.2. Here,
the triangles and squares denote the positions of the first and second star,
respectively (which are always diagrammatically opposite one another, as
indicated by the arrows). It can be seen that both stars execute elliptical
orbits about their common center of mass.
Binary star systems have been very useful to astronomers, since it is
possible to determine the masses of both stars in such a system by careful
observation. The sum of the masses of the two stars, M = m1 + m2, can
be found from Equation (7.16) after a measurement of the major radius, a
(which is the mean of the greatest and smallest distance apart of the two
stars during their orbit), and the orbital period, T . The ratio of the masses of
the two stars, m1/m2, can be determined from Equations (7.17) and (7.18)
by observing the fixed ratio of the relative distances of the two stars from the
common center of mass about which they both appear to rotate. Obviously,
given the sum of the masses, and the ratio of the masses, the individual
masses themselves can then be calculated.
114 NEWTONIAN DYNAMICS
closest approach
particle
β
α θ
b Θ
origin
Figure 7.3: The one-body equivalent to the previous figure.
symmetry, the angle α in Figure 7.3 is equal to the angle β. However, from
simple geometry, α = Θ. Hence,
θ = π − 2 Θ. (7.19)
Now, by analogy with Equation (6.55), the conserved total energy E in the
equivalent problem, which can easily be shown to be the same as the total
energy in the original problem, is given by
" 2 #
µ h2 du 2
E= +u + U(u), (7.20)
2 dϑ
where u = r−1, and h is the angular momentum per unit mass in the equiv-
alent problem. It is easily seen that
1/2
2E
h = b v∞ = b , (7.21)
µ
where v∞ is the approach velocity in the equivalent problem at large r. It
follows that " #
du 2
2 2
E = Eb + u + U(u). (7.22)
dϑ
The above equation can be rearranged to give
dϑ b
=q . (7.23)
du 1 − b2 u2 − U(u)/E
Two-Body Dynamics 117
Integration yields
Z umax
b du
Θ= q . (7.24)
0 1 − b2 u2 − U(u)/E
Here, umax = 1/rmin, where rmin is the distance of closest approach. Since,
by symmetry, (du/dϑ)umax = 0, it follows from Equation (7.23) that
Equations (7.19) and (7.24) enable us to calculate the function b(θ) for
a given interaction potential, U(r), and a given total energy, E, of the two
particles in the center of mass frame. The function b(θ) tells us which impact
parameter corresponds to which scattering angle, and vice versa.
Instead of two particles, suppose that we now have two counter-propag-
ating beams of identical particles (with the same properties as the two parti-
cles described above) which scatter one another via binary collisions. What
is the angular distribution of the scattered particles? Consider pairs of par-
ticles whose impact parameters lie in the range b to b + db. These particles
are scattered in such a manner that their scattering angles lie in the range θ
to θ + dθ, where θ is determined from inverting the function b(θ), and
db
dθ = . (7.26)
|db(θ)/dθ|
angle. For two uniform beams scattering off one another, the differential
scattering cross-section thus effectively specifies the probability of scattering
into a given range of solid angle. The total scattering cross-section is the
integral of the differential cross-section over all solid angles,
Z
dσ
σ= dΩ, (7.30)
dΩ
and measures the effective target area for scattering in any direction. Thus,
if the flux of particles per unit area per unit time, otherwise known as the
intensity, of the two beams is I, then the number of particles of a given type
scattered per unit time is simply I σ.
Let us now calculate the scattering cross-section for the following very
simple interaction potential:
0 r>a
U(r) = . (7.31)
∞ r≤a
Note that
db
1 db2 a2 a2
b = = sin(θ/2) cos(θ/2) = sin θ. (7.34)
dθ 2 dθ 2 4
dσ a2
= . (7.35)
dΩ 4
Two-Body Dynamics 119
Integration yields
π α
Θ = − sin−1 √ . (7.40)
2 4 + α2
Hence, from Equation (7.19), we obtain
α
−1
θ = 2 sin √ . (7.41)
4 + α2
The above equation can be rearranged to give
k2
b2 = cot2(θ/2). (7.42)
4E2
Thus,
db 2 sin θ
= k
2 b . (7.43)
dθ 8 E sin4(θ/2)
2
There are a number of things to note about the above formula. First, the
scattering cross-section is proportional to k2. This means that repulsive
(k > 0) and attractive (k < 0) inverse-square interaction forces of the same
strength give rise to identical angular distributions of scattered particles.
Second, the scattering cross-section is proportional to E−2. This means that
inverse-square interaction forces are much more effective at scattering low
energy, rather than high energy, particles. Finally, the differential scatter-
ing cross-section is proportional to sin−4(θ/2). This means that, with an
inverse-square interaction force, the overwhelming majority of “collisions”
consist of small angle scattering events (i.e., θ ≪ 1).
Let us now consider a specific case. Suppose that we have particles of
electric charge q scattering off particles of the same charge. The interaction
potential due to the Coulomb force between the particles is simply
q2
U(r) = . (7.45)
4π ǫ0 r
Thus, it follows from Equation (7.44) [with k = q2/(4π ǫ0)] that the differ-
ential scattering cross-section takes the form
dσ q4 1
= 2 2 4
. (7.46)
dΩ 16 (4 π ǫ0) E sin (θ/2)
This very famous formula is known as the Rutherford scattering cross-section,
since it was first derived by Earnst Rutherford for use in his celebrated α-
particle scattering experiment.
Note, finally, that if we try to integrate the Rutherford formula to obtain
the total scattering cross-section then we find that the integral is divergent,
due to the very strong increase in dσ/dΩ as θ → 0. This implies that the
Coulomb potential (or any other inverse-square-law potential) has an effec-
tively infinite range. In practice, however, an electric charge is generally
surrounded by charges of the opposite sign which shield the Coulomb po-
tential of the charge beyond a certain distance. This shielding effect allows
the charge to have a finite total scattering cross-section (for the scattering
of other electric charges). However, the total scattering cross-section of
the charge depends (albeit, logarithmically) on the shielding distance, and,
hence, on the nature and distribution of the charges surrounding it.
We have seen that two-particle scattering looks fairly simple when viewed
in the center of mass frame. Unfortunately, we are not usually in a position
Two-Body Dynamics 121
V1′
y v1′
v1 θ v2 V1 ψ
x
ζ
v2′
V2′
of reference are:
v1 = v1 (1, 0), (7.50)
v2 = (m1/m2) v1 (−1, 0), (7.51)
v1′ = v1 (cos θ, sin θ), (7.52)
v2′ = (m1/m2) v1(− cos θ, − sin θ), (7.53)
V1 = (1 + m1/m2) v1 (1, 0), (7.54)
V1′ = v1 (cos θ + m1/m2, sin θ), (7.55)
V2′ = (m1/m2) v1 (1 − cos θ, − sin θ). (7.56)
In the center of mass frame, let E be the total energy, let E1 = (1/2) m1 v12
and E2 = (1/2) m2 v22 be the kinetic energies of the first and second parti-
cles, respectively, before the collision, and let E1′ = (1/2) m1 v1′ 2 and E2′ =
(1/2) m2 v2′ 2 be the kinetic energies of the first and second particles, respec-
tively, after the collision. Of course, E = E1 + E2 = E1′ + E2′ . In the laboratory
frame, let E be the total energy. This is, of course, equal to the kinetic energy
of the first particle before the collision. Likewise, let E1′ = (1/2) m1 V1′ 2 and
E2′ = (1/2) m2 V2′ 2 be the kinetic energies of the first and second particles,
respectively, after the collision. Of course, E = E1′ + E2′ .
The following results can easily be obtained from the above definitions
and Equations (7.50)–(7.56). First,
m1 + m2
E= E. (7.57)
m2
Hence, the total energy in the laboratory frame is always greater than that
in the center of mass frame. In fact, it can be demonstrated that the total
energy in the center of mass frame is less than the total energy in any other
inertial frame. Second,
m2
E1 = E1′
= E, (7.58)
m1 + m2
m1
′
E2 = E2 = E. (7.59)
m1 + m2
These equations specify how the total energy in the center of mass frame
is distributed between the two particles. Note that this distribution is un-
changed by the collision. Finally,
" #
m12 + 2 m1 m2 cos θ + m22
E1′ = E, (7.60)
(m1 + m2)2
Two-Body Dynamics 123
2 m1 m2 (1 − cos θ)
E2′ = E. (7.61)
(m1 + m2)2
These equations specify how the total energy in the laboratory frame is dis-
tributed between the two particles after the collision. Note that the energy
distribution in the laboratory frame is different before and after the collision.
Equations (7.50)–(7.56), and some simple trigonometry, yield
sin θ
tan ψ = , (7.62)
cos θ + m1/m2
and
sin θ π θ
tan ζ = = tan − . (7.63)
1 − cos θ 2 2
The last equation implies that
π θ
ζ= − . (7.64)
2 2
Differentiating Equation (7.62) with respect to θ, we obtain
m2/m1
tan ψmax = q , (7.67)
1 − (m2/m1)2
which reduces to
m2
−1
ψmax = sin . (7.68)
m1
Hence, we conclude that when m1 > m2 there is a maximum possible value
of the scattering angle, ψ, in the laboratory frame. This maximum value
is always less than π/2, which implies that there is no backward scattering
(i.e., ψ > π/2) at all when m1 > m2. For the special case when m1 = m2,
the maximum scattering angle is π/2. However, for m1 < m2 there is no
124 NEWTONIAN DYNAMICS
maximum value, and the scattering angle in the laboratory frame can thus
range all the way to π.
Equations (7.57)–(7.64) enable us to relate the particle energies and
scattering angles in the laboratory frame to those in the center of mass
frame. In general, these relationships are fairly complicated. However, there
are two special cases in which the relationships become much simpler.
The first special case is when m2 ≫ m1. In this limit, it is easily seen
from Equations (7.57)–(7.64) that the second mass is stationary both before
and after the collision, and that the center of mass frame coincides with
the laboratory frame (since the energies and scattering angles in the two
frames are the same). Hence, the simple analysis outlined in Section 7.4 is
applicable in this case.
The second special case is when m1 = m2. In this case, Equation (7.62)
yields
sin θ
tan ψ = = tan(θ/2). (7.69)
cos θ + 1
Hence,
θ
ψ= . (7.70)
2
In other words, the scattering angle of the first particle in the laboratory
frame is half of the scattering angle in the center of mass frame. The above
equation can be combined with Equation (7.64) to give
π
ψ+ζ= . (7.71)
2
Thus, in the laboratory frame, the two particles move off at right-angles to
one another after the collision. Equation (7.57) yields
E = 2 E. (7.72)
In other words, the total energy in the laboratory frame is twice that in the
center of mass frame. According to Equations (7.58) and (7.59),
E
E1 = E1′ = E2 = E2′ = . (7.73)
2
Hence, the total energy in the center of mass frame is divided equally be-
tween the two particles. Finally, Equations (7.60) and (7.61) give
1 + cos θ
E1′ = E = cos2(θ/2) E = cos2 ψ E, (7.74)
2
1 − cos θ
E2′ = E = sin2(θ/2) E = sin2 ψ E. (7.75)
2
Two-Body Dynamics 125
Thus, in the laboratory frame, the unequal energy distribution between the
two particles after the collision is simply related to the scattering angle ψ.
What is the angular distribution of scattered particles when a beam of
particles of the first type scatter off stationary particles of the second type?
Well, we can define a differential scattering cross-section, dσ(ψ)/dΩ ′ , in
the laboratory frame, where Ω ′ = 2π sin ψ dψ is an element of solid angle
in this frame. Thus, (dσ(ψ)/dΩ ′ ) dΩ ′ is the effective cross-sectional area
in the laboratory frame for scattering into the range of scattering angles ψ
to ψ + dψ. Likewise, (dσ(θ)/dΩ) dΩ is the effective cross-sectional area in
the center of mass frame for scattering into the range of scattering angles θ
to θ + dθ. Note that dΩ = 2π sin θ dθ. However, a cross-sectional area is
not changed when we transform between different inertial frames. Hence,
we can write
dσ(ψ) dσ(θ)
dΩ ′ = dΩ, (7.76)
dΩ ′ dΩ
provided that ψ and θ are related via Equation (7.62). This equation can be
rearranged to give
dσ(ψ) dΩ dσ(θ)
′
= , (7.77)
dΩ dΩ ′ dΩ
or
dσ(ψ) sin θ dθ dσ(θ)
′
= . (7.78)
dΩ sin ψ dψ dΩ
The above equation allows us to relate the differential scattering cross-
section in the laboratory frame to that in the center of mass frame. In
general, this relationship is extremely complicated. However, for the special
case where the masses of the two types of particles are equal, we have seen
that ψ = θ/2 [see Equation (7.70)]. Hence, it follows from Equation (7.78)
that
dσ(ψ) dσ(θ = 2 ψ)
′
= 4 cos ψ . (7.79)
dΩ dΩ
Let us now consider some specific examples. We saw earlier that, in the
center of mass frame, the differential scattering cross-section for impenetra-
ble spheres is [see Equation (7.35)]
dσ(θ) a2
= , (7.80)
dΩ 4
where a is the sum of the radii. According to Equation (7.79), the differen-
tial scattering cross-section (for equal mass spheres) in the laboratory frame
is
dσ(ψ)
= a2 cos ψ. (7.81)
dΩ ′
126 NEWTONIAN DYNAMICS
Note that this cross-section is negative for ψ > π/2. This just tells us that
there is no scattering with scattering angles greater than π/2 (i.e., there is no
backward scattering). Comparing Equations (7.80) and (7.81), we can see
that the scattering is isotropic in the center of mass frame, but appears con-
centrated in the forward direction in the laboratory frame. We can integrate
Equation (7.81) over all solid angles to obtain the total scattering cross-
section in the laboratory frame. Note that we only integrate over angular
regions where the differential scattering cross-section is positive. Doing this,
we get
σ = π a2, (7.82)
which is the same as the total scattering cross-section in the center of mass
frame [see Equation (7.36)]. This is a general result. The total scattering
cross-section is frame independent, since a cross-sectional area is not modi-
fied by switching between different frames of reference.
As we have seen, the Rutherford scattering cross-section takes the form
[see Equation (7.46)]
dσ q4 1
= 2 2 4
(7.83)
dΩ 16 (4 π ǫ0) E sin (θ/2)
in the center of mass frame. It follows, from Equation (7.79), that the
Rutherford scattering cross-section (for equal mass particles) in the labo-
ratory frame is written
dσ q4 cos ψ
′
= . (7.84)
dΩ (4 π ǫ0) E sin4 ψ
2 2
Here, we have made use of the fact that E = 2 E for equal mass particles
[see Equation (7.72)]. Note, again, that this cross-section is negative for
ψ > π/2, indicating the absence of backward scattering.
7.6 Exercises
1. Using the notation of Section 7.2, show that the angular momentum of a
two-body system takes the form
where M = m1 + m2 .
Two-Body Dynamics 127
dσ k 1 (1 − θ/π)
= .
dΩ E π sin θ (θ/π)2 (2 − θ/π)2
128 NEWTONIAN DYNAMICS
Non-Inertial Reference Frames 129
8.1 Introduction
Suppose that a given object has position vector r in some non-rotating iner-
tial reference frame. Let us observe the motion of this object in a non-inertial
reference frame which rotates with constant angular velocity Ω about an
axis passing through the origin of the inertial frame. Suppose, first of all,
that our object appears stationary in the rotating reference frame. Hence, in
the non-rotating frame, the object’s position vector r will appear to precess
about the origin with angular velocity Ω. It follows, from Equation (2.42),
that in the non-rotating reference frame
dr
= Ω × r. (8.1)
dt
Suppose, now, that our object appears to move in the rotating reference
frame with instantaneous velocity v ′ . It is fairly obvious that the appropriate
generalization of the above equation is simply
dr
= v ′ + Ω × r. (8.2)
dt
Let and d/dt and d/dt ′ denote apparent time derivatives in the non-
rotating and rotating frames of reference, respectively. Since an object
130 NEWTONIAN DYNAMICS
dr dr
= ′ + Ω × r, (8.3)
dt dt
or
d d
= ′ + Ω×, (8.4)
dt dt
since r is a general position vector. Equation (8.4) expresses the relationship
between apparent time derivatives in the non-rotating and rotating refer-
ence frames.
Operating on the general position vector r with the time derivative (8.4),
we get
v = v ′ + Ω × r. (8.5)
Here, m is the mass of our object, and f is the (non-fictitious) force acting on
it. Note that these quantities are the same in both reference frames. Making
use of Equation (8.7), the apparent equation of motion of our object in the
rotating reference frame takes the form
m a ′ = f − m Ω × (Ω × r) − 2 m Ω × v ′ . (8.9)
Non-Inertial Reference Frames 131
λ
Equator
The last two terms in the above equation are so-called “fictitious forces”.
Such forces are always needed to account for motion observed in non-
inertial reference frames. Note that fictitious forces can always be distin-
guished from non-fictitious forces in Newtonian Dynamics because the for-
mer have no associated reactions. Let us now investigate the two fictitious
forces appearing in Equation (8.9).
Let our non-rotating inertial frame be one whose origin lies at the center
of the Earth, and let our rotating frame be one whose origin is fixed with
respect to some point, of latitude λ, on the Earth’s surface—see Figure 8.1.
The latter reference frame thus rotates with respect to the former (about an
axis passing through the Earth’s center) with an angular velocity vector, Ω,
which points from the center of the Earth toward its North Pole, and is of
magnitude
2π
Ω= = 7.27 × 10−5 rad./s. (8.10)
24 hrs
m a ′ = f − m Ω × (Ω × r). (8.11)
Let the non-fictitious force acting on our object be the force of gravity,
f = m g. Here, the local gravitational acceleration, g, points directly to-
ward the center of the Earth. It follows, from the above, that the apparent
gravitational acceleration in the rotating frame is written
g ′ = g − Ω × (Ω × R), (8.12)
where R is the displacement vector of the origin of the rotating frame (which
lies on the Earth’s surface) with respect to the center of the Earth. Here, we
are assuming that our object is situated relatively close to the Earth’s surface
(i.e., r ≃ R).
It can be seen, from Equation (8.12), that the apparent gravitational
acceleration of a stationary object close to the Earth’s surface has two com-
ponents. First, the true gravitational acceleration, g, of magnitude g ≃
9.8 m/s2 , which always points directly toward the center of the Earth. Sec-
ond, the so-called centrifugal acceleration, −Ω × (Ω × R). This acceleration
is normal to the Earth’s axis of rotation, and always points directly away
from this axis. The magnitude of the centrifugal acceleration is Ω2 ρ =
Ω2 R cos λ, where ρ is the perpendicular distance to the Earth’s rotation
axis, and R = 6.37 × 106 m is the Earth’s radius—see Figure 8.2.
It is convenient to define Cartesian axes in the rotating reference frame
such that the z ′ -axis points vertically upward, and x ′ - and y ′ -axes are hori-
zontal, with the x ′ -axis pointing directly northward, and the y ′ -axis pointing
directly westward—see Figure 8.1. The Cartesian components of the Earth’s
angular velocity are thus
rotation axis
centrifugal accn.
Earth ρ
Ω2 R
θdev ≃ − sin(2 λ) ≃ −0.1◦ sin(2 λ). (8.18)
2g
134 NEWTONIAN DYNAMICS
Here, a positive angle denotes a northward deflection, and vice versa. Thus,
the deflection is southward in the Northern Hemisphere (i.e., λ > 0) and
northward in the Southern Hemisphere (i.e., λ < 0). The deflection is zero at
the Poles and at the Equator, and reaches its maximum magnitude (which is
very small) at middle latitudes.
x ′ ≃ 0, and
t3
y ′ ≃ −g Ω cos λ. (8.23)
3
In other words, the particle is deflected eastward (i.e., pin the negative y ′ -
direction). Now, the particle hits the ground when t ≃ 2 h/g. Hence, the
net eastward deflection of the particle as strikes the ground is
!1/2
Ω 8 h3
deast = cos λ . (8.24)
3 g
Note that this deflection is in the same direction as the Earth’s rotation (i.e.,
West to East), and is greatest at the Equator, and zero at the Poles. A particle
dropped from a height of 100 m at the Equator is deflected by about 2.2 cm.
Consider a particle launched horizontally with some fairly large velocity
Here, θ is the compass bearing of the velocity vector (so North is 0◦ , East is
90◦ , etc.). Neglecting any vertical motion, Equations (8.19) and (8.20) yield
If follows that the Coriolis force causes the compass bearing of the particle’s
velocity vector to rotate steadily as time progresses. The rotation rate is
dθ
≃ 2 Ω sin λ. (8.32)
dt
Hence, the rotation is clockwise (looking from above) in the Northern Hemi-
sphere, and counter-clockwise in the Southern Hemisphere. The rotation rate
is zero at the Equator, and greatest at the Poles.
136 NEWTONIAN DYNAMICS
N
high λ>0
E low W
Furthermore, as air flows from high to low pressure regions, the Coriolis
force deflects the air in a clockwise/counter-clockwise manner in the North-
ern/Southern Hemisphere, producing cyclonic rotation—see Figure 8.3. It
follows that cyclonic rotation is counter-clockwise in the Northern Hemi-
sphere, and clockwise in the Southern Hemisphere. Thus, this is the direc-
tion of rotation of tropical storms (e.g., hurricanes, typhoons) in each hemi-
sphere.
Non-Inertial Reference Frames 137
T ′
ẍ ′ = − x + 2 Ω sin λ ẏ ′ , (8.34)
lm
T ′
ÿ ′ = − y − 2 Ω sin λ ẋ ′ , (8.35)
lm
T
0 = − g − 2 Ω cos λ ẏ ′ . (8.36)
m
To lowest order in Ω (i.e., neglecting Ω), the final equation, which is just
vertical force balance, yields T ≃ m g. Hence, Equations (8.34) and (8.35)
reduce to
g
ẍ ′ ≃ − x ′ + 2 Ω sin λ ẏ ′ , (8.37)
l
g
ÿ ′ ≃ − y ′ − 2 Ω sin λ ẋ ′ . (8.38)
l
Let
s = x ′ + i y ′. (8.39)
138 NEWTONIAN DYNAMICS
T cos θ
T
origin r′
m
T sin θ
Figure 8.4: The Foucault pendulum.
8.6 Exercises
ẍ = 3 ω2 x + 2 ω ẏ,
ÿ = −2 ω ẋ,
assuming that |x|/a ≪ 1 and |y|/a ≪ 1. You may neglect the gravitational
attraction between the satellite and the mass. Show that the mass executes
a retrograde (i.e., in the opposite sense to the satellite’s orbital rotation)
elliptical orbit about the satellite whose period matches that of the satellite’s
orbit, and whose major and minor axes are in the ratio 2 : 1, and are aligned
along the y- and x-axes, respectively.
Rigid Body Rotation 141
9.1 Introduction
Here, fij is the internal force exerted on the ith element by the jth element,
and Fi the external force acting on the ith element. The internal forces fij
represent the stresses which develop within the body in order to ensure that
its various elements maintain a constant spatial relationship with respect
to one another. Of course, fij = −fji, by Newton’s third law. The external
forces represent forces which originate outside the body.
Generalizing the analysis contained in Section 3.6, we can sum Equa-
tion (9.1) over all mass elements to obtain
d2rcm
M = F. (9.2)
dt2
P
Here, M = i=1,N mi is the total mass, P rcm the position vector of the center
of mass [see Equation (3.27)], and F = i=1,N Fi the total external force. It
can be seen that the center of mass of a rigid body moves under the action
of the external forces like a point particle whose mass is identical with that
of the body.
Again generalizing the analysis of Section 3.6, we can sum ri× Equa-
tion (9.1) over all mass elements to obtain
dL
= T. (9.3)
dt
142 NEWTONIAN DYNAMICS
P
Here, L = i=1,N mi ri × driP /dt is the total angular momentum of the body
(about the origin), and T = i=1,N ri × Fi the total external torque (about
the origin). Note that the above equation is only valid if the internal forces
are central in nature. However, this is not a particularly onerous constraint.
Equation (9.3) describes how the angular momentum of a rigid body evolves
in time under the action of the external torques.
In the following, we shall only consider the rotational motion of rigid
bodies, since their translational motion is similar to that of point particles
[see Equation (9.2)], and, therefore, fairly straightforward in nature.
Consider a rigid body rotating with fixed angular velocity ω about an axis
which passes through the origin—see Figure 9.1. Let ri be the position
vector of the ith mass element, whose mass is mi. We expect this position
vector to precess about the axis of rotation (which is parallel to ω) with
angular velocity ω. It, therefore, follows from Equation (2.42) that
dri
= ω × ri. (9.4)
dt
Thus, the above equation specifies the velocity, vi = dri/dt, of each mass
element as the body rotates with fixed angular velocity ω about an axis
passing through the origin.
The total angular momentum of the body (about the origin) is written
X dri X X h i
L= mi ri × = mi ri ×(ω×ri) = mi ri2 ω − (ri · ω) ri ,
dt
i=1,N i=1,N i=1,N
(9.5)
where use has been made of Equation (9.4), and some standard vector iden-
tities (see Section 2.7). The above formula can be written as a matrix equa-
tion of the form
Lx Ixx Ixy Ixz ωx
Ly = Iyx Iyy Iyz ωy , (9.6)
Lz Izx Izy Izz ωz
where
X Z
Ixx = (yi2 + zi2) mi = (y2 + z2) dm, (9.7)
i=1,N
Rigid Body Rotation 143
O
ri
X Z
Iyy = (xi2 + zi2) mi = (x2 + z2) dm, (9.8)
i=1,N
X Z
Izz = (xi2 + yi2) mi = (x2 + y2) dm, (9.9)
i=1,N
X Z
Ixy = Iyx = − xi yi mi = − x y dm, (9.10)
i=1,N
X Z
Iyz = Izy = − yi zi mi = − y z dm, (9.11)
i=1,N
X Z
Ixz = Izx = − xi zi mi = − x z dm. (9.12)
i=1,N
Here, Ixx is called the moment of inertia about the x-axis, Iyy the moment of
inertia about the y-axis, Ixy the xy product of inertia, Iyz the yz product of
inertia, etc. The matrix of the Iij values is known as the moment of inertia
tensor.1 Note that each component of the moment of inertia tensor can
be written as either a sum over separate mass elements, or as an integral
over infinitesimal mass elements. In the integrals, dm = ρ dV, where ρ is
the mass density, and dV a volume element. Equation (9.6) can be written
1
A tensor is the two-dimensional generalization of a vector. However, for present pur-
poses, we can simply think of a tensor as another name for a matrix.
144 NEWTONIAN DYNAMICS
more succinctly as
L = Ĩ ω. (9.13)
Here, it is understood that L and ω are both column vectors, and Ĩ is the
matrix of the Iij values. Note that Ĩ is a real symmetric matrix: i.e., Iij∗ = Iij
and Iji = Iij.
In general, the angular momentum vector, L, obtained from Equation
(9.13), points in a different direction to the angular velocity vector, ω. In
other words, L is generally not parallel to ω.
Finally, although the above results were obtained assuming a fixed angu-
lar velocity, they remain valid at each instant in time if the angular velocity
varies.
Making use of Equation (9.4), and some vector identities (see Section 2.6),
the kinetic energy takes the form
1 X 1 X
K= mi (ω × ri) · (ω × ri) = ω · mi ri × (ω × ri). (9.15)
2 2
i=1,N i=1,N
1 T
K= ω Ĩ ω. (9.17)
2
Here, ωT is the row vector of the Cartesian components ωx, ωy, ωz, which
is, of course, the transpose (denoted T ) of the column vector ω. When
written in component form, the above equation yields
1
K= Ixx ωx2 + Iyy ωy2 + Izz ωz2 + 2 Ixy ωx ωy + 2 Iyz ωy ωz + 2 Ixz ωx ωz .
2
(9.18)
Rigid Body Rotation 145
A xi = λi xi, (9.27)
A xj = λj xj, (9.28)
where λi 6= λj. Taking the transpose of the first equation and right multiply-
ing by xj, and left multiplying the second equation by xTi , we obtain
It should be clear that this argument can be generalized to deal with any
number of eigenvalues which take the same value.
In conclusion, a real symmetric n-dimensional matrix possesses n real
eigenvalues, with n associated real eigenvectors, which are, or can be cho-
sen to be, mutually orthogonal.
Rigid Body Rotation 147
We have seen that the moment of inertia tensor, Ĩ, defined in Section 9.3,
takes the form of a real symmetric three-dimensional matrix. It therefore fol-
lows, from the matrix theory which we have just reviewed, that the moment
of inertia tensor possesses three mutually orthogonal eigenvectors which are
associated with three real eigenvalues. Let the ith eigenvector (which can be
normalized to be a unit vector) be denoted ω ^ i, and the ith eigenvalue λi. It
then follows that
^ i = λi ω
Ĩ ω ^ i, (9.34)
for i = 1, 3.
The directions of the three mutually orthogonal unit vectors ω ^ i define
the three so-called principal axes of rotation of the rigid body under inves-
tigation. These axes are special because when the body rotates about one
of them (i.e., when ω is parallel to one of them) the angular momentum
vector L becomes parallel to the angular velocity vector ω. This can be seen
from a comparison of Equation (9.13) and Equation (9.34).
Suppose that we reorient our Cartesian coordinate axes so the they co-
incide with the mutually orthogonal principal axes of rotation. In this new
reference frame, the eigenvectors of Ĩ are the unit vectors, ex, ey, and ez,
and the eigenvalues are the moments of inertia about these axes, Ixx, Iyy,
and Izz, respectively. These latter quantities are referred to as the principal
moments of inertia. Note that the products of inertia are all zero in the new
reference frame. Hence, in this frame, the moment of inertia tensor takes
the form of a diagonal matrix: i.e.,
Ixx 0 0
Ĩ = 0 Iyy 0 . (9.35)
0 0 Izz
Incidentally, it is easy to verify that ex, ey, and ez are indeed the eigenvectors
of the above matrix, with the eigenvalues Ixx, Iyy, and Izz, respectively, and
that L = Ĩ ω is indeed parallel to ω whenever ω is directed along ex, ey, or
ez.
When expressed in our new coordinate system, Equation (9.13) yields
111111111111111
000000000000000
000000000000000
111111111111111
000000000000000
111111111111111
000000000000000
111111111111111
b 000000000000000
111111111111111
000000000000000
111111111111111
000000000000000
111111111111111
000000000000000
111111111111111
000000000000000
111111111111111 x
a
Figure 9.2: A uniform rectangular laminar.
have now determined the directions of all three principal axes. Incidentally,
once we have determined the orientation angle, α, of a principal axis, we
can then substitute back into Equation (9.40) to obtain the corresponding
principal moment of inertia, λ.
As an example, consider a uniform rectangular lamina of mass m and
sides a and b which lies in the x-y plane, as shown in Figure 9.2. Suppose
that the axis of rotation passes through the origin (i.e., through a corner
of the lamina). Since z = 0 throughout the lamina, it follows from Equa-
tions (9.11) and (9.12) that Ixz = Iyz = 0. Hence, the z-axis is a principal
axis. After some straightforward integration, Equations (9.7)–(9.10) yield
1
Ixx = m b2, (9.44)
3
1
Iyy = m a2, (9.45)
3
1
Ixy = − m a b. (9.46)
4
Thus, it follows from Equation (9.43) that
1 3 ab
α= tan−1 . (9.47)
2 2 a2 − b2
The above equation specifies the orientation of the two principal axes which
lie in the x-y plane. For the special case where a = b, we get α = π/4, 3π/4:
150 NEWTONIAN DYNAMICS
i.e., the two in-plane principal axes of a square lamina (at a corner) are
parallel to the two diagonals of the lamina.
where ˙ = d/dt. Here, we have made use of the fact that the moments of
inertia of a rigid body are constant in time in the co-rotating body frame.
The above equations are known as Euler’s equations.
Rigid Body Rotation 151
where
Ik
Ω = ω cos α −1 . (9.68)
I⊥
We conclude that, in the body frame, the angular velocity vector precesses
about the symmetry axis (i.e., the z ′ -axis) with the angular frequency Ω.
Now, the components of the angular momentum vector are
Thus, in the body frame, the angular momentum vector is also of constant
length, and precesses about the symmetry axis with the angular frequency
Ω. Furthermore, the angular momentum vector makes a constant angle θ
with the symmetry axis, where
I⊥
tan θ = tan α. (9.72)
Ik
Note that the angular momentum vector, the angular velocity vector, and
the symmetry axis all lie in the same plane: i.e., ez′ · L × ω = 0, as can
easily be verified. Moreover, the angular momentum vector lies between the
angular velocity vector and the symmetry axis (i.e., θ < α) for a flattened
(or oblate) body (i.e., I⊥ < Ik ), whereas the angular velocity vector lies
between the angular momentum vector and the symmetry axis (i.e., θ > α)
for an elongated (or prolate) body (i.e., I⊥ > Ik ).
Rigid Body Rotation 153
Let us now consider the most general motion of a freely rotating asym-
metric rigid body, as seen in the body frame. Since a freely rotating body
experiences no external torques, its angular momentum vector L is a con-
stant of the motion in the inertial fixed frame. In general, the direction of
this vector varies with time in the non-inertial body frame, but its length
remains fixed. This can be seen from Equation (9.49): if dL/dt = 0 then the
scalar product of this equation with L implies that dL2/dt ′ = 0. It follows
from Equation (9.57) that
The above constraint can also be derived directly from Euler’s equations,
(9.51)–(9.53), by setting Tx′ = Ty′ = Tz′ = 0. A freely rotating body sub-
ject to no external torques clearly has a constant rotational kinetic energy.
Hence, from Equation (9.16),
This constraint can also be derived directly from Euler’s equations. We con-
clude that, in the body frame, the components of ω must simultaneously
satisfy the two constraints (9.73) and (9.74). These constraints are the
equations of two ellipsoids whose principal axes coincide with the principal
axes of the body, and whose principal radii are in the ratio I−1 −1 −1
x ′ x ′ : Iy ′ y ′ : Iz ′ z ′
−1/2 −1/2 −1/2
and Ix′ x′ : Iy′ y′ : Iz′ z′ , respectively. In general, the intersection of these
two ellipsoids is a closed curve. Hence, we conclude that the most general
motion of a freely rotating asymmetric body, as seen in the body frame, is a
form of irregular precession in which the tip of the angular velocity vector ω
periodically traces out the aforementioned closed curve. It is easily demon-
strated that the tip of the angular momentum vector L periodically traces
out a different closed curve.
We have seen how we can solve Euler’s equations to determine the proper-
ties of a rotating body in the co-rotating body frame. Let us now investigate
how we can determine the same properties in the inertial fixed frame.
The fixed frame and the body frame share the same origin. Hence, we
can transform from one to the other by means of an appropriate rotation of
our coordinate axes. In general, if we restrict ourselves to rotations about
one of the Cartesian axes, three successive rotations are required to trans-
form the fixed frame into the body frame. There are, in fact, many different
154 NEWTONIAN DYNAMICS
The angular velocity vector associated with φ has the magnitude φ̇, and is
directed along ez (i.e., along the axis of rotation). Hence, we can write
ωφ = φ̇ ez. (9.76)
Clearly, φ̇ is the precession rate about the ez axis, as seen in the fixed frame.
The second rotation is counterclockwise (looking down the axis) through
an angle θ about the x ′′ -axis. The new frame has coordinates x ′′′ , y ′′′ , z ′′′ ,
and unit vectors ex′′′ , ey′′′ , ez′′′ . By analogy with Equation (9.75), the trans-
formation of coordinates can be represented as follows:
x ′′′ 1 0 0 x ′′
′′′ ′′
y = 0 cos θ sin θ y . (9.77)
z ′′′ 0 − sin θ cos θ z ′′
The angular velocity vector associated with θ has the magnitude θ̇, and is
directed along ex′′ (i.e., along the axis of rotation). Hence, we can write
ωθ = θ̇ ex′′ . (9.78)
The angular velocity vector associated with ψ has the magnitude ψ̇, and is
directed along ez′′ (i.e., along the axis of rotation). Note that ez′′′ = ez′ ,
since the third rotation is about ez′′′ . Hence, we can write
ωψ = ψ̇ ez′ . (9.80)
Clearly, ψ̇ is minus the precession rate about the ez′ axis, as seen in the body
frame.
The full transformation between the fixed frame and the body frame is
rather complicated. However, the following results can easily be verified:
ω = ωφ + ωθ + ωψ, (9.83)
The angles φ, θ, and ψ are termed Eulerian angles. Each has a clear
physical interpretation: φ is the angle of precession about the ez axis in the
fixed frame, ψ is minus the angle of precession about the ez′ axis in the
body frame, and θ is the angle of inclination between the ez and ez′ axes.
Moreover, we can express the components of the angular velocity vector ω
in the body frame entirely in terms of the Eulerian angles, and their time
derivatives [see Equations (9.84)–(9.86)].
Consider a rigid body which is constrained to rotate about a fixed axis
with the constant angular velocity ω. Let the fixed angular velocity vector
point along the z-axis. In the previous section, we saw that the angular
momentum and the torque were both steady in the body frame. Since there
is no precession of quantities in the body frame, it follows that the Eulerian
angle ψ is constant. Furthermore, since the angular velocity vector is fixed
in the body frame, as well as the fixed frame [as can be seen by applying
156 NEWTONIAN DYNAMICS
constant angle α with the symmetry axis. It follows that ωy′ = ω sin α and
ωz′ = ω cos α. Equation (9.84) yields ψ = 0. Hence, Equation (9.85) yields
ω sin α = sin θ φ̇. (9.90)
This can be combined with Equation (9.72) to give
#1/2
Ik2
" !
2
φ̇ = ω 1 + − 1 cos α . (9.91)
I⊥2
Finally, Equations (9.86), together with (9.72) and (9.90), yields
tan α Ik
ψ̇ = ω cos α − cos θ φ̇ = ω cos α 1 − = ω cos α 1 − .
tan θ I⊥
(9.92)
A comparison of the above equation with Equation (9.68) gives
ψ̇ = −Ω. (9.93)
Thus, as expected, ψ̇ is minus the precession rate (of the angular momentum
and angular velocity vectors) in the body frame. On the other hand, φ̇ is
the precession rate (of the angular velocity vector and the symmetry axis)
in the fixed frame. Note that φ̇ and Ω are quite dissimilar. For instance,
Ω is negative for elongated bodies (Ik < I⊥ ) whereas φ̇ is positive definite.
It follows that the precession is always in the same sense as Lz in the fixed
frame, whereas the precession in the body frame is in the opposite sense
to Lz′ for elongated bodies. We found, in the previous section, that for
a flattened body the angular momentum vector lies between the angular
velocity vector and the symmetry axis. This means that, in the fixed frame,
the angular velocity vector and the symmetry axis lie on opposite sides of
the fixed angular momentum vector. On the other hand, for an elongated
body we found that the angular velocity vector lies between the angular
momentum vector and the symmetry axis. This means that, in the fixed
frame, the angular velocity vector and the symmetry axis lie on the same side
of the fixed angular momentum vector. (Recall that the angular momentum
vector, the angular velocity vector, and the symmetry axis, are coplanar.)
As an example, consider the free rotation of a thin disk. It is easily
demonstrated (from the perpendicular axis theorem) that
Ik = 2 I⊥ (9.94)
for such a disk. Hence, from Equation (9.68), the precession rate in the
body frame is
Ω = ω cos α. (9.95)
158 NEWTONIAN DYNAMICS
In the limit in which α is small (i.e., in which the angular velocity vector is
almost parallel to the symmetry axis), we obtain
Ω ≃ ω, (9.97)
φ̇ ≃ 2 ω. (9.98)
Thus, the symmetry axis precesses in the fixed frame at approximately twice
the angular speed of rotation. This precession is manifest as a wobbling
motion.
It is known that the axis of rotation of the Earth is very slightly inclined
to its symmetry axis (which passes through the two geographic poles). The
angle α is approximately 0.2 seconds of an arc. It is also known that the
ratio of the terrestrial moments of inertia is about Ik /I⊥ = 1.00327, as deter-
mined from the Earth’s oblateness—see Section 13.7. Hence, from (9.68),
the precession rate of the angular velocity vector about the symmetry axis,
as viewed on Earth, is
Ω = 0.00327 ω, (9.99)
giving a precession period of
2π
T′ = = 305 days. (9.100)
Ω
(Of course, 2π/ω = 1 day.) The observed period of precession is about
440 days. The disagreement between theory and observation is attributed
to the fact that the Earth is not perfectly rigid. The (theoretical) preces-
sion rate of the Earth’s symmetry axis, as viewed from space, is given by
Equation (9.91):
φ̇ = 1.00327 ω. (9.101)
The associated precession period is
2π
T= = 0.997 days. (9.102)
φ̇
The symmetry axis subtends an angle θ ≃ α = 0.2 ′′ with the Earth’s fixed an-
gular momentum vector. The free precession of the Earth’s symmetry axis in
space is superimposed on a much slower precession, with a period of about
26,000 years, due to the small gravitational torque exerted on the Earth by
the Sun and the Moon, as a consequence of the Earth’s slight oblateness—
see Section 13.10.
Rigid Body Rotation 159
z′ y′
C
θ
l
mg y
O
Figure 9.3: A symmetric top.
Let us now study the motion of a rigid rotationally symmetric top which is
free to turn about a fixed point (without friction), but which is subject to
a gravitational torque—see Figure 9.3. Suppose that the z ′ -axis coincides
with the symmetry axis. Let the principal moment of inertia about the sym-
metry axis be Ik , and let the other principal moments both take the value I⊥ .
Suppose that the z-axis points vertically upward, and let the common origin,
O, of the fixed and body frames coincide with the fixed point about which
the top turns. Suppose that the center of mass of the top lies a distance l
along its symmetry axis from point O, and that the mass of the top is m.
Let the symmetry axis of the top subtend an angle θ (which is an Eulerian
angle) with the upward vertical.
Consider an instant in time at which the Eulerian angle ψ is zero. This
implies that the x ′ -axis is horizontal [see Equation (9.81)], as shown in the
diagram. The gravitational force, which acts at the center of mass, thus
exerts a torque m g l sin θ in the x ′ -direction. Hence, the components of the
torque in the body frame are
The components of the angular velocity vector in the body frame are given
by Equations (9.84)–(9.86). Thus, Euler’s equations (9.51)–(9.53) take the
form:
where
Lψ = Ik (cos θ φ̇ + ψ̇) = Ik Ω, (9.109)
and Ω = ωz′ is the angular velocity of the top. Multiplying Equation (9.107)
by sin θ, we obtain
L̇φ = 0, (9.110)
where
Lφ = I⊥ sin2 θ φ̇ + Lψ cos θ. (9.111)
According to Equations (9.108) and (9.110), the two quantities Lψ and Lφ
are constants of the motion. These two quantities are the angular momenta
of the system about the z ′ - and z-axis, respectively. (To be more exact,
they are the generalized momenta conjugate to the coordinates ψ and φ,
respectively—see Section 10.8.) They are conserved because the gravita-
tional torque has no component along either the z ′ - or the z-axis. (Alterna-
tively, they are conserved because the Lagrangian of the top does not depend
explicitly on the coordinates ψ and θ—see Section 10.8.)
If there are no frictional forces acting on the top then the total energy,
E = K + U, is also a constant of the motion. Now,
1
E= I⊥ ωx2′ + I⊥ ωy2′ + Ik ωz2′ + m g l cos θ. (9.112)
2
When written in terms of the Eulerian angles (with ψ = 0), this becomes
1
E= I⊥ θ̇ 2 + I⊥ sin2 θ φ̇ 2 + Lψ
2
/Ik + m g l cos θ. (9.113)
2
Let
2
1 Lψ
E′ = E − , (9.115)
2 Ik
and u = cos θ. It follows that
u̇ 2 = 2 (E ′ − m g l u) (1 − u2) I−1 2 −2
⊥ − (Lφ − Lψ u) I⊥ , (9.116)
or
u̇ 2 = f(u), (9.117)
where f(u) is a cubic polynomial. In principal, the above equation can be
integrated to give u (and, hence, θ) as a function of t:
Zu
du ′
t= p . (9.118)
u0 f(u ′ )
Fortunately, we do not have to perform the above integration (which
is very ugly) in order to discuss the general properties of the solution to
Equation (9.117). It is clear, from Equation (9.118), that f(u) needs to
be positive in order to obtain a physical solution. Hence, the limits of the
motion in θ are determined by the three roots of the equation f(u) = 0.
Since θ must lie between 0 and π/2, it follows that u must lie between 0
and 1. It can easily be demonstrated that f → ±∞ as u → ±∞. It can
also be shown that the largest root u3 lies in the region u3 > 1, and the
two smaller roots u1 and u2 (if they exist) lie in the region −1 ≤ u ≤ +1.
It follows that, in the region −1 ≤ u ≤ 1, f(u) is only positive between
u1 and u2. Figure 9.4 shows a case where u1 and u2 lie in the range 0 to
1. The corresponding values of θ—θ1 and θ2, say—are then the limits of
the vertical motion. The axis of the top oscillates backward and forward
between these two values of θ as the top precesses about the vertical axis.
This oscillation is called nutation. Incidentally, if u1 becomes negative then
the nutation will cause the top to strike the ground (assuming that it is
spinning on a level surface).
If there is a double root of f(u) = 0 (i.e., if u1 = u2) then there is no
nutation, and the top precesses steadily. However, the criterion for steady
precession is most easily obtained directly from Equation (9.106). In the
absence of nutation, θ̇ = θ̈ = 0. Hence, we obtain
m g l = −I⊥ cos θ φ̇ 2 + Lψ φ̇, (9.119)
or
m g l I⊥
Ω= + cos θ φ̇. (9.120)
Ik φ̇ Ik
162 NEWTONIAN DYNAMICS
f (u)
+1
u
u1 u2 u3
The above equation is the criterion for steady precession. Since the right-
hand√side of Equation (9.120) possesses a minimum value, which is given
by 2 m g l I⊥ cos θ/Ik , it follows that
√
2 m g l I⊥ cos θ
Ω > Ωcrit = (9.121)
Ik
Consider a rigid body for which all of the principal moments of inertia are
distinct. Let Iz′ z′ > Iy′ y′ > Ix′ x′ . Suppose that the body is rotating freely
Rigid Body Rotation 163
about one of its principal axes. What happens when the body is slightly
disturbed?
Let the body be initially rotating about the x ′ -axis, so that
Thus, the body oscillates sinusoidally about its initial state with the angu-
lar frequency Ωx′ . It follows that the body is stable to small perturbations
when rotating about the x ′ -axis, in the sense that the amplitude of such
perturbations does not grow in time.
Suppose that the body is initially rotating about the z ′ -axis, and is sub-
ject to a small perturbation. A similar argument to the above allows us
to conclude that the body oscillates sinusoidally about its initial state with
angular frequency
" #1/2
(Iz′ z′ − Ix′ x′ ) (Iz′ z′ − Iy′ y′ )
Ωz′ = ωz ′ . (9.134)
Ix ′ x ′ Iy ′ y ′
Hence, the body is also stable to small perturbations when rotating about
the z ′ -axis.
Suppose, finally, that the body is initially rotating about the y ′ -axis, and
is subject to a small perturbation, such that
ω = λ ex′ + ωy′ ey′ + µ ez′ . (9.135)
It is easily demonstrated that λ satisfies the following differential equation:
(Iy′ y′ − Ix′ x′ ) (Iz′ z′ − Iy′ y′ )
λ̈ − ωy2′ λ = 0. (9.136)
Ix ′ x ′ Iz ′ z ′
Note that the term in square brackets is positive. Hence, the above equation
is not the simple harmonic equation. Indeed its solution takes the form
λ = A e kt + B e−kt. (9.137)
Here, A and B are constants of integration, and
1/2
(Iy′ y′ − Ix′ x′ ) (Iz′ z′ − Iy′ y′ )
k= ωy ′ . (9.138)
Ix ′ x ′ Iz ′ z ′
In this case, the amplitude of the perturbation grows exponentially in time.
Hence, the body is unstable to small perturbations when rotating about the
y ′ -axis.
In conclusion, a rigid body with three distinct principal moments of in-
ertia is stable to small perturbations when rotating about the principal axes
with the largest and smallest moments, but is unstable when rotating about
the axis with the intermediate moment.
Finally, if two of the principal moments are the same then it can be
shown that the body is only stable to small perturbations when rotating
about the principal axis whose moment is distinct from the other two.
Rigid Body Rotation 165
9.11 Exercises
1. Find the principal axis of rotation and the principal moments of inertia for
a thin uniform rectangular plate of mass m and dimensions 2 a by a for
rotation about axes passing through (a) the center of mass, and (b) a corner.
2. A rigid body having an axis of symmetry rotates freely about a fixed point
under no torques. If α is the angle between the symmetry axis and the in-
stantaneous axis of rotation, show that the angle between the axis of rotation
and the invariable line (the L vector) is
−1 (Ik − I⊥ ) tan α
tan
Ik − I⊥ tan2 α
where Ik (the moment of inertia about the symmetry axis) is greater than I⊥
(the moment of inertia about an axis normal to the symmetry axis).
3. Since the greatest value of Ik /I⊥ is 2 (symmetrical lamina) show from the
previous result that the angle between the√angular velocity and angular mo-
mentum vectors cannot exceed tan−1 (1/ 8) ≃ 19.5◦ . Find the correspond-
ing value of α.
4. A thin uniform rod of length l and mass m is constrained to rotate with
constant angular velocity ω about an axis passing through the center of the
rod, and making an angle α with the rod. Show that the angular momentum
about the center of the rod is perpendicular to the rod, and is of magnitude
(m l2 ω/12) sin α. Show that the torque is perpendicular to both the rod and
the angular momentum vector, and is of magnitude (m l2 ω2 /12) sin α cos α.
5. A thin uniform disk of radius a and mass m is constrained to rotate with
constant angular velocity ω about an axis passing through its center, and
making an angle α with the normal to the disk. Find the angular momentum
about the center of the disk, as well as the torque acting on the disk.
6. Demonstrate that for an isolated rigid body which possesses an axis of sym-
metry, and rotates about one of its principal axes, the motion is only stable
to small perturbations if the principal axis is that which corresponds to the
symmetry axis.
166 NEWTONIAN DYNAMICS
Lagrangian Dynamics 167
10 Lagrangian Dynamics
10.1 Introduction
the time, t, explicitly. This would be the case if the dynamical system were
subject to time varying constraints. For instance, a system consisting of a
particle constrained to move on a surface which is itself moving. Finally, by
the chain rule, the variation of the xj due to a variation of the qi (at constant
t) is given by
X ∂xj
δxj = δqi, (10.2)
∂qi
i=1,F
for j = 1, F.
The work done on the dynamical system when its Cartesian coordinates
change by δxj is simply X
δW = fj δxj (10.3)
j=1,F
Here, the fj are the Cartesian components of the forces acting on the various
particles making up the system. Thus, f1, f2, f3 are the components of the
force acting on the first particle, f4, f5, f6 the components of the force acting
on the second particle, etc. Using Equation (10.2), we can also write
X X ∂xj
δW = fj δqi. (10.4)
∂qi
j=1,F i=1,F
where
X ∂xj
Qi = fj . (10.6)
∂qi
j=1,F
Here, the Qi are termed generalized forces. Note that a generalized force
does not necessarily have the dimensions of force. However, the product
Qi qi must have the dimensions of work. Thus, if a particular qi is a Carte-
sian coordinate then the associated Qi is a force. Conversely, if a particular
qi is an angle then the associated Qi is a torque.
Suppose that the dynamical system in question is conservative. It follows
that
∂U
fj = − , (10.7)
∂xj
Lagrangian Dynamics 169
for i = 1, F.
for j = 1, F, where m1, m2, m3 are each equal to the mass of the first par-
ticle, m4, m5, m6 are each equal to the mass of the second particle, etc.
Furthermore, the kinetic energy of the system can be written
1 X
K= mj ẋj2. (10.10)
2
j=1,F
for j = 1, F. Hence, it follows that ẋj = ẋj(q̇1, q̇2, · · · , q̇F , q1, q2, · · · , qF , t).
According to the above equation,
∂ẋj ∂xj
= , (10.12)
∂q̇i ∂qi
Furthermore,
1 ∂ẋj2 ∂ẋj
= ẋj , (10.15)
2 ∂q̇i ∂q̇i
and
1 ∂ẋj2 ∂ẋj ∂ X ∂xj ∂xj
= ẋj = ẋj q̇k +
2 ∂qi ∂qi ∂qi ∂qk ∂t
k=1,F
X ∂2xj ∂2xj
= ẋj q̇k +
∂qi ∂qk ∂qi ∂t
k=1,F
d ∂xj
= ẋj , (10.16)
dt ∂qi
where use has been made of Equation (10.14). Thus, it follows from Equa-
tions (10.13), (10.15), and (10.16) that
Let us take the above equation, multiply by mj, and then sum over all j.
We obtain
d ∂K
X ∂xj ∂K
= fj + , (10.18)
dt ∂q̇i ∂qi ∂qi
j=1,F
where use has been made of Equations (10.9) and (10.10). Thus, it follows
from Equation (10.6) that
d ∂K ∂K
= Qi + . (10.19)
dt ∂q̇i ∂qi
d ∂K ∂U ∂K
=− + . (10.20)
dt ∂q̇i ∂qi ∂qi
L = K − U. (10.21)
Lagrangian Dynamics 171
Since the potential energy U is clearly independent of the q̇i, it follows from
Equation (10.20) that
d ∂L ∂L
− = 0, (10.22)
dt ∂q̇i ∂qi
for i = 1, F. This equation is known as Lagrange’s equation.
According to the above analysis, if we can express the kinetic and po-
tential energies of our dynamical system solely in terms of our generalized
coordinates and their time derivatives then we can immediately write down
the equations of motion of the system, expressed in terms of the generalized
coordinates, using Lagrange’s equation, (10.22). Unfortunately, this scheme
only works for conservative systems. Let us now consider some examples.
v2 = ṙ 2 + (r θ̇)2. (10.23)
1
L= m (ṙ 2 + r2 θ̇ 2) − U(r). (10.24)
2
Note that
∂L ∂L dU
= m ṙ, = m r θ̇ 2 − , (10.25)
∂ṙ ∂r dr
∂L ∂L
= m r2 θ̇, = 0. (10.26)
∂θ̇ ∂θ
d ∂L ∂L
− = 0, (10.27)
dt ∂ṙ ∂r
d ∂L ∂L
− = 0. (10.28)
dt ∂θ̇ ∂θ
172 NEWTONIAN DYNAMICS
Hence, we obtain
d dU
(m ṙ) − m r θ̇ 2 + = 0, (10.29)
dt dr
d
m r2 θ̇ = 0, (10.30)
dt
or
dV
r̈ − r θ̇ 2 = − , (10.31)
dr
r2 θ̇ = h, (10.32)
1 1 1 ẋ 2
K= m1 ẋ 2 + m2 ẋ 2 + I 2 . (10.33)
2 2 2 a
The potential energy of the system takes the form
a
I
x l−x .
m2
m1
thus yields
I
m1 + m2 + 2 ẍ − g (m1 − m2) = 0, (10.37)
a
or
g (m1 − m2)
ẍ = , (10.38)
m1 + m2 + I/a2
which is the correct answer.
Consider the dynamical system drawn in Figure 10.2. This is an Atwood
machine in which one of the weights has been replaced by a second Atwood
machine with a cord of length l ′ . The system now has two degrees of free-
dom, and its instantaneous position is specified by the two coordinates x
and x ′ , as shown.
For the sake of simplicity, let us neglect the masses of the two pulleys.
Thus, the kinetic energy of the system is written
1 1 1
K= m1 ẋ 2 + m2 (−ẋ + ẋ ′ )2 + m3 (−ẋ − ẋ ′ )2, (10.39)
2 2 2
whereas the potential energy takes the form
U = −m1 g x − m2 g (l − x + x ′ ) − m3 g (l − x + l ′ − x ′ ). (10.40)
174 NEWTONIAN DYNAMICS
x l−x .
x′
m1
l ′ − x′.
m2
m3
1 1 1
L = m1 ẋ 2 + m2 (−ẋ + ẋ ′ )2 + m3 (−ẋ − ẋ ′ )2
2 2 2
+g (m1 − m2 − m3) x + g (m2 − m3) x ′ + const. (10.41)
d ∂L ∂L
− = 0, (10.42)
dt ∂ẋ ∂x
d ∂L ∂L
′
− = 0, (10.43)
dt ∂ẋ ∂x ′
yield
x′
M y
x θ x
The accelerations ẍ and ẍ ′ can be obtained from the above two equations
via simple algebra.
dpi ∂L
= , (10.61)
dt ∂qi
dpk ∂L
= = 0. (10.62)
dt ∂qk
Hence,
pk = const. (10.63)
∂L
pθ = = m r2 θ̇ (10.64)
∂θ̇
∂L
px = = M ẋ + m (ẋ + ẋ ′ cos θ) (10.65)
∂ẋ
U = m g z = −m g l cos θ. (10.69)
Also,
v2 = ẋ 2 + ẏ 2 + ż 2 = l2 (θ̇ 2 + sin2 θ φ̇ 2). (10.70)
Thus, the Lagrangian of the system is written
1
L= m l2 (θ̇ 2 + sin2 θ φ̇ 2) + m g l cos θ. (10.71)
2
Note that the Lagrangian is independent of the angular coordinate φ. It
follows that
∂L
pφ = = m l2 sin2 θ φ̇ (10.72)
∂φ̇
is a constant of the motion. Of course, pφ is the angular momentum of the
system about the z-axis. This is conserved because neither the tension in the
string nor the force of gravity exert a torque about the z-axis. Conservation
of angular momentum about the z-axis implies that
sin2 θ φ̇ = h, (10.73)
Lagrangian Dynamics 179
where h is a constant.
The equation of motion of the system,
d ∂L ∂L
− = 0, (10.74)
dt ∂θ̇ ∂θ
yields
g
θ̈ + sin θ − sin θ cos θ φ̇ 2 = 0, (10.75)
l
or
g cos θ
θ̈ + sin θ − h2 = 0, (10.76)
l sin3 θ
where use has been made of Equation (10.73).
Suppose that φ = φ0 = const. It follows that φ̇ = h = 0. Hence,
Equation (10.76) yields
g
θ̈ + sin θ = 0. (10.77)
l
This, of course, is the equation of a simple pendulum whose motion is re-
stricted to the vertical plane φ = φ0—see Section 4.10.
Suppose that θ = θ0 = const. It follows from Equation (10.73) that
φ̇ = φ̇0 = const.: i.e., the pendulum bob rotates uniformly in a horizontal
plane. According to Equations (10.73) and (10.76),
r
g
φ̇0 = , (10.78)
d
where d = l cos θ0 is the vertical distance of the plane of rotation below
the pivot point. This type of pendulum is usually called a conical pendulum,
since the string attached to the pendulum bob sweeps out a cone as the bob
rotates.
Suppose, finally, that the motion is almost conical: i.e., the value of θ
remains close to the value θ0. Let
θ = θ0 + δθ. (10.79)
Taylor expanding Equation (10.76) to first order in δθ, the zeroth order
terms cancel out, and we are left with
where q
Ω = φ̇0 1 + 3 cos2 θ0. (10.82)
Thus, the angle θ executes simple harmonic motion about its mean value θ0
at the angular frequency Ω.
Now the azimuthal angle, φ, increases by
π π
∆φ ≃ φ̇0 =p (10.83)
Ω 1 + 3 cos2 θ0
10.10 Exercises
1. A double pendulum consists of two simple pendula, with one pendulum sus-
pended from the bob of the other. If the two pendula have equal lengths, l,
and have bobs of equal mass, m, and if both pendula are confined to move
in the same vertical plane, find Lagrange’s equations of motion for the sys-
tem. Use θ and φ—the angles the upper and lower pendulums make with
the downward vertical (respectively)—as the generalized coordinates. Do
not assume small angles.
ωt
x
P
11 Hamiltonian Dynamics
11.1 Introduction
B
y
a b x
Figure 11.1: Different paths between points A and B.
Now, in order to find the shortest path between points A and B, we need
to minimize the functional l with respect to small variations in the function
184 NEWTONIAN DYNAMICS
y(x), subject to the constraint that the end points, A and B, remain fixed. In
other words, we need to solve
δl = 0. (11.2)
The meaning of the above equation is that if y(x) → y(x) + δy(x), where
δy(x) is small, then the first-order variation in l, denoted δl, vanishes. In
other words, l → l + O(δy 2). The particular function y(x) for which δl = 0
obviously yields an extremum of l (i.e., either a maximum or a minimum).
Hopefully, in the case under consideration, it yields a minimum of l.
Consider a general functional of the form
Zb
I= F(y, y ′ , x) dx, (11.3)
a
where the end points of the integration are fixed. Suppose that y(x) →
y(x) + δy(x). The first-order variation in I is written
Zb
∂F ∂F
′
δI = δy + δy dx, (11.4)
a ∂y ∂y ′
where δy ′ = d(δy)/dx. Setting δI to zero, we obtain
Zb
∂F ∂F
′
δy + δy dx = 0. (11.5)
a ∂y ∂y ′
This equation must be satisfied for all possible small perturbations δy(x).
Integrating the second term in the integrand of the above equation by
parts, we get
Zb b
∂F d ∂F ∂F
− δy dx + δy = 0. (11.6)
a ∂y dx ∂y ′ ∂y ′ a
d ∂F ∂F
′
y − y′ = 0. (11.10)
dx ∂y ′ ∂y
However,
d ∂F d ∂F ∂F
y′ =y ′
+ y ′′ . (11.11)
dx ∂y ′ dx ∂y ′ ∂y ′
Thus, we get
d ∂F ∂F ∂F
y′ = y′ + y ′′ . (11.12)
dx ∂y ′ ∂y ∂y ′
Now, if F is not an explicit function of x then the right-hand side of the above
equation is the total derivative of F, namely dF/dx. Hence, we obtain
d ∂F dF
y′ = , (11.13)
dx ∂y ′ dx
which yields
∂F
y′ − F = const. (11.14)
∂y ′
p
Returning to the case under consideration, we have F = 1 + y ′ 2, ac-
cording to Equation (11.1) and (11.3). Hence, F is not an explicit function
of y, so Equation (11.9) yields
∂F y′
= = c, (11.15)
∂y ′
p
1 + y′ 2
Suppose that we wish to find the function y(x) which maximizes or mini-
mizes the functional Zb
I= F(y, y ′ , x) dx, (11.17)
a
d ∂F ∂F d ∂[λ G] ∂[λ G]
− + − = 0. (11.19)
dx ∂y ′ ∂y dx ∂y ′ ∂y
In principle, the above equation, together with the constraint (11.18), yields
the functions λ(x) and y(x). Incidentally, λ is generally termed a Lagrange
multiplier. If F and G have no explicit x-dependence then λ is usually a
constant.
As an example, consider the following famous problem. Suppose that
a uniform chain of fixed length l is suspended by its ends from two equal-
height fixed points which are a distance a apart, where a < l. What is the
equilibrium configuration of the chain?
Suppose that the chain has the uniform density per unit length ρ. Let the
x- and y-axes be horizontal and vertical, respectively, and let the two ends
of the chain lie at (±a/2, 0). The equilibrium configuration of the chain is
specified by the function y(x), for −a/2 ≤ x ≤ +a/2, where y(x) is the
vertical distance of the chain below its end points at horizontal position x.
Of course, y(−a/2) = y(+a/2) = 0.
According to the discussion in Section 4.2, the stable equilibrium state
of a conservative dynamical system is one which minimizes the system’s po-
tential energy. Now, the potential energy of the chain is written
Z Z a/2
U = −ρ g y ds = −ρ g y [1 + y ′ 2]1/2 dx, (11.20)
−a/2
Hamiltonian Dynamics 187
p
where ds = dx2 + dy2 is an element of length along the chain, and g is the
acceleration due to gravity. Hence, we need to minimize U with respect to
small variations in y(x). However, the variations in y(x) must be such as to
conserve the fixed length of the chain. Hence, our minimization procedure
is subject to the constraint that
Z Z a/2
l = ds = [1 + y ′ 2]1/2 dx (11.21)
−a/2
remains constant.
It follows, from the above discussion, that we need to minimize the func-
tional
Z a/2
K = U + λl = (−ρ g y + λ) [1 + y ′ 2]1/2 dx, (11.22)
−a/2
dz
= −h−1. (11.26)
dx
Hence,
x
z=− + c, (11.27)
h
where c is a constant. It follows from Equation (11.25) that
Finally, the F Lagrangian equations of motion of the system take the form
d ∂L ∂L
− = 0, (11.38)
dt ∂q̇i ∂qi
for i = 1, F.
Note that the above equations of motion have exactly the same math-
ematical form as the Euler-Lagrange equations (11.34). Indeed, it is clear,
from Section 11.4, that the F Lagrangian equations of motion (11.38) can
all be derived from a single equation: namely,
Z t2
δ L(q1, q2, · · · , qF , q̇1, q̇2, · · · , q̇F , t) dt = 0. (11.39)
t1
In other words, the motion of the system in a given time interval is such
as to maximize or minimize the time integral of the Lagrangian, which is
know as the action integral. Thus, the laws of Newtonian Dynamics can be
summarized in a single statement:
(In practice, the action integral is almost always minimized.) This statement
is known as Hamilton’s principle, and was first formulated in 1834 by the
Irish mathematician William Hamilton.
∂f
Q̃1 = λ(t) , (11.49)
∂q1
d ∂L ∂L ∂f
− − λ(t) = 0, (11.51)
dt ∂q̇i ∂qi ∂qi
for i = 1, F. As before,
∂f
Q̃i = λ(t) (11.52)
∂qi
is the generalized constraint force conjugate to qi. Finally, the generaliza-
tion to multiple holonomic constraints is straightforward.
Consider the following example. A cylinder of radius a rolls without slip-
ping down a plane inclined at an angle θ to the horizontal. Let x represent
the downward displacement of the center of mass of the cylinder parallel
to the surface of the plane, and let φ represent the angle of rotation of the
cylinder about its symmetry axis. The fact that the cylinder is rolling without
slipping implies that x and φ are interrelated via the well-known constraint
f = x − a φ = 0. (11.53)
1 1
L= m ẋ 2 + I φ̇ 2 + m g x sin θ, (11.54)
2 2
192 NEWTONIAN DYNAMICS
φ
x a
θ
Figure 11.2: A cylinder rolling down an inclined plane.
where m is the cylinder’s mass, I its moment of inertia, and g the accelera-
tion due to gravity.
Note that ∂f/∂x = 1 and ∂f/∂φ = −a. Hence, Equation (11.51) yields
the following Lagrangian equations of motion:
m ẍ − m g sin θ − λ = 0, (11.55)
I φ̈ + λ a = 0. (11.56)
This represents the frictional force acting parallel to the plane which im-
pedes the downward acceleration of the cylinder, causing it to be less than
Hamiltonian Dynamics 193
m r̈ − m r θ̇ 2 − m g cos θ − λ = 0, (11.64)
m r2 θ̈ + m g r sin θ = 0. (11.65)
The second of these equations can be integrated (by multiplying by θ̇), sub-
ject to the constraint (11.62), to give
2g
θ̇ 2 = cos θ + c, (11.66)
a
where c is a constant. Let v0 be the tangential velocity of the bead at the
bottom of the hoop (i.e., at θ = 0). It follows that
2g v2
θ̇ 2 = (cos θ − 1) + 02 . (11.67)
a a
Equations (11.62), (11.64), and (11.67) can be combined to give
!
v2
λ = −m 3 g cos θ − 2 g + 0 . (11.68)
a
194 NEWTONIAN DYNAMICS
This represents the radial reaction exerted on the bead by the hoop. Of
course, there is no constraint force conjugate to θ (since ∂f/∂θ = 0) because
the bead slides without friction.
where the mij depend on the qi, but not on the q̇i. It is easily demonstrated
from the above equation that
X ∂K
q̇i = 2 K. (11.71)
∂q̇i
i=1,F
∂L ∂K
pi = = , (11.72)
∂q̇i ∂q̇i
If all of the conditions discussed above are satisfied then Equations (11.71)
and (11.72) yield
H = K + U. (11.74)
Hamiltonian Dynamics 195
In other words, the function H is equal to the total energy of the system.
Consider the variation of the function H. We have
X ∂L ∂L
δH = δq̇i pi + q̇i δpi − δq̇i − δqi . (11.75)
∂q̇i ∂qi
i=1,F
The first and third terms in the bracket cancel, because pi = ∂L/∂q̇i. Fur-
thermore, since Lagrange’s equation can be written ṗi = ∂L/∂qi (see Sec-
tion 10.8), we obtain
X
δH = (q̇i δpi − ṗi δqi) . (11.76)
i=1,F
Suppose, now, that we can express the total energy of the system, H, solely
as a function of the qi and the pi, with no explicit dependence on the q̇i. In
other words, suppose that we can write H = H(qi, pi). When the energy is
written in this fashion it is generally termed the Hamiltonian of the system.
The variation of the Hamiltonian function takes the form
X ∂H ∂H
δH = δpi + δqi . (11.77)
∂pi ∂qi
i=1,F
∂H
q̇i = , (11.78)
∂pi
∂H
ṗi = − , (11.79)
∂qi
1 p2 1
H=K+U= + k x2. (11.82)
2 m 2
Thus, Hamilton’s equations, (11.78) and (11.79), yield
∂H p
ẋ = = , (11.83)
∂p m
∂H
ṗ = − = −k x. (11.84)
∂x
Of course, the first equation is just a restatement of Equation (11.80), whereas
the second is Newton’s second law of motion for the system.
Consider a particle of mass m moving in the central potential U(r). In
this case,
1
K = m (ṙ 2 + r2 θ̇ 2), (11.85)
2
where r, θ are plane polar coordinates. The generalized momenta conjugate
to r and θ are
∂K
pr = = m ṙ, (11.86)
∂ṙ
∂K
pθ = = m r2 θ̇, (11.87)
∂θ̇
respectively. Hence, we can write
!
1 p2
K= pr2 + 2θ . (11.88)
2m r
∂H pr
ṙ = = , (11.90)
∂pr m
∂H pθ
θ̇ = = , (11.91)
∂pθ m r2
Hamiltonian Dynamics 197
11.8 Exercises
k e−β t
f=− er ,
r2
where k and β are positive constants. Find the Hamiltonian of the particle.
Compare the Hamiltonian to the total energy of the particle. Is the energy of
the particle conserved?
Coupled Oscillations 199
12 Coupled Oscillations
12.1 Introduction
1 X
K= mij(q1, q2, · · · , qF ) q̇i q̇j. (12.1)
2
i,j=1,F
Without loss of generality, we can specify that the weights mij in the above
form are invariant under interchange of the indices i and j: i.e.,
d ∂K ∂K ∂U
− + = 0, (12.3)
dt ∂q̇i ∂qi ∂qi
of the system. Hence, the first two terms in the above equation are zero,
and the condition for equilibrium reduces to
∂U(q10, q20, · · · , qF 0)
Qi0 = − = 0, (12.4)
∂qi
1 X
K≃ Mij δq̇i δq̇j, (12.6)
2
i,j=1,F
where
Mij = mij(q10, q20, · · · , qF 0), (12.7)
and
Mij = Mji. (12.8)
Note that the weights Mij in the quadratic form (12.6) are now constants.
Taylor expanding the potential energy function about the equilibrium
state, up to second-order in the δqi, we obtain
X 1 X
U ≃ U0 − Qi0 δqi − Gij δqi δqj, (12.9)
2
i=1,F i,j=1,F
where U0 = U(q10, q20, · · · , qF 0), the Qi0 are specified in Equation (12.4),
and
∂2U(q10, q20, · · · , qF 0)
Gij = − . (12.10)
∂qi ∂qj
Coupled Oscillations 201
Note that, since ∂2U/∂qi ∂qj ≡ ∂2U/∂qj ∂qi, the constants weights Gij in
the above quadratic form are invariant under interchange of the indices i
and j: i.e.,
Gij = Gji. (12.12)
With K and U specified by the quadratic forms (12.6) and (12.11), re-
spectively, Lagrange’s equations of motion (12.3) reduce to
X
(Mij δq̈j − Gij δqj) = 0, (12.13)
j=1,F
for i = 1, F. Note that the above coupled differential equations are linear
in the δqi. It follows that the solutions are superposable. Let us search for
solutions of the above equations in which all of the perturbed coordinates
δqi have a common time variation of the form
(G − γ2 M) δq = 0. (12.16)
Here, G is the real [see Equation (12.10)] symmetric [see Equation (12.12)]
F × F matrix of the Gij values. Furthermore, M is the real [see Equa-
tion (12.1)] symmetric [see Equation (12.8)] F × F matrix of the Mij val-
ues. Finally, δq is the 1 × F vector of the δqi values, and 0 is a null vector.
202 NEWTONIAN DYNAMICS
(G − λ M) x = 0. (12.17)
Here, G and M are both real symmetric matrices, whereas λ is termed the
eigenvalue, and x the associated eigenvector. The above matrix equation is
essentially a set of F homogeneous simultaneous algebraic equations for
the components of x. As is well-known, a necessary condition for such a set
of equations to possess a non-trivial solution is that the determinant of the
matrix must be zero: i.e.,
|G − λ M| = 0. (12.18)
The above formula reduces to an Fth-order polynomial equation for λ. Hence,
we conclude that Equation (12.17) is satisfied by F eigenvalues, and F as-
sociated eigenvectors.
We can easily demonstrate that the eigenvalues are all real. Suppose
that λk and xk are the kth eigenvalue and eigenvector, respectively. Then
we have
G xk = λk M xk. (12.19)
Taking the transpose and complex conjugate of the above equation, and
right multiplying by xk, we obtain
Since x∗kT M xk is not generally zero, except in the trivial case where xk
is a null vector, we conclude that λ∗k = λk for all k. In other words, the
eigenvalues are all real. It immediately follows that the eigenvectors can
also be chosen to be all real.
Coupled Oscillations 203
Consider two distinct eigenvalues, λk and λl, with the associated eigen-
vectors xk and xl, respectively. We have
G xk = λk M xk, (12.24)
G xl = λl M xl. (12.25)
Right multiplying the transpose of Equation (12.24) by xl, and left multi-
plying Equation (12.25) by xTk, we obtain
where δkl = 1 when k = l, and δkl = 0 otherwise. Note, finally, that since
the xk, for k = 1, F, are mutually orthogonal, they are independent (i.e., one
eigenvector cannot be expressed as a linear combination of the others), and
completely span F-dimensional vector space.
It follows from Equation (12.14) and (12.15), plus the mathematical results
contained in the previous section, that the most general solution to Equa-
tion (12.13) can be written
X
δq(t) = δqk(t), (12.31)
k=1,F
204 NEWTONIAN DYNAMICS
where √ √
δqk(t) = αk e+ λk t
+ βk e− λk t
xk. (12.32)
Here, the λk and the xk are the eigenvalues and eigenvectors obtained by
solving Equation (12.17). Moreover, the αk and βk are arbitrary constants.
Finally,√we have made use of the fact that the two roots of γ2 = λk are
γ = ± λk.
According to Equation (12.31), the most general perturbed motion of
the system consists of a linear combination of F different modes. These
modes are generally termed normal modes, since they are mutually orthogo-
nal (because the xk are mutually orthogonal). Furthermore, it follows from
the independence of the xk that the normal modes are also independent (i.e.,
one mode cannot be expressed as a linear combination of the others). The
kth normal mode has a specific pattern of motion which is specified by the
kth eigenvector, xk. Moreover, the kth mode has a specific time variation
which is determined by the associated eigenvalue, λk. Recall that λk is real.
Hence, there are only two possibilities. Either λk is positive, in which case
we can write
δqk(t) = αk e+γk t + βk e−γk t xk, (12.33)
and √
xTk M δq̇(0) = λk (αk − βk). (12.36)
Hence,
√
xTk M δq(0) + xTk M δq̇(0)/ λk
αk = , (12.37)
2
√
xTk M δq(0) − xTk M δq̇(0)/ λk
βk = . (12.38)
2
Note, finally, that since there are 2F arbitrary constants (two for each of
the F normal modes), we can be sure that Equation (12.31) represents the
most general solution to Equation (12.13).
Let us now try to express K, U, and the equations of motion in terms of the
ηk.
The kinetic energy can be written
1 T
K= δq̇ M δq̇, (12.41)
2
where use has been made of Equation (12.6). It follows from (12.39) that
1 X
K= η̇k η̇l xTk M xl. (12.42)
2
k,l=1,F
Hence, the kinetic energy K takes the form of a diagonal quadratic form
when expressed in terms of the normal coordinates.
The potential energy can be written
1
U = − δqT G δq, (12.44)
2
where use has been made of Equations (12.11). It follows from (12.39) that
1 X
U=− ηk ηl xTk G xl. (12.45)
2
k,l=1,F
k′ k k′
m m
q1 q2
1h ′ 2 i
U= k q1 + k (q2 − q1)2 + k ′ q22 . (12.51)
2
The above expression can be rearranged to give
1h i
U= (k + k ′ ) q12 − 2 k q1 q2 + (k + k ′ ) q22 . (12.52)
2
A comparison of Equations (12.50) and (12.52) with the standard forms
(12.6) and (12.11) yields the following expressions for the mass matrix, M,
and the force matrix, G:
!
m 0
M = , (12.53)
0 m
!
−k − k ′ k
G = . (12.54)
k −k − k ′
208 NEWTONIAN DYNAMICS
Now, the equation of motion of the system takes the form [see Equation
(12.17)]
(G − λ M) x = 0, (12.55)
where x is the column vector of the q1 and q2 values. The solubility condi-
tion for the above equation is
|G − λ M| = 0, (12.56)
or
−k − k ′ − λ m k
= 0, (12.57)
−k − k ′ − λ m
k
m2 λ2 + 2 m (k + k ′ ) λ + k ′ (k ′ + 2 k) = 0. (12.58)
k′
λ1 = − , (12.59)
m
(2 k + k ′ )
λ2 = − . (12.60)
m
The fact that the roots are negative implies that both normal modes are os-
cillatory in nature: i.e., the original equilibrium is stable. The characteristic
oscillation frequencies of the modes are
s
p k′
ω1 = −λ1 = , (12.61)
m
s
p 2 k + k′
ω2 = −λ2 = . (12.62)
m
q1 k
= . (12.63)
q2 k + k′ + λ m
Consider the simple model of a linear triatomic molecule (e.g., carbon diox-
ide) illustrated in Figure 12.2. The molecule consists of a central atom of
mass M flanked by two identical atoms of mass m. The atomic bonds are
represented as springs of spring constant k. The linear displacements of the
flanking atoms are q1 and q2, whilst that of the central atom is q3. Let us
investigate the linear modes of oscillation our model molecule.
210 NEWTONIAN DYNAMICS
m k M k m
q1 q3 q2
Figure 12.2: A model triatomic molecule.
m 2 M 2
K= (q̇1 + q̇22) + q̇ , (12.71)
2 2 3
whereas the potential energy takes the form
k k
U= (q3 − q1)2 + (q2 − q3)2. (12.72)
2 2
Clearly, we have a three degree of freedom dynamical system. However,
we can reduce this to a two degree of freedom system by only considering
oscillatory modes of motion, and, hence, neglecting translational modes. We
can achieve this by demanding that the center of mass of the system remains
stationary. In other words, we require that
Now, the equation of motion of the system takes the form [see Equation
(12.17)]
(G − λ M) x = 0, (12.79)
where x is the column vector of the q1 and q2 values. The solubility condi-
tion for the above equation is
|G − λ M| = 0, (12.80)
1 2
K = (η̇ + η̇22), (12.90)
2 1
1
U = (ω12 η12 + ω22 η22), (12.91)
2
respectively.
12.9 Exercises
slides without friction along the hoop. Consider the case of small oscillations.
Calculate the characteristic frequencies, and describe the associated normal
modes.
4. A simple pendulum of mass m and length l is suspended from a block of mass
M which is constrained to slide along a frictionless horizontal track. Consider
the case of small oscillations. Calculate the characteristic frequencies, and
describe the associated normal modes.
5. A thin uniform rod of mass m and length a is suspended from one end by
a light string of length l. The other end of the string is attached to a fixed
support. Consider the case of small oscillations in a vertical plane. Calculate
the characteristic frequencies, and describe the associated normal modes.
6. A triatomic molecule consists of three atoms of equal mass. Each atom is
attached to the other two atoms via identical chemical bonds. The equilib-
rium state of the molecule is such that the masses are at the vertices of an
equilateral triangle of side a. Modeling the chemical bonds as springs of
spring constant k, and only considering motion in the plane of the molecule,
find the vibrational frequencies and normal modes of the molecule. Exclude
translational and rotational modes.
214 NEWTONIAN DYNAMICS
Gravitational Potential Theory 215
13.1 Introduction
(r ′ − r)
g = G m′ . (13.1)
|r ′ − r| 3
(x ′ − x)
gx = G m ′ , (13.2)
[(x ′ − x)2 + (y ′ − y)2 + (z ′ − z)2] 3/2
(x ′ − x)
≡
[(x ′ − x)2 + (y ′ − y)2 + (z ′ − z)2] 3/2
!
∂ 1
. (13.3)
∂x [(x − x) + (y − y)2 + (z ′ − z)2] 1/2
′ 2 ′
Hence,
∂ 1
gx = G m ′ ′
, (13.4)
∂x |r − r|
with analogous expressions for gy and gz. It follows that
g = −∇Φ, (13.5)
where
G m′
Φ=− (13.6)
|r ′ − r|
216 NEWTONIAN DYNAMICS
where the integral is taken over all space. This is the general expression for
the gravitational potential, Φ(r), generated by a continuous mass distribu-
tion, ρ(r).
Hence,
|r ′ − r|−1 = (r2 − 2 r r ′ F + r ′ 2)−1/2. (13.17)
Suppose that r > r ′ . In this case, we can expand |r ′ −r|−1 as a convergent
power series in r ′ /r, to give
" 2 3#
1 r′ 1 r′ r′
′ −1 2
|r − r| = 1+ F+ (3 F − 1) + O . (13.18)
r r 2 r r
Let us now average this expression over the azimuthal angle, φ ′ . Since
h1i = 1, hcos φ ′ i = 0, and hcos2 φ ′ i = 1/2, it is easily seen that
Hence,
1 r′
D E
′
|r − r| −1
= 1+ cos θ cos θ ′ (13.21)
r r
2 3#
r′ 3 1 3 1 r′
+ cos2 θ − cos2 θ ′ − +O .
r 2 2 2 2 r
P0(x) = 1, (13.23)
P1(x) = x, (13.24)
1
P2(x) = (3 x2 − 1), (13.25)
2
etc. The Legendre polynomials are mutually orthogonal: i.e.,
Z1 Zπ
δnm
Pn(x) Pm(x) dx = Pn(cos θ) Pm(cos θ) sin θ dθ = .
−1 0 (n + 1/2)
(13.26)
Here, δnm is 1 if n = m, and 0 otherwise. The Legendre polynomials also
form a complete set: i.e., any well-behaved function of x can be represented
as a weighted sum of the Pn(x). Likewise, any well-behaved (even) function
of θ can be represented as a weighted sum of the Pn(cos θ).
A comparison of Equation (13.21) and Equations (13.23)–(13.25) makes
it reasonably clear that, when r > r ′ , the complete expansion of h|r ′ − r|−1i
is
1 X r′ n
D E
|r ′ − r|−1 = Pn(cos θ) Pn(cos θ ′ ). (13.27)
r r
n=0,∞
where
Zr Zπ
2π G
Φn(r) = − n+1 r ′ n+2 ρ(r ′ , θ ′ ) Pn(cos θ ′ ) sin θ ′ dr ′ dθ ′
r 0 0
Z∞ Zπ
−2 π G r n
r ′ 1−n ρ(r ′ , θ ′ ) Pn(cos θ ′ ) sin θ ′ dr ′ dθ ′ .
r 0
(13.30)
Now, given that the Pn(cos θ) form a complete set, we can always write
X
ρ(r, θ) = ρn(r) Pn(cos θ). (13.31)
n=0,∞
This expression can be inverted, with the aid of Equation (13.26), to give
Zπ
ρn(r) = (n + 1/2) ρ(r, θ) Pn(cos θ) sin θ dθ. (13.32)
0
for r ≤ a, and Za
4π G γ
Φ0(r) = − r ′ 2 dr ′ (13.36)
r 0
220 NEWTONIAN DYNAMICS
2π G γ (3 a2 − r2)
Φ(r) = − (3 a2 − r2) = −G M (13.37)
3 2 a3
for r ≤ a, and
4π G γ a3 GM
Φ(r) = − =− (13.38)
3 r r
for r > a. Here, M = (4π/3) a3 γ is the total mass of the sphere.
According to Equation (13.38), the gravitational potential outside a uni-
form sphere of mass M is the same as that generated by a point mass M
located at the sphere’s center. It turns out that this is a general result for any
finite spherically symmetric mass distribution. Indeed, from the previous
analysis, it is clear that ρ(r) = ρ0(r) and Φ(r) = Φ0(r) for a spherically sym-
metric mass distribution. Suppose that the mass distribution extends out to
r = a. It immediately follows, from Equation (13.33), that
Z
G a GM
Φ(r) = − 4π r ′ 2 ρ(r ′ ) dr ′ = − (13.39)
r 0 r
for r > a, where M is the total mass of the distribution. We, thus, conclude
that Newton’s laws of motion, in their primitive form, apply not just to point
masses, but also to extended spherically symmetric masses. In fact, this is
something which we have implicitly assumed all along in this course.
According to Equation (13.37), the gravitational potential inside a uni-
form sphere is quadratic in r. This implies that if a narrow shaft were
drilled though the center of the sphere then a test mass, m, moving in
this shaft would experience a gravitational force acting toward the cen-
ter which scales linearly in r. In fact, the force in question is given by
fr = −m ∂Φ/∂r = −(G m M/a3) r. It follows that a test mass dropped into
the shaft executes simple harmonic motion about the center of the sphere
with period
a
r
T = 2π , (13.40)
g
where g = G M/a2 is the gravitational acceleration at the sphere’s surface.
axis of rotation
prolate oblate
produced by rotating an ellipse about a major or minor axis. Let the axis of
rotation coincide with the z-axis, and let the outer boundary of the spheroid
satisfy
2
r = aθ(θ) = a 1 − ǫ P2(cos θ) , (13.41)
3
where ǫ is the termed the ellipticity. Here, we are assuming that |ǫ| ≪ 1, so
that the spheroid is very close to being a sphere. If ǫ > 0 then the spheroid
is slightly squashed along its symmetry axis, and is termed oblate. Likewise,
if ǫ < 0 then the spheroid is slightly elongated along its axis, and is termed
prolate—see Figure 13.1. Of course, if ǫ = 0 then the spheroid reduces to a
sphere.
Now, according to Equation (13.29) and (13.30), the gravitational po-
tential generated outside an axially symmetric mass distribution can be writ-
ten
X Pn(cos θ)
Φ(r, θ) = Jn , (13.42)
rn+1
n=0,∞
where
ZZ
Jn = −2π G r 2+n ρ(r, θ) Pn(cos θ) sin θ dr dθ. (13.43)
Here, the integral is taken over the whole cross-section of the distribution
in r–θ space.
222 NEWTONIAN DYNAMICS
Hence, Zπ
2π G γ
Jn = − Pn(cos θ) a3+n
θ (θ) sin θ dθ, (13.45)
(3 + n) 0
giving
Zπ
2π G γ a3+n 2
Jn ≃ − Pn(cos θ) P0(cos θ) − (3 + n) ǫ P2(cos θ) sin θ dθ,
(3 + n) 0 3
(13.46)
to first-order in ǫ. It is thus clear, from Equation (13.26), that, to first-order
in ǫ, the only non-zero Jn are
4π G γ a3
J0 = − = −G M, (13.47)
3
8π G γ a5 ǫ 2
J2 = = G M a2 ǫ, (13.48)
15 5
since M = (4π/3) a3 γ.
Thus, the gravitational potential outside a uniform spheroid of total mass
M, mean radius a, and ellipticity ǫ, is
G M 2 G M a2
Φ(r, θ) = − + ǫ P2(cos θ) + O(ǫ2). (13.49)
r 5 r3
In particular, the gravitational potential on the surface of the spheroid is
G M 2 G M a2
Φ(aθ, θ) = − + ǫ P2(cos θ) + O(ǫ2), (13.50)
aθ 5 aθ3
which yields
GM 4
Φ(aθ, θ) ≃ − 1+ ǫ P2(cos θ) + O(ǫ2) , (13.51)
a 15
where use has been made of Equation (13.41).
So, let us now consider a self-gravitating spheroid of mass M, mean ra-
dius a, and ellipticity ǫ: e.g., a star, or a planet. Assuming, for the sake of
simplicity, that the spheroid is composed of uniform density incompressible
fluid, the gravitational potential on its surface is given by Equation (13.51).
Gravitational Potential Theory 223
However, the condition for an equilibrium state is that the potential be con-
stant over the surface. If this is not the case then there will be gravitational
forces acting tangential to the surface. Such forces cannot be balanced by in-
ternal pressure, which only acts normal to the surface. Hence, from (13.51),
it is clear that the condition for equilibrium is ǫ = 0. In other words, the
equilibrium configuration of a self-gravitating mass is a sphere. Deviations
from this configuration can only be caused by forces in addition to self-
gravity and internal pressure: e.g., centrifugal forces due to rotation, or
tidal forces due to orbiting masses.
Ω2 r2 Ω2 r2
χ(r, θ) = − sin2 θ = − [1 − P2(cos θ)] (13.53)
2 3
As before, the criterion for an equilibrium state is that the surface lie at
a constant total potential, so as to eliminate tangential surface forces which
224 NEWTONIAN DYNAMICS
GM 4 Ω2 a2
− 1+ ǫ P2(cos θ) − [1 − P2(cos θ)] ≃ c, (13.56)
a 15 3
which yields
5 Ω2 a3
ǫ= . (13.57)
4 GM
We conclude, from the above expression, that the equilibrium configu-
ration of a (relatively slowly) rotating self-gravitating mass distribution is
an oblate spheroid: i.e., a sphere which is slightly flattened along its axis of
rotation. The degree of flattening is proportional to the square of the ro-
tation rate. Now, from (13.41), the mean radius of the spheroid is a, the
radius at the poles (i.e., along the axis of rotation) is ap = a (1 − 2 ǫ/3),
and the radius at the equator (i.e., perpendicular to the axis of rotation) is
ae = a (1+ǫ/3)—see Figure 13.2. Hence, the degree of rotational flattening
can be written
ae − ap 5 Ω2 a3
=ǫ= . (13.58)
a 4 GM
Now, for the Earth, a = 6.37 × 106 m, Ω = 7.27 × 10−5 rad./s, and
M = 5.97 × 1024 kg. Thus, we predict that
ǫ = 0.00429, (13.59)
axis of rotation
gc
ap
ae
ǫ = 0.101. (13.61)
Note that this degree of flattening is much larger than that of the Earth, due
to Jupiter’s relatively large radius (about 10 times that of Earth), combined
with its relatively short rotation period (about 0.4 days). In fact, the polar
flattening of Jupiter is clearly apparent from images of this planet. The
observed degree of polar flattening of Jupiter is actually ǫ = 0.065. Our
estimate of ǫ is probably slightly too large because Jupiter, which is mostly
gaseous, does not rotate as a solid body.
where the integral is over the whole volume of the Earth, and I0 = (2/5) M a2
would be the Earth’s moment of inertia were it exactly spherical. Now, the
226 NEWTONIAN DYNAMICS
Here, use has been made of Equations (13.9)–(13.11). Likewise, the Earth’s
moment of inertia about an axis perpendicular to its axis of rotation (and
passing through the Earth’s center) is
Z Z
I⊥ = (y + z ) γ d r = r2 γ (sin2 θ sin2 φ + cos2 θ) d3r
2 2 3
Z Z
1 1 2
2 2 2 3
= r γ sin θ + cos θ d r = r γ (1 + cos2 θ) d3r,
2 2
(13.64)
G M G (Ik − I⊥ )
Φ(r, θ) = − + P2(cos θ). (13.66)
r r3
This is the general expression for the gravitational potential generated out-
side an axially symmetric mass distribution. The first term on the right-hand
side is the monopole gravitational field which would be generated if all of the
mass in the distribution were concentrated at its center of mass, whereas the
second term is the quadrupole field generated by any deviation from spheri-
cal symmetry in the distribution.
Consider two point masses, m and m ′ , executing circular orbits about their
common center of mass, C, with angular velocity ω. Let R be the distance
Gravitational Potential Theory 227
R
C
′
m m
ρ
.
between the masses, and ρ the distance between point C and mass m—see
Figure 13.3. We know, from Section 7.3, that
GM
ω2 = , (13.67)
R3
and
m′
ρ= R, (13.68)
M
where M = m + m ′ .
Let us transform to a non-inertial frame of reference which rotates, about
an axis perpendicular to the orbital plane and passing through C, at the
angular velocity ω. In this reference frame, both masses appear to be sta-
tionary. Consider mass m. In the rotating frame, this mass experiences a
gravitational acceleration
G m′
ag = (13.69)
R2
directed toward the center of mass, and a centrifugal acceleration (see Chap-
ter 8)
ac = ω2 ρ (13.70)
directed away from the center of mass. However, it is easily demonstrated,
using Equations (13.67) and (13.68), that
ac = ag. (13.71)
D
′
R
r
θ
A R C B
z
Figure 13.4: Calculation of tidal forces.
It follows that
gc = −∇χ, (13.73)
where
χ = ω2 ρ z (13.74)
is the centrifugal potential, and z = r cos θ. The centrifugal potential can
also be written
G m′ r
χ= P1(cos θ). (13.75)
R R
The gravitational acceleration at point D due to mass m ′ is given by
gg = −∇Φ ′ , (13.76)
G m′
Φ′ = − . (13.77)
R′
Here, R ′ is the distance between points A and D. Note that the gravita-
tional potential generated by the mass distribution m ′ is the same as that
Gravitational Potential Theory 229
D
B
ρ
D C′ D
B C ρ B
D
B
Figure 13.5: The center B of the mass distribution m orbits about the center
of mass C in a circle of radius ρ. If the mass distribution is non-rotating
then a non-central point D must maintain a constant spatial relationship to
B. It follows that point D orbits some point C ′ , which has the same spatial
relationship to C that D has to B, in a circle of radius ρ.
230 NEWTONIAN DYNAMICS
Hence,
" #
G m′
′ r r2
Φ ≃− 1 + P1(cos θ) + 2 P2(cos θ) (13.81)
R R R
to second-order in r/R.
Adding χ and Φ ′ , we obtain
" #
G m′′ r2
χ+Φ ≃− 1 + 2 P2(cos θ) (13.82)
R R
a−
R a+
m′ m
G m G m′
χ + Φ′ + Φ ≃ − −
a R
4 Gm G m ′ a2
− ǫ P2(cos θ) − P2(cos θ), (13.83)
15 a R3
where we have treated ǫ and a/R as small quantities. As before, the condi-
tion for equilibrium is that the total potential be constant over the surface
of the spheroid. Hence, we obtain
15 m ′
3
a
ǫ=− (13.84)
4 m R
as our prediction for the ellipticity induced in a self-gravitating spherical
mass distribution of total mass m and radius a by a second mass, m ′ , which
is in a circular orbit of radius R about the distribution. Thus, if a+ is the
maximum radius of the distribution, and a− the minimum radius (see Fig-
ure 13.6), then
a+ − a− 15 m ′ a 3
= −ǫ = . (13.85)
a 4 m R
Consider the tidal elongation of the Earth due to the Moon. In this case,
we have a = 6.37 × 106 m, R = 3.84 × 108 m, m = 5.97 × 1024 kg, and
m ′ = 7.35 × 1022 kg. Hence, we calculate that −ǫ = 2.1 × 10−7, or
∆a = a+ − a− = −ǫ a = 1.34 m. (13.86)
232 NEWTONIAN DYNAMICS
We, thus, predict that tidal forces due to the Moon cause the Earth to elon-
gate along the axis joining its center to the Moon by about 1.3 meters. Since
water is obviously more fluid than rock (especially on relatively short time-
scales) most of this elongation takes place in the oceans rather than in the
underlying land. Hence, the oceans rise, relative to the land, in the region
of the Earth closest to the Moon, and also in the region furthest away. Since
the Earth is rotating, whilst the tidal bulge of the oceans remains relatively
stationary, the Moon’s tidal force causes the ocean at a given point on the
Earth’s surface to rise and fall, by about a meter, twice daily, giving rise to
the phenomenon known as the tides.
Consider the tidal elongation of the Earth due to the Sun. In this case,
we have a = 6.37 × 106 m, R = 1.50 × 1011 m, m = 5.97 × 1024 kg, and
m ′ = 1.99 × 1030 kg. Hence, we calculate that −ǫ = 9.6 × 10−8, or
∆a = a+ − a− = −ǫ a = 0.61 m. (13.87)
Thus, the tidal elongation due to the Sun is about half that due to the Moon.
It follows that the tides are particularly high when the Sun, the Earth, and
the Moon lie approximately in a straight-line, so that the tidal effects of the
Sun and the Moon reinforce one another. This occurs at a new moon, or at
a full moon. These type of tides are called spring tides (note that the name
has nothing to do with the season). Conversely, the tides are particularly
low when the Sun, the Earth, and the Moon form a right-angle, so that the
tidal effects of the Sun and the Moon partially cancel one another. These
type of tides are called neap tides. Generally speaking, we would expect two
spring tides and two neap tides per month.
In reality, the amplitude of the tides varies significantly from place to
place on the Earth’s surface, due to the presence of the continents, which
impede the flow of the oceanic tidal bulge around the Earth. Moreover,
there is a time-lag of approximately 12 minutes between the Moon being
directly overhead (or directly below) and high tide, because of the finite
inertia of the oceans. Similarly, the time-lag between a spring tide and a full
moon, or a new moon, can be up to 2 days.
moon then the center of mass lies close to the planet’s center.) According to
the analysis in the previous section, a constituent element of the moon ex-
periences a force per unit mass, due to the gravitational field of the planet,
which takes the form
g ′ = −∇(χ + Φ ′ ), (13.88)
where
G m′ 2
χ + Φ′ = − (z − x2/2 − y2/2) + const. (13.89)
R3
Here, (x, y, z) is a Cartesian coordinate system whose origin is the center of
the moon, and whose z-axis always points toward the center of the planet.
It follows that
2 G m′ x y
′
g = − ex − ey + z ez . (13.90)
R3 2 2
This so-called tidal force is generated by the spatial variation of the planet’s
gravitational field over the interior of the moon, and acts to elongate the
moon along an axis joining its center to that of the planet, and to compress
it in any direction perpendicular to this axis. Note that the magnitude of the
tidal force increases strongly as the radius, R, of the moon’s orbit decreases.
Now, if the tidal force is sufficiently strong then it can overcome the moon’s
self-gravity, and thereby rip the moon apart. It follows that there is a mini-
mum radius, generally referred to as the Roche radius, at which a moon can
orbit a planet without being destroyed by tidal forces.
Let us derive an expression for the Roche radius. Consider a small mass
element at the point on the surface of the moon which lies closest to the
planet, and at which the tidal force is consequently largest (i.e., x = y = 0,
z = a). According to Equation (13.90), the mass experiences an upward
(from the moon’s surface) tidal acceleration due to the gravitational attrac-
tion of the planet of the form
2 G m′ a
g′ = ez. (13.91)
R3
The mass also experiences a downward gravitational acceleration due to the
gravitational influence of the moon which is written
Gm
g=− ez. (13.92)
a2
Thus, the effective surface gravity at the point in question is
!
Gm m ′ a3
geff = 2 1−2 . (13.93)
a m R3
234 NEWTONIAN DYNAMICS
It follows that if the planet and the moon have the same mean density then
the Roche radius is 2.44 times the planet’s radius. Note that small orbital
bodies such as rocks, or even very small moons, can survive intact within the
Roche radius because they are held together by internal tensile forces rather
than gravitational attraction. However, this mechanism becomes progres-
sively less effective as the size of the body in question increases. Not sur-
prisingly, virtually all large planetary moons occurring in the Solar System
have orbital radii which exceed the relevant Roche radius, whereas virtually
all planetary ring systems (which consist of myriads of small orbiting rocks)
have radii which lie inside the relevant Roche radius.
φ y
Ω
θ
Earth
Sun
L η
In Astronomy, the plane of the Sun’s apparent orbit relative to the Earth is
known as the ecliptic plane. Let us define Cartesian coordinates centered
on the Earth such that the x- and y-axes lie in the ecliptic plane, and the
z-axis is normal to this plane (in the sense such that the Earth’s north pole
lies at positive z). It follows that the z-axis points toward a fixed point in
the sky (in the constellation Draco) known as the north ecliptic pole. It is
convenient to parameterize the instantaneous position of the Sun in terms
of a counter-clockwise azimuthal angle η—see Figure 13.7.
Let Ω be the Earth’s angular velocity vector due to its daily rotation. This
vector makes an angle θ with the z-axis, where θ = 23.44◦ is the inclination
of the ecliptic to the Earth’s equatorial plane. Suppose that the projection
of Ω onto the ecliptic plane makes an angle φ with the x-axis, where φ is
measured in a counter-clockwise sense—see Figure 13.7. The orientation
of the Earth’s axis of rotation (which is, of course, parallel to Ω) is thus
specified by the two angles θ and φ. Note, however, that these two angles
are also Euler angles, in the sense specified in Chapter 9. Let us examine the
Earth-Sun system at an instant in time when φ = 0: i.e., when Ω lies in the
x-z plane.
Now, according to Equation (13.66), the potential energy of the Earth-
Sun system is written
G Ms Me G Ms (Ik − I⊥ )
U = Ms Φ = − + P2(γ), (13.97)
L L3
where Ms is the mass of the Sun, Me the mass of the Earth, Ik the Earth’s
moment of inertia about its axis of rotation, and I⊥ the Earth’s moment of
236 NEWTONIAN DYNAMICS
and
r = L (cos η, sin η, 0). (13.99)
Hence,
cos γ = cos η sin θ, (13.100)
giving
G Ms Me G Ms (Ik − I⊥ )
U=− + (3 cos2 η sin2 θ − 1). (13.101)
L 2 L3
Now, on time-scales much longer than a year, we can average the above
expression over the Sun’s orbit to give
G Ms Me G Ms (Ik − I⊥ ) 3
U=− + sin2 θ − 1 (13.102)
L 2 L3 2
U = U0 − α cos2 θ, (13.103)
3 G Ms
α= (Ik − I⊥ ). (13.104)
4 L3
Here, Equation (13.103) represents the mean potential energy of the Earth
due to its gravitational interaction with the Sun.
Now, according to Section 9.9, the rotational kinetic energy of the Earth
can be written
1
K= I⊥ θ̇2 + I⊥ sin2 θ φ̇2 + Ik Ω2 , (13.105)
2
where the Earth’s angular velocity
Ω = cos θ φ̇ + ψ̇ (13.106)
Gravitational Potential Theory 237
is a constant of the motion. Here, ψ is the third Euler angle. Hence, the
Earth’s Lagrangian takes the form
1
L=K−U= I⊥ θ̇2 + I⊥ sin2 θ φ̇2 + Ik Ω2 + α cos2 θ. (13.107)
2
One equation of motion which can immediately be derived from this La-
grangian is
d ∂L ∂L
− = 0, (13.108)
dt ∂θ̇ ∂θ
which reduces to
∂L
I⊥ θ̈ − = 0. (13.109)
∂θ
Consider steady precession, in which θ̇ = 0, and φ̇ and ψ̇ are constants.
It follows, from the above equation, that such motion must satisfy the con-
straint
∂L
= 0. (13.110)
∂θ
Thus, we obtain
where use has been made of Equations (13.106) and (13.107). It follows
that
I⊥ cos θ φ̇ 2 − Ik Ω φ̇ − 2 α cos θ = 0. (13.112)
Now, |φ̇| ≪ Ω, so the above equation yields
!
2 α cos θ 3 Ms G Ik − I⊥
φ̇ ≃ − =− cos θ
Ik Ω 2 L3 Ω Ik
3 Ms G
= − ǫ cos θ, (13.113)
2 L3 Ω
where ǫ = 0.00335 is the Earth’s ellipticity, and use has been made of Equa-
tions (13.65) and (13.104). According to the above expression, the mutual
interaction between the Sun and the quadrupole gravitational field gener-
ated by the Earth’s slight oblateness causes the Earth’s axis of rotation to
precess steadily about the normal to the ecliptic plane at the rate φ̇. The fact
that φ̇ is negative implies that the precession is in the opposite direction to
the directions of the Earth’s rotation and the Sun’s apparent orbit about the
Earth. Incidentally, the interaction causes a precession of the Earth’s rota-
tional axis, rather than the plane of the Sun’s orbit, because the Earth’s axial
moment of inertia is much less than the Sun’s orbital moment of inertia.
238 NEWTONIAN DYNAMICS
Now, the Sun’s apparent orbital angular velocity, ωs, about the Earth
satisfies
G Ms
ωs2 = . (13.114)
L3
Hence,
φ̇ 3 ωs
=− ǫ cos θ, (13.115)
ωs 2 Ω
and the precession period in years is given by
ωs 2 To(day)
Ts(yr) = = , (13.116)
|φ̇| 3 ǫ cos θ
where To(day) = Ω/ωs = 365.24 is the Sun’s orbital period in days. Thus,
given that ǫ = 0.00335 and θ = 23.44◦ , we obtain
Using analogous arguments, we can also show that the Earth’s axis of
rotation precesses due to the interaction between the Moon and the Earth’s
quadrupole gravitational field. The precession rate is given by
3 Mm G
φ̇ = − ǫ cos θ, (13.118)
2 R3 Ω
where Mm is the mass of the Moon, and R the mean distance between the
Moon and Earth. Now, the orbital angular velocity, ωm, of the Moon about
the Earth satisfies
2 G Me
ωm = . (13.119)
R3
Hence,
φ̇ 3 ωm 2 M
m
=− cos θ, (13.120)
ωs 2 ωs Ω Me
and the precession period in years is given by
ωs Me
Tm(yr) = = T02 Ts, (13.121)
|φ̇| Mm
where Me/Mm = 81.3 is the ratio of the mass of the Earth to that of the
Moon, and To = ωs/ωm = 0.081 is the Moon’s (synodic) orbit period in
years. It follows that
Tm = 0.53 Ts. (13.122)
Gravitational Potential Theory 239
Of course, the precession rates induced by the Sun and Moon are addi-
tive, giving rise to a net precession period, T , which satisfies
1 1 1
= + . (13.123)
T Ts Tm
Thus, we finally obtain
Ts
T= = 27, 000 years. (13.124)
2.88
According to the above analysis, the combined gravitational interaction
of the Sun and the Moon with the quadrupole field generated by the Earth’s
slight oblateness causes the Earth’s axis of rotation to precess steadily about
the normal to the ecliptic plane with a period of about 27,000 years. This
effect is known as the precession of the equinoxes. The observed period for
the precession of the equinoxes is actually 26,000 years. Our estimate is
slightly off because we neglected to take into account the small eccentricities
of the Earth’s orbit around the Sun, and the Moon’s orbit around the Earth.
We also neglected the fact that the Moon’s orbit is inclined by about 5◦ to
the ecliptic plane.
Now, the point in the sky toward which the Earth’s axis of rotation points
is known as the north celestial pole. Currently, this point lies within about
a degree of the fairly bright star Polaris, which is consequently sometimes
known as the north star or the pole star. It follows that Polaris appears to be
almost stationary in the sky, always lying due north, and can thus be used
for navigational purposes. Indeed, mariners have relied on the north star
for many hundreds of years to determine direction at sea. Unfortunately,
because of the precession of the equinoxes, the north celestial pole is not a
fixed point in the sky, but instead traces out a circle, of angular radius 23.44◦ ,
about the north ecliptic pole, with a period of 26,000 years. Hence, a few
thousand years from now, the north celestial pole will no longer coincide
with Polaris, and there will be no convenient way of telling direction from
the stars.
The projection of the ecliptic plane onto the sky is called the ecliptic,
and coincides with the apparent path of the Sun against the backdrop of
the stars. Furthermore, the projection of the Earth’s equator onto the sky
is known as the celestial equator. The ecliptic is inclined at 23.44◦ to the
celestial equator. The two points in the sky at which the ecliptic crosses the
celestial equator are called the equinoxes, since night and day are equally
long when the Sun lies at these points. Thus, the Sun reaches the spring
equinox on about March 21st, and this traditionally marks the beginning of
240 NEWTONIAN DYNAMICS
Spring. Likewise, the Sun reaches the autumn equinox on about September
22nd, and this traditionally marks the beginning of Autumn. However, the
precession of the Earth’s axis of rotation causes the celestial equator (which
is always normal to this axis) to precess in the sky, and thus also produces
a precession of the equinoxes along the ecliptic—strictly speaking, it is this
effect which is known as the precession of the equinoxes. The direction of
the precession is opposite to the direction of the Sun’s apparent motion
around the ecliptic. Thus, in about 2000 BC, when the science of Astronomy
originated in ancient Egypt and Babylonia, the spring equinox lay in the
constellation Aries. Indeed, the spring equinox is still sometimes called the
first point of Aries in astronomical texts. About 90 BC, the spring equinox
moved into the constellation Pisces, where it still remains. The equinox will
move into the constellation Aquarius (marking the beginning of the much
heralded “Age of Aquarius”) in about 2600 AD. Incidentally, the position of
the spring equinox in the sky is of great importance in Astronomy, since it
is used as the zero of celestial longitude (much as Greenwich is used as the
zero of terrestrial longitude).
The Solar System consists of eight major planets (Mercury to Neptune) mov-
ing around the Sun in slightly elliptical orbits which are approximately co-
planar with one another. According to Chapter 6, if we neglect the relatively
weak interplanetary gravitational interactions then the perihelia of the var-
ious planets (i.e., the points on their orbits at which they are closest to the
Sun) remain fixed in space. However, once these interactions are taken into
account, it turns out that the planetary perihelia all slowly precess around
the Sun. We can calculate the approximate rate of perihelion precession of
a given planet by treating the other planets as uniform concentric rings, cen-
tered on the Sun, of mass equal to the planetary mass, and radius equal to
the mean orbital radius.1 This is equivalent to averaging the interplanetary
gravitational interactions over the orbits of the other planets. It is reason-
able to do this, since the precession period in question is very much longer
than the orbital period of any planet in the Solar System. Thus, by treating
the other planets as rings, we can calculate the mean gravitational pertur-
bation due to these planets, and, thereby, determine the desired precession
rate.
We can conveniently index the planets in the Solar System such that
Mercury is planet 1, and Neptune planet 8. Let the Mi and the Ri, for
i = 1, 8, be the planetary masses and orbital radii, respectively. Furthermore,
let M0 be the mass of the Sun. It follows, from the previous section, that the
gravitational potential generated at the ith planet by the Sun and the other
planets is
" 2 4 #
G M0 X Mj 1
Rj 9
Rj
Φ(Ri) = − −G 1+ + + ···
Ri Ri 4 Ri 64 Ri
j<i
!2 !4
X Mj 1 Ri 9 Ri
−G 1 + + + · · · . (13.129)
Rj 4 Rj 64 Rj
j>i
Now, the radial force per unit mass acting on the ith planet is written f(Ri) =
−dΦ(Ri)/dr, giving
" 2 4 #
G M0 G X 3 Rj 45 Rj
f(Ri) = − 2 − 2 Mj 1 + + + ···
Ri Ri j<i 4 Ri 64 Ri
1
M.G. Stewart, American Jou. Physics 73, 730 (2005).
242 NEWTONIAN DYNAMICS
! !2 !4
G X Ri 1 Ri 9 Ri
+ 2 Mj + + · · · . (13.130)
Ri j>i Rj 2 Rj 16 Rj
Hence, we obtain
" 2 4 #
2 G M0 G X Rj 135 Rj
Ri f ′ (Ri) = 2
+ 2
Mj 2 + 3 + + ···
Ri Ri j<i Ri 32 Ri
! !2 !4
G X Ri 1 Ri 27 Ri
+ 2 Mj + + · · · ,
Ri j>i Rj 2 Rj 16 Rj
(13.131)
where ′ ≡ d/dr. It follows that
−1/2
Ri f ′ (Ri)
3+ = (13.132)
f(Ri)
2 " 2 4 #
3 X Mj Rj 15 Rj 175 Rj
1+ 1+ + + ···
4 M0 Ri 8 Ri 64 Ri
j<i
!3 !2 !4
3 X Mj Ri 15 Ri 175 Ri
+ 1 + + + · · · .
4 M0 Rj 8 Rj 64 Rj
j>i
Thus, according to Equation (6.104), the apsidal angle for the ith planet is
3 X Mj
2 "
Rj 15 Rj 2 175 Rj 4
#
ψi = π 1 + 1+ + + ···
4 M0 Ri 8 Ri 64 Ri
j<i
!3 !2 !4
3 X
Mj
Ri 15 Ri 175 Ri
+ 1+ + + ··· .
4 M0 Rj 8 Rj 64 Rj
j>i
(13.133)
Hence, the perihelion of the ith planet advances by
2 " 2 4 #
3π X Mj Rj 15 Rj 175 Rj
δψi = 1+ + + ···
2 M0 Ri 8 Ri 64 Ri
j<i
!3 !2 !4
3π X Mj Ri 15 Ri 175 Ri
+ 1 + + + · · ·
2 M0 Rj 8 Rj 64 Rj
j>i
(13.134)
Gravitational Potential Theory 243
Table 13.1: Data for the major planets in the Solar System, giving the plane-
tary mass relative to that of the Sun, the orbital period in years, and the mean
orbital radius relative to that of the Earth.
radians per revolution around the Sun. Now, the time for one revolution is
Ti = 2π/ωi, where ωi2 = G M0/Ri3. Thus, the rate of perihelion precession,
in arc seconds per year, is given by
75 X Mj
2 " #
Rj 15 Rj 2 175 Rj 4
δψ̇i = 1+ + + ···
Ti(yr) M0 Ri 8 Ri 64 Ri
j<i
!
X Mj Ri 3
!2 !4
15 Ri 175 Ri
+ 1+ + + ··· .
M0 Rj 8 Rj 64 Rj
j>i
(13.135)
Table 13.2 and Figure 13.8 compare the observed perihelion precession
rates with the theoretical rates calculated from Equation (13.135) and the
planetary data given in Table 13.1. It can be seen that there is excellent
agreement between the two, except for the planet Venus. The main reason
for this is that Venus has an unusually low eccentricity (e = 0.0068), which
renders its perihelion point extremely sensitive to small perturbations.
If the calculation described in the previous section is carried out more accu-
rately, taking into account the slight eccentricities of the planetary orbits, as
244 NEWTONIAN DYNAMICS
Table 13.2: The observed perihelion precession rates of the planets compared
with the theoretical precession rates calculated from Equation (13.135) and
Table 13.1. The precession rates are in arc seconds per year.
Figure 13.8: The triangular points show the observed perihelion precession
rates of the major planets in the Solar System, whereas the square points show
the theoretical rates calculated from Equation (13.135) and Table 13.1. The
precession rates are in arc seconds per year.
Gravitational Potential Theory 245
well as their small mutual inclinations, and retaining many more terms in
the expansions (13.126) and (13.128), then the perihelion precession rate
of the planet Mercury is found to be 5.32 arc seconds per year. However, the
observed precession rate is 5.75 arc seconds per year. It turns out that the
cause of this discrepancy is the general relativistic correction to Newtonian
gravity.
General Relativity gives rise to a small correction to the force per unit
mass exerted by the Sun (mass M0) on a planet in a circular orbit of radius
r, and angular momentum per unit mass h. In fact, the modified formula
for f is
G M 0 3 G M 0 h2
f≃− 2 − , (13.136)
r c2 r4
where c is the velocity of light in vacuum. It follows that
!
r f′ 3 h2
= −2 1 + 2 2 + · · · . (13.137)
f c r
6π G M0
δψ ≃ (13.139)
c2 r
radians per revolution due to the general relativistic correction to Newto-
nian gravity. Here, use has been made of h2 = G M0 r. It follows that the
rate of perihelion precession due to the general relativistic correction is
0.0383
δψ̇ ≃ (13.140)
RT
arc seconds per year, where R is the mean orbital radius in mean Earth or-
bital radii, and T is the orbital period in years. Hence, from Table 13.1, the
general relativistic contribution to δψ̇ for Mercury is 0.41 arc seconds per
year. It is easily demonstrated that the corresponding contribution is negli-
gible for the other planets in the Solar System. If the above calculation is
carried out sightly more accurately, taking the eccentricity of Mercury’s orbit
into account, then the general relativistic contribution to δψ̇ becomes 0.43
arc seconds per year. It follows that the total perihelion precession rate for
246 NEWTONIAN DYNAMICS
Mercury is 5.32+0.43 = 5.75 arc seconds per year. This is in exact agreement
with the observed precession rate. Indeed, the ability of General Relativity
to explain the discrepancy between the observed perihelion precession rate
of Mercury, and that calculated from Newtonian Dynamics, was one of the
first major successes of this theory.
13.14 Exercises
1. The Moon’s orbital period about the Earth is approximately 27.3 days, and is
in the same direction as the Earth’s axial rotation (whose period is 24 hours).
Use this data to show that high tides at a given point on the Earth occur every
12 hours and 26 minutes.
2. Consider an artificial satellite in a circular orbit of radius L about the Earth.
Suppose that the normal to the plane of the orbit subtends an angle θ with
the Earth’s axis of rotation. By approximating the orbiting satellite as a uni-
form ring, demonstrate that the Earth’s oblateness causes the plane of the
satellite’s orbit to precess about the Earth’s rotational axis at the rate
2
φ̇ 1 R
≃− ǫ cos θ.
ω 2 L
14.1 Introduction
GM
ω2 = , (14.1)
R3
r1 m2
= , (14.2)
r2 m1
248 NEWTONIAN DYNAMICS
η m3
R r2 m2
r1 ωt
ξ
C
m1
Figure 14.1: The circular restricted three-body problem.
where M = m1 + m2.
It is convenient to choose our unit of length such that R = 1, and our unit
of mass such that G M = 1. It follows, from Equation (14.1), that ω = 1.
However, we shall continue to retain ω in our equations, for the sake of
clarity. Let µ1 = G m1, and µ2 = G m2 = 1 − µ1. It is easily demonstrated
that r1 = µ2, and r2 = 1 − r1 = µ1. Hence, the two orbiting masses, m1 and
m2, have position vectors r1 = (ξ1, η1, 0) and r2 = (ξ2, η2, 0), respectively,
where (see Figure 14.1)
r1 = µ2 (− cos ωt, − sin ωt, 0), (14.3)
r2 = µ1 (cos ωt, sin ωt, 0). (14.4)
Let the third mass have position vector r = (ξ, η, ζ). The Cartesian compo-
nents of the equation of motion of this mass are thus
(ξ − ξ1) (ξ − ξ2)
ξ̈ = −µ1 3
− µ2 , (14.5)
ρ1 ρ23
(η − η1) (η − η2)
η̈ = −µ1 3
− µ2 , (14.6)
ρ1 ρ23
ζ ζ
ζ̈ = −µ1 3
− µ2 3 , (14.7)
ρ1 ρ2
where
ρ12 = (ξ − ξ1)2 + (η − η1)2 + ζ2, (14.8)
ρ22 = (ξ − ξ2)2 + (η − η2)2 + ζ2. (14.9)
The Three-Body Problem 249
Ċ = 0. (14.14)
the planet Jupiter, and mass m3 a comet. Since the mass of a comet is very
much less than that of the Sun or Jupiter, and the Sun and Jupiter are in
(almost) circular orbits about their common center of mass, the dynamical
system in question satisfies all of the necessary criteria to be considered an
example of a restricted three-body problem.
Now, the mass of the Sun is much greater than that of Jupiter. It follows
that the gravitational effect of Jupiter on the cometary orbit is negligible un-
less the comet makes a very close approach to Jupiter. Hence, as described
in Chapter 6, before and after such an approach the comet executes a stan-
dard elliptical orbit about the Sun with fixed orbital parameters: i.e., fixed
major radius, eccentricity, and inclination to the ecliptic plane. However, in
general, the orbital parameters before and after the close approach will not
be the same as one another. Let us investigate further.
Now, since m1 ≫ m2, we have µ1 = G m1 ≃ G (m1 + m2) = 1, and ρ1 ≃
r. Hence, according to Equations (6.51) and (6.57), the (approximately)
conserved energy (per unit mass) of the comet before and after its close
approach to Jupiter is
1 2 1 1
E≡ ξ̇ + η̇ 2 + ζ̇ 2 − = − . (14.16)
2 r 2a
Note that the comet’s orbital energy is entirely determined by its major ra-
dius, a. (Incidentally, we are working in units such that the major radius of
Jupiter’s orbit is unity.) Furthermore, the (approximately) conserved angu-
lar momentum (per unit mass) of the comet before and after its approach to
Jupiter is written h, where h is directed normal to the comet’s orbital plane,
and, from Equations (6.27) and (6.47),
h2 = a (1 − e2). (14.17)
Here, e is the comet’s orbital eccentricity. It follows that
q
ω · h = ω h cos I = a (1 − e2) cos I, (14.18)
since ω = 1 in our adopted system of units. Here, I is the angle of inclination
of the normal to the comet’s orbital plane to that of Jupiter’s orbital plane.
Let a, e, and I be the major radius, eccentricity, and inclination angle of
the cometary orbit before the close encounter with Jupiter, and let a ′ , e ′ ,
and I ′ be the corresponding parameters after the encounter. It follows from
Equations (14.15), (14.16), and (14.18), and the fact that C is conserved
during the encounter, whereas E and h are not, that
1 q
1 q
+ a (1 − e2) cos I = + a ′ (1 − e ′ 2) cos I ′ . (14.19)
2a 2 a′
The Three-Body Problem 251
This result is known as the Tisserand criterion, and restricts the possible
changes in the orbital parameters of a comet due to a close encounter with
Jupiter (or any other massive planet).
The Tisserand criterion is very useful. For instance, whenever a new
comet is discovered, astronomers immediately calculate its Tisserand pa-
rameter,
1 q
TJ = + 2 a (1 − e2) cos I. (14.20)
a
If this parameter has the same value as that of a previously observed comet
then it is quite likely that the new comet is, in fact, the same comet, but that
its orbital parameters have changed since it was last observed, due to a close
encounter with Jupiter. Incidentally, the subscript J in the above formula is
to remind us that we are dealing with the Tisserand parameter for close
encounters with Jupiter. (The parameter is, thus, evaluated in a system of
units in which the major radius of Jupiter’s orbit is unity). Obviously, it
is also possible to calculate Tisserand parameters for close encounters with
other planets.
The Tisserand criterion is also applicable to so-called gravity assists, in
which a space-craft gains energy due to a close encounter with a moving
planet. Such assists are often employed in missions to the outer planets to
reduce the amount of fuel which the space-craft must carry in order to reach
its destination. In fact, it is clear, from Equations (14.16) and (14.19), that
a space-craft can make use of a close encounter with a moving planet to
increase (or decrease) its orbital major radius a, and, hence, to increase (or
decrease) its total orbital energy.
m3
r
x
m1 C m2
µ2 µ1
.
(r − r1) (r − r2)
r̈ + 2 ω × ṙ = −µ1 3
− µ2 − ω × (ω × r), (14.21)
ρ1 ρ23
Here, the second term on the left-hand side of Equation (14.21) is the Cori-
olis acceleration, whereas the final term on the right-hand side is the cen-
trifugal acceleration. The components of Equation (14.21) reduce to
µ1 (x + µ2) µ2 (x − µ1)
ẍ − 2 ω ẏ = − − + ω2 x, (14.24)
ρ13 ρ23
µ1 y µ2 y
ÿ + 2 ω ẋ = − 3 − 3 + ω2 y, (14.25)
ρ1 ρ2
µ1 z µ2 z
z̈ = − 3 − 3 , (14.26)
ρ1 ρ2
which yield
∂U
ẍ − 2 ω ẏ = − , (14.27)
∂x
The Three-Body Problem 253
∂U
ÿ + 2 ω ẋ = − , (14.28)
∂y
∂U
z̈ = − , (14.29)
∂z
where
µ1 µ2 ω2 2
U=− − − (x + y2) (14.30)
ρ1 ρ2 2
is the sum of the gravitational and centrifugal potentials.
Now, it follows from Equations (14.27)–(14.29) that
∂U
ẍ ẋ − 2 ω ẋ ẏ = −ẋ , (14.31)
∂x
∂U
ÿ ẏ + 2 ω ẋ ẏ = −ẏ , (14.32)
∂y
∂U
z̈ ż = −ż . (14.33)
∂z
Summing the above three equations, we obtain
d 1 2
ẋ + ẏ2 + ż2 + U = 0. (14.34)
dt 2
In other words,
C = −2 U − v2 (14.35)
is a constant of the motion, where v2 = ẋ2 + ẏ2 + ż2. In fact, C is the
Jacobi integral introduced in Section 14.3 [it is easily demonstrated that
Equations (14.10) and (14.35) are identical]. Note, finally, that the mass
m3 is restricted to regions in which
−2 U ≥ C, (14.36)
Let us search for possible equilibrium points of the mass m3 in the rotat-
ing reference frame. Such points are termed Lagrange points. Thus, in the
rotating frame, the mass m3 would remain at rest if placed at one of the
Lagrange points. It is, thus, clear that these points are fixed in the rotat-
ing frame. Conversely, in the inertial reference frame, the Lagrange points
254 NEWTONIAN DYNAMICS
rotate about the center of mass with angular velocity ω, and the mass m3
would consequently also rotate about the center of mass with angular ve-
locity ω if placed at one of these points (with the appropriate velocity). In
the following, we shall assumed, without loss of generality, that m1 ≥ m2.
The Lagrange points satisfy ṙ = r̈ = 0 in the rotating frame. It thus
follows, from Equations (14.27)–(14.29), that the Lagrange points are the
solutions of
∂U ∂U ∂U
= = = 0. (14.37)
∂x ∂y ∂z
Now, it is easily seen that
!
∂U µ1 µ2
= + z. (14.38)
∂z ρ13 ρ23
Since the term in brackets is positive definite, we conclude that the only
solution to the above equation is z = 0. Hence, all of the Lagrange points lie
in the x-y plane.
If z = 0 then it is readily demonstrated that
where use has been made of the fact that µ1 + µ2 = 1. Hence, Equa-
tion (14.30) can also be written
! !
1 ρ2 1 ρ2 µ1 µ2
U = −µ1 + 1 − µ2 + 2 + . (14.40)
ρ1 2 ρ2 2 2
∂U ∂U ∂ρ1 ∂U ∂ρ2
= + = 0, (14.41)
∂x ∂ρ1 ∂x ∂ρ2 ∂x
∂U ∂U ∂ρ1 ∂U ∂ρ2
= + = 0, (14.42)
∂y ∂ρ1 ∂y ∂ρ2 ∂y
which reduce to
! !
1 − ρ13 x + µ2 1 − ρ23 x − µ1
µ1 + µ2 = 0, (14.43)
ρ12 ρ1 ρ22 ρ2
! !
1 − ρ13 y 1 − ρ23 y
µ1 + µ2 = 0. (14.44)
ρ12 ρ1 ρ22 ρ2
The Three-Body Problem 255
µ2 ρ23 (1 − ρ2 + ρ22/3)
= . (14.45)
3 µ1 (1 + ρ2 + ρ22) (1 − ρ2)3
α2 α3 23 α4
ρ2 = α − − − + O(α5), (14.47)
3 9 81
where 1/3
µ2
α= . (14.48)
3 µ1
is assumed to be a small parameter.
At the L2 point, we have x = −µ2 + ρ1 = µ1 + ρ2 and ρ1 = 1 + ρ2. Hence,
from Equation (14.43),
µ2 ρ 3 (1 + ρ2 + ρ22/3)
= 2 . (14.49)
3 µ1 (1 + ρ2)2 (1 − ρ23)
µ2 (1 − ρ 3) (1 + ρ1)2
= 3 21 . (14.52)
µ1 ρ1 (ρ1 + 3 ρ1 + 3)
256 NEWTONIAN DYNAMICS
µ2 12 β 144 β2 1567 β3
= + + + O(β4), (14.53)
µ1 7 49 343
2 3 4
7 µ2 7 µ2 13223 µ2 µ2
β = − + +O ,
12 µ1 12 µ1 20736 µ1 µ1
(14.54)
ρ1 = ρ2 = 1, (14.56)
or
(x + µ2)2 + y2 = (x − 1 + µ2)2 + y2 = 1, (14.57)
since µ1 = 1 − µ2. The two solutions of the above equation are
1
x = − µ2, (14.58)
2
√
3
y = ± , (14.59)
2
and specify the positions of the Lagrange points designated L4 and L5. Note
that point L4 and the masses m1 and m2 lie at the apexes of an equilateral
triangle. The same is true for point L5. We have now found all of the possible
Lagrange points.
Figure 14.3 shows the positions of the two masses, m1 and m2, and the
five Lagrange points, L1 to L5, calculated for the case where µ2 = 0.1.
L4
m1 m2
L3 L1 L2
L5
Figure 14.3: The masses m1 and m2, and the five Lagrange points, L1 to L5,
calculated for µ2 = 0.1.
Note that V ≥ 0. It follows, from Equation (14.35), that if the mass m3 has
the Jacobi integral C, and lies on the surface specified in Equation (14.60),
then it must have zero velocity. Hence, such a surface is termed a zero-
velocity surface. The zero-velocity surfaces are important because they form
the boundary of regions from which the mass m3 is dynamically excluded:
i.e., regions in which V < C. Generally speaking, the regions from which
m3 is excluded grow in area as C increases, and vice versa.
Let Ci be the value of V at the Li Lagrange point, for i = 1, 5. When
µ2 ≪ 1, it is easily demonstrated that
2/3
C1 ≃ 3 + 34/3 µ2 − 10 µ2/3, (14.62)
2/3
C2 ≃ 3 + 34/3 µ2 − 14 µ2/3, (14.63)
C3 ≃ 3 + µ2, (14.64)
C4 ≃ 3 − µ2, (14.65)
C4 ≃ 3 − µ2. (14.66)
Note that C1 > C2 > C3 > C4 = C5.
Figures 14.4–14.8 show the intersection of the zero-velocity surface V =
C with the x-y plane for various different values of C, and illustrate how the
region from which m3 is dynamically excluded—which we shall term the
258 NEWTONIAN DYNAMICS
m1
m2
Figure 14.4: The zero-velocity surface V = C, where C > C1, calculated for
µ2 = 0.1. The mass m3 is excluded from the region lying between the two inner
curves and the outer curve.
L1
m1
m2
about m1 (in the inertial frame) with C3 being directly opposite m2, L4 (by
convention) 60◦ ahead of m2, and L5 60◦ behind.
We have seen that the five Lagrange points, L1 to L5, are the equilibrium
points of mass m3 in the co-rotating frame. Let us now determine whether
or not these equilibrium points are stable to small displacements.
Now, the equations of motion of mass m3 in the co-rotating frame are
specified in Equations (14.27)–(14.29). Note that the motion in the x-y
plane is complicated by presence of the Coriolis acceleration. However, the
motion parallel to the z-axis simply corresponds to motion in the potential
U. Hence, the condition for the stability of the Lagrange points (which all
lie at z = 0) to small displacements parallel to the z-axis is simply (see
Section 4.2) !
∂2U µ1 µ2
= + > 0. (14.67)
∂z2 z=0 ρ13 ρ23
This condition is satisfied everywhere in the x-y plane. Hence, the Lagrange
260 NEWTONIAN DYNAMICS
m1
L1
m2
m1
L3
m2
L4
.
m1
m2
.
L5
Figure 14.8: The zero-velocity surface V = C, where C4 < C < C3, calculated
for µ2 = 0.1. The mass m3 is excluded from the regions lying inside the two
curves.
points are all stable to small displacements parallel to the z-axis. It, thus,
remains to investigate their stability to small displacements lying within the
x-y plane.
Suppose that a Lagrange point is situated in the x-y plane at coordinates
(x0, y0, 0). Let us consider small amplitude x-y motion in the vicinity of this
point by writing
x = x0 + δx, (14.68)
y = y0 + δy, (14.69)
z = 0, (14.70)
L4
M2
L3 L1 L2
M1
L5
Figure 14.9: The zero-velocity surfaces and Lagrange points calculated for
µ2 = 0.01.
The Three-Body Problem 263
expansion simplifies to
1 1
U = U0 + Uxx (δx)2 + Uxy δx δy + Uyy (δy)2. (14.72)
2 2
Finally, substitution of Equations (14.68)–(14.70), and (14.72) into the
equations of x-y motion, (14.27) and (14.28), yields
since ω = 1.
Let us search for a solution of the above pair of equations of the form
δx(t) = δx0 exp(γ t) and δy(t) = δy0 exp(γ t). We obtain
γ2 + Uxx −2 γ + Uxy
! ! !
δx0 0
= . (14.75)
2 γ + Uxy γ2 + Uyy δy0 0
This equation only has a non-trivial solution if the determinant of the matrix
is zero. Hence, we get
where all terms are evaluated at the point (x0, y0, 0). It thus follows that
Uxx = A − D − 1, (14.81)
Uyy = A − B − 1, (14.82)
Uxy = −C. (14.83)
264 NEWTONIAN DYNAMICS
Consider the co-linear Lagrange points, L1, L2, and L3. These all lie
on the x-axis, and are thus characterized by y = 0, ρ12 = (x + µ2)2, and
ρ22 = (x − µ1)2. It follows, from the above equations, that B = C = 0 and
D = 3 A. Hence, Uxx = −1 − 2 A, Uyy = A − 1, and Uxy = 0. Equation
(14.76) thus yields
Γ 2 + (2 − A) Γ + (1 − A) (1 + 2 A) = 0, (14.84)
We thus conclude that the L4 and L5 Lagrange points are stable equilib-
rium points, in the co-rotating frame, provided that mass m2 is less than
about 4% of mass m1. If this is the case then mass m3 can orbit around
these points indefinitely. In the inertial frame, the mass will share the or-
bit of mass m2 about mass m1, but will stay approximately 60◦ ahead of
mass m2, if it is orbiting the L4 point, or 60◦ behind, if it is orbiting the L5
point—see Figure 14.9. This type of behavior has been observed in the Solar
System. For instance, there is a sub-class of asteroids, known as the Trojan
asteroids, which are trapped in the vicinity of the L4 and L5 points of the
Sun-Jupiter system (which easily satisfies the above stability criterion), and
consequently share Jupiter’s orbit around the Sun, staying approximately
60◦ ahead of, and 60◦ behind, Jupiter, respectively. Furthermore, the L4 and
L5 points of the Sun-Earth system are occupied by clouds of dust.
266 NEWTONIAN DYNAMICS
The Chaotic Pendulum 267
15.1 Introduction
Up until now, we have only dealt with dynamical problems which are ca-
pable of analytic solution. Let us now investigate a problem which is quite
intractable analytically, and in which meaningful progress can only be made
via numerical methods.
d2θ
ml + m g sin θ = 0, (15.1)
dt2
d2θ dθ
ml 2
+ν + m g sin θ = 0, (15.2)
dt dt
fixed support
pivot point
l
θ
T
mg
drive (which could arise, for instance, via periodic oscillations of the pen-
dulum’s pivot point). Thus, the final equation of motion of the pendulum is
written
d2θ dθ
ml 2 + ν + m g sin θ = A cos(ω t), (15.3)
dt dt
where A and ω are constants parameterizing the amplitude and frequency
of the external driving torque, respectively.
Let
r
g
ω0 = . (15.4)
l
^t = ω0 t, (15.5)
ω
^ =
ω , (15.6)
ω0
mg
Q = , (15.7)
ω0 ν
^ = A ,
A (15.8)
mg
The Chaotic Pendulum 269
Suppose that the pendulum’s position, θ(0), and velocity, v(0), are specified
at time t = 0. As is well-known, in this case, the above equations of motion
can be solved analytically to give:
A (1 − ω2)
θ(t) = θ(0) − e−t/2Q cos(ω∗ t)
[(1 − ω2)2 + ω2/Q2]
1 θ(0) A (1 − 3 ω2)/2Q
+ v(0) + − e−t/2Q sin(ω∗ t)
ω∗ 2Q [(1 − ω2)2 + ω2/Q2]
h i
A (1 − ω2) cos(ω t) + (ω/Q) sin(ω t)
+ , (15.15)
[(1 − ω2)2 + ω2/Q2]
A ω2/Q
v(t) = v(0) − e−t/2Q cos ω∗ t
[(1 − ω2)2 + ω2/Q2]
1 v(0) A [(1 − ω2) − ω2/2Q2]
− θ(0) + − e−t/2Q sin(ω∗ t)
ω∗ 2Q [(1 − ω2)2 + ω2/Q2]
h i
ω A −(1 − ω2) sin(ω t) + (ω/Q) cos(ω t)
+ . (15.16)
[(1 − ω2)2 + ω2/Q2]
Here, s
1
ω∗ = 1− , (15.17)
4Q2
and it is assumed that Q > 1/2. It can be seen that the above expressions
for θ and v both consist of three terms. The first two terms clearly represent
transients, since they depend on the initial conditions, and decay exponen-
tially in time—see Section 4.9. In fact, the e-folding time for the decay of
these terms is 2 Q (in normalized time units). The final term represents the
time-asymptotic motion of the pendulum, and is manifestly independent of
the initial conditions—see Section 4.5.
It is often convenient to visualize the motion of a dynamical system as
an orbit, or trajectory, in phase-space, which is defined as the space of all
of the dynamical variables required to specify the instantaneous state of the
system. For the case in hand, there are two dynamical variables, v and θ, and
so phase-space corresponds to the θ-v plane. Note that each different point
in this plane corresponds to a unique instantaneous state of the pendulum.
[Strictly speaking, we should also consider t to be a dynamical variable,
since it appears explicitly on the right-hand side of Equation (15.11).]
It is clear, from Equations (15.15) and (15.16), that if we wait long
enough for all of the transients to decay away then the motion of the pen-
The Chaotic Pendulum 271
v/A
θ /A
Figure 15.2: A phase-space plot of the periodic attractor for a linear damped
periodically driven pendulum. Data calculated analytically for Q = 4 and
ω = 2.
As illustrated in Figure 15.2, this curve is an ellipse whose principal axes are
aligned with the v and θ coordinate axes. Observe that the curve is closed,
which suggests that the associated motion is periodic in time. In fact, the
motion repeats itself exactly every
2π
τ= (15.22)
ω
272 NEWTONIAN DYNAMICS
A/A
~
v/A
θ /A
θ /A
For the sake of definiteness, let us fix the normalized amplitude and fre-
quency of the external drive to be A = 1.5 and ω = 2/3, respectively.2
Furthermore, let us investigate any changes which may develop in the na-
ture of the pendulum’s time-asymptotic motion as the quality-factor Q is
varied. Of course, if Q is made sufficiently small (i.e., if the pendulum is
1
W.H. Press, S.A. Teukolsky, W.T. Vetterling, and B.P. Flannery, Numerical recipes in C: The
art of scientific computing, 2nd Edition (Cambridge University Press, Cambridge UK, 1992),
Section 16.1.
2
G.L. Baker, Control of the chaotic driven pendulum, Am. J. Phys. 63, 832 (1995).
The Chaotic Pendulum 275
ω t = φ + k 2π (15.23)
where k is any integer, and φ is referred to as the Poincaré phase. For period-
1 motion, in which the motion repeats itself exactly every period of the
external drive, we expect the Poincaré section to consist of only one point in
phase-space (i.e., we expect all of the points to plot on top of one another).
Likewise, for period-2 motion, in which the motion repeats itself exactly
every two periods of the external drive, we expect the Poincaré section to
consist of two points in phase-space (i.e., we expect alternating points to plot
on top of one another). Finally, for period-n motion we expect the Poincaré
section to consist of n points in phase-space.
Figure 15.7 displays the Poincaré section of the orbit shown in Fig-
ure 15.6. The fact that the section consists of a single point confirms that
the motion displayed in Figure 15.6 is indeed period-1 motion.
The Chaotic Pendulum 277
3
E.S. Albers and B.W. Lee, Phys. Rep. 9C, 1 (1973).
4
P. Coles, and F. Lucchin, Cosmology: The origin and evolution of cosmic structure, (J. Wiley
& Sons, Chichester UK, 1995).
The Chaotic Pendulum 281
We have seen that when Q = 1.3, A = 1.5, and ω = 2/3 there are two
co-existing period-1 attractors in θ–v phase-space. The time-asymptotic tra-
jectory of the pendulum through phase-space converges on one or other of
these attractors depending on the initial conditions: i.e., depending on the
values of θ(0) and v(0). Let us define the basin of attraction of a given at-
tractor as the locus of all points in the θ(0)–v(0) plane which lead to motion
which ultimately converges on that attractor. We have seen that in the low-
amplitude (i.e., linear) limit (see Section 15.3) there is only a single period-
1 attractor in phase-space, and all possible initial conditions lead to motion
which converges on this attractor. In other words, the basin of attraction
for the low-amplitude attractor constitutes the entire θ(0)–v(0) plane. The
present case, in which there are two co-existing attractors in phase-space, is
somewhat more complicated.
Figure 15.13 shows the basins of attraction, in θ(0)–v(0) space, of the
asymmetric mirror image attractors pictured in Figures 15.10 and 15.11.
The basin of attraction of the left-favoring attractor shown in Figure 15.10
is coloured black, whereas the basin of attraction of the right-favoring at-
tractor shown in Figure 15.11 is coloured white. It can be seen that the two
basins form a complicated interlocking pattern. Since we can identify the
angles π and −π, the right-hand edge of the pattern connects smoothly with
its left-hand edge. In fact, we can think of the pattern as existing on the
surface of a cylinder.
Suppose that we take a diagonal from the bottom left-hand corner of
Figure 15.13 to its top right-hand corner. This diagonal is intersected by a
number of black bands of varying thickness. Observe that the two narrowest
bands (i.e., the fourth band from the bottom left-hand corner and the second
band from the upper right-hand corner) both exhibit structure which is not
very well resolved in the present picture.
Figure 15.14 is a blow-up of a region close to the lower left-hand corner
of Figure 15.13. It can be seen that the unresolved band in the latter figure
(i.e., the second and third bands from the right-hand side in the former
figure) actually consists of a closely spaced pair of bands. Note, however,
that the narrower of these two bands exhibits structure which is not very
well resolved in the present picture.
Figure 15.15 is a blow-up of a region of Figure 15.14. It can be seen that
the unresolved band in the latter figure (i.e., the first and second bands from
the left-hand side in the former figure) actually consists of a closely spaced
pair of bands. Note, however, that the broader of these two bands exhibits
282 NEWTONIAN DYNAMICS
Figure 15.13: The basins of attraction for the asymmetric, mirror image,
attractors pictured in Figures 15.10 and 15.11. Regions of θ(0)–v(0) space
which lead to motion converging on the left-favouring attractor shown in Fig-
ure 15.10 are coloured white: regions of θ(0)–v(0) space which lead to motion
converging on the right-favouring attractor shown in Figure 15.11 are coloured
black. Data calculated numerically for Q = 1.3, A = 1.5, ω = 2/3, and φ = 0.
The Chaotic Pendulum 283
Figure 15.14: Detail of the basins of attraction for the asymmetric, mirror
image, attractors pictured in Figures 15.10 and 15.11. Regions of θ(0)–v(0)
space which lead to motion converging on the left-favouring attractor shown
in Figure 15.10 are coloured white: regions of θ(0)–v(0) space which lead to
motion converging on the right-favouring attractor shown in Figure 15.11 are
coloured black. Data calculated numerically for Q = 1.3, A = 1.5, ω = 2/3,
and φ = 0.
284 NEWTONIAN DYNAMICS
Figure 15.15: Detail of the basins of attraction for the asymmetric, mirror
image, attractors pictured in Figures 15.10 and 15.11. Regions of θ(0)–v(0)
space which lead to motion converging on the left-favouring attractor shown
in Figure 15.10 are coloured white: regions of θ(0)–v(0) space which lead to
motion converging on the right-favouring attractor shown in Figure 15.11 are
coloured black. Data calculated numerically for Q = 1.3, A = 1.5, ω = 2/3,
and φ = 0.
The Chaotic Pendulum 285
Figure 15.16: Detail of the basins of attraction for the asymmetric, mirror
image, attractors pictured in Figures 15.10 and 15.11. Regions of θ(0)–v(0)
space which lead to motion converging on the left-favouring attractor shown
in Figure 15.10 are coloured white: regions of θ(0)–v(0) space which lead to
motion converging on the right-favouring attractor shown in Figure 15.11 are
coloured black. Data calculated numerically for Q = 1.3, A = 1.5, ω = 2/3,
and φ = 0.
286 NEWTONIAN DYNAMICS
Let us now return to Figure 15.8. Recall, that as the quality-factor Q is grad-
ually increased, the time-asymptotic orbit of the pendulum through phase-
space undergoes a sudden transition, at Q ≃ 1.245, from a left-right sym-
metric period-1 orbit to a left-favouring period-1 orbit. What happens if
5
B.B. Mandelbrot, The fractal geometry of nature, (W.H. Freeman, New York NY, 1982).
The Chaotic Pendulum 287
hand side of the plot), and takes the form of a closed curve consisting of two
interlocked loops in phase-space. Recall that for period-1 orbits there was
only a single closed loop in phase-space. Figure 15.19 shows the Poincaré
section of the orbit shown in Figure 15.18. The fact that the section consists
of two points confirms that the orbit does indeed correspond to period-2
motion.
A period-doubling bifurcation is an example of temporal symmetry break-
ing. The equations of motion of the pendulum are invariant under the trans-
formation t → t + τ, where τ is the period of the external drive. In the low
amplitude (i.e., linear) limit, the time-asymptotic motion of the pendulum
always respects this symmetry. However, as we have just seen, in the non-
linear regime it is possible to obtain solutions which spontaneously break
this symmetry. Obviously, motion which repeats itself every two periods of
the external drive is not as temporally symmetric as motion which repeats
every period of the drive.
Figure 15.20 is essentially a continuation of Fig 15.12. Data is shown
for two sets of initial conditions which lead to motions converging on left-
favouring (lower branch) and right-favouring (upper branch) periodic at-
tractors. We have already seen that the left-favouring periodic attractor
undergoes a period-doubling bifurcation at Q = 1.348. It is clear from Fig-
ure 15.20 that the right-favouring attractor undergoes a similar bifurcation
The Chaotic Pendulum 289
at almost exactly the same Q-value. This is hardly surprising since, as has
already been mentioned, for fixed physical parameters (i.e., Q, A, ω), the
left- and right-favouring attractors are mirror-images of one another.
for n > 1. Note that the distance (in Q) between bifurcations decreases
rapidly as n increases. In fact, the above formula predicts an accumulation
of period-doubling bifurcations at Q = Q∞ , where
∞
X 1 F
Q∞ = Q1 + (Q2 − Q1) j
≡ Q1 + (Q2 − Q1) = 1.3758. (15.26)
F F−1
j=0
292 NEWTONIAN DYNAMICS
Bifurcation n Qn Qn − Qn−1 Fn
period-1→period-2 1 1.34870 - -
period-2→period-4 2 1.37003 0.02133 -
period-4→period-8 3 1.37458 0.00455 4.69 ± 0.01
period-8→period-16 4 1.37555 0.00097 4.69 ± 0.04
period-16→period-32 5 1.37575 0.00020 4.9 ± 0.20
peats itself every eight periods. Figure 15.25 shows period-16 motion: as
expected, the Poincaré section consists of sixteen points, and the rotation
sequence repeats itself every sixteen periods. Finally, Figure 15.26 shows
chaotic motion. Note that the Poincaré section now consists of a set of four
continuous line segments, which are, presumably, made up of an infinite
number of points (corresponding to the infinite period of chaotic motion).
Note, also, that the associated sequence of net rotations per period shows
no obvious sign of ever repeating itself. In fact, this sequence looks rather
like one of the previously shown periodic sequences with the addition of
a small random component. The generation of apparently random motion
from equations of motion, such as Equations (15.10) and (15.11), which
contain no overtly random elements is one of the most surprising features
of non-linear dynamics.
Many non-linear dynamical systems found in nature exhibit a transition
294 NEWTONIAN DYNAMICS
quite different physical systems.8 Note that our best estimate of the Feigen-
baum ratio (see Table 15.1) is 4.69 ± 0.01, in good agreement with Feigen-
baum’s prediction.
The existence of a universal ratio characterizing the transition to chaos
via period-doubling bifurcations is one of many pieces of evidence indicating
that chaos is a universal phenomenon (i.e., the onset and nature of chaotic
motion in different dynamical systems has many common features). This
observation encourages us to believe that in studying the chaotic motion of
a damped periodically driven pendulum we are learning lessons which can
be applied to a wide range of non-linear dynamical systems.
Suppose that we launch our pendulum and then wait until its motion has
converged onto a particular attractor. The subsequent motion can be visual-
ized as a trajectory θ0(t), v0(t) through phase-space. Suppose that we then
somehow perturb the pendulum, at time t = t0, such that its position in
phase-space is instantaneously changed from θ0(t0), v0(t0) to θ0(t0) + δθ0,
v0(t0) + δv0. The subsequent motion can be visualized as a second trajec-
tory θ1(t), v1(t) through phase-space. What is the relationship between the
original trajectory θ0(t), v0(t) and the perturbed trajectory θ1(t), v1(t)? In
particular, does the phase-space separation between the two trajectories,
whose components are
δθ(∆t) = θ1(t0 + ∆t) − θ0(t0 + ∆t), (15.28)
δv(∆t) = v1(t0 + ∆t) − v0(t0 + ∆t), (15.29)
grow in time, decay in time, or stay more or less the same? What we are
really investigating is how sensitive the time-asymptotic motion of the pen-
dulum is to initial conditions.
According to the linear analysis of Section 15.3,
δθ(∆t) = δθ0 cos(ω∗ ∆t) e−∆t/2Q
1 δθ0
+ δv0 + sin(ω∗ ∆t) e−∆t/2Q, (15.30)
ω∗ 2Q
δv(∆t) = δv0 cos(ω∗ ∆t) e−∆t/2Q
1 δv0
− δθ0 + sin(ω∗ ∆t) e−∆t/2Q, (15.31)
ω∗ 2Q
8
P. Citanovic, Universality in chaos, (Adam Hilger, Bristol UK, 1989).
The Chaotic Pendulum 297
within the attractor itself. Obviously, this exponential divergence must come
to an end when the phase-space separation of the trajectories becomes com-
parable to the extent of the attractor.
A dynamical system characterized by a positive Liapunov exponent, λ,
has a time horizon beyond which regular deterministic prediction breaks
down. Suppose that we measure the initial conditions of an experimental
system very accurately. Obviously, no measurement is perfect: there is al-
ways some error δ0 between our estimate and the true initial state. After a
time t, the discrepancy grows to δ(t) ∼ δ0 exp(λ t). Let a be a measure of
our tolerance: i.e., a prediction within a of the true state is considered ac-
ceptable. It follows that our prediction becomes unacceptable when δ ≫ a,
which occurs when
1 a
t > th ∼ ln . (15.33)
λ δ0
Note the logarithmic dependence on δ0. This ensures that, in practice, no
matter how hard we work to reduce our initial measurement error, we can-
not predict the behaviour of the system for longer than a few multiples of
1/λ.
It follows, from the above discussion, that chaotic attractors are associ-
ated with motion which is essentially unpredictable. In other words, if we at-
tempt to integrate the equations of motion of a chaotic system then even the
slightest error made in the initial conditions will be amplified exponentially
over time, and will rapidly destroy the accuracy of our prediction. Eventu-
ally, all that we will be able to say is that the motion lies somewhere on the
chaotic attractor in phase-space, but exactly where it lies on the attractor at
any given time will be unknown to us.
The hyper-sensitivity of chaotic systems to initial conditions is sometimes
called the butterfly effect. The idea is that a butterfly flapping its wings in a
South American rain-forest could, in principle, affect the weather in Texas
(since the atmosphere exhibits chaotic dynamics). This idea was first pub-
licized by the meteorologist Edward Lorenz, who constructed a very crude
model of the convection of the atmosphere when it is heated from below
by the ground.9 Lorenz discovered, much to his surprise, that his model
atmosphere exhibited chaotic motion—which, at that time, was virtually
unknown. In fact, Lorenz was essentially the first scientist to fully under-
stand the nature and ramifications of chaotic motion in physical systems.
In particular, Lorenz realized that the chaotic dynamics of the atmosphere
spells the doom of long-term weather forecasting: the best one can hope to
9
E. Lorenz, Deterministic nonperiodic flow, J. Atmospheric Science 20, 130 (1963).
302 NEWTONIAN DYNAMICS
achieve is to predict the weather a few days in advance (1/λ for the atmo-
sphere is of order a few days).
Let us return to Figure 15.22. Recall, that this figure shows the onset of
chaos, via a cascade of period-doubling bifurcations, as the quality-factor
Q is gradually increased. Figure 15.31 is essentially a continuation of Fig-
ure 15.22 which shows the full extent of the chaotic region (in Q-v space). It
can be seen that the chaotic region ends abruptly when Q exceeds a critical
value, which is about 1.4215. Beyond this critical value, the time-asymptotic
motion appears to revert to period-1 motion (i.e., the solid black region col-
lapses to a single curve). It can also be seen that the chaotic region contains
many narrow windows in which chaos reverts to periodic motion (i.e., the
solid black region collapses to n curves, where n is the period of the mo-
tion) for a short interval in Q. The four widest windows are indicated in the
figure.
The Chaotic Pendulum 303
motion. Figure 15.34 shows that just at the appearance of the window the
attractor loses much of its chaotic nature (i.e., its Poincaré section breaks
up into a series of points), and the chaotic intervals become shorter and
much less frequent. Finally, Figure 15.35 shows that just after the appear-
ance of the window the attractor collapses to a period-3 attractor, and the
chaotic intervals cease altogether. All of the other periodic windows seen in
Figure 15.31 appear in an analogous manner to that just described.
According to the above discussion, the typical time-asymptotic motion
seen just prior to the appearance of a period-n window consists of inter-
vals of period-n motion interspersed with intervals of chaotic motion. This
type of behaviour is called intermittency, and is observed in a wide variety
of non-linear systems. As we move away from the window, in parameter
space, the intervals of periodic motion become gradually shorter and more
infrequent. Eventually, they cease altogether. Likewise, as we move towards
the window, the intervals of periodic motion become gradually longer and
more frequent. Eventually, the whole motion becomes periodic.
In 1973, Metropolis and co-workers investigated a class of simple math-
ematical models which all exhibit a transition to chaos, via a cascade of
period-doubling bifurcations, as some control parameter r is increased.10
They were able to demonstrate that, for these models, the order in which
stable periodic orbits occur as r is increased is fixed. That is, stable periodic
attractors always occur in the same sequence as r is varied. This sequence is
called the universal or U-sequence. It is possible to make a fairly convincing
argument that any physical system which exhibits a transition to chaos via a
sequence of period-doubling bifurcations should also exhibit the U-sequence
of stable periodic attractors. Up to period-6, the U-sequence is
1, 2, 2 × 2, 6, 5, 3, 2 × 3, 5, 6, 4, 6, 5, 6.
Figure 15.36 shows the complete bifurcation diagram for the damped, peri-
odically driven, pendulum (with A = 1.5 and ω = 2/3). It can be seen that
the chaotic region investigated in the previous section is, in fact, the first,
and least extensive, of three different chaotic regions.
The interval between the first and second chaotic regions is occupied
by the period-1 orbits shown in Figure 15.37. Note that these orbits differ
somewhat from previously encountered period-1 orbits in that the pendu-
11
R.H. Simoyi, A. Wolf, and H.L. Swinney, One-dimensional dynamics in a multi-component
chemical reaction, Phys. Rev. Lett. 49, 245 (1982).
308 NEWTONIAN DYNAMICS
lum executes a complete rotation (either to the left or to the right) every
period of the external drive. Now, an n, l periodic orbit is defined such that
θ(t + n τ) = θ(t) + 2π l
for all t (after the transients have died away). It follows that all of the pe-
riodic orbits which we encountered in previous sections were l = 0 orbits:
i.e., their associated motions did not involve a net rotation of the pendulum.
The orbits show in Figure 15.37 are n = 1, l = −1 and n = 1, l = +1 orbits,
respectively. The existence of periodic orbits in which the pendulum under-
goes a net rotation, either to the left or to the right, is another example of
spatial symmetry breaking—there is nothing in the pendulum’s equations of
motion which distinguishes between the two possible directions of rotation.
Figure 15.38 shows the Poincaré section of a typical attractor in the sec-
ond chaotic region shown in Figure 15.36. It can be seen that this attractor
is far more convoluted and extensive than the simple 4-line chaotic attrac-
tor pictured in Figure 15.26. In fact, the attractor shown in Figure 15.38 is
clearly a fractal curve. It turns out that virtually all chaotic attractors exhibit
fractal structure.
The interval between the second and third chaotic regions shown in Fig-
ure 15.36 is occupied by n = 3, l = 0 periodic orbits. Figure 15.39 shows
the Poincaré section of a typical attractor in the third chaotic region. It can
be seen that this attractor is even more overtly fractal in nature than that
The Chaotic Pendulum 309
pictured in the previous figure. Note that the fractal nature of chaotic attrac-
tors is closely associated with some of their unusual properties. Trajectories
on a chaotic attractor remain confined to a bounded region of phase-space,
and yet they separate from their neighbours exponentially fast (at least, ini-
tially). How can trajectories diverge endlessly and still stay bounded? The
basic mechanism is described below. If we imagine a blob of initial condi-
tions in phase-space then these undergo a series of repeated stretching and
folding episodes, as the chaotic motion unfolds. The stretching is what gives
rise to the divergence of neighbouring trajectories. The folding is what en-
sures that the trajectories remain bounded. The net result is a phase-space
structure which looks a bit like filo pastry—in other words, a fractal struc-
ture.
310 NEWTONIAN DYNAMICS