Lag Lectures00-18 2008-09
Lag Lectures00-18 2008-09
Lecture 0: Introduction
Synopsis I From the Programme Guide
The principles of classical dynamics, in the Newtonian formulation, are expressed in terms
of (vectorial) equations of motion. These principles are recapitulated and extended to cover
systems of many particles. The laws of dynamics are then reformulated in the Lagrangian
framework, in which a scalar quantity (the Lagrangian) takes centre stage. The equations
of motion then follow by differentiation, and can be obtained directly in terms of whatever
generalised coordinates suit the problem at hand. These ideas are encapsulated in Hamiltons principle, a statement that the motion of any classical system is such as to extremise
the value of a certain integral. The laws of mechanics are then obtained by a method known
as the calculus of variations. As a problem-solving tool, the Lagrangian approach is especially useful in dealing with constrained systems, including (for example) rotating rigid
bodies, and one aim of the course is to gain proficiency in such methods. At the same time,
we examine the conceptual content of the theory, which reveals the deep connection between
symmetries and conservation laws in physics. Hamiltons formulation of classical dynamics
(Hamiltonian Dynamics) is introduced, and some of its consequences and applications are
explored.
Syllabus
1. Introduction: scope of the course. Newtons formulation: its strengths and weaknesses.
Its relation to 20th Century physics. Inertial frames. Dynamics of a particle: torque,
angular momentum, conservation laws. Examples.
2. Dynamics of a system of particles. Central and noncentral forces the third law in
strong and weak form. Centre of mass motion, angular momentum, its independence
of origin in the centre of momentum frame. Rigid body as a collection of particles.
3. Constraints and their classification. Generalised displacements, generalised forces.
From Newtons laws to Lagranges equations (the hard way - superseded later). Conservative forces, the Lagrangian function. Absence of friction. Examples. Revision of
rotating frames and fictitious forces: derivation of centrifugal and Coriolis forces as an
illustration of the Lagrangian approach.
4. Functions and functionals. Calculus of variations. The Euler-Lagrange equation and its
multi-variable extension. Examples of variational calculus from other areas of physics.
Constrained variations. Hamiltons principle as an embodiment of Lagrangian dynamics. Its coordinate invariance. From Newton to Lagrange (the easy way).
5. Lagrangian Mechanics. Changes of variable, changes of Lagrangian. Ignorable coordinates, first integrals. Generalised momenta. The energy (or Hamiltonian) function
and its relation to the total energy of the system. Examples. Conservation of angular
and linear momentum deduced from the homogeneity and isotropy of space. General
relation between symmetries of the Lagrangian and conservation laws.
6. Extensions of the Lagrangian method. Velocity-dependent forces; the Lagrangian of
a charged particle in an external electromagnetic field, distinction between canonical
and mechanical momentum. The Lagrangian for a relativistic particle in an external
electromagnetic field (briefly).
7. Hamiltonian Dynamics. The Legendre transformation. Derivation of Hamiltons equations of motion (from Lagranges equations). Poisson brackets. Hamiltonian of a
charged particle in an external electromagnetic field. The connection with Quantum
Mechanics.
8. Rigid body motion. Angular velocity, angular momentum; their noncollinearity. The
moment of inertia tensor. Principal axes. Constrained motion about a general axis.
Spherical, symmetrical and asymmetrical tops defined. Rigid body kinematics: Eulerian approach axes in the moving frame of the body. Torque-free motion of a
symmetric top. Stability of steady motion in the asymmetric case. Limitations. Rigid
body dynamics: Lagrangian method axes fixed in space. The Euler angles as generalised coordinates. The Lagrangian of a symmetrical top under gravity. The gyroscope:
precession, nutation. Examples.
9. Small oscillation theory. Normal modes studied by Lagrangian methods. Simultaneous
diagonalisation of two matrices. Examples.
Textbooks
R D Gregory, Classical Mechanics (Cambridge). A fairly new book which covers the whole
of classical mechanics up to and including the level of this course. Recommended.
H Goldstein, C P Poole, J L Safko, Classical Mechanics (Addison-Wesley). The Bible
of Classical Mechanics. Well balanced between formalism and examples, but a little longwinded and a bit esoteric in parts. We only cover a small part of whats there, but it will
be useful for anyone planning to take Hamiltonian Dynamics next year.
J Leech, Classical Mechanics (Chapman and Hall). Very concise, good for revision if you
already have seen the material. Emphasises concepts rather than problem solving.
M Spiegel, Theoretical Mechanics (Schaum Series, McGraw Hill). Teaches by worked
example, emphasises problem solving rather than concepts.
V Barger and M Olsson, Classical Mechanics, A Modern Perspective (McGraw Hill,
second edition. 1995). A very unusual book. There is a concise introduction to Lagrangian
methods in Chapter 3; thereafter the book is strong on rigid body motion but weak on other
parts of the course. Worth looking at for its discussions of esoterica such as the superball,
the tippy-top, boomerangs, etc. . .
W Greiner, Classical Mechanics: Systems of Particles and Hamiltonian Dynamics (SpringerVerlag, 2003). A relatively recent book. It lies somewhere between Barger & Olsson and
Goldstein et al in both level & content.
T W B Kibble, F H Berkshire, Classical Mechanics (Imperial College Press, fourth or
fifth Edition). A well-established text which covers most aspects of classical dynamics. Its
coverage of Lagrangian Dynamics is a bit thin, but you might already have a copy, so dont
throw it away!
K F Riley, M P Hobson and S J Bence, Mathematical Methods for Physics and Engineering (third edition, 2006). See Chapter 20 for mathematical background on variational
calculus.
dr
r
dt
where r is the position vector relative to some origin O. [Note: In this course, f is always
the total derivative df /dt not the partial derivative f /t.]
If no forces act on a body, it remains at rest or moves with constant velocity: v = 0. (N1)
Second Law (N2): Define the momentum of a particle as
p = mv
In the presence of a force F ,
p = F
(N2)
and if m is constant, F = m
r ma.
In general Newton did not specify F a , except to say that the force F ab on a due to b obeys
F ab = F ba
The strong form of N3 states in addition that F ab is directed along the line r ab = r a r b
between the two particles.
There are exceptions to N3 in both forms, for example the EM force between current elements
is not directed along the line between them. (Indeed, to treat electromagnetic forces within
the Newtonian framework is rather clumsy: momentum and angular momentum are only
conserved if the contributions of the electric and magnetic fields are included.) We shall
return to this later.
In Newtonian mechanics, we need N3 to derive conservation laws. Later we shall rederive
these without appeal to N3 using Lagrangian methods.
1
F ab
which says that the force on a particle can be decomposed into the sum of contributions
from other particles two-body forces. This is not always true.
Hypothetical Example: 3 particles in one dimension with interaction potential V (x1 , x2 , x3 ) =
k x1 x2 x3 . The force on particle 1 is:
V
F1 =
x1
x2 ,x3
= k x2 x3
p = F
What distinguishes the inertial frames from the others? To get to the bottom of this really requires
general relativity (GR). Consider Machs famous thought experiment: a bucket of water rotating about its
axis has a curved top surface (meniscus). If we choose to rotate with the bucket, we see that the meniscus
of the (stationary) water is not flat, and thus the frame is not inertial. However, we also notice the fact
that distant gravitational bodies are rotating around us. Mach suggested that the gravity of these distant
rotating bodies causes the curvature. In fact, it turns out that Einsteins equations for GR possess a
solution representing a rotating body in an otherwise empty universe and a test-particle on the surface of
the body would feel a centrifugal force, so Machs principle is not obeyed and the concept of a non-inertial
frame is a fundamental.
F = p
Take the cross product of r with this equation, ie take r(both sides) to obtain
r F = r p =
d(r p)
r p
dt
dL
(1)
dt
where G r F is the moment of force about O (also known as the torque or couple) and
G=
Lrp
is the moment of momentum (usually known as the angular momentum) about O.
G is the turning effect of the force, which tries to rotate the particle about an axis through
The magnitude
O perpendicular to both r and F (the axis defined by the unit vector G).
of the moment of the force is |G| = rF sin where is the angle between the force and the
position vector. Likewise L is a vector perpendicular to r and p of magnitude mvr sin 0
where 0 is the angle between these vectors.
G and L are both axial vectors (aka pseudovectors): they change sign in transforming from
a right to a left-handed coordinate system. Both depend on the choice of O.
Conservation of angular momentum
From equation (1), if G = 0, then L is conserved. This is clearly true componentwise, eg if
Gx = 0 then Lx is constant, (irrespective of whether Gy 6= 0 or Gz 6= 0.)
Conservation of energy
Start with N2: F = mv , and define the work done by the force F in moving a particle from
point A to point B
WBA
Z B
A
F dr =
Z tB
tA
Z tB
Z tB
m d(v v)
dr
dt =
mv v dt =
dt
dt
dt
tA
tA 2
WBA =
1 2
mv
2
tB
tA
TB TA
which we identify as the change in the kinetic energy T 12 mv 2 . This may be regarded as
the definition of the kinetic energy in Newtonian Mechanics.
If F is conservative, we can (almost2 ) always write F = V , where V is the potential
energy. Then
Z B
Z B
TB TA =
(V ) dr =
dV = VA VB
A
independent of the path taken. We have thus VB + TB = VA + TA which expresses the law
of conservation of energy E T + V .
2
The important case of conservative velocity-dependent forces and potentials is treated in Lecture 12.
Coordinates: Choose circular cylindrical coordinates (r, , z) (aka cylindrical polars) with
the z axis parallel to L, with the motion confined to the plane z = 0.
er
cos i + sin j
e = sin i + cos j
where i and j are the usual basis vectors in the
x and y directions. The position vector r(t) of a
particle which is at the point P at time t is then
r = x i + yj = r e r
1
1
1
T = mv 2 = mr r = m(r 2 + r2 2 )
2
2
2
Alternatively, we may evaluate T by differentiating x = r cos and y = r sin :
x = r cos r sin
and
This gives
y = r sin + r cos
1
1
1
T = mv 2 = m(x 2 + y 2 ) = m(r 2 + r2 2 )
2
2
2
as before. We will usually use vr and vt to simply write down the kinetic energy when using
polar coordinates, but this second method is much more general and well use it a lot.
4
L = r mr = m r e r (r e r + r e )
= 0 + mr2 e r e = Le z
1
L2
1
m(r 2 + r2 2 ) + V (r) = mr 2 +
+ V (r) = constant
2
2
2mr2
(2)
Equation for the radial velocity: Solving equation (2) for r gives:
"
L2
2
E V (r)
r =
m
2mr2
!#1/2
L /2mr
(3)
r2 r1
r2
r
Veff(r)
E2
E1
V(r)
d 1 2
dE
=
mr + Veff (r)
dt
dt 2
= mr
r +
dVeff (r)
r = 0
dr
(4)
dVeff (r)
dr
(5)
dVeff (r)
= 0.
dr r=r1
(6)
If E = E2 > E1 , the particle is confined to the zone between the two radii r2 and r20 : at
these points E = Veff (r2 ) = Veff (r20 ) = E2 and the radial component of velocity r = 0 there.
5
2 d2 Veff (r)
dVeff (r)
+
Veff (r) = Veff (r1 + ) = Veff (r1 ) +
+ O(3 )
2
dr r=r1
2
dr
r=r1
k
= constant + 0 + 2 + O(3 )
2
d2 Veff (r)
00
(r1 ).
where we have defined the constant k
Veff
dr2 r=r1
For small (and k 6= 0), we can ignore the terms of order O(3 ), and the total energy
becomes3
1
k
E = m2 + 2 ,
(7)
2
2
If r1 is a local minimum of the effective potential, then k > 0, and equation (7) is the
total energy of a harmonic oscillator of mass m and spring-constant
q k. Thus our small
perturbation leads to radial oscillations with angular frequency = k/m about r1 .
If r1 is a local maximum of the effective potential, then k < 0, and the circular orbit is
unstable. If k = 0, we have to go beyond O(2 ) in the Taylor expansion of V (r).
The Energy Method Zones
The above method, which was based on using the energy directly to obtain an equation of
motion, is especially powerful for systems with only one or two degrees of freedom. Rather
than three second order ODEs, the application of conservation laws gives one first order
ODE a major simplification. In the energy method, as with the Lagrangian methods
covered later, it is extremely important to write down the kinetic and potential energies
correctly(!) as a function of the chosen coordinates, whatever they may be. Practice makes
perfect see the problem sheets.
Zones:
In our example, if 0 > E > Veff (r1 ) then the particle is confined to a zone:
rmin r rmax , where rmin,max are the roots of Veff (r) = E. Put differently, equation (3) for
r is of the form
q
r = g(r)
with, in this case, g(r) = 2(E Veff (r))/m. Then the fact that r must be real requires that
g(r) must be positive. So, whatever happens in the subsequent motion, r cannot escape the
zone rmin r rmax where these are the roots of g(r) = 0. (Eg: if E = E2 , r2 r r20 .)
The use of zones is an important method which can be applied to more complicated problems
(eg spinning tops) to find partial information without solving for the full trajectory which
in many cases is extremely difficult. If asked to show that some coordinate q
is confined to a
zone, your first reaction should be to look for a velocity equation like r = g(r).
3
We have dropped the constant term, Veff (r1 ) = E1 , from the RHS of the expression for E. E1 is just
the total energy of the original circular orbit, and it doesnt affect the oscillation frequency.
ma r a =
X
a
Fa =
X
a
+
F ext
a
F ab
a,b
F ab =
a,b
(F ab + F ba ) = 0
pairs
ma r a =
X
a
F ext
= F ext
a
(1)
P
Differentiating this twice with respect to time and substituting into the LHS of equation (1),
we deduce that the centre of mass moves as a particle of mass M under the resultant F ext :
= F ext
MR
The total linear momentum is
P =
X
a
pa =
X
a
ma r a = M R
P = F ext
r a p a =
ra F a
(2)
dL a
d
(r a p a ) r a p a =
r a p a
dt
dt
P
where L = Pa L .
The last term is zero (since p a = mr a ) and so a r a p a = L,
a
r a p a =
ra F a =
X
a
+
r a F ext
a
X
a,b
r a F ab
Using N3 (weak) the last term becomes pairs (r a r b )F ab . This vanishes if we also assume
P
N3 (strong), namely that F ab is along r ab . Hence, given N3 (strong), L = a L a obeys
X
L =
ext
=
r a F ext
a
= Gext
Gext
a
where we have defined G as the total external couple acting on the system. Note that
L and G are additive over particles (ie, they obey vector addition rules, as axial vectors
should). Both still depend on the origin O chosen.
Energy
For one particle, we showed previously that
1 2
mv
= dT
2
where T is the kinetic energy. For N particles, it is straightforward to show that
F dr = d
dT =
X
a
dTa =
X
a
F a dr a =
X
a
F ext
dr a +
a
pairs
F ab (dr a dr b )
X
a
F ext
dr a
a
In words: the change in KE + the change in internal energy = work done by external forces.
If the external forces are also conservative, there is a potential function V (r 1 , r 2 , , r N )
such that
= a V
F ext
a
where a denotes the gradient wrt to r a , hence
F ext
dr a = dV . If so
a
dT + dU + dV = 0
X
a
pa =
X
a
ma r a = 0
Now
X
a
L a = r a p a = L0a + b p a
L = L0 + b P
where we summed both sides over a. Hence L depends on the choice of origin unless P = 0.
Let us now define the intrinsic angular momentum J as the value of L in the centre of
momentum frame; the quantity J is automatically independent of the choice of origin.
Technical Note: In contrast, Gext = (Gext )0 + b F ext where the extra term from shifting O
does not vanish in the centre of momentum frame. The origin for G must therefore always
be specified.
Galilean transformation
This is the name for a transformation from one inertial frame to another. Suppose frame S 0
moves at constant velocity V wrt frame S. For definiteness we choose O0 = O + b at time
t = 0.
In frame S, a position vector is r . In S 0 , the corresponding r0 obeys r = r0 + V t + b. Hence
the linear momentum of a particle obeys the transformation law:
p a = p0a + ma V
0
So that p a = p0a = F a . Likewise P = P = F ext . Newtons equations are therefore unaffected
by any Galilean transformation.
X
a
(r0a + b + V t) (p0a + ma V )
X
a
expanding:
X1
1
ma v 2a =
ma (v 0a + V ) (v 0a + V )
2
a 2
T = T0 +
X
a
1
m a v 0a V + M V 2
2
If S is again chosen as the centre of momentum frame, the middle term vanishes (since
P
0
0
a ma v a = P = 0) to give
1
T = T 0 + MV 2
2
that is, the kinetic energy in frame S equals the value in the centre-of-momentum frame, plus
the contribution of the motion in S of the centre of mass (again treated as a point particle).
This result is a great help in writing down the kinetic energy T of a system in arbitrary
motion. We shall use it a lot in problem solving.
momentum Lz about the axis of the tube, and the kinetic energy T , each as a function of .
Method: First draw a diagram:
V, e
R
C of Mom Frame
Lab Frame
The centre of mass (CM) of the hoop moves on a circle of radius Ra, with velocity V = vt e
where vt = (R a) (the subscript t stands for tangential). The angular momentum and
ez
kinetic energy associated with this motion, in the lab frame, are respectively LCM = LCM
z
1
CM
2
CM
2 2
with Lz = m(R a) and T
= 2 m(R a) where e and e z are the usual unit
vectors, with e z pointing out of the page.
To these we must add the contributions from the hoop in its centre-of-momentum frame.
This frame is an inertial frame instantaneously co-moving with the centre of mass of the
hoop; hence it is not rotating relative to lab coordinates (just translating with speed vt in
a direction tangential to the radius vector at which the frame is defined instantaneously).
The motion of the hoop in its c-of-mom (centre of momentum) frame consists of a rotation
about the centre of mass at some angular velocity . To find , we note that the (lab-frame)
velocity of the point on the hoop in instantaneous contact with the tube is v = vt +a. Since
the hoop is rolling (not sliding) with respect to the stationary tube, we must have v = 0, or
= vt /a = (R a)/a.
Note that is negative: the hoop is rotating clockwise.
The intrinsic angular momentum is Jz = ma2 = ma(R a) (again, note the sign), and
the (intrinsic) contribution to T is T int = 12 ma2 2 = 12 m(R a)2 2 .
Adding these to the previous results for the angular momentum and kinetic energy for
centre of mass motion, we find the total angular momentum, Lz = LCM
+ Jz , and total
z
CM
int
kinetic energy, T = T
+ T , respectively, in the lab frame are
Lz = m(R 2a)(R a)
and
1
T = m(R a)2 2 .
This calculation is not easy it requires clear thinking, but not much more than that. Also,
its rather easier than a direct evaluation in the lab frame (try it if you are sceptical).
To get the equation of motion, one uses the conservation laws to get an expression for as
a function of .
Noninertial/Rotating frames
This topic was covered in Junior Honours Dynamics & Relativity; it is included here for
completeness as we shall use it later in the course. See D&R notes for a more detailed
discussion.
Noninertial frames are frames which, with respect to an inertial frame, are accelerating. The
most important case is a frame rotating at constant angular velocity. We will sometimes
encounter noninertial frames in this course (although there are pitfalls in applying Lagrangian
methods in such frames). Something we will need later is the equation of motion (which is
not N2) for a particle in a rotating frame.
Consider an inertial frame S0 and a noninertial frame S rotating with constant angular
velocity with respect to such a frame, but instantaneously coincident with it (so the origin
and cartesian axes are lined up). We use notation [f ]X to denote a quantity measured in
frame X. Now, a particle with position vector [R]S , stationary in S, has velocity in S0
[v]S0 = [R]S
where [R]S = [R]S0 since S and S0 instantaneously coincide. Adding any velocity the particle
may have in S itself, we have
S0 = [R]
S + [R]S
(1)
[R]
We derived this for a position vector R but if it holds for one of these it must also apply to
any other vector A (since all vectors transform between frames according to the same rules):
S0 = [A]
S + [A]S
[A]
(2)
S0 = m d [R]
S + [R]S
m[R]
dt
S + [R]S
+ m [R]
The left hand side is the acceleration in the inertial frame S0 , and since Newtons second
law holds in this frame, we can write this term as F , the force on the particle. Expanding
the RHS gives:
S + m [R]
S + m [R]
s + m ( [R]S )
F = m[R]
Rearranging, we find the equation of motion in the rotating frame S to be:
S = F m ( [R]S ) 2m [R]
S
m[R]
2
to calculate these. Even when few in number, constraint forces are often uninteresting and
end up being finally eliminated. Lagrangian methods allow one to do this elimination at the
outset.
Conservation laws: In Newtons scheme, conservation of energy, and of linear and angular
momentum are derived consequences, not fundamental principles. Since the derivation
requires N3 (strong), the fundamental status of angular momentum conservation, in particular, is rather unclear. Even linear momentum conservation, derived via N3 (weak), assumes
pairwise additive forces which is not very general. The Lagrangian approach completely
avoids N3 and lays bare the deep relationship between conservation laws and symmetries.
It turns out that N3 is a red herring. (This relationship extends beyond strictly classical
mechanics into relativity and quantum theory, too.)
Relation to modern physics: The real world is described (as far as we know) by relativistic quantum field theory (or perhaps (super)string theory or even M -theory). Classical
mechanics should emerge from this as the limiting case of macroscopic (ie, non-quantum),
slow-moving (ie, nonrelativistic) objects. Already you have learned something about (nonquantum) relativity, and about (non-relativistic) quantum mechanics. In the first case,
the emergence of Newtonian physics for small v/c is fairly natural. (In fact, Lagrangian
methods can be extended to include relativity and we explore this briefly). For quantum
mechanics, the limiting process is harder to tease out, but a careful pursuit of a correspondence principle between classical and quantum regimes leads to the scalar (Lagrangian
& Hamiltonian) formulations of dynamics, rather than the Newtonian one.
F ab
(N2)
z
Axes fixed
in LAB
y0
y
R
x0
Technical Note: The limit of a continuous material can be taken by sending N ; for a
rigid body, the number of degrees of freedom is still 6.
Even given that we have a rigid body, there may be further constraints. For example, a disc
confined to a plane has 2 centre of mass coordinates and one rotation angle giving a total of
3 dof. For our previous example of a hoop rolling inside a cylinder, the hoop rolls without
slipping so that the tangential velocity of the hoop and the cylinder at the point of contact
must be equal. Moreover the centre of the hoop is confined to a circle. This reduces the
problem to one dof. In problem solving, the first step is very often to identify the number
and type of degrees of freedom.
What is a constraint? For all the examples we will meet in this course, a constraint is a
limiting approximation valid for the problem at hand.
Example: Two particles a, b are connected by a light inextensible rod of length ba . In
reality the rod has a large stiffness k there is a harmonic restoring force (Hookes Law)
when the rod is stretched to length rba = |r ba | :
F ba (r ba ) = k(r ba ba ) r ba
1
U (r ba ) = k (rba ba )2 + constant
2
U
^r
ma
ba
ba
mb
ba
rba
In the limit k , rba = ba always, and the corresponding degree of freedom is eliminated.
1
r
F ext
ma
mb
(held fixed)
In equilibrium
F constraint (r) = k r = F ext
with r = (r ba ) r ba . The work done by the constraint force in reaching this state is
Z
1
1
F constraint (r) dr = k(r)2 = (F ext )2 /k
2
2
which vanishes in the limit of large k. In general:
U =
Constraint forces do no work in any small instantaneous displacement of the system consistent with the constraints themselves.
A small instantaneous displacement of this kind is called a virtual displacement. The point
is that constraint forces are there to prevent certain types of virtual displacement; only if
they fail to do this can work be done. And if they fail, they are no longer constraints.
Technical Note: The above result does not mean that the constraint forces can do no work
during the actual motion of a system. Consider, for example, a particle constrained to lie
on a surface which is itself moving: there may then be a component of the actual particle
velocity in the direction of the constraint force, so that work is done. We return to this later.
Holonomic Constraints
|r ab | = ab
Rigid body:
y
a
This system has 3 dofs: (x0 , y 0 , ). If we impose rolling conditions (no-slide and no-bounce)
on the centre of mass of the log,
x0 = a + constant
and
y0 = a ,
ie two holonomic constraints, then only one dof remains. We can choose either or x0 as
the independent variable. The hoop rolling in the cylinder is similar.
Algebraic constraints mean that we can eliminate variables before solving the equations of
motion. This wouldnt be true if the log started to slide, or to bounce.
Nonholonomic constraints
Common examples of these are: (i) inequalities (ii) differential equations of constraint:
Example of an inequality constraint: a particle constrained to move inside a rectangular box:
The constraints:
(
0 x a
0 y b
dx = a cos d
dy = a sin d
x
These constraint equations cannot be integrated to eliminate a dof. The physical interpretation is that in the 4-D coordinate space, any point (x, y, , ), can be reached from any other
(try it on your unicycle). Hence there can be no real reduction in the number of degrees of
freedom and variables cannot be eliminated. Differential constraint equations like these can
only be solved after a full general solution of the unconstrained problem has been found.
From now on, we consider only holonomic systems but keep an eye out for places where our
constraints might break down or cease to be holonomic.
3
Technical Note: The rolling constraint for a log (see above) can also be viewed as a differential
however this can be integrated immediately (x0 = a + constant) to
constraint (x 0 = a),
give an algebraic relation, which is holonomic. Almost all the rolling constraints met later
in the course are holonomic in the same way.
ma r a = F a
where i = 1, 2, , 3N
Here and below, the partial derivatives are performed with all other variables held fixed, ie,
xi /qj means that all the {q} (except qj of course), and also t, are held constant during
the differentiation.
In a virtual displacement (an instantaneous small change consistent with the constraints),
which we denote {x}, there is no contribution proportional to dt :
xi =
X
j
xi
qj + 0
qj
Fi xi
Fio xi
(1)
since the constraint forces do no work in a virtual displacement (see Lecture 4). In terms of
{q}, we have
XX
xi
qj
(2)
Fio
W =
qj
j
i
which can be written
W =
Qj qj
X
i
Fi
xi X o xi
=
Fi
qj
qj
i
Note that the constraint forces do not contribute to this expression because they do no work
in a virtual displacement consistent with the constraints (equation (1) above).
A given Qj is called the generalised force conjugate to the corresponding generalised coordinate qj .
1
Example: A particle of mass m slides under gravity on a fixed vertical hoop of radius a.
z
x
In cartesians (x, z), there is one holonomic constraint and therefore just one degree of freedom
remains. We choose , with the origin at the centre of the hoop:
x = a sin
and
z = a cos
x
z
Fxo
+ Fzo
= {0 + (mg) (a sin )} Q
X
i
1
mi x 2i
2
Since xi = xi ({q}, t), in generalised coordinates the expression for T can depend explicitly
on both the coordinates {q} and the velocities {q},
as well as on time t. Thus we have
T = T ({q}, {q},
t). (For example, in plane polars T = 12 m(r 2 + r2 2 ) which depends on r in
addition to r and .)
For such functions, the increment df can be written
df =
X
j
X f
f
f
dqj +
dqj +
dt
qj
t
j qj
Here the partial derivatives are performed at fixed values of all the other variables. For
example
f
qj
means the derivative of f wrt qj at fixed values of all the other {q}, and fixed values of all
the velocities {q},
including qj , and fixed t. Because all remaining variables are always held
fixed, there is no need to put subscripts on the partial derivatives.
2
Technical Note: Formally, this procedure consists of treating the coordinates {q} and the
velocities {q}
as independent variables. This is legitimate since we are so far considering
arbitrary motions of the system we have not yet applied (or even found!) the equations of
motion which relate the velocities to the coordinates.
Special Case: Consider a function f = f ({q}, {q},
t) for which there happens to be no
dependence on the velocities {q}.
(Important examples include the cartesian coordinates
xi = xi ({q}, t).) To emphasise this independence we will sometimes write in such cases
f = f ({q}, {6 q},
t)
For such a function, clearly f / qj = 0, and so
X f dqj
f
df
=
+ 0 +
f
dt
t
j qj dt
f =
f
f
q j +
qj
t
(3)
ie f = f({q}, {q},
t)
Moreover the partial derivatives of f appearing in this expression will themselves not depend
on velocities (think about it):
f
qj
f
t
f
({q}, {6 q},
t)
qj
f
({q}, {6 q},
t)
t
Thus the only explicit velocity dependence in f comes from the q j shown in bold in equation (3).
By partially differentiating the expression for f wrt qj in equation (3) we therefore obtain
f
f
=
+ 0
qj
qj
(4)
This is called the cancellation of dots rule and holds for functions f ({q}, {6 q},
t) ONLY.
Another result for such functions, needed below, is that
d
dt
f
qj
=
qj
df
dt
The proof of this commuting derivatives rule is left as an exercise (see Tutorial Sheet).
NB the above results are needed now for the derivation of Lagranges equations, but only
occasionally thereafter.
Lagranges equations
We shall now find the equations of motion in generalised coordinates. The derivation (from
Newtons laws) is a bit technical and not very illuminating. A more attractive one (using
the calculus of variations) is given later in the course.
3
which holds for all i and j. Now multiply by mi x i and take the time derivative of each side:
!
x i
d
mi x i
dt
qj
d
xi
=
mi x i
dt
qj
Expanding the RHS and applying the commuting derivatives rule gives
d
x i
mi x i
dt
qj
d
xi
x i
(mi x i )
+ mi x i
dt
qj
qj
Finally, we sum on i:
"
d
dt qj
X
i
1
mi x 2i
2
!#
X
i
xi
Fi
+
qj
qj
X
i
1
mi x 2i
2
Qj ({q}, {q},
t) =
to obtain
1
mi x 2i
2
X
i
Fi
xi
qj
(5)
d T
T
= Qj +
dt qj
qj
There is one such equation for each generalised coordinate: j = 1, , 3N M . These
3N M equations, usually written
d
dt
T
qj
T
= Qj
qj
As shown previously, there is one generalised coordinate, , and the generalised force con so the kinetic
jugate to this is Q = mga sin . The particles velocity along the hoop is a,
energy is
t) = 1 ma2 2
T (, ,
2
2
Hence Lagranges equation of motion is
for which T / = 0 and T / = ma .
d
= mga sin
(ma2 )
dt
which can easily be checked by other means do it!
Conservative forces
In an instantaneous virtual displacement
W =
Qj qj
If (as in the example above) the forces are conservative, then there exists a potential energy
function V such that
W = V
We suppose that V = V ({q}, {6 q},
t) (where we have allowed V to be explicitly timedependent, but so-called velocity dependent potentials will be treated separately later),
in which case this may be written
X
j
Qj qj = V =
X
j
V
qj
qj
T
qj
T
V
=
qj
qj
(1)
L
qj
L
= 0
qj
L({q}, {q},
t) = T ({q}, {q},
t) V ({q}, t)
L is called the Lagrangian. These equations are the key to the entire course.
L=T V.
Note also the total time derivative in Lagranges equations, and be sure you know what it
means, viz:
X f
X f
f
df
=
qj +
qj +
dt
t
j qj
j qj
(This should become second nature after enough examples.) Finally, note that since L is
always differentiated to get the equations of motion, any additive constant in the potential
energy V can be neglected.
Technical Note: In the above we have allowed for explicit time-dependence in the potentials
V = V ({q}, {6 q},
t). These are not conservative in the usual everyday sense because total
energy is not conserved (see later).
Generalisations: a partial list.
1. Our constraints are holonomic. Any additional nonholonomic constraints would have
to be imposed explicitly after solving Lagranges equations. We do not pursue this,
but bear it in mind.
2. Not all forces are conservative, for example friction. We can separate {Q} into a
conservative part (derived from V ) and a non-conservative part. This gives
d
dt
L
qj
L
= Qnoncon
j
qj
T
qj
T
= Qj ({q}, {q},
t)
qj
d
dt
L
xj
= 0
d
V
(mx j ) =
dt
xj
which is the usual Newtonian result. (Note that if m is time-dependent, this is handled
correctly.)
Single Particle in plane polars: Find the equations of motion of a particle moving in a
conservative potential in two dimensions, in polar coordinates.
There are two dofs and we choose generalised coordinates as suggested: q1 = r, q2 = .
hence
The radial and tangential components of the particles velocity are vr = r,
v = r,
1
2
2 2
T = 2 m(r + r ). The Lagrangian is therefore
1
L = m(r 2 + r2 2 ) V (r, )
2
Lagranges equation for q1 = r is
d
dt
L
r
L
r
d
V
(mr)
= mr2
dt
r
V
m(
r r2 ) =
r
The term in parentheses on the LHS is the usual expression for the radial acceleration ar ;
the RHS is the radial force component see, eg, MP2B or FoMP lectures on central forces.
L
L
= 0
V
d 2
mr =
dt
And taking the total time derivative (care needed here!) gives
V
We have thus obtained correctly the two equations of motion for a 2-D particle in plane polar
coordinates (or a 3-D particle confined to the xy plane in circular cylindrical coordinates).
Comments:
1. Is it easier than Newton? Even for this very simple problem, perhaps yes. In generalised
coordinates it is usually easier to write down the velocity components and hence the
scalars T and V , and then to differentiate than it is to write down by inspection (or
memory) the correct expressions for the acceleration.
2. Care is needed with differentiation (practice makes perfect).
3. Lagrangian methods help us formulate the equations of motion. We still have to solve
them!
Atwoods machine
This is a fancy name for a smooth pulley, a light
inextensible string (a rope!) and two masses M
& m.
The frictionless pulley is attached to a fixed
support. The rope slides without friction on the
pulley.
h
M
z1
m
z2
L
q
L
q
(M + m)
q = (M m)g
Note that the constraint force (the string tension) is not mentioned at all.
Exercise: Generalise to compound pulleys (see Tutorial Sheet).
(t).
Find the motion.
z1
m
z2
T =
V
After we impose the constraint
z1 + z2 + l (t) = 2h
there will be one degree of freedom left: choose q = z1 . Differentiating the constraint wrt t
Hence
gives z2 = z1 + .
1
1
2 M gz1 mg(2h l + (t) z1 )
L(z1 , z1 , t) = M z12 + m(z1 )
2
2
which is explicitly time-dependent. Lagranges equation is
d
dt
L
z1
L
= 0
z1
d
mg + M g = 0
M z1 + m(z1 )
dt
(M + m)
z1 = (M m)g + m
This is our equation of motion. Integrating twice gives the general solution
M m gt2
m
z1 (t) = At + B
+
(t)
M +m 2
M +m
with A and B fixed by the initial conditions.
Comments:
1. The tension in the rope is never mentioned.
2. L is explicitly time-dependent.
3. The monkey does work on the system, so energy is not conserved.
1
Particle and wedge: A point particle moves without friction down a wedge that is free to
slide on a frictionless table. Find the acceleration of the wedge.
v1
m
v2
There are two dofs; we are free to choose our generalised coordinates. Lets take x, y as
shown in the figure (Note: these are not a conventional orthonormal cartesian pair for the
particle; we choose our set to include directly the position of the block, x, which is what
were asked to find. This is often a good strategy.) We have two contributions to T :
1
M x 2
2
i
1 2
1 h
=
m v1 + v22 = m (x + y cos )2 + (y sin )2
2
2
Tblock =
Tparticle
(The Lagrange EoMs are not trivial in these coordinates.) For the potential energy we have
= mgy sin + constant
i
1 h
1
M x 2 + m x 2 + y 2 + 2x y cos + mgy sin
L =
2
2
V
Hence
L
x
L
x
( = 0)
d
(M x + m(x + y cos )) = (M + m)
x + m cos y = 0
dt
(1)
L
y
L
y
d
(m [y + x cos ]) = m(
x cos + y)
dt
Substituting the expression for y from the first Lagrange equation, (1), into (2) gives
"
m+M
L
m cos
x =
= mg sin
m cos
y
and therefore the acceleration of the block is
x =
g sin
cos (m + M )/(m cos )
2
(2)
Simple Pendulum:
The Lagrangian is
L = T V
d
dt
1 2 2
ma + mga cos + constant
2
Spherical Pendulum:
Such a pendulum is free to rotate about the vertical
as well as swing in a plane. There is one more dof,
described by the rotation angle .
The kinetic energy is
1 2
m v + v2
2
= a and v = a sin
T =
where v
Therefore
1 2 2 2 1 2 2
ma sin + ma
2
2
= mga cos + constant
T =
V
1 2 2 2 1 2 2
ma sin + ma + mga cos
2
2
!
L
L
=
d
dt
L =
d
dt
a sin
(3)
(4)
The right hand side of the latter is zero, so the EoM for is
d 2 2
ma sin = ma2 sin2 + 2ma2 sin cos = 0
dt
(5)
We have two coupled nonlinear differential equations of second order. A direct attack is
pretty difficult. However, conservation laws come to the rescue. We deal with these
generally later on, but in the present example we notice from the equation
d
dt
L
=0
L
= constant
Or
(6)
This is telling us about conservation of angular momentum (Lz ) about the vertical axis.
Substituting for from equation (6) into the EoM for (equation (3)) gives
ma2 = ma2 sin cos
Lz
ma2 sin2
2
mga sin
d =
d
1
dt = 2 ) to give
dt
2
1 2 2
L2z
ma = mga cos
+E
2
2ma2 sin2
where E is an integration constant. Rearranging we have
L2z
1 2 2
E = ma +
mga cos
2
2ma2 sin2
where the first two terms are the and parts of the KE, and the last term is the potential
energy V . So E is the total energy and this equation tells us about conservation of energy.
Conceptual Lessons:
1. Conservation of Lz arises because L is invariant under rotations about the z axis:
L
L
=0
= Lz = const.
We return later to the generic relation between symmetry and conservation laws.
2. Conservation of E: we show later that this arises similarly from
L
=0
t
which is also a symmetry (symmetry under time-translation).
Problem-Solving Lessons:
1. The explicit integration (done above) to get the conservation of energy was tricky and
unpleasant. In practice, unless asked for proof, one should write down this and any
other applicable conservation laws by inspection where possible. For the above example
we could (and should!) have written directly the conservation laws
ma2 sin2 = Lz = constant
L2z
1 2 2
ma +
mga cos = E = constant
2
2ma2 sin2
With Lz and E fixed by the initial conditions.
2. These are first order DEs and, as we showed explicitly, are the first integrals of our
two Lagranges equations. Generically, each symmetry gives a first integral, which is a
big step on the way to solving the equations. In some cases, like this one, there are as
many symmetries as dofs in which case we may be able to write down a complete set
of first order equations of motion without explicitly mentioning Lagranges equation
at all. Such a system is said to be integrable.
4
Inertial frame S 0
q2
(Looking down)
(x, y, z)
(x, y, z)
q1
y
t
x
Clearly T = 12 m|r|
requires care, but the
2 . To write this correctly in terms of the qs and qs
required result is given in equation (1) of Lecture 3.
r = [q]
S + q = ((q1 q2 ), (q2 + q1 ), q3 )
Hence the kinetic energy in the inertial frame S0 may be written as
i
1 h
T = m (q1 q2 )2 + (q2 + q1 )2 + q32
2
and the potential is V = V (q, t) an explicit t dependence is more than likely here if V is
time-independent in the lab frame.
Constructing L = T V and applying Lagranges equations gives, for q1
V
d
m(q1 q2 ) = m(q2 + q1 )
dt
q1
q2
V
q3
m
q1 = m 2 q1 + 2m q2
m
q2 = m 2 q2 2m q1
m
q3 = 0
These are, componentwise, the correct equations of motion for a particle in the frame rotating
with angular velocity = e z , as derived earlier (see Lecture 3, pages 2-3). Check this claim!
1
(wrong)
The reason this fails is that the information we need lies in the functional form of the
Lagrangian, not in its numerical value. To replace it with something else of equal value
in the actual motion, but of a different functional form, is not permitted.
L
qj
L
=0
qj
j = 1, , 3N M
Is there any way of stating the laws of dynamics which is manifestly independent of the choice
of the qs? The answer is yes, by means of a variational principle. The result (Hamiltons
principle) has many advantages, including a more direct proof of the equivalence of the
Lagrangian and Newtonian formulations.
To study this we need to develop the general theory of variational calculus. Since this is
not taught in any detail elsewhere in our (Mathematical) Physics degree programme, we do
it from first principles. Though not in itself part of mechanics, the variational calculus is a
powerful and important part of the physicists toolkit.
y (s)
Z s2
s1
|y(s)|2 ds
2
s1
s2
(1)
This is an example of a functional of the path y(s); we write it J[y(s)] or simply J[y]. The
square brackets remind us that the argument is already a function. J depends on the shape
of the function y throughout its range, not just on its value in a particular place.
Another example:
Z s2
L[y(s)] =
s1
dy
1 +
ds
!2 1/2
ds
(2)
This is the length of the curve y(s). Why? (Convince yourself we shall use this later.)
The following are not functionals of y(s), but merely functions:
y 0 (s)
sin y(s)
dy(s)
ds
These are merely functions because each depends on the value of y only in some infinitesimal
neighbourhood (near the point s).
Eulers Equation: We are often interested in functionals of the form
I[y(s)] =
Z b
a
F (y(s), y 0 (s), s) ds
F = [1 + y 02 ]1/2
y (s)
Z b
a
Z b
F
F (y, y 0 , s) + (s)
y
s1 = a
s2 = b
I =
(s)
F
+ 0 (s)
y 0
y0 , s
#
F
F
(s)
+ 0 (s) 0 ds
y
y
3
y, s
ds
where we no longer need subscripts on the partial derivatives (since all other variables are
held constant). The second term is now integrated by parts to give:
I =
Z b
a
"
F
F
(s)
ds + (s) 0
y
y
#b
Z b
a
d
(s)
ds
F
y 0
ds
Z b"
F
a
y
ds
F
y 0
!#
(s) ds
For I[y] to be stationary as required, I must vanish for any arbitrary small change (s).
This is only possible if the square bracket vanishes identically. Therefore
F
d
y
ds
F
y 0
=0
F
y 0
F
y
The obvious one: If F (y, y 0 , s) does not depend explicitly on y, so that F/y = 0, then
F
= constant (independent of s)
y 0
The less-obvious one: If F (y, y 0 , s) does not depend explicitly on s, so that F/s = 0,
then
F
y 0 0 F = constant (independent of s)
y
Proof: The total derivative of F (y, y 0 , s) with respect to s is:
dF
ds
F 0 F 00 F
y + 0y +
y
y
s
"
d
ds
d
=
ds
F
y 0
!#
y0 +
F 00 F
y +
y 0
s
F
F 0
y +
0
y
s
F
s
So if F/s = 0 as assumed, the left hand side is also zero. This proves the result.
This first integral is very important but much less obvious.
4
A Theorem of Euclid:
(x 1 , y1)
y (x)
(x 2 , y2)
x
Z x2
x1
dy
1 +
dx
!2 1/2
dx
F (y, y 0 , x) = [1 + y 02 ]1/2
F
y 0
d
=
dx
y0
[1 + y 02 ]1/2
F
=0
y
y0
= constant p
1 + y 02
y0 =
p
= constant, ie independent of x
1 p2
Hence the shortest path is a straight line. Obvious! But note the method !
Brachistochrone
(x 1 , y1 )
v=
2g(y1 y(x))
The time d required to fall an increment of arc-length ds = (1 + y 02 )1/2 dx along the curve
is d = ds/v, hence the transit time is
[y] =
where
Z x2
x1
Z x2
ds
F (y, y 0 , x) dx
=
v
x1
1 + y 02
2g(y1 y)
F (y, y , x) =
!1/2
F
F = constant
y 0
F y 02
F
=
y 0
1 + y 02
2g(y1 y) (1 +
y 02 )
y0
F
F = F/(1 + y 02 ) = constant
y 0
= constant
ds
(z)
z
d
a
dz
d
We have
A[(z)] =
Z d
z=d
2(z) ds
where
2
ds = 1 + (d/dz)2
i1/2
dz
Hence A = 2
Z d
F (, 0 , z) dz where
F = (1 + 02 )1/2
d
=
dz
2
ds = L.
In general this involves a version of the method of Lagrange multipliers: the procedure is to
minimise I J with an undetermined multiplier. This gives an Euler equation involving
, whose value is finally fixed by imposing the constraint itself. For examples, see text books.
Calculus of Variations for Many Variables: We now consider
I[{y(s)}] =
Z b
a
F ({y}, {y 0 }, s) ds
where {y} = y1 (s), y2 (s), , yp (s) is a set of p linearly independent functions of the (single)
variable s.
3
For a small variation in the paths {y(s)} {y(s)} + {(s)} we can expand
F ({y + }, {y 0 + 0 }, s) = F ({y}, {y 0 }, s) +
p
X
p
X
F
F 0
j (s) +
(s)
0 j
j=1 yj
j=1 yj
Demanding I = 0, and following exactly the derivation in the one dimensional case (and
using the fact that all the variations {} must vanish at the endpoints) one obtains a set of
p equations, the Euler-Lagrange equations
d
ds
F
yj0
F
=0
yj
j = 1, , p
F
yj0
F
= const.
yj0
=0
This is identical to the single-variable case. There is one such equation for each variable that
F does not depend on.
The less obvious one: Now consider the case when F does not depend explicitly on s.
Not surprisingly this leads to only one first integral (not one for each of our p independent
functions {y}). The result, whose proof is left as an exercise (see Tutorial Sheet), is
X
F
yj0 0 F = const.
yj
j
Hamiltons Principle
The above results show that Lagranges equations for dynamics, namely
d
dt
L
qj
L
=0
qj
j = 1 , 3N M
are neither more nor less than the Euler-Lagrange equations found by demanding that the
following integral is stationary:
S[{q(t)}] =
Z t2
t1
L({q}, {q},
t) dt
To make the correspondence, we just switch notation from {y(s)} to {q(t)} (bearing in mind
that dq/dt is conventionally written q rather than q 0 ).
Hence Lagranges equations are equivalent to Hamiltons Principle:
The quantity S =
Z t2
t1
S = 0
L dt is usually called the Action.
Z t2
t1
L({q}, {q},
t) dt is stationary under small variations.
Technical Note: Hamiltons principle involves choosing two fixed times t1 and t2 , and
(since the end points of the paths are not allowed to vary) specifying the initial and the final
coordinates, {q(t1 )} and {q(t2 )}. It is formally under these conditions that S is stationary.
This is a legitimate, but unusual choice of boundary conditions in a system with one second
order DE for each degree of freedom (which is the case in classical mechanics). More normal
would be to specify the initial coordinates and velocities, {q(t1 )} and {q(t
1 )}. Fortunately,
starting from Hamiltons principle, we can derive Lagranges equations and then solve them
for any set of boundary conditions we choose. Hence Hamiltons principle is true of any
possible motion of the system (ie any that obeys Lagranges equations of motion), between
any pair of times (t1 , t2 ), regardless of what boundary conditions are, in practice, applied.
Coordinate Invariance: Hamiltons principle states that
L dt = 0
where L = T V . The choice of generalised coordinates {q} need not be specified. Hence
Hamiltons principle is a coordinate-invariant statement of the laws of mechanics. As
such it is very powerful. For example it allows, for conservative systems, a much nicer proof
of the equivalence of Newtonian and Lagrangian schemes, as follows.
From Newton to Lagrange via Hamiltons Principle
1. Consider first a system of N particles with no constraints (3N dofs). The kinetic
energy is (in cartesians)
X1
mi x 2i
T =
2
i
Observe that T / x i = mi x i and that T /xi = 0. Using the first of these, Newtons
equations may be written
!
V
d T
=
dt x i
xi
where V = V ({x}, {6 x},
t) is a potential (which obeys V / x i = 0).
T
x i
V
T
d
=
+
+
xi xi dt
V
x i
where we have added two terms to the RHS, both of which are zero.
1
=0
dt x i
xi
2. We construct the action, S =
L({x}, {x},
t) dt, and deduce that S = 0 in cartesians.
f ({x}, t) = 0
4. (The clever bit.) We are free now to specify our choice of 3N coordinates {
q} =
(
q1 , q2 , , q3N ) as we wish. We make the following choice
{
q } = (q1 , q2 , , q3N M , f1 , f2 , , fM ) = {q, f }
In other words, we choose M of our coordinates to be precisely those functions that
are held constant by the constraints. What remains is a set of 3N M generalised
coordinates {q} which are linearly independent of each other and of the constraints.
5. Hamiltons principle states that, in an unconstrained motion, S = 0 or
L({q, f }, {q,
f}, t) dt = 0
To describe a constrained motion, we now demand that all the f s (and therefore the
fs) are zero identically. Accordingly these variables can just be dropped.
Indeed, defining L({q}, {q},
t) = L({q, 0}, {q,
0}, t), we are left with
L({q}, {q},
t) dt = 0
where {q} are our set of 3N M of independent generalised coordinates (one for each
dof that remains). Writing the Euler-Lagrange equations for this variational principle
gives Lagranges equations of motion for the {q} variables in their usual form.
Lesson: This shows the conceptual power of the variational formulation of mechanics. To
most people, the above proof is more appealing than the one starting from the cancellation
of dots rule of Lecture 5 though it is slightly less general (it doesnt include forces, such as
friction, which cant be written in terms of derivatives of a potential).
Technical Note: In Step 5 we could, if desired, impose the constraints explicitly, by adding
P
to the Lagrangian a term V constraint = 21 k f2 . Constraint forces will then appear in the
Lagrange equations for the {f } variables in such a way that, if we let each k (see
Lecture 4) the constraints will take hold and f 0. The trick here is that, by choosing
our generalised coordinates appropriately, we have arranged that V constraint has no effect on
the Lagrange equations for any of the remaining {q} variables.
2
L
qj
L
=0
qj
j = 1, , 3N M
S =
L dt = 0
L
qj
"
L X d
=
qj
dt
i
L0
qi0
L0 qi0
0
qi qj
If the LHS = 0, then [ ] = 0 i. Hence, if Lagranges equations hold for one set of
coordinates, they do for any other. Happily, the algebraic proof can be entirely bypassed by
invoking Hamiltons principle, whose coordinate invariance guarantees the result.
Changes of Lagrangian: If two Lagrangians L1 and L2 are related as follows
d
F ({q}, {6 q},
t)
dt
then L1 and L2 are equivalent Lagrangians, ie they give the same equations of motion. There
are two proofs (direct and via Hamiltons principle) see Tutorial Sheet .
L1 ({q}, {q},
t) = L2 ({q}, {q},
t) +
1X
mi x 2i V ({x}, {6 x},
t)
2 i
pj is conserved
dt
qj
The corresponding qj is called a cyclic or ignorable coordinate. A conserved momentum
(or other conserved quantity) is called a constant of the motion. In the language of Lecture 8,
a conserved momentum corresponds to an obvious first integral.
The statement that L/qj = 0 can be viewed as a symmetry property of L:
In other words, L is invariant under changes in qj . For the examples above, these changes
correspond to a translation (xi xi + ) or, for an angle, a rotation.
The energy function: We now define the energy function
X L
h({q}, {q},
t) =
qj L({q}, {q},
t)
j qj
Or equivalently:
h =
pj qj L
dt
ie h is a constant of the motion. This is an example of the less obvious first integral of
the Euler-Lagrange equations (see Lecture 8), whose proof is on the Tutorial Sheet. Hence
we obtain one of the most important results of the whole course:
Time Translational Symmetry of L
Conservation of h
The energy function h is a generalisation of the notion of energy. For many (but not all)
conservative systems we will show that h is identical to the total energy E = T + V .
Technical Note: h is numerically equal to the Hamiltonian (see later). There one expresses
the velocities in terms of the canonical momenta and coordinates, so that the Hamiltonian
becomes a function of the coordinates and their conjugate momenta:
H({q}, {p}, t) =
X
j
pj qj L({q}, {q},
t)
Beware: Prior to 2006/07, the energy function h was called the Hamiltonian function,
denoted by H. Both names can be confusing we now use the more conventional one.
4
h =
X
i
so that h = E
pi qi L =
X
ij
Tij qi qj T + V = 2T T + V = T + V
.
v = r
m
.
vr = r
h = pr r + p L
L
= mr
r
v = r
F constr
t
This constraint force acts perpendicular to the wire, in the direction of the velocity v = r.
Hence the constraint force does work on the bead at a rate
d
F constr v = 2m 2 rr = (mr2 2 )
dt
This is just
d
dE
(E h) =
dt
dt
(since h is conserved) which precisely accounts for the change in E.
Technical Query: But didnt we say earlier in Lecture 4 that constraint forces do no
work? Not quite, we said that they do no work in any virtual displacement consistent with
the constraints. The only virtual (that is, instantaneous) displacement possible in this system
is for the bead to move along r, which is perpendicular to F constr and hence no work is done.
This does not rule out constraint forces doing work in a non-instantaneous change, including
the actual motion, as the above example shows.
More on Symmetries and Conservation Laws
We now go back to first principles. Consider an isolated system of N particles in cartesians
r a = (xa , ya , za ), where a = 1, , N . (This explicit notation is more suited to our current purposes than the {x} notation.) We now prove the conservation of the total linear
momentum and total angular momentum for this system.
Homogeneity of Space: We demand that space is homogeneous, ie its the same everywhere. If all particles are subjected to the same small displacement (keeping the velocities
r a fixed):
ra ra +
then L is invariant. This is symmetry under translations of the entire system a bit different
from varying one coordinate on its own. If so, we must have (using Taylors theorem to
expand L):
!
L =
X
a
(r ) L r a =
a
X
a
(r ) L = 0
a
where (r ) L is the gradient of L wrt r a , ie its components are (L/xa , L/ya , L/za )
a
and the dot product is the usual three-dimensional one. Since is arbitrary, we have
X
a
(r ) L
a
= 0
dp a
dP
=
=0
dt
dt
(1)
Isotropy of space: We now demand that space is isotropic: ie, that L is invariant
under rotations of the entire system of N particles about an arbitrary axis. For a rotation
b through a small angle about an axis ,
b we have
r a = r a
r a = r a
which are the usual transformation laws for vectors under infinitesimal rotation. Now
L =
X
a
(r ) L r a +
a
X
a
(r ) L r a = 0
a
X
d
(r ) L ( r a ) +
(r ) L ( r a ) = 0
a
a
dt
a
X
a
d
ra
(r ) L
a
dt
or
"
r a (r ) L
a
!#
d X
r (r ) L
a
dt a a
#
= 0
= 0
This holds for any small rotation , hence the term in square brackets vanishes identically.
Hence
X
X
=
r a p a L = constant
r a (r ) L
a
The above proofs make no use of Newtons third law in either form! The third law is thereby
exposed as a red herring.
Symmetries and Conservation Laws: Summary
L invariant under:
qj qj +
Conserved Quantity
pj
P
tt+
h=
Global Translations
Global Rotations
Poincare Transformations
Energy-momentum Tensor
Gauge Transformations
Electric charge
pj qj L
In the last two examples we have strayed rather beyond the course boundaries. . .
4
L
L
=0
qi
qi
T
qi
T
= Qi
qi
in which there will now be an explicit dependence on the velocities. Suppose, however, we
can find a function V ({q}, {q},
t) such that
d
Qi =
dt
V
qi
V
qi
L
L
=0
qi
qi
but now with L = T V . Clearly, V is not a conventional potential energy. However, the
formal structure of the theory, including Hamiltons principle and any relevant conservation
laws, carries over.
Lorentz Force: The construction above works for a very important special case: a particle
of charge e in an EM field:
mr = F = e(E + r B)
where E & B are the electric and magnetic fields respectively.
A
t
and
1
B =A
Here is the electrostatic potential, and A is the magnetic vector potential. Since this is not
a course on electromagnetism, we take these results on trust (see EM lectures or handout).
We shall show explicitly that the following function
V (r, r , t) = e [(r, t) r A(r, t)]
does exactly the job we require.
Proof: The derivation is a little complicated, but the result is sufficiently important to
include it here. We will work in Cartesians using suffix notation, r = xi e i (i = 1, 2, 3), and
(for this section only) use the summation convention. We also need the standard result
ijk klm = il jm im jl
where ijk is the usual Levi-Civita or epsilon symbol (so (a b)i = ijk aj bk ).
Writing E & B in terms of the potentials, the ith component of the Lorentz force becomes
!
A
Fi = e
+ r ( A)
t
Am
xl
Am
= (il jm im jl ) x j
= x j
xl
so the Lorentz force becomes
"
Ai
Ai
Aj
+ x j
Fi = e
xi
t
xi
xj
Our task is to identify a function V (r, r,
t) such that
!
V
d V
Fi =
dt x i
xi
Aj
Ai
xi
xj
!#
(1)
V
+
(x j Aj ) =
with
V e ((r, t) x j Aj (r, t))
e
xi xi
xi
(2)
Now for the less-obvious bit. Recalling that r(t) is the position of a moving particle, so that
Ai (r, t) really means Ai (r(t), t), we may verify that
!
"
#
d V
d
= e
(r, t) x j Aj (r, t)
dt x i
dt x i
d
= e
0 Ai (r, t)
dt
"
Ai Ai
= e
+
x j
t
xj
which we identify with the second and fourth terms in equation (1). We have thus shown
that V as defined in equation (2) has the desired property that:
!
!
A
d V
V
e
+ r ( A)
=
with
V = e ( x j Aj )
t
dt
x
i
i
i
This completes the proof.
2
X
1 X 2
L= m
x i e
x i Ai
2
i
i
L
= mx j + eAj
x j
h =
1 2
mv + e(r, t)
2
1 2
mv e( v A)
2
This is conserved only if L has no explicit time dependence, which requires that both and
A are not explicitly time dependent (which in turn implies E & B fields which dont have
explicit time dependence.)
Technical Note: Under such conditions, h can be viewed as the sum of the kinetic and
electrostatic potential energies. This is because the magnetic force v B is perpendicular
to the velocity and therefore does no work. If B is time dependent, the latter is still true,
but now E = A/t, which includes a part that does not derive from .
Example: A particle moves in a steady toroidal B field for which = 0 and
A = (0, 0, Az ) = (0, 0, B(x2 + y 2 )1/2 ) = (0, 0, B)
y
x
y
x
B = A = (Bx , By , Bz ) = B
, 2
,0 =B
, ,0
2
2
1/2
2
1/2
(x + y )
(x + y )
so the magnetic field has constant magnitude B. The field lines are circles in the xy plane:
A
y
x
B
The Lagrangian is
1
L = m(x 2 + y 2 + z 2 ) + ezA
z
2
has no explicit z dependence, then nor does L, and so
L
= mz + eB(x2 + y 2 )1/2
z
is conserved. This is our first constant of the motion.
pz =
On the other hand, there is explicit x and y dependence (via Az ) so px and py are not
conserved.
But since L has no explicit time dependence h = 21 m(x 2 + y 2 + z 2 ) + 0 is conserved (as
discussed above); this is the second constant of the motion.
Special Relativity
Lagrangian methods can be used in relativity. We have time for a brief treatment of this
important topic here.
Lagrangian for a Relativistic Particle
Suppose we postulate the following Lagrangian for a particle of mass m moving with velocity
u in a potential V (r)
L = mc2 [1 u2 /c2 ]1/2 V (r) =
1
mc2 V (r)
(u)
with u2 = x 2 + y 2 + z 2 . Then
px =
or in vector notation
L
mx
=
= (u)mx
x
[1 u2 /c2 ]1/2
p = (u) mu
This is the correct relativistic expression for momentum. Lagranges equations are
p = L = V
which is the expected result. Since this Lagrangian gives the correct equations of motion, it
is an acceptable choice for a relativistic particle.
4
d
dt
(2)
electromagnetic four-vector potential A = (/c, A). Relativity then tells us the form of
the Lagrangian because the only Lorentz-invariant quantity we can construct from these two
four-vectors is
dt
(3)
u A = (u)( u A) =
( u A)
d
Hence, from equations (1), (2) and (3), the action S may be written in the simple form
S =
L dt =
2 d
d
mc
eu A
dt
dt
dt =
Z
mc2 + eA u d
Because relativity can be cast as a Lagrangian theory, the fundamental connections between
symmetries of the Lagrangian and conservation laws still apply.
Hamiltonian Dynamics from Lagrange to Hamilton
We only have time for an elementary introduction to Hamiltonian Dynamics. However, this
should be sufficient to give a flavour of its formulation and to illustrate its connection to
quantum mechanics. For a full treatment, see the year 4/5 course Hamilton Dynamics.
For a general system with 3N M dofs, we defined the energy function
h({q}, {q},
t) =
where
pi =
X
i
pi qi L({q}, {q},
t)
L({q}, {q},
t)
qi
1
(4)
We first solve equation (4) for the generalised velocities in terms of the generalised coordinates
and their conjugate momenta, ie qi = qi ({q}, {p}). We then define the Hamiltonian as a
function of ({q}, {p}) by the so-called Legendre transformation:
H({q}, {p}, t) =
X
i
pi qi L({q}, {q},
t) .
(5)
The change of variables looks harmless but has deep consequences. The (3N M ) generalised
coordinates {q} and their (3N M ) conjugate momenta {p} now enter on an equal footing,
and can more obviously be varied independently.
Example: For a particle moving in one dimension with a conservative potential V (x),
H = E = T + V and the Hamiltonian is (exercise)
H(x, p) =
p2
+ V (x) .
2m
H =
"
H
H
H
qi +
pi +
t
qi
pi
t
(6)
X
i
X
i
"
L
L
L
qi
qi
qi pi + pi qi
t
qi
qi
t
[qi pi pi qi ]
L
t
t
(7)
(8)
to cancel the 2nd and 4th terms in the square bracket, and the Euler-Lagrange equations
in the form pi = L/qi to obtain the final result. Comparing equations (6) and (7), which
must hold for all qi and pi , we obtain Hamiltons Equations of motion
qi =
H
,
pi
pi =
H
qi
and
L
H
=
t
t
Essentially, we have converted 3N M Lagranges equations (2nd order DEs for {q}) into
exactly twice as many Hamiltons equations (1st order DEs for {q} and {p}), plus the
(almost trivial) last equation for H/t.
Hamiltons equations may also be derived directly by applying Hamiltons Principle, S = 0,
to the action in the form (tutorial)
S =
Z (X
i
pi qi H({q}, {p}, t)
2
dt
X
i
"
A
A H A H
+
.
qi pi
pi qi
t
.
qi pi pi qi
i
The derivative wrt time of any quantity A({q}, {p}, t) may then be written in terms of the
Poisson bracket of A with the Hamiltonian
dA
A
= {A, H}PB +
.
(9)
dt
t
In particular, Hamiltons equations may be expressed as
qi = {qi , H}PB
Notice the similarity in form to the Heisenberg equations of motion in quantum mechanics.
Lorentz Force: For a non-relativistic charged particle in an EM field, we showed that the
energy function expressed in terms of the particles position and velocity is
1
t) = p r L = (mr + eA) v [ mv 2 e( v A)]
h(r, r,
2
1
2
m|v| + e
=
2
To obtain the Hamiltonian, we recall that the canonical momentum is p = mv + eA We then
1
solve for v in terms of p to obtain (trivially) v =
p eA , so the Hamiltonian is simply
m
1
H(r, p, t) =
|p eA|2 + e
2m
which is probably not what you would first think of for H. It is left as a tutorial problem to
check that Hamiltons equations lead to the correct Lorentz force.
From Classical to Quantum: H({q}, {p}, t) corresponds very closely to the Hamiltonian
operator in quantum mechanics. Specifically, to get quantum mechanics for a system with
one generalised coordinate q we replace
!
q, ih , t
H(q, p, t) H
q
keeping the functional form of H the same. Thereby a classical function is converted into a
quantum operator which acts on the wavefunction (q, t) obeying the Schrodinger equation.1
1
For a system with more than one degree of freedom, the naive quantisation prescription pi ih/qi
works for Cartesian coordinates, but it must be modified slightly for general curvilinear coordinates.
p2
h
2 2
H=
+ V (x) H =
+ V (x)
2m
2m x2
1
|p
2m
eA|2 + e
h2 2
e
h
e2
=
H
[2A + ( A)] +
|A|2 + e
2m
2im
2m
Canonical Quantisation: A more general form of quantisation begins with the Poisson
brackets for the classical quantities A and B, say, from which the canonical commutation
are obtained by the following prescription
relations between the quantum operators A and B
i
h{A, B}PB
B]
[A,
Using this method, the quantum equation of motion for the operator (in the Heisenberg
picture see SH Quantum Theory) may be postulated almost trivially from equation (9)
ih
h
i
dA
H
+ ih A
= A,
dt
t
The same equation gives directly the expectation values of operators in the Schrodinger
picture see Junior Honours Quantum Mechanics, Section 6, Page 1.
Feynman Path Integrals: Finally, there is a formulation of quantum mechanics due to
Feynman which is based on the Lagrangian rather than the Hamiltonian. It says. . .
The quantum amplitude for a particle in one dimension to propagate from position xi at
time ti to position xf at time tf is the weighted sum over all paths between the two points:
Draw all possible continuous paths the particle can take between xi at ti and xf at tf .
Calculate the classical action S =
Z tf
ti
L(x, x)
dt for each path.
The amplitude hxf , tf |xi , ti i for the particle to propagate from position xi at time ti to
position xf at time tf is the weighted sum or integral over all possible paths:
hxf , tf |xi , ti i = N
exp (iS[x(t)]/
h) = N
all paths
Dx(t) exp
i Z tf
L(x, x)
dt
h
ti
where N is a normalisation constant, and the path integral is over all continuous functions
x(t). All paths are treated on an equal footing one simply sums the phases for all paths. The
particle does whatever it likes and the th path contributes a factor Z = exp(iS[x (t)]/h)
to the amplitude. Hamiltons principle emerges in the limit h
0, but to actually show this
is beyond the scope of this course. See Senior Honours Quantum Theory for details.
= F ext
MR
L = G ext
We also showed, for angular momentum L about some stated axis and for kinetic energy T
L = J +RP
1
T = Tc-of-mom + M V 2
2
(1)
(2)
ie, the value in a general (inertial) frame is the value in the centre of momentum (c-of-mom)
frame plus a contribution from the motion of the centre of mass, treated as though it were
a point particle.
NB Remember that in the c-of-mom frame, the intrinsic angular momentum J does not
depend on the origin chosen (though, because of the second term in equation (1) this will
not be true for L in a general frame). Recall that the c-of-mom frame is an inertial frame (not
a rotating frame) in which the centre of mass is stationary, but not necessarily at the origin.
In such a frame, at least instantaneously, the motion of a rigid body is purely rotational.
Eulers Theorem: Any displacement of a rigid body with one point fixed in space can be
described as a rotation about some single axis
This is surely obvious(!), and the formal proof is not illuminating. To specify a finite rotation
requires an axis (unit vector, two degrees of freedom) and a magnitude of the rotation (one
dof). Hence we will need 3 generalised coordinates to specify the centre of mass motion
and three for the angular motion. In practice, there are many ways to choose three angular
coordinates: we do not make an explicit choice at this stage.
Technical Note: In choosing angular coordinates, thereR is some subtlety since finite rotations about different axes do not commute. Hence (t) = 0t (t0 ) dt0 is not a valid generalised
coordinate; the same value of this quantity can be obtained for different sequences of rotations and hence different actual orientations of the system. Hence the need for Euler
angles introduced later.
The Inertia Tensor: Working in the c-of-mom frame and choosing our origin at the centre
of mass, the instantaneous velocity of the ath particle is
r a = (t) ra
r a pa =
ma (ra ( ra )) =
ma (ra r a )
X
a
ma ra2
X
a
ma ra (ra )
(3)
(using a (b c) = (a c)b (a b)c). The first term is a vector parallel to , the second
term is a vector that is, in general, not parallel to .
The intrinsic angular momentum J is thus a vector which depends linearly on (for example,
J(2) = 2J()) but need not be parallel to it. So there must be a relationship
J = I
(4)
X
a
X
a
ma
(ya2 + za2 )
xa ya
xa za
xa ya
(x2a + za2 )
ya za
(5)
xa za
ya za
2
2
(xa + ya )
(6)
The expression for I explicitly involves the particle coordinates ra , which are measured
relative to the centre of mass. For a rigid body, these coordinates can be taken as constants,
though this applies only in a frame of reference in which the body is completely stationary
(ie, r a = 0, a). For a body that is actually rotating, this is a noninertial frame, the body
frame which rotates and translates with the body. Despite this inconvenience, one defines
the inertia tensor I to be evaluated in such a frame, that is, with respect to a set of x, y, z
axes fixed in the body. The inertia tensor is therefore a time independent characteristic of
how the mass is distributed in the body.
Kinetic Energy: In the centre of momentum frame, the KE is
X1
T =
ma ( ra ) ( ra )
a 2
Principal Axes: In this course, we will generally not need the explicit form of I found
above; we take I as given. But it is important that (however irregular the body) the tensor
I is symmetric. Hence there exist three real eigenvalues, I1 , I2 , I3 , and three mutually
perpendicular eigenvectors e 1 , e 2 , e 3 .
Choosing these to define new axes, the Principal Axes (PA), fixed in the body, we have the
diagonal form
I1 0 0
I = 0 I2 0
0 0 I3
Where (I1 , I2 , I3 ) are the Principal Moments of Inertia. Moreover, in this coordinate system
J = (I1 1 , I2 2 , I3 3 )
and
T =
1 2
I1 1 + I2 22 + I3 32
2
Here lies the central problem: These nice axes are fixed in the body. But the body is
rotating. Hence in lab coordinates (or any other inertial frame), the principal axes have timedependent orientations. This is why rigid body motion is not a straightforward application
of linear algebra.
Shift of Origin: We have placed the origin of our coordinate system for the rs at the
centre of mass O of the body. This is the best choice for a body undergoing arbitrary motion,
or whose centre of mass is constrained to be fixed in space. In systems where some other
point O0 is constrained to be fixed instead (for a spinning top, this is usually the point of
contact with the table) one can use the usual formula L = J + M R P to calculate the
angular momentum L with respect to O0 .
Centre
of Mass
O
Pivot at O
Euler versus Lagrange: At this point there are two divergent approaches:
Eulerian Approach: Study equations in a noninertial frame (the body frame) rotating
with the body, so that principal axes are constant in time. Difficulty:
J = G ext
where G ext is (usually) specified wrt axes fixed in space (ie in the lab). We cant translate
G back into the body frame until after we have found the motion! In practice, the Eulerian
approach is therefore useful only for kinematics (motion without external forces): G ext = 0.
Lagrangian Approach: Choose a convenient set of generalised coordinates (in practice,
three angles specifying the orientation of the principal body axes with respect to another set,
fixed in space). Then construct L and differentiate as usual to obtain equations of motion.
This is more powerful but less intuitive; we follow Eulers approach first.
Eulers Equations Of Motion: These describe motion in the principal axes frame S
rotating with the body this is not an inertial frame. Consider S0 , an inertial frame instaneously coinciding with S. As we saw previously, if the instantaneous angular velocity of
the body is , then for any vector A
S0 = [A]
S + [A]S
[A]
Now, in the inertial frame S0 , we have the usual equation of motion
S0 = G ext G
[L]
where G is the external torque on the system. (The label ext is implicit from now on.)
Accordingly in the body frame S
S + [L]S
G = [L]
Now, since S is the frame of the principal axes (see above)
[L]S = J = (I1 1 , I2 2 , I3 3 )
In this coordinate frame, the equation for G therefore reads
(G1 , G2 , G3 ) = (I1 1 , I2 2 , I3 3 ) + (1 , 2 , 3 ) (I1 1 , I2 2 , I3 3 )
or, expanding this out
G1 = I1 1 + (I3 I2 ) 2 3
G2 = I2 2 + (I1 I3 ) 3 1
G3 = I3 3 + (I2 I1 ) 1 2
These are Eulers Equations of Motion.
For the reasons stated above, the useful case is when G = 0; then Eulers equations become
I1 1 = (I2 I3 ) 2 3
I2 2 = (I3 I1 ) 3 1
I3 3 = (I1 I2 ) 1 2
4
1 =
(1)
(2)
1 = 2 1
2 = 2 2
Both of these are equations for SHM at angular frequency . Since 2 = 1 , there is a
/2 phase difference between 1 and 2 (eg, if 2 = a cos(t + ), then 1 = a sin(t + )).
This corresponds to circular motion of in the
(1,2) plane, with a constant 3 .
Hence precesses about the 3 axis with precession frequency = (I1 I3 )3 /I1 . This precession
of in the body frame traces out the body cone.
Since 3 is constant, and so is 12 + 22 (circular
motion in the 1, 2 plane) we have || = constant.
Since J = (I1 1 , I2 2 , I3 3 ), and I1 = I2 , |J| is
constant for the same reason, and so is J .
The fact that J , J e 3 and e 3 are all constants means that J, and e 3 make constant
angles with each other, and are coplanar because I1 = I2 . Hence and J precess together
at frequency about the 3 axis at inclination angles = cos1 (3 /||), = cos1 (I3 3 /|J|)
which are fixed by the ICs. Here, > because I3 < I1 = I2 .
1
e3
Note: If all three principal moments are equal (isotropic body), Eulers equations reduce to
= 0. This is trivially true because J and are parallel for this case.
Symmetric Top in the Lab Frame: We now have to translate these results into the
(inertial) lab frame. We assumed G = 0 and therefore, in the lab, J is constant. Clearly, J,
and e3 remain coplanar in this frame. Hence we have the picture:
space
cone
e3
body
cone
So, traces out the surface of a cone (the space cone) about J. Since simultaneously
executes a cone about e3 (see above), we find that the body cone rolls without slipping
around the outside of the space cone a nice picture if you like geometry! If I3 > I1 = I2 ,
the body cone rolls inside the space cone.
An example is the free precession of the earth (wobble): 3 = 2 per day; the asphericity
(I1 I3 )/I3 ' 0.3% giving a precession period of about 300 days. The observed value is 430
days because the earth is not quite a rigid body (it has a molten core), and the amplitude
is about 10m. (Note: this phenomena is not precession of the equinoxes.)
Asymmetric Top: the Tennis Racquet theorem: When all three principal moments of
inertia are different, the general solution of Eulers equations for torque-free motion involves
elliptic integrals and is not illuminating.
We instead ask the following: Can one have motions where is constant in the body frame?
Inspection of Eulers equations shows that this is only possible if two of the components of
(say, 1 and 2 ) are zero. Hence
An asymmetric body can only rotate with constant angular velocity if the rotation axis is one
of the principal axes.
Is the resulting motion stable? To investigate, write 1 = 1 , 2 = 2 and 3 = + 3 where
the s are small. Expanding Eulers equations to first order in s gives
I1 1 = (I2 I3 ) 2
I2 2 = (I3 I1 ) 1
2
I3 3 = 0
Eliminating 2 (say) gives the second order equation
I1 I2 1 = (I2 I3 )(I3 I1 ) 2 1
The same equation holds for 2 . The solution is
1 = Aeit + Beit
with
(I1 I3 )(I2 I3 ) 2
I1 I2
We conclude that if I3 is either the smallest or the largest moment of inertia, then the RHS
is positive and is real. Hence perturbations to steady motion about this axis oscillate but
do not grow. In contrast, if I3 is the middle axis, is imaginary and perturbations grow
exponentially. This is the tennis racquet theorem (easily demonstrated with said object or
similar):
2 =
Steady rotation about the intermediate axis is unstable; about the major or minor axis, it is
stable.
Lagrangian Dynamics of Rigid Bodies
We now turn to the Lagrangian approach, which is suitable for systems in which there is an
external torque acting.
Choice of Origin: We will be concerned almost exclusively with spinning objects (tops)
with a point O held fixed:
3
Centre
of Mass
Pivot at O, fixed
The fixed point is usually not the centre of mass. From now on we denote by I the inertia
tensor about O (before this was called I 0 ). The angular momentum (about O) of such a
body is
L = I
and its kinetic energy (including centre of mass motion remember Eulers theorem) is
T =
1
1
I = (I1 12 + I2 22 + I3 32 )
2
2
y
x
This sequence defines (, , ) the three Euler angles specifying the orientation of an arbitrary
rigid body. The main advantage of the Euler angles over other choices is that the various
conservation laws emerge more tidily.
4
1
1
I = (I1 12 + I 2 22 + I 3 32 )
2
2
e2
e2
e
e1
e 1 e01 =
cos
e 2 e01 = sin
e 1 e02 = sin
e 2 e02 = cos
e3
^z
e1
The primed set are an orthonormal basis of unit vectors along the , and r directions
respectively (just as one would define by ordinary spherical polar coordinates for a single
particle); e02 is horizontal whereas the e01 e03 plane is vertical.
Now lets try to get the components of in the primed basis set. From the definition of the
Euler angles, we identify three contributions to the angular velocity
= rate of rotation about the 20
axis
= rate of rotation about the z
axis
= rate of rotation about the 30 = 3 axis
The problem here is that the 20 , 30 and z axes are not all orthogonal and so these three terms
are not directly the vector components of in any orthonormal basis. Nonetheless we can
add the three contributions to get
= e02 + z + e03
and decompose
z e01 ) e01 = e03 cos e01 sin
z e03 ) e03 + (
z = (
X
i
i0 e0i
Finally, since e03 = e 3 we have e 1 e03 = 0, and e 2 e03 = 0. Hence (using also the results for
e 1 e01 , etc, given above)
1 = e 1 = sin cos + sin
2 = e2 =
sin sin + cos
3 = e =
cos +
3
This completes the construction of the angular velocity components referred to body axes;
these can be substituted into the usual expression
1
T = (I 1 21 + I 2 22 + I3 32 )
2
to give
,
,
t) =
T (, , , ,
1
I 1 ( sin cos + sin )2
2
1
1
2
+ I 2 ( sin sin + cos )2 + I3 ( cos + )
2
2
2
Tij qi qj
ij
but, unlike earlier examples, the matrix Tij is explicitly dependent on the coordinates {q}.
Symmetric Top: Life is a lot simpler if the 3 axis that passes through the pivot is a
symmetry axis of the body. Then I 1 = I 2 = A (say). Denoting I3 = C we have
i
1h
A( 21 + 22 ) + C32
2
i
1h
2
2
=
A( 0 1 + 0 2 ) + C32
2
i
1 h 2 2 2
2
=
A( + sin ) + C( cos + )
2
T =
where we noted that the 1 and 2 axes differ from 10 and 20 by rotation in the (1, 2) plane, so
12 + 22 = 0 1 2 + 0 2 2 . Hence the degeneracy of the two moments of inertia, I1 = I2 , means
that the explicit transformation from e01, 2 to e 1, 2 is not required in this case. An explicit
calculation gives the same result of course check it! Equivalently, we may assert that T is
independent of by symmetry, so that we may evaluate it for = 0.
Accordingly for the symmetric top pivoted at O
i
1 h 2 2 2
2 M gl cos
A( + sin ) + C( cos + )
(1)
2
Note: In examination questions, this result would either be given, or you would be asked to
quote it. Although a full derivation along the lines given above will not normally be asked
for, an explanation or interpretation of the various contributions may be required.
L=
Conservation Laws and Equations of Motion: We consider the generalised coordinates in turn:
1. L does not depend explicitly on . Hence
p
L
= C( + cos ) = I3 3
3
is a conserved quantity. The reason for this conservation law is the symmetry of L
under rotations of the body about its 3 axis. Since the centre of mass lies on the 3
axis, the gravitational torque cannot change 3 . (This is a good reason for choosing
as one of our Euler angles.) It is conventional to write this as conservation of
n 3 = ( + cos )
where n is called the spin of the top.
(2)
2
X
i=1
qi pi L
is a third conserved quantity. For this system, T is quadratic in the velocities (see
above), in which case h coincides with E = T +V , the total energy, as proved generally
in Lecture 11. (Anyway it is clear that E is conserved!) To get information about the
motion, we first deduce, using (say) equation (1), that
A
C
E = (2 + 2 sin2 ) + n2 + M gl cos
2
2
This can be written
2E Cn2 2M gl cos
2 + 2 sin2 =
A
A
2
C
A 2 2 2
( + sin ) + n2 M gl cos
2
2
and then obtain Lagranges equation for by differentiation as usual, treating n as constant.
(A similar mistake was discussed in Lecture 8, but this case is more dangerous: the result is
wrong, but not obviously so.) Of course, you can always use n as shorthand for ( + cos )
in Lagranges equations, but then it is vital to remember that some of the n/qi and n/ qi
are non-zero in order to get the right answers. To be safe, it is probably best to write out L
longhand in terms of the chosen generalised coordinates and velocities.
Zone Function: The fact that is real requires 2 > 0 (remember Lecture 1). This defines
for given n, Lz , E a zone of allowed values of the quantity
u cos
u = sin
u = f (u)
(Lz Cnu)2
2M gl
2E Cn2
(1 u2 )
u(1 u2 )
2
A
A
A
u2
allowed zone
U ()
min
Umin
Zone
o
min
steady precession
Note that U () becomes very large as 0, , and that it is bounded from below, as in the
figure. Obviously, the potential U is different for each value of Lz and n. This particle
analogy gives a physically appealing alternative to the zone-function approach.
We deduce that steady precession (a motion with = 0) is possible, but that this requires
E 0 = Umin (Lz , n) and = min (Lz , n). For this case = constant, and since
Lz = A sin2 + Cn cos
n = + cos
are both conserved, then and are also constants for this particular motion.
To find 0 min , we set dU/d = 0, giving (after some algebra and substitutions):
h
sin 0 2 A cos 0 Cn
+ M gl
= 0
(1)
This result may also be obtained trivially from the Lagrange equation for (equation (2) of
the previous lecture) by noting that = 0 implies = 0.
Equation (1) has two solutions:
1. sin 0 = 0: an upright top. This is called the sleeping top; we return to it later.
2. 0 6= 0, in which case, the term in square brackets in equation (1) is zero and
Cn C 2 n2 4AM gl cos 0
=
2A cos 0
so that if cos 0 > 0 (corresponding as usual to a top that does not hang down below
its support), the existence of steady precession requires
C 2 n2 4AM gl cos 0
In other words, the top must be spinning fast enough if it is to show steady precession
at an upward tilting angle 0 .
2
Fast Top (Gyroscope): For large enough spin we can binomial expand our equation for
the angular velocity of steady precession
Cn [C 2 n2 4AM gl cos 0 ]
=
2A cos 0
"
1/2
2AM gl cos 0
Cn
1 1
'
2A cos 0
C 2 n2
#!
Cn
A cos 0
These limiting forms can readily be checked by other methods: for slow precession, one can
balance the gravitational torque against a constant L ; for free precession the result can be
obtained from Eulers equations of motion (with zero torque). Note that in real gyroscopes,
the free precession is rapidly damped by air friction so the slow precession is what dominates
the observed behaviour.
Nutation: Suppose now we increase the energy (at fixed Lz ) so that E 0 is slightly above
Umin ; in this case (by analogy with our 1-D particle) we have oscillations superposed on the
steady precession.
The path traced by the 3 axis looks like
Z
Z
Zone
Path traced
by 3 axis
OR
etc.
nut = (U 00 (0 )/A)
If the spin is large enough (the formal requirement is Cn (4AM gl cos 0 )1/2 ) one has
nut and there are many nutational wobbles per precession period (ie, per complete
circuit of the 3 axis about the vertical).
3
Sleeping Top: We finally return to the case 0 = 0, that is, we look for steady motion of
an upright top. Lagranges equation for the motion is (see Lecture 16, page 4, item 3)
A = A 2 sin cos Cn sin + M gl sin
Now expand for small
A = (A 2 Cn + M gl)
(2)
2
Cn2 /2 = A
Cn
=
2A
This is the precession frequency of a sleeping top which is perturbed infinitesimally away
from the vertical. Substituting back into equation (2) for gives
A =
C 2 n2
+ M gl
4A
For n2 > 4M glA/C 2 , this is SHM and the sleeping top is stable. If the spin drops below
this value, any perturbation in will grow exponentially and steady motion in an upright
position is unstable.
The sleeping top will therefore spin happily in an upright position until such time as its spin
drops below the threshold for stability. (The spin decreases slowly due to air friction.) Then,
since there are always perturbations from somewhere, it suddenly wakes up and performs
an elaborate motion it goes haywire!
{q} = (q1 , q2 , , qD )
Static equilibrium (no motion) is possible only if all the generalised forces Qj vanish together
at some set of coordinates {q}eq . This defines the equilibrium position. (There may be more
than one such position.)
Let us redefine our coordinates: {q} {q} {q}eq so that the equilibrium position is at
the origin; then
V ({q})
= 0
j
Qj ({0}) =
qj {q}=0
If we Taylor expand V about this point, all terms linear in {q} vanish, and we have
1 X 2V
V ({q}) = V ({0}) +
qi qj + O({q 3 })
2 ij qi qj
where V ({0}) is just a constant (which we can ignore from now on). We write
1
V ({q}) = qi Vij qj
2
where from now on we use the summation convention (repeated indices are summed over)
and we have defined
2V
({0})
Vij =
qi qj
which is a constant D D matrix of second derivatives (evaluated at the origin). Since
V is assumed differentiable, this matrix must be symmetric. Moreover if {q = 0} is a
local minimum, then the matrix Vij is positive semi-definite: all the eigenvalues are zero
or positive. (Otherwise we could lower the potential energy by moving slightly along any
direction in q-space corresponding to a negative eigenvalue.)
Kinetic Energy: We assume that the kinetic energy is a time-independent quadratic form
in the velocities (thus ruling out some systems with time-dependent constraints). Then
1
T = T ({q}, {q},
6 t) = qi Tij qj
2
where in general the matrix Tij will depend in some arbitrary way on the coordinates {q}.
However, for small displacements about the equilibrium position {q}eq = {0} we can again
take Tij Tij ({q = 0}) to be a constant matrix. It is convenient to choose Tij to be
symmetric. This is always possible since any asymmetric form Tij can be symmetrised to
(Tij + Tji )/2 without changing the above result for T . (Check for yourself.) Since kinetic
energies are generally positive, the matrix Tij is positive definite: all the eigenvalues are
greater than zero.
The Lagrangian: For small displacements, we have the Lagrangian
1
L = T V = (qi Tij qj qi Vij qj )
2
Lagranges equations become
d
dt
L
qi
L
qi
Tij qj = Vij qj
i = 1, , D
By assuming small displacements they have been linearised; accordingly they can be solved
by matrix methods.
Normal Modes: These are solutions of the equations of motion in which all the generalised
coordinates oscillate with the same angular frequency :
qj = aj cos(t + j )
In such a motion, the equations of motion read
or, setting 2 ,
2 Tij aj + Vij aj = 0
(Vij Tij )aj = 0
This is a generalised eigenvalue/eigenvector problem, ie its not the standard one, which
would be to solve (Mij ij )aj = 0 where Mij is symmetric and ij is the ij th identity
of the identity matrix see, eg MP2B or FoMP lecture notes. Fortunately, the method of
solution is the same:
1. Find the eigenvalues (1) , (2) , , (D) by solving
det (V T ) = 0
where the D D matrices V and T have components Vij and Tij respectively.
2. For a given (k) , write
(k)
This defines an (un-normalised) eigenvector e(k) for the k th mode, with j th component
(k)
ej . There is a special feature here: since Tij is not ij , these eigenvectors are in general
not orthogonal. However, they are complete (proved later).
2
x1
x2
x3
1
1
m(x 21 + x 23 ) + M x 22
2
2
1
1
1
=
K(x1 x2 )2 + K(x3 x2 )2 = K(x21 + 2x22 + x23 2x1 x2 2x3 x2 )
2
2
2
T =
V
Choosing (q1 , q2 , q3 ) = (x1 , x2 , x3 ), T and V take the required matrix form with
K K
0
Vij = K 2K K
0 K
K
m 0 0
Tij = 0 M 0
0 0 m
Our eigenproblem is then
K m
K
0
2K M
K
det K
=0
0
K
K m
Expanding the determinant (exercise) gives
(K m) (mM (M + 2m)K) = 0
which has roots
(1) = 0
(2) = K/m
(3) = K(2m + M )/M m
Now to find the eigenvectors e(k) we solve
(k)
(2)
(3)
where A and v are integration constants. To avoid this pathology one normally restricts attention to positive definite Vij , for which all the frequencies are strictly positive. Zero modes,
which usually correspond to conserved momenta associated with symmetries (displacement
of the entire molecule, in this case) can usually be spotted and the relevant coordinates
treated separately.
Simultaneous Diagonalisation of the Matrices T and V : The matrix T is positive
definite. We now show that a coordinate transformation exists that diagonalises both T and
V (both symmetric) simultaneously. The argument is formal but quite powerful.
1. We apply an orthogonal transformation C to T and V , such that T becomes diagonal:
T 0 = CT T C
V 0 = CT V C
where C T is the transpose of C (recall CijT Cji , and, since C is orthogonal, we also
have C T = C 1 ). This diagonalisation is always possible because T is symmetric. The
matrix T 0 is now
T1 .
.
.
. T
.
.
2
T0 =
.
. Tk .
.
.
. TD
4
K=
(T1 )1/2
.
.
.
.
(T2 )1/2
.
.
.
.
(Tk )1/2
.
.
.
.
(TD )1/2
This is possible so long as T is positive definite, as assumed. After this has been done,
we have
T 00 = K T C T T CK = I
V 00 = K T C T V CK
so that T has become the unit matrix, and V 00 is still symmetric (exercise: prove it).
3. Finally, apply a second orthogonal transformation D to diagonalise V . Since T 00 = I
is already the unit matrix, this is unaffected and we have T 000 = I also, whereas
V 000 =
1 .
.
.
. 2 .
.
.
. k .
.
.
. D
V 000 = M T V M
with qi = Mij qj
which is not an orthogonal transformation (due to the second step) but is invertible.
Hence the set {
q } are an acceptable set of generalised coordinates: no information
has been lost during the transformations. (This is the proof promised earlier of the
completeness of such a set.)
Normal Coordinates: In terms of the new set of transformed qs (we now drop the tilde),
we have Tij = ij and V is diagonal with eigenvalues (k) . Hence the Lagrangian reads
X 1
1
1
1
L = qi Tij qj qi Vij qj =
qi2 i qi2
2
2
2
2
i
where in the last form, and from now on, the summation convention is abandoned. This is
called the normal form and the transformed coordinates are normal coordinates. Lagranges
equations read simply
qi = i qi
i = 1, , D
with general solution
qi (t) = Ai cos(i t + i )
1/2
where i = i , and Ai and i are integration constants (fixed by the initial conditions).
There is no summation over modes left to do here: each normal coordinate executes pure
SHM independent of the others. All of the effort in solving for the motion is thereby moved
5
formally into the task of finding the transformation laws between these normal coordinates
and some natural ones (ie, whatever set we would like to have the answer in terms of).
In practice, one does not usually go through this formal sequence of transformations separately for each problem encountered. The transformation between natural ({q}) and normal
({
q }) coordinates boils down to qi = Mij qj , where (as would be expected from examples of
single rather than double diagonalisation) M is the matrix of eigenvectors with components
Mij :
(j)
Mij = ei
Hence the task of finding the normal coordinates and the normal form of L is neither
more, nor less, than finding the eigenvalues and eigenvectors of (Vij Tij ) aj = 0 as was
done for the triatomic molecule above.