03a PDF
03a PDF
Constraints
Often times we consider dynamical systems which are defined using some kind of
restrictions on the motion. For example, the spherical pendulum can be defined as a
particle moving in 3-d such that its distance from a given point is fixed. Thus the true
configuration space is defined by giving a simpler (usually bigger) configuration space along
with some constraints which restrict the motion to some subspace. Constraints provide a
phenomenological way to account for a variety of interactions between systems. We now
give a systematic treatment of this idea and show how to handle it using the Lagrangian
formalism.
For simplicity we will only consider holonomic constraints, which are restrictions which
can be expressed in the form of the vanishing of some set of functions – the constraints –
on the configuration space and time:
Cα (q, t) = 0, α = 1, 2, . . . m.
We assume these functions are smooth and independent so that if there are n coordinates
q i , then at each time t the constraints restrict the motion to a nice n − m dimensional
space. For example, the spherical pendulum has a single constraint on the three Cartesian
configruation variables (x, y, z):
C(x, y, z) = x2 + y 2 + z 2 − l2 = 0.
This constraint restricts the configuration to a two dimensional sphere of radius l centered
at the origin. To see another example of such constraints, see our previous discussion of
the double pendulum and pendulum with moving point of support.
We note that the constraints will restrict the velocities:
d ∂Cα i ∂Cα
Cα = q̇ + = 0.
dt ∂q i ∂t
For example in the spherical pendulum we have
xẋ + y ẏ + z ż = 0.
There are two ways to deal with such constraints. Firstly, one can simply solve the
constraints, i.e., find an independent set of generalized coordinates. We have been doing
1
Constraints and Lagrange Multipliers.
this all along in our examples with constraints. For the spherical pendulum, we solve the
constraint by
x = l sin θ cos φ, y = l sin θ sin φ, z = l cos θ,
and express everything in terms of θ and φ, in particular the Lagrangian and EL equations.
In principle this can always be done, but in practice this might be difficult. There is another
method in which one can find the equations of motion without having to explicitly solve
the constraints. This is known as the method of Lagrange multipliers. This method is
not just popular in mechanics, but also features in “constrained optimization” problems,
e.g., in economics. As we shall see, the Lagrange multiplier method is more than just an
alternative approach to constraints – it provides additional physical information about the
forces which maintain the constraints.
Lagrange Multipliers
Evidently, at the critical point the gradient of f is orthogonal to the curve C(x, y) = 0.
Now, any vector orthogonal to the curve – orthogonal to ~t at (x0 , y0 ) – will be proportional
to the gradient of C at that point.* Thus the condition for a critical point (x0 , y0 ) of f
* This follows from the basic calculus result that the gradient of a function is orthogonal to
the locus of points where the function takes a constant value.
2
Constraints and Lagrange Multipliers.
(where C(x0 , y0 ) = 0) is that the gradient of f and the gradient of C are proportional at
(x0 , y0 ). We write
∇f + λ∇C = 0, where C(x0 , y0 ) = 0
for some λ. This requirement is meant to hold only on the curve C = 0, so without loss of
generality we can take as the critical point condition
∇(f + λC) = 0, C = 0.
This constitutes three conditions on 3 unknowns; the unknowns being (x, y) and λ. The
function λ is known as a Lagrange multiplier. In fact, if we artificially enlarge our x-y plane
to a 3-d space parametrized by (x, y, λ) we can replace the above critical point condition
with
˜ + λC) = 0,
∇(f
˜ is the gradient in (x, y, λ) space. You should prove this as an exercise.
where ∇
To summarize: the critical points (x0 , y0 ) of a function f (x, y) constrained to a curve
C(x, y) = 0 can be obtained by finding unconstrained critical points (x0 , y0 , λ0 ) of a
function in the space of variables (x, y, λ):
We can do the same thing with our variational principle. Suppose we have an action
for n degrees of freedom q i , i = 1, 2, . . . , n:
Z t2
S[q] = dt L(q(t), q̇(t), t)
t1
Cα (q, t) = 0, α = 1, 2, . . . , m.
to study the effect of a point transformation on the EL equations (principally the chain
rule), it is not hard to see that the correct equations of motion are then
∂F i ∂L
d ∂L
− = 0.
∂sA ∂q i dt ∂ q̇ i q=F (s,t)
i
∂F have the geometric meaning of (a basis of) (n − m) tangent
We note that the functions ∂sA
vectors to the (n − m)-dimensional surface Cα = 0. Thus the equations of motion are the
statement that the projections of the EL equations along the surface must vanish.
Now we introduce the Lagrange multiplier method. We consider a modified action,
Z t2 Z t2
S̃[q, λ] = dt L̃ = S[q] + dt λα (t)Cα (q(t), t),
t1 t1
Cα = 0,
One physically interprets this “force term” as the force required to keep the motion on
this surface.
It is easy to verify that these modified equations, (n + m) in number, are equivalent
the correct (n − m) equations obtained for sA earlier. Indeed, we have the m equations
of constraint. And, given this constraint, to say the EL expression coming from L is
orthogonal to Cα = 0 is the same as saying its projection tangent to the surface vanishes,
i.e.,
∂F i ∂L ∂F i
d ∂L α ∂Cα
− = −λ = 0,
∂sA ∂q i dt ∂ q̇ i q=F (s,t) ∂sA ∂q i
which is precisely the content of the equations for the sA we obtained above.
C = x2 + y 2 − l2 = 0.
x2 + y 2 − l2 = 0,
The typical analysis of EL equations involving Lagrange multipliers can now be nicely
demonstrated. First, the three EL equations can be solved for λ (exercise)
m
λ= (xẍ + y ÿ + gy) .
2l2
Next, differentiation of the constraint twice reveals:
C̈ = 0 =⇒ xẍ + y ÿ = −(ẋ2 + ẏ 2 ),
so that the multiplier λ can be solved for in terms of the original velocity phase space
variables:
m
λ = − 2 (ẋ2 + ẏ 2 − gy).
2l
Substituting this result back into the EL equations for x and y we get the equations of
motion for x and y with the effect of the constraint — physically, the tension in the rod
— taken into account:
m m
mẍ = − 2 (ẋ2 + ẏ 2 − gy)x, mÿ = − 2 (ẋ2 + ẏ 2 − gy)y − mg.
l l
Note we never had to solve the constraint! Still, as a nice exercise you can check that,
after solving the constraint with x = l cos φ, y = −l sin φ, these remaining 2 equations are
equivalent the familiar equation of motion for a plane pendulum, namely,
g
φ̈ = − sin φ,
l
where φ is the angular displacement from equilibrium.
Using Lagrangian multipliers, the equations of motion for x and y tell us that the
pendulum moves according to a superposition of forces consisting of (i) gravity, (ii) the
force of constraint F~constraint needed to keep the mass moving in a circle of radius l.
This latter force is supplied by the Lagrange multiplier terms in the equation of motion.
Indeed, thanks to these Lagrange multiplier terms, the radial component of the net force
is (exercise)
~r ~ m
· F = − (ẋ2 + ẏ 2 ),
l l
which is the centripetal force, as it should be.
To summarize: Given a dynamical system with coordinates q i and Lagrangian L, we
can impose constraints Cα (q, t) = 0 by the following recipe.
(i) Add variables λα – the Lagrange multipliers – to the configuration space,
(ii) Define a Lagrangian on the augmented velocity phase space L̃ = L + λα Cα ,
(iii) Compute the usual EL equations from L̃ for the q i and λα degrees of freedom.
6
Constraints and Lagrange Multipliers.
The resulting equations will include the constraints themselves as equations of motion
coming from variations of λα . The equations coming from the variations of the q i will
have extra terms involving the multipliers. For Newtonian systems these terms represent
the forces in the system which are necessary to enforce the constraints. If desired, one can
use the equations of motion, the constraints, and the time derivatives of the constraints
to solve for the multipliers in terms of the velocity phase space. One can then reduce the
original equations to only be built from the original degrees of freedom.
Thus the Lagrange multiplier method has distinct advantages over our previous ap-
proach in which we just solve the constraints at the beginning.. Namely, you do not have
to explcitly solve the constraints in order to compute the equations of motion, and the
equations of motion have additional physical information: the forces of constraint.