Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

Constrained Lagrangian Systems

I advertised that one advantage of Lagrangian dynamics is the ability to easily incorporate constraints. I should have said, some constraints; there are some that Lagrangian mechanics cannot handle (just as the framework does poorly with friction). Still when it works, it is very powerful.

So we will open by describing different types of constraints, and then move to redoing the mead-on-a-wire example we covered in the first week of the course.

Holonomic and non-holonomic constraints

Holonomic constraints

These are constraints that can be simply written as a set of algebraic equations on the space of coordinates, that us,

fα(qI)=0f_{\alpha}(q^I) = 0

where I=1,DI = 1, \ldots D and α=1,,m<D\alpha = 1,\ldots, m < D. This is mm equations in DD unknowns. The solutions form an n=Dmn = D - m-dimensional space of solutions (under the right conditions, called a submanifold); while mm is called the codimension of this space. Roughly the codimension is the number of directions that is in some sense perpendicular to this space.

single constraint
  1. As a simple example, for a single particle in d=3d = 3, f1=z=0f_1 = z = 0 defines the xyx-y plane. Similarly, y=0y = 0 defines the xzx-z plane. In each case the dimension of the space of solutions is 2 and the codimension 1.

  2. In this language the xx axis satisfies f1=z=0f_1 = z = 0, f2=y=0f_2 = y = 0. The xx axis is the intersection of the xyxy and xzxz planes. The dimension of the space of solutions is 1 and the codimension 2.

  3. A more complicated example is x2+y2R2=0x^2 + y^2 - R^2 = 0 in D=2D = 2, which describes a circle with radius RR.

  4. The constraint equation f1=x2+y2+z2R2=0f_1 = x^2 + y^2 + z^2 - R^2 = 0 defines a sphere of radius RR in 3 dimensions. If we also include f2=zf_2 = z- \ell with <<R\ell < < R, then we have a circle parallel to the xyx-y plane, with radius R22\sqrt{R^2 - \ell^2}

We can parameterize the space of solutions with a set of coordinates sis^i, so that qI(si)q^I(s^i) lies on the space of solutions. This paramaterization may have to be local; for example, in the case of a sphere, we often use different parameters to describe the northern and southern hemisphere.

These will turn out to be simply dealth with in Lagrangian mechanics via Lagrange multiplier techniques.

Non-holonomic constraints.

This is basically every other type of constraint and there are several ways a constraint can fail to be holonomic.

  1. Non-integrable constraint. Let is impose constraints locally on each point of the ambient space. That is, locally at each point q0Iq^I_0, we demand that motion away from this point by an infinitestimal δqI\delta q^I must satisfy

FI,α(q0I)δqI=0F_{I,\alpha}(q^I_0) \delta q^I = 0

Locally, in some infinitesimal neighborhood, this will look like a holonomic constraint. But there is no guarantee that this will hold globally. This failure will hold if FIF_I is not integrable, that is if we cannot write it as

FI,α(q0I)=fαqIqI=q0IF_{I,\alpha}(q^I_0) = \frac{\del f_{\alpha}}{\del q^I} \Big|_{q^I = q^I_0}

wuth fα(q0I)=0f_{\alpha}(q^I_0) = 0 (which we can achieve by a constant shift of ff). If we can satisfy (3), then (2) becomes

fα(q0I)+δqI)=fα(q0I)+fαqI(q0I)δqI+O(δq2)fα(q0I)FI,αδqI+O(δq2)\begin{align} f_{\alpha}(q^I_0) + \delta q^I) & = f_{\alpha}(q^I_0) + \frac{\del f_{\alpha}}{\del q^I}(q^I_0) \delta q^I + {\cal O}(\delta q^2) \\ & \equiv f_{\alpha}(q^I_0) F_{I,\alpha} \delta q^I + {\cal O}(\delta q^2) \end{align}

An example of a non-holonomic constraint can be found in pp 15-16 of Goldstein et al., 2002. Consider a vertical disc with radius aa, rolling without slipping or tilting on the xyx-y plane, as shown in this figure:

nonholonomic example

The configuration space is described by the contact point of the disc on the x-y plane with coordinates (x,y)(x,y); the angle θ\theta between the xx-axis direction and perpendicular to the disc; and the angle ϕ\phi describing the angle of a reference point on the disk’s edge, with respect to the zz axis.

The total speed of the disc is aϕ˙a {\dot \phi}; this translates to velocities:

x˙=aϕ˙sinθy˙=aϕ˙cosθ\begin{align} \dot{x} & = a \dot{\phi} \sin\theta\\ \dot{y} & = - a \dot{\phi} \cos\theta \end{align}

For an infinitesimal motion in time, δqI=q˙Iδt\delta q^I = \dot{q}^I \delta t, where here qI=1,,4=(x,y,ϕ,θ)q^{I = 1,\ldots,4} = (x,y,\phi,\theta). This we can write the above as

FI,1δqI=δxasinθδϕFI,2δqI=δy+acosθδϕ\begin{align*} F_{I,1} \delta q^I & = \delta x - a \sin\theta \delta \phi\\ F_{I, 2} \delta q^I & = \delta y + a\cos\theta \delta \phi \end{align*}

Starting with the first equation, there are no functions f1f_1 for which f1x=1\frac{\del f_1}{\del x} = 1, f1y=0\frac{\del f_1}{\del y}= 0, f1ϕ=asinθ\frac{\del f_1}{\del \phi} = - a \sin\theta, f1θ=0\frac{\del f_1}{\del \theta} = 0. The last two equations make thisclear, as you can’t have the ϕ\phi derivative of f1f_1 be θ\theta-dependent while f1f_1 has a vanishing θ\theta-derivative. A more algorithic wat to detect whether Fi=fqIF_i = \frac{\del f}{\del q^I} for some ff is to note that if it is, then

2fqIqJ2fqJqI=0\frac{\del^2 f}{\del q^I \del q^J} - \frac{\del^2 f}{\del q^J \del q^I} = 0

since it doesn’t matter in which order you take partial derivatives. Thus, for the constraint (2) to be integrable, we must have

qIFJ,alphaqJFI,α=0\frac{\del}{\del q^I} F_{J,alpha} - \frac{\del}{\del q^J} F_{I,\alpha} = 0

In fact, one can argue that in a finite “star-shaped” region, Poincare’s theorem states that if this condition is met, a function fαf_{\alpha} must exist so that FI,α=IfαF_{I,\alpha} = \nabla_I f_{\alpha}.

  1. Inequality constraints. Consider, in two dimensions, a particle on a rectangular billiard table with length LxL_x in the xx direction and LyL_y in teh yy direction. The particle is subject to the constraints

0xLx ;  0yLy0 \leq x \leq L_x\ ; \ \ 0 \leq y \leq L_y
Billiard table

Clearly these cannot be written as the derivative of a function.

  1. Velocity-dependent constraints. If the constraints are of the form

FI,α(q)q˙I=0F_{I,\alpha}(q) \dot{q}^I = 0

and FI,αqJFJ,αqI=0\frac{\del F_{I,\alpha}}{\del q^J} - \frac{\del F_{J,\alpha}}{\del q^I} = 0 then (under appropriate conditions on the region of interest) by writing detqI=δqIδt\det q^I = \delta q^I \delta t, we can convert these to holonomic constraints. If FI,αF_{I,\alpha} is not so constrained, the constraints are nonholonomic. Also, if FI,αF_{I,\alpha} containsadditional dependence on q˙I\dot{q}^I, the constraints are nonholonomic.

Imposing holonomic constraints

Example: a bead on a wire

We can impose the constraints in teh Lagrangian by adding an additional set of coordinates λα\lambda_{\alpha} known as Lagrange multiplies, so that the full Lagrangian is:

L=L0(qI,q˙I)+λαfα(qI)L = L_0(q^I,\dot{q}^I) + \lambda_{\alpha} f^{\alpha}(q^I)

where L0L_0 captures the kinetic energy of the particle, and any forces that do not follow from the constraint. In the bead on a wire example, this would be the gravitational force.

Applying the Euler-Langrange equations, the fact that λ˙\dot{\lambda} dies not appear means that the associated generalized momentum pαLλ˙α=0p_{\alpha} \frac{\del L }{\del \dot{\lambda}_{\alpha}} = 0. The Euler-Lagrange equation then states:

Lλα=falpha=0\frac{\del L}{\del \lambda^{\alpha}} = f^{alpha} = 0

which is the constraint equation. Note however that we have a new generalized force:

QIconstraint=λαfαqIQ_I^{constraint} = - \lambda_{\alpha} \frac{\del f_{\alpha}}{\del q^I}

This force is a gradient of ff which means it acts perpendicularly to the level sets of ff which include the constraint surface. Thus, it is the normal force. λ\lambda can be set to ensure that Newton’s laws for qIq^I do not take one off of the constraint surface.

We can solve for this constraint by finding coordinates sis^i living on the constraint surface, so that fα(qI(si))=0f_{\alpha}(q^I(s^i)) = 0 for all sis^i. We can then write

L=L0(q(s),ddtq(s))λαfα(q(s))=L0(q(s),s˙iqIsi)L~(s,s˙)\begin{align} L & = L_0(q(s), \frac{d}{dt} q(s)) - \lambda_{\alpha}f_{\alpha}(q(s)) \\ & = L_0(q(s), \dot{s}^i \frac{\del q^I}{\del s^i})\\ & \equiv {\tilde L}(s, \dot{s}) \end{align}

We then simply apply the Euler-Lagrange equations to L~{\tilde L}. In fact a little work is needed to show that this gives the correct equations, equivalent to the full equations of motion for qIq^I with the normal force included. I will start this in the section below and assign the rest in the next problem set.

To see how this works, let us go back to the bead on the wire example, for which f=zz(x)f = z - z(x). In this case. We just need a single coordinate along the wire, which we take to be s=xs = x. Then

L~(x,x˙)=12mx˙2](1+(z(x))2)+mgz(x){\tilde L}(x, \dot{x}) = \half m \dot{x}^2]\left(1 + (z'(x))^2\right) + m g z(x)

The generalized momentum is

p=Lx˙=m(1+(z(x))2)x˙p = \frac{\del L}{\del \dot{x}} = m (1 + (z'(x))^2) \dot{x}

and the generalized force is:

F=L~x=mx˙2zz+mgz{\cal F} = \frac{\del {\tilde L}}{\del x} = m \dot{x}^2 z' z'' + m g z'

So p˙=F\dot{p} = {\cal F} gives us

m(1+(z(x))2)x¨+2mzzx˙2=mx˙2zz+mgzm (1 + (z'(x))^2) \ddot{x} + 2 m z'z'' \dot{x}^2 = m\dot{x}^2 z'z'' + m g z'

or

mx¨=mgzmzzx˙21+(z(x))2m \ddot{x} = \frac{m g z' - m z'z''\dot{x}^2}{1 + (z'(x))^2}

This is exactly what we got in our treatment via Newton’s laws where we computed the explicit normal force, only it was much easier! NB -- an earlier draft of that section had the wrong sign in front of the second term on the RHS of (19). I found that sign mistake by doing the correct calculation above.

Consistency of our solution for the constraints

The full equations of motion for a system constrained to a subspace of the configuration space by a normal force is:

p˙IFINI=0\dot{p}_I - F_I - N_I = 0

Iw we consider δqI\delta q^I to be an infinitesimal motion along the constraint surface. This means that NIδqI=0N_I \delta q^I = 0 since the normal force is always perpendicular to this surface, and we can write

(p˙IFI)δqI=0(\dot{p}_I - F_I) \delta q^I = 0

Now if sis^i parameterize the constraint surface, then for δqI\delta q^I along this surface we can write

(p˙IFI)qIsiδsi=0(\dot{p}_I - F_I) \frac{\del q^I}{\del s^i} \delta s^i = 0

Thuis should hold for any δsi\delta s^i along the constraining surface. The goal is to show that (p˙IFI)qIsi=0(\dot{p}_I - F_I) \frac{\del q^I}{\del s^i} = 0 is equivalent to the equations of motion derived from L~{\tilde L}. We will leave this as a problem for the student to solve.

References
  1. Goldstein, H., Poole, C. P., & Safko, J. L. (2002). Classical Mechanics. Addison Wesley.