IFEM Ch10
IFEM Ch10
IFEM Ch10
MultiFreedom
Constraints II
101
102
TABLE OF CONTENTS
Page
102
.
. .
.
. .
.
. .
. .
.
. .
.
. .
.
. .
. .
. .
.
.
. .
.
. .
.
. .
. .
. .
. .
. .
. .
.
. .
.
. .
.
. .
. .
. .
. .
. . . . . .
. . . . . .
103
103
104
105
105
106
107
108
108
109
1010
1010
1011
1011
1012
1012
1013
103
10.1
In this Chapter we continue the discussion of methods to treat multifreedom constraints (MFCs). The
master-slave method described in the previous Chapter was found to exhibit serious shortcomings
for treating arbitrary constraints, although the method has important applications to model reduction.
We now pass to the study of two other methods: penalty augmentation and Lagrange multiplier
adjunction. Both of these techniques are better suited to general implementations of the Finite
Element Method.
10.1. THE PENALTY METHOD
u 1, f1
u 2, f2
u 3, f3
u 4, f4
u 5, f5
u 6, f6
(1)
(2)
(3)
(4)
(5)
(6)
u 7, f7
x
7
u 2, f2
u 3, f3
u 4, f4
u 5, f5
u 6, f6
(1)
(2)
(3)
(4)
(5)
(6)
u 7, f7
x
7
(7)
Such an element, albeit ctitious, can be treated exactly like another bar element insofar as continuing the assembly of the master stiffness equations. The penalty element stiffness equations,
K(7) u(7) = f(7) , are1
(7)
f2
1 1
u2
w
=
(10.1)
u6
1
1
f 6(7)
1
103
104
Because there is one freedom per node, the two local element freedoms map into global freedoms 2
and 6, respectively. Using the assembly rules of Chapter 3 we obtain the following modied master
u = f, which shown in detail are
stiffness equations: K
K 11
K 12
0
0
K 12
K 22 + w
K 23
0
0
w
0
0
K 23
K 33
K 34
0
0
0
0
0
K 34
K 44
K 45
0
0
0
0
0
K 45
K 55
K 56
0
0
w
0
0
K 56
K 66 + w
K 67
0
u1
0 u2
0 u3
0 u4 =
0 u5
K 67
u6
K 77
u7
f1
f2
f3
f4 ,
f5
f6
f7
(10.2)
This system can now be submitted to the equation solver. Note that u u, and only K has changed.
10.1.2. Choosing the Penalty Weight
What happens when (10.2) is solved numerically? If a nite weight w is chosen the constraint
u 2 = u 6 is approximately satised in the sense that one gets u 2 u 6 = eg , where eg = 0. The gap
error eg is called the constraint violation. The magnitude |eg | of this violation depends on w: the
larger w, the smaller the violation. More precisely, it can be shown that |eg | becomes proportional
to 1/w as w gets to be sufciently large (see Exercises). For example, raising w from, say, 106
to 107 can be expected to cut the constraint violation by roughly 10 if the physical stiffnesses are
small compared to w.
Consequently, it seems as if the proper strategy should be: try to make w as large as possible while
respecting computer overow limits. However, this is misleading. As the penalty weight w tends
to the modied stiffness matrix in (10.2) becomes more and more ill-conditioned with respect
to inversion.
To make this point clear, suppose for deniteness that the rigidities E (e) A(e) /L (e) of the actual bars
e = 1, . . . 6 are unity, that w >> 1, and that the computer solving the stiffness equations has a
oating-point precision of 16 decimal places. Numerical analysts characterize such precision by
saying that f = O(1016 ), where | f | is the smallest power of 10 that perceptibly adds to 1 in
oating-point arithmetic.2 The modied stiffness matrix of (10.2) becomes
1
1
K= 0
0
0
1
2+w
1
0
0
w
0
0
1
2
1
0
0
0
0
0
1
2
1
0
0
0
0
0
1
2
1
0
0
w
0
0
1
2+w
1
0
0
1
1
(10.3)
Clearly as w rows 2 and 6, as well as columns 2 and 6, tend to become linearly dependent;
approaches
in fact the negative of each other. Linear dependence means singularity; hence K
2
Such denitions are more rigurously done by working with binary numbers and base-2 arithmetic but for the present
conceptual discussion the use of decimal powers is sufcient.
104
105
10.1
The name square root arises because the recommended w is in fact 10k 10 p . It is seen that
the choice of penalty weight by this rule involves knowledge of both stiffness magnitudes and
oating-point hardware properties of the computer used as well as the precision selected by the
program.
10.1.4. Penalty Elements for General MFCs
For the constraint u 2 = u 6 the physical interpretation of the penalty element is clear. Points 2 and
6 must move in lockstep along x, which can be approximately enforced by the heavy bar device
shown in Figure 10.2. But how about 3u 3 + u 5 4u 6 = 1? Or just u 2 = u 6 ?
The treatment of more general constraints is linked to the theory of Courant penalty functions, which
in turn is a topic in variational calculus. Because the necessary theory has not yet been introduced
(it is given in the next subsection), the procedure used for constructing a penalty element is stated
here as a recipe. Consider the homogeneous constraint
3u 3 + u 5 4u 6 = 0.
(10.4)
[3
u3
4 ] u 5
u6
= 0,
(10.5)
Such order-of-magnitude estimates can be readily found by scanning the diagonal of K because the largest stiffness
coefcient of the actual structure is usually a diagonal entry.
If overows occurs, the master stiffness should be scaled throughout or a better choice of physical units made.
105
106
3
1 [3
4
u3
4 ] u 5
u6
9
3
12
3
1
4
12
4
16
u3
u5
u6
(e) u(e) = 0.
=K
(10.6)
(e)
is the unscaled stiffness matrix of the penalty element. This is now multiplied by the
Here K
penalty weight w and assembled into the master stiffness matrix following the usual rules. For the
example problem, augmenting (10.1) with the scaled penalty element (10.6) yields
K 11
K 12
0
0
K 12
K 22
K 23
0
0
0
0
0
K 23
K 33 + 9w
K 34
3w
12w
0
0
0
K 34
K 44
K 45
0
0
0
0
3w
K 45
K 55 + w
K 56 4w
0
0
0
12w
0
K 56 4w
K 66 + 16w
K 67
0
u1
0 u2
0 u3
0 u4 =
0 u5
K 67
u6
K 77
u7
f1
f2
f3
f4 ,
f5
f6
f7
(10.7)
If the constraint is nonhomogeneous the force vector is also modied. To illustrate this effect,
consider the MFC: 3u 3 + u 5 4u 6 = 1. Rewrite in matrix form as
[3
u3
4 ] u 5
u6
= 1.
(10.8)
9
3
12
3
1
4
12
4
16
u3
u5
u6
3
1 .
4
(10.9)
K 11
K 12
0
0
K 12
K 22
K 23
0
0
0
0
0
K 23
K 33 + 9w
K 34
3w
12w
0
0
0
K 34
K 44
K 45
0
0
0
0
3w
K 45
K 55 + w
K 56 4w
0
0
0
12w
0
K 56 4w
K 66 + 16w
K 67
u1
f1
0
0 u2
f2
0 u 3 f 3 + 3w
f4 ,
0 u4 =
0 u 5 f5 + w
f 6 4w
K 67
u6
K 77
u7
f7
(10.10)
p = 1, . . . m
106
(10.11)
107
10.1
where u contains all degrees of freedom and each a p is a row vector with same length as u. To incorporate
the MFCs into the FEM model one selects a weight w p > 0 for each constraints and constructs the so-called
Courant quadratic penalty function or penalty energy
P=
Pp ,
with
Pp = u T
1
aT a u w p aTp b p = 12 uT K( p) u uT f( p) ,
2 p p
(10.12)
p=1
where we have called K( p) = w p aTp a p and f( p) = w p aT bi . P is added to the potential energy function
= 12 uT Ku uT f to form the augmented potential energy a = + P. Minimization of a with respect
to u yields
Ku +
K( p) u = f +
p=1
f( p) .
(10.13)
p=1
Each term of the sum on p, which derives from term Pp in (10.12), may be viewed as contributed by a penalty
element with globalized stiffness matrix K( p) = w p aTp a p and globalized added force term f( p) = w p aTp b p .
To use a even more compact form we may write the set of multifreedom constraints as Au = b. Then the
penalty augmented system can be written compactly as
(K + AT WA) u = f + WAT b,
(10.14)
where W is a diagonal matrix of penalty weights. This compact form, however, conceals the structure of the
penalty elements.
108
u 1, f1
u 2, f2
u 3, f3
u 4, f4
u 5, f5
u 6, f6
(1)
(2)
(3)
(4)
(5)
(6)
u 7, f7
x
7
experimentation, wasting the user time with numerical games that have no bearing on the actual
objective, which is getting a solution.
The deterioration of the condition number of the penalty-augmented stiffness matrix can have a
serious side effects in some solution procedures such as eigenvalue extraction or iterative solvers.
Finally, even if optimal weights are selected, the combined solution error cannot be lowered beyond
a threshold value.
From these comments it is evident that penalty augmentation, although superior to the master-slave
method from the standpoint of generality and ease of implementation, is no panacea.
10.2. LAGRANGE MULTIPLIER ADJUNCTION
10.2.1. Physical Interpretation
As in the case of the penalty function method, the method of Lagrange multipliers can be given a
rigorous justication within the framework of variational calculus. But in the same spirit it will be
introduced for the example structure from a physical standpoint that is particularly illuminating.
Consider again the constraint u 2 = u 6 . Borrowing some ideas from the penalty method, imagine
that nodes 2 and 6 are connected now by a rigid link rather than a exible one. Thus the constraint
is imposed exactly. But of course the penalty method with an innite weight would blow up.
We may remove the link if it is replaced by an appropriate reaction force pair (, +), as illustrated
in Figure 10.3. These are called the constraint forces. Incorporating these forces into the original
stiffness equations (9.10) we get
K 11
K 12
0
0
K 12
K 22
K 23
0
0
0
0
0
K 23
K 33
K 34
0
0
0
0
0
K 34
K 44
K 45
0
0
0
0
0
K 45
K 55
K 56
0
0
0
0
0
K 56
K 66
K 67
0
u1
f1
0 u 2 f2
0 u 3 f3
0 u 4 = f4 .
0 u 5 f5
K 67
u6
f6 +
K 77
u7
f7
(10.15)
This is called a Lagrange multiplier. Because is an unknown, let us transfer it to the left hand
108
109
10.2
K 11
K 12
0
0
K 12
K 22
K 23
0
0
0
0
0
K 23
K 33
K 34
0
0
0
0
0
K 34
K 44
K 45
0
0
0
0
0
K 45
K 55
K 56
0
0
0
0
0
K 56
K 66
K 67
0
0
0
0
0
K 67
K 77
u1
0
u
1 2
u3
0
u
0 4 =
u
0 5
u
1 6
u7
0
f1
f2
f3
f4 .
f5
f6
f7
(10.16)
But now we have 7 equations in 8 unknowns. To render the system determinate, the constraint
condition u 2 u 6 = 0 is appended as eighth equation:
K 11
K 12
0
0
K 12
K 22
K 23
0
0
0
0
1
0
K 23
K 33
K 34
0
0
0
0
0
0
K 34
K 44
K 45
0
0
0
0
0
0
K 45
K 55
K 56
0
0
0
0
0
0
K 56
K 66
K 67
1
0
0
0
0
0
K 67
K 77
0
u1
0
1 u2
0 u3
0 u4
=
0 u5
1 u 6
0
u7
f1
f2
f3
f4
,
f5
f6
f7
0
(10.17)
This is called the multiplier-augmented system. Its coefcient matrix, which is symmetric, is
called the bordered stiffness matrix. The process by which is appended to the vector of original
unknowns is called adjunction. Solving this system provides the desired solution for the degrees
of freedom while also characterizing the constraint forces through .
10.2.2. Lagrange Multipliers for General MFCs
The general procedure will be stated rst as a recipe. Suppose that we want to solve the example
structure subjected to three MFCs
u 2 u 6 = 0,
5u 2 8u 7 = 3,
3u 3 + u 5 4u 6 = 1,
(10.18)
K 11
K 12
0
0
K 12
K 22
K 23
0
0
0
0
1
5
0
0
K 23
K 33
K 34
0
0
0
0
0
3
0
0
K 34
K 44
K 45
0
0
0
0
0
0
0
0
K 45
K 55
K 56
0
0
0
1
0
0
0
0
K 56
K 66
K 67
1
0
4
109
0
0
u1
u2
0
u
0 3
u =
K 67 4
u
K 77 5
u6
u7
8
0
f1
f2
f3
f4
f5
,
f6
f7
3
1
(10.19)
1010
Three Lagrange multipliers: 1 , 2 and 3 , are required to take care of three MFCs. Adjoin those
unknowns to the nodal displacement vector. Symmetrize the coefcient matrix by adjoining 3
columns that are the transpose of the 3 last rows, and lling the bottom right-hand corner with
zeros:
K 11
K 12
0
0
K 12
K 22
K 23
0
0
0
0
1
5
0
0
K 23
K 33
K 34
0
0
0
0
0
3
0
0
K 34
K 44
K 45
0
0
0
0
0
0
0
0
K 45
K 55
K 56
0
0
0
1
0
0
0
0
K 56
K 66
K 67
1
0
4
0
0
0
0
0
K 67
K 77
0
8
0
0
1
0
0
0
1
0
0
0
0
u1
0
0
5
0 u2
0
3 u3
0
0 u4
0
1 u5
=
0 4 u 6
8 0 u 7
0
0 1
2
0
0
3
0
0
f1
f2
f3
f4
f5
,
f6
f7
3
1
(10.20)
(10.21)
K
A
AT
0
u
f
=
(10.22)
The master stiffness matrix K in (10.22) is said to be bordered with A and AT . Solving this system provides
u and . The latter can be interpreted as forces of constraint in the following sense: a removed constraint can
be replaced by a system of forces characterized by multiplied by the constraint coefcients. More precisely,
the constraint forces are AT .
1011
10.4
SUMMARY
On the whole the Lagrangian multiplier method appear to be the most elegant for a general-purpose
nite element program that is supposed to work as a black box by minimizing guesses and
choices from its users. Its implementation, however, is not simple. Special care must be exercised
to detect singularities due to constraint dependency and to account for the effect of loss of positive
deniteness of the bordered stiffness on equation solvers.
10.3. *THE AUGMENTED LAGRANGIAN METHOD
The general matrix forms of the penalty function and Lagrangian multiplier methods are given by expressions
(10.14) and (10.22), respectively. A useful connection between these methods can be established as follows.
Because the lower diagonal block of the bordered stiffness matrix in (10.22) is null, it is not possible to directly
eliminate . To allow that, replace this block by S1 , where S is a constraint-scaling diagonal matrix of
appropriate order and is a small number. The reciprocal of is a large number called w = 1/ . To maintain
exactness of the second equation, S1 is added to the right-hand side:
K AT
A S1
u
f
=
S1 P
(10.23)
Here superscript P (for predicted value) is attached to the on the right-hand side as a tracer. We can
now formally solve for and subsequently for u. The results may be presented as
(K + wAT SA) u = f + wAT Sb AT P ,
= P + wS(b Au),
(10.24)
(10.25)
On taking W = wS, the general matrix equation (10.14) of the penalty method is recovered.
This relation suggests the construction of iterative procedures in which one tries to improve the accuracy
of the penalty function method while w is kept constant.5 This strategy circumvents the aforementioned illconditioning problems when the weight w is gradually increased. One such method is easily constructed by
inspecting (10.24). Using superscript k as an iteration index and keeping w xed, solve equations (10.24) in
tandem as follows:
(K + AT WA) uk = f + AT Wb AT k ,
(10.26)
k+1 = k + W(b Auk ),
for k = 0, 1, . . . , beginning with 0 = 0.6 Then u0 is the penalty solution. If the process converges one
recovers the exact Lagrangian solution without having to solve the Lagrangian system (10.23) directly.
The family of iterative procedures that may be precipitated from (10.24) collectively pertains to the class
of augmented Lagrangian methods. These have received much attention since the late 1960s, when they
originated in the eld of constrained optimization.
5
C. A. Felippa, Iterative procedures for improving penalty function solutions of algebraic systems, Int. J. Numer. Meth.
Engrg., 12, 821836, 1978.
This form of the stiffness equations is discussed in C. A. Felippa, Iterative procedures for improving penalty function
solutions of algebraic systems, Int. J. Numer. Meth. Engrg., 12, 821836, 1978.
1011
1012
Master-Slave
Elimination
Penalty
Function
Lagrange
Multiplier
fair
excellent
excellent
poor to fair
good
fair
high
high
Generality
Ease of implementation
Sensitivity to user decisions
Accuracy
small to none
variable
mediocre
excellent
Sensitivity as regards
constraint dependence
high
none
high
yes
yes
no
yes
no
yes
10.4. SUMMARY
The treatment of linear MFCs in nite element systems can be carried out by several methods. Three
of these: the master-slave elimination, penalty augmentation and Lagrange multiplier adjunction,
have been discussed. It is emphasized that no method is uniformly satisfactory in terms of generality,
numerical behavior and simplicity of implementation. See Figure 10.4 for a summary.
For a general purpose program that tries to approach black box behavior (that is, minimal decisions
on the part of users) the method of Lagrange multipliers has the edge. This edge is unfortunately
blunted by a fairly complex computer implementation and by the loss of positive deniteness in the
bordered stiffness matrix.
10.5. NOTES AND BIBLIOGRAPHY
References
1012
1013
Exercises
[C+N:20] This is identical to Exercise 9.1, except that the MFC u 2 u 6 = 1/5 is to be treated by the penalty
function method. Take the weight w to be 10k , in which k varies as k = 3, 4, 5, . . . 16. For each sample
w compute the Euclidean-norm solution error e(w) = ||u p (w) uex ||2 , where u p is the computed solution
and uex is the exact solution listed in (E9.1). Plot k = log10 w versus log10 e and report for which weight e
attains a minimum. (See Slide #5 for a check). Does it roughly agree with the square root rule (10.1.3) if the
computations carry 16 digits of precision?
As in Exercise 9.1, use Mathematica, Matlab (or similar) to do the algebra. For example, the following
Mathematica script solves this Exercise:
(* Exercise 10.1 - Penalty Method *)
(* MFC: u2-u6=1/5 variable w *)
K=MasterStiffnessOfSixElementBar[100];
Print["Stiffness K=",K//MatrixForm];
f={1,2,3,4,5,6,7}; Print["Applied forces=",f];
uexact= {0,0.27,0.275,0.25,0.185,0.07,0.14}; ew={};
For [w=100, w<=10^16, w=10*w; (* increase w by 10 every pass *)
Khat=K; fhat=f;
Khat[[2,2]]+=w; Khat[[6,6]]+=w; Khat[[6,2]]=Khat[[2,6]]-=w;
fhat[[2]]+=(1/5)*w; fhat[[6]]-=(1/5)*w; (*insert penalty *)
{Kmod,fmod}=FixLeftEndOfSixElementBar[Khat,fhat];
u=LinearSolve[N[Kmod],N[fmod]];
Print["Weight w=",N[w]//ScientificForm," u=",u//InputForm];
e=Sqrt[(u-uexact).(u-uexact)];
(*Print["L2 solution error=",e//ScientificForm]; *)
AppendTo[ew,{Log[10,w],Log[10,e]}];
];
ListPlot[ew,AxesOrigin->{5,-8},Frame->True, PlotStyle->
{AbsolutePointSize[4],AbsoluteThickness[2],RGBColor[1,0,0]},
PlotJoined->True,AxesLabel->{"Log10(w)","Log10(u error)"}];
Here MasterStiffnessOfSixElementBar and FixLeftEndOfSixElementBar are the same modules listed
in Exercise 9.1.
Note: If you run the above program, you may get several beeps from Mathematica as it is processing some of
the systems with very large weights. Dont be alarmed: those are only warnings. The LinearSolve function
for weights of order 1012 or bigger are ill-conditioned.
is alerting you that the coefcient matrices K
EXERCISE 10.2
[C+N:15] Again identical to Exercise 9.1, except that the MFC u 2 u 6 = 1/5 is to be treated by the Lagrange
multiplier method. The results for the computed u and the recovered force vector Ku should agree with (E9.1).
Use Mathematica, Matlab (or similar) to do the algebra. For example, the following Mathematica script solves
this Exercise:
(* Exercise 10.2 - Lagrange Multiplier Method *)
(* MFC: u2-u6=1/5 *)
K=MasterStiffnessOfSixElementBar[100];
Khat=Table[0,{8},{8}]; f={1,2,3,4,5,6,7}; fhat=AppendTo[f,0];
1013
1014
[A:10] For the example structure, show which penalty elements would implement the following MFCs:
(a)
u 2 + u 6 = 0,
(b)
u 2 3u 6 = 1/3.
(E10.1)
As answer, show the stiffness equations of those two elements in a manner similar to (10.1).
EXERCISE 10.4
[A/C+N:15+15+10] Suppose that the assembled stiffness equations for a one-dimensional nite element
model before imposing constraints are
2
1
0
1
2
1
0
1
2
u1
u2
u3
1
0
2
(E10.2)
(E10.3)
(a)
Impose the constraint (E10.3) by the master-slave method taking u 1 as master, and solve the resulting
2 2 system of equations by hand.
(b)
Impose the constraint (E10.3) by the penalty function method, leaving the weight w as a free parameter.
Solve the equations by hand or CAS (Cramers rule is recommended) and verify analytically that as
w the solution approaches that found in (a). Tabulate the values of u 1 , u 2 , u 3 for w = 0, 1, 10, 100.
Hint 1: the value of u 2 should not change. Hint 2: the solution for u 1 should be (6w + 5)/(4w + 4).
(c)
Impose the constraint (E10.3) by the Lagrange multiplier method. Show the 4 4 multiplier-augmented
system of equations analogous to (10.13) and solve it by computer or calculator.
1014
1015
beam-column
member
;
;
;
E, A, Izz
;
;;;
45
Exercises
EXERCISE 10.5
[A/C:10+15+10] The beam-column member shown in Figure E10.1 rests on a skew roller that forms a 45
angle with the horizontal axis x, and is loaded axially by a force P.
The nite element equations upon removing the xed right end, but before imposing the skew-roller MFC, are
E A/L
0
0
0
12E Izz /L 3
6E Izz /L 2
0
6E Izz /L 2
4E Izz /L
u x1
u y1
1
P
0
0
(E10.4)
where E, A, and Izz are member properties, 1 is the left end rotation, and L is the member length.7 To simplify
the calculations set P = E A, and Izz = AL 2 , in which and are dimensionless parameters, and express
the following solutions in terms of and .
(a)
Apply the skew roller constraint by the master-slave method (make u y1 slave) and solve for u x1 and 1
in terms of L, and . This may be done by hand or a CAS. Partial solution: u x1 = L/(1 + 3).
(b)
Apply the skew roller constraint by the penalty method by adjoining a penalty truss member of axial
stiffness k = wE A normal to the roller, and compute u x1 (Cramers rule is recommended if solved
by hand). Verify that as w the answer obtained in (a) is recovered. Partial solution: u x1 =
L(3 + wL)/(3 + wL(1 + 3)).
(c)
Apply the skew roller constraint by Lagrangian multiplier adjunction, and solve the resulting 44 system
of equations using a CAS. Verify that you get the same solution as in (a).
EXERCISE 10.6
[A:30] Show that the master-slave transformation method u = Tu can be written down as a special form of
the method of Lagrange multipliers. Start from the augmented functional
M S = 12 uT Ku uT f + T (u Tu)
(E10.5)
and write down the stationarity conditions of M S with respect to u, and u in matrix form.
EXERCISE 10.7
[A:35] Check the matrix equations (10.23) through (10.26) quoted for the Augmented Lagrangian method.
The stiffness equations for a beam column are derived in Part III of this book. For now consider (E10.4) as a recipe.
1015
1016
EXERCISE 10.8
[A:40] (Advanced, close to a research problem) Show that the master-slave transformation method u = Tu can
be expressed as a limit of the penalty function method as the weights go to innity. Start from the augmented
functional
T (u Tu)
(E10.6)
P = 12 uT Ku uT f + 12 w(u Tu)
Write down the matrix stationarity conditions with respect to to u u and eliminate u. Hint: using Woodburys
formula (Appendix C, C.5.2)
show that
(K + wTT ST)1 = K1 K1 TT (K + w1 S1 )1 T K1 .
(E10.7)
K = TK1 TT
(E10.8)
1016