Variational Methods in PDEs (Yan, 2008)
Variational Methods in PDEs (Yan, 2008)
by
Baisheng Yan
Department of Mathematics
Michigan State University
[email protected]
Contents
Course Description i
A. Motivating Examples i
B. Application to Physical Problems iv
C. Tentative Plans v
Chapter 1. Preliminaries 1
§1.1. Banach Spaces 1
§1.2. Bounded Linear Operators 5
§1.3. Weak Convergence and Compact Operators 7
§1.4. Spectral Theory for Compact Linear Operators 9
§1.5. Nonlinear Functional Analysis 11
Chapter 2. Sobolev Spaces 23
§2.1. Weak Derivatives and Sobolev Spaces 23
§2.2. Approximations and Extensions 27
§2.3. Sobolev Imbedding Theorems 39
§2.4. Equivalent Norms 47
Chapter 3. Existence Theory for Linear Problems 51
§3.1. Differential Equations in Divergence Form 51
§3.2. The Lax-Milgram Theorem 54
§3.3. Gårding’s Estimates and Existence Theory 55
§3.4. Symmetric Elliptic Operators 60
Chapter 4. Variational Methods for PDEs 65
§4.1. Variational Problems 65
§4.2. Multiple Integrals in the Calculus of Variations 66
§4.3. Direct Method for Minimization 70
§4.4. Minimization with Constraints 72
§4.5. Mountain Pass Theorem 86
iii
iv Contents
This course is intended to give an introduction to some important variational methods for
certain problems in partial differential equations (PDE) and applications. The course is
suitable for graduate students with some knowledge of partial differential equations. Since
this is an introduction course meant to convey important ideas of the methods, the rigorous
proof will be kept minimum; some details may be left to students as homework or class
presentation. Attendance, class participation and some extra reading of the references may
be necessary to truly learn the material.
A. Motivating Examples
Variational methods provide a solid basis for the existence theory of PDE and other applied
problems. We use some examples to introduce the main content of the course.
∆u = 0, u|∂Ω = f,
the infimum here is taken over all u in the given class. If this infimum is finite, then we
know that each component of {∇uj } is a bounded sequence in L2 (Ω). The weak convergence
i
ii Course Description
theorem implies that there exist subsequence {∇ujk } and G = (g1 , · · · , gn ) ∈ L2 (Ω) such
that
∇ujk * G weakly in L2 (Ω) and hence Ω |G|2 dx ≤ inf I(u).
R
Now the question is whether G renders a function in the given class; that is, does there
exist a function u in the given class such that G = ∇u? Since, in principle, G is only in
L2 (Ω), it is not clear whether such a u should exist or not. For this problem, the smoothness
of uj or even the fact that ∇uj converges strongly in L2 (Ω) to G (why?) would not help
much. The class in which we seek the minimizers plays an important role in guaranteeing
the existence of a minimizer. We need to have a larger class where I(u) is defined and a
minimizer can be found by this direct method. A minimizer in this larger class is only a
weak solution to the original Laplace equation. Is it smooth and a classical solution of the
Laplace equation? This is the regularity problem; we will not focus on such a problem in
this course. The extension of the admissible class leads use to the class of functions with
gradient in L2 (Ω) and such a class should be also complete (i.e., containing the limits of all
its Cauchy sequences). This motivates the study of Sobolev space H 1 (Ω) = W 1,2 (Ω) or the
general W m,p (Ω) spaces.
Note that f can also be assumed in the dual space H ∗ = H −1 (Ω); in this case, (f, v)L2 (Ω)
above is replaced by the pairing hf, vi between H ∗ and H. Note also that B[u, v] satisfies
the Gårding’s estimates (energy estimates):
|B[u, v]| ≤ αkukkvk, B[u, u] ≥ βkuk2 − γkuk2L2 (Ω) ,
where α > 0, β > 0 and γ ≥ 0 are constants (assuming aij , bi , c are L∞ (Ω) functions). The
Lax-Milgram theorem says that if γ = 0 then for any f ∈ H −1 (Ω) there exists a unique
u ∈ H such that B[u, v] = hf, vi for all v ∈ H. If B is also symmetric then the Lax-Milgram
theorem is simply the Riesz representation theorem. We will not consider the case
when γ > 0; the problem in this case involves the Fredholm alternative theorem.
A. Motivating Examples iii
Example 3 – Mountain Pass Method. Our second example is the use of critical point
theory to find a nontrivial solution to the semilinear elliptic equations of the following type:
∆u + |u|p−1 u = 0 in Ω, u = 0 on ∂Ω,
where 1 < p < n+2
n−2 (we will see why this special exponent appears here later). If we define
a functional on H = H01 (Ω) by
Z
1 2 1 p+1
I(u) = |∇u| − |u| dx,
Ω 2 p+1
then any critical point of I on H will be a weak solution of the above problem. It can be seen
that I does not have finite infimum or superimum on H. Although there is a method solving
this problem based on minimizing functional I on a manifold of H (i.e., via minimization
with constraints, to be discussed later in the class), to study the critical points of I, we use
the mountain pass method, which is a major contribution in the nonlinear functional
analysis.
The functional I can be proved a C 1 function on H with derivative I 0 : H → H being a
locally Lipschitz function. A critical property of I is that it satisfies the so-called Palais-
Smale condition: every sequence {uj } in H with {I(uj )} bounded and I 0 (uj ) → 0 in H
is precompact in H. Furthermore, there exist positive constants r, a such that
I(u) ≥ a if kuk = r,
and there exists an element v ∈ H with kvk > r and I(v) ≤ 0. Note that I(0) = 0.
Therefore, both inside and outside the mountain range kuk = r there are points where I
takes a smaller value than it takes on the mountain range. Let Γ be the set of all continuous
passes connecting the two lower points 0 and v. Let
c = inf max I(u).
g∈Γ u∈g
Then the mountain pass theorem says that c is a critical point of I; that is, there exists
u ∈ H such that I(u) = c, I 0 (u) = 0. Note that c ≥ a and hence u 6= 0.
Example 4 – Weak Lower Semicontinuity. The direct method also works for the
minimization problem for many other integral functionals of the type
Z
I(u) = F (x, u(x), Du(x)) dx,
Ω
where u may be even a vector valued function. An important question in this regard is when
the functional I is lower semi-continuous with respect to weak convergence of W 1,p (Ω); that
is, when we have
I(u) ≤ lim inf I(uj ) whenever uj * u in W 1,p (Ω).
j→∞
This will motivate the study of various notion of convexity conditions. In the scalar case,
the weak lower semicontinuity of I is equivalent to the convexity of function F (x, u, ξ)
on the variable ξ ∈ Rn . In the vectorial case (where, say, u : Ω → RN for some N ≥ 2), the
weak lower semicontinuity problem is a difficult problem and involves Morrey’s notion of
quasiconvexity, which will also be discussed in the course. But, a sufficient condition for the
semicontinuity will be given in terms of null-Lagrangians; this will be the polyconvexity.
The vectorial case is closely related to the problems in nonlinear elasticity, harmonic
maps, liquid crystals, and other physical problems.
iv Course Description
the constraint |n(x)| = 1 is lower-order in H 1 (Ω; S 2 ) and thus presents no problem when
using the direct method, but this nonconvex condition is the main obstacle for uniqueness
and regularity.
C. Tentative Plans
1. Preliminaries. I plan to start with some preliminary results in functional analysis
using some materials of a lecture note by Professor Dunninger; most of the materials can
be found in any textbook on real and functional analysis (also in Appendix D of Evans’
book, Partial Differential Equations, AMS Press).
2. Sobolev Spaces. I will do the Sobolev spaces using Evans’ book, Chapter 5, and some-
time refer to the new book Sobolev Spaces, Academic Press, by Adams and Fournier.
3. Variational Methods for PDEs. I will follow mainly Chapters 6 and 8 of Evans’
book on the variational methods discussed above, but will add some materials deemed to
be useful in understanding and developing the methods. If time permits, the maximum
principles and Sections 9.4.2 and 9.5.2 of Evans’ book may also be included.
4. Applications. I will briefly discuss the physical applied problems mentioned above
when a method can be applied to the problem. Some of the materials will come from sev-
eral research articles, including my own.
If time permits, I will discuss the nonconvex variational problems and relaxation prin-
ciple and the Young measure solutions to such problems; explicit example will be the
2-D micromagnetics problem.
Chapter 1
Preliminaries
1
2 1. Preliminaries
Then X is a Banach space with norm k · km,∞ . To prove, for example, the completeness
when m = 0, we let {fn } be a Cauchy sequence in X, i.e., assume for any ε > 0 there is a
number N (ε) such that for all x ∈ Ω
But this means that {fn (x)} is a uniformly Cauchy sequence of bounded continuous func-
tions, and thus converges uniformly to a bounded continuous function f (x). Letting m → ∞
in the above inequality shows that kfn − f km,∞ → 0.
Note that the same proof is valid for the set of bounded continuous scalar-valued func-
tions defined on a nonempty subset of a normed space X.
Example 1.2. Let Ω be a nonempty Lebesgue measurable set in Rn . For p ∈ [1, ∞), we
denote by Lp (Ω) the set of equivalence classes of Lebesgue measurable functions on Ω for
which
Z 1
p
p
kf kp ≡ |f (x)| dx < ∞.
Ω
(Two functions belong to the same equivalence class, i.e., are equivalent, if they differ
only on a set of measure 0.) Let L∞ (Ω) denote the set of equivalence classes of Lebesgue
measurable functions on Ω for which
Then Lp (Ω), 1 ≤ p ≤ ∞, are Banach spaces with norms k · kp . For p ∈ [1, ∞] we write
f ∈ Lploc (Ω) iff f ∈ Lp (K) for each compact set K ⊂ Ω.
For the sake of convenience, we will also consider Lp (Ω) as a set of functions. With this
convention in mind, we can assert that C0 (Ω) ⊂ Lp (Ω). In fact, if p ∈ [1, ∞), then as we shall
show later, C0 (Ω) is dense in Lp (Ω). The space Lp (Ω) is also separable if p ∈ [1, ∞). This
follows easily, when Ω is compact, from the last remark and the Weierstrass approximation
theorem.
Finally we recall that if p, q, r ∈ [1, ∞] with p−1 + q −1 = r−1 , then Hölder’s inequality
implies that if f ∈ Lp (Ω) and g ∈ Lq (Ω), then f g ∈ Lr (Ω) and
kf gkr ≤ kf kp kgkq .
Example 1.3. The Cartesian product X × Y , of two vector spaces X and Y , is itself a
vector space under the following operations of addition and scalar multiplication:
Moreover, under this norm, X × Y becomes a Banach space provided X and Y are Banach
spaces.
4 1. Preliminaries
1.1.4. Hilbert Spaces. Let H be a real vector space. H is said to be an inner product
space if to every pair of vectors x and y in H there corresponds a real-valued function
(x, y), called the inner product of x and y, such that
(a) (x, y) = (y, x) for all x, y ∈ H
(b) (x + y, z) = (x, z) + (y, z) for all x, y, z ∈ H
(c) (λx, y) = λ(x, y) for all x, y ∈ H, λ ∈ R
(d) (x, x) ≥ 0 for all x ∈ H, and (x, x) = 0 if and only if x = 0.
For x ∈ H we set
(1.1) kxk = (x, x)1/2 .
Theorem 1.4. If H is an inner product space, then for all x and y in H, it follows that
(a) |(x, y)| ≤ kxk kyk (Cauchy-Schwarz inequality);
(b) kx + yk ≤ kxk + kyk (Triangle inequality);
(c) kx + yk2 + kx − yk2 = 2(kxk2 + kyk2 ) (Parallelogram law).
Theorem 1.6. Every nonempty closed convex subset S of a Hilbert space H contains a
unique element of minimal norm.
Proof. Let S = {x − y : y ∈ M }. It is easy to see that S is convex and closed. Theorem 1.6
implies that there exists a y ∈ M such that kx − yk ≤ kx − wk for all w ∈ M . Let z = x − y.
For an arbitrary w ∈ M , w 6= 0, let α = (z, w)/kwk2 and note that
kzk2 ≤ kz − αwk2 = kzk2 − |(z, w)/kwk|2
which implies (z, w) = 0. Therefore z ∈ M ⊥ . If x = y 0 + z 0 for some y 0 ∈ M , z 0 ∈ M ⊥ , then
y 0 − y = z − z 0 ∈ M ∩ M ⊥ = {0}, which implies uniqueness.
Proof. It is easy to see that B(X, Y ) is a normed space. To prove completeness, assume
that {Tn } is a Cauchy sequence in B(X, Y ). Since
(1.5) kTn x − Tm xk ≤ kTn − Tm kkxk
we see that, for fixed x ∈ X, {Tn x} is a Cauchy sequence in Y and therefore we can define
a linear operator T by
T x = lim Tn x for all x ∈ X.
n→∞
If ε > 0, then the right side of (1.5) is smaller than εkxk provided that m and n are large
enough. Thus, (letting n → ∞)
kT x − Tm xk ≤ εkxk for all large enough m.
Hence, kT xk ≤ (kTm k + ε)kxk, which shows that T ∈ B(X, Y ). Moreover, kT − Tm k < ε
for all large enough m. Hence, limn→∞ Tn = T .
The following theorems are important in linear functional analysis; see, e.g., [3].
Theorem 1.9. (Banach-Steinhaus) Let X be a Banach space and Y a normed space. If
A ⊂ B(X, Y ) is such that supT ∈A kT xk < ∞ for each fixed x ∈ X, then supT ∈A kT k < ∞.
6 1. Preliminaries
Theorem 1.10. (Bounded Inverse) If X and Y are Banach spaces and if T ∈ B(X, Y )
is one-to-one and onto, then T −1 ∈ B(Y, X).
1.2.2. Dual Spaces and Reflexivity. When X is a (real) normed space, the Banach
space B(X, R) will be called the (normed) dual space of X and will be denoted by X*. El-
ements of X* are called bounded linear functionals or continuous linear functionals
on X. Frequently, we shall use the notation hf, xi to denote the value of f ∈ X* at x ∈ X.
Using this notation we note that |hf, xi| ≤ kf k kxk for all f ∈ X*, x ∈ X.
Example 1.11. Suppose 1 < p, q < ∞ satisfy 1/p + 1/q = 1 and let Ω be a nonempty
Lebesgue measurable set in Rn . Then Lp (Ω)∗ = Lq (Ω). The case of p = ∞ is different. The
dual of L∞ is much larger then L1 .
The dual space X ∗∗ of X ∗ is called the second dual space of X and is again a Banach
space. Note that to each x ∈ X we can associate a unique Fx ∈ X ∗∗ by Fx (f ) = hf, xi for
all f ∈ X ∗ . From Corollary 1.13, one can also show that kFx k = kxk. Thus, the (canonical)
mapping J : X → X**, given by Jx = Fx , is a linear isometry of X onto the subspace
J(X) of X ∗∗ . Since J is one-to-one, we can identify X with J(X).
A Banach space X is called reflexive if its canonical map J is onto X ∗∗ . For example,
all Lp spaces with 1 < p < ∞ are reflexive.
where y = f (z)z. To prove uniqueness, suppose (x, y) = (x, y 0 ) for all x ∈ H. Then in
particular, (y − y 0 , y − y 0 ) = 0, which implies y = y 0 . From the Cauchy-Schwarz inequality
we get |f (x)| ≤ kxkkyk, which yields kf k ≤ kyk. The reverse inequality follows by choosing
x = y in the representation.
Corollary 1.16. H is reflexive.
Proof. For any y ∈ H and all x ∈ H, set f (x) = (T x, y). Then it is easily seen that
f ∈ H ∗ . Hence by the Riesz representation theorem, there exists a unique z ∈ H such that
(T x, y) = (x, z) for all x ∈ H, i.e., D(T ∗ ) = H. Moreover, kT ∗ yk = kzk = kf k ≤ kT kkyk,
i.e., T ∗ ∈ B(H) and kT ∗ k ≤ kT k. The reverse inequality follows easily from kT xk2 =
(T x, T x) = (x, T ∗ T x) ≤ kT xkkT ∗ kkxk.
Proof. To prove (a), suppose that x and y are both weak limits of the sequence {xn } and
set z = x − y. Then hf, zi = 0 for every f ∈ X ∗ and by Corollary 1.13, z = 0. To prove (b),
let f ∈ X ∗ and note that xn → x implies hf, xn i → hf, xi since f is continuous. To prove (c),
assume xn * x and consider the sequence {Jxn } of elements of X ∗∗ , where J : X → X ∗∗
is the bounded operator defined above. For each f ∈ X ∗ , sup |Jxn (f )| = sup |hf, xn i| < ∞
(since hf, xn i converges). By the Banach-Steinhaus Theorem, there exists a constant c such
that kxn k = kJxn k ≤ c which implies {xn } is bounded. Finally, for f ∈ X ∗
|hf, xi| = lim |hf, xn i| ≤ lim inf kf kkxn k = kf k lim inf kxn k
which implies the desired inequality since kxk = supkf k=1 |hf, xi|.
We note that in a Hilbert space H, the Riesz representation theorem implies that xn * x
means (xn , y) → (x, y) for all y ∈ H. Moreover, we have
(xn , yn ) → (x, y) if xn * x, yn → y.
This follows from the estimate
|(x, y) − (xn , yn )| = |(x − xn , y) − (xn , yn − y)| ≤ |(x − xn , y)| + kxn kky − yn k
and the fact that kxn k is bounded.
8 1. Preliminaries
Remark. In other words, every bounded equicontinuous sequence of functions has a uni-
formly convergent subsequence.
Theorem 1.21. Let X and Y be Banach spaces. If Tn : X → Y are linear and compact
for n ≥ 1 and if limn→∞ kTn − T k = 0, then T is compact. Thus, compact operators form
a closed, but not a dense, subspace of B(X, Y ).
Proof. Let {xn } be a sequence in X with M = supn kxn k < ∞. Let A1 denote an infinite
set of integers such the sequence {T1 xn }n∈A1 converges. For k ≥ 2 let Ak ⊂ Ak−1 denote
an infinite set of integers such that the sequence {Tk xn }n∈Ak converges. Choose n1 ∈ A1
and nk ∈ Ak , nk > nk−1 for k ≥ 2. Choose ε > 0. Let k be such that kT − Tk kM < ε/4
and note that
kT xni − T xnj k ≤ k(T − Tk )(xni − xnj )k + kTk xni − Tk xnj k < ε/2 + kTk xni − Tk xnj k.
Since {Tk xni }∞ ∞
i=1 converges, {T xni }i=1 is a Cauchy sequence.
Theorem 1.22. Let X and Y be normed spaces.
(a) If T ∈ B(X, Y ), then T is weakly continuous, i.e.,
xn * x implies T xn * T x.
(b) If T : X → Y is weakly continuous and X is a reflexive Banach space, then T
is bounded.
(c) If T ∈ B(X, Y ) is compact, then T is strongly continuous, i.e.,
xn * x implies T xn → T x.
1.4. Spectral Theory for Compact Linear Operators 9
Proof. Let {xn } be a sequence in H satisfying kxn k ≤ m. The sequence {T *xn } is therefore
bounded, since T * is bounded. Since T is compact, by passing to a subsequence if necessary,
we may assume that the sequence {T T *xn } converges. But then
kT *(xn − xm )k2 = (xn − xm , T T *(xn − xm ))
≤ 2mkT T *(xn − xm )k → 0 as m, n → ∞.
Since H is complete, the sequence {T *xn } is convergent and hence T * is compact.
Let T : D(T ) ⊂ H → H be a linear operator on the real Hilbert space H. The set ρ(T )
of all scalars λ ∈ R for which (T − λI)−1 ∈ B(H) is called the resolvent set of T . The
operator R(λ) = (T − λI)−1 is known as the resolvent of T . σ(T ) = R \ ρ(T ) is called the
spectrum of T . It can be shown that ρ(T ) is an open set and σ(T ) is a closed set. The
set of λ ∈ R for which there exists a nonzero x ∈ N (T − λI) is called the point spectrum
of T and is denoted by σp (T ). The elements of σp (T ) are called the eigenvalues of T and
the nonzero members of N (T − λI) are called the eigenvectors (or eigenfunctions if X
is a function space) of T .
If T is compact and λ 6= 0, then by the Fredholm alternative, either λ ∈ σp (T ) or
λ ∈ ρ(T ). Moreover, if H is infinite-dimensional, then 0 6∈ ρ(T ); otherwise, T −1 ∈ B(H)
and T −1 T = I is compact. As a consequence, σ(T ) consists of the nonzero eigenvalues of T
together with the point 0. The next result shows that σp (T ) is either finite or a countably
infinite sequence tending to zero.
Theorem 1.25. Let T : X → X be a compact linear operator on the normed space X.
Then for each r > 0 there exist at most finitely many λ ∈ σp (T ) for which |λ| > r.
1.4.3. Symmetric Compact Operators. The next result implies that a symmetric com-
pact operator on a Hilbert space has at least one eigenvalue. On the other hand, an arbi-
trary bounded, linear, symmetric operator need not have any eigenvalues. As an example,
let T : L2 (0, 1) → L2 (0, 1) be defined by T u(x) = xu(x).
Theorem 1.26. Suppose T ∈ B(H) is symmetric, i.e., (T x, y) = (x, T y) for all x, y ∈ H.
Then
kT k = sup |(T x, x)|.
kxk=1
Moreover, if H 6= {0}, then there exists a real number λ ∈ σ(T ) such that |λ| = kT k. If
λ ∈ σp (T ), then in absolute value λ is the largest eigenvalue of T .
Proof. Clearly m ≡ supkxk=1 |(T x, x)| ≤ kT k. To show kT k ≤ m, observe that for all
x, y ∈ H
2(T x, y) + 2(T y, x) = (T (x + y), x + y) − (T (x − y), x − y)
≤ m(kx + yk2 + kx − yk2 )
= 2m(kxk2 + kyk2 )
where the last step follows from the paralleogram law. Hence, if T x 6= 0 and y =
(kxk/kT xk)T x, then
2kxkkT xk = (T x, y) + (y, T x) ≤ m(kxk2 + kyk2 ) = 2mkxk2
which implies kT xk ≤ mkxk. Since this is also valid when T x = 0, we have kT k ≤ m. To
prove the ‘moreover’ part, choose xn ∈ H such that kxn k = 1 and kT k = limn→∞ |(T xn , xn )|.
By renaming a subsequence of {xn }, we may assume that (T xn , xn ) converge to some real
number λ with |λ| = kT k. Observe that
k(T − λ)xn k2 = kT xn k2 − 2λ(T xn , xn ) + λ2 kxn k2
≤ 2λ2 − 2λ(T xn , xn ) → 0.
We now claim that λ ∈ σ(T ). Otherwise, we arrive at the contradiction
1 = kxn k = k(T − λ)−1 (T − λ)xn k ≤ k(T − λ)−1 k k(T − λ)xn k → 0.
1.5. Nonlinear Functional Analysis 11
Finally, we note that if T φ = µφ, with kφk = 1, then |µ| = |(T φ, φ)| ≤ kT k which implies
the last assertion of the theorem.
and thus x is the unique fixed point. Note that the fixed point x is independent of x0 since
x is a fixed point and fixed points are unique.
Our main existence result will be based upon the following so-called method of con-
tinuity or continuation method.
12 1. Preliminaries
(1.10) x = Ts−1 (f + τ T0 x − τ T1 x) ≡ Ax
and for A : X → X we have for all x, y ∈ X
kAx − Ayk ≤ τ c(kT1 k + kT0 k)kx − yk.
By the contraction mapping theorem, (1.10) has a solution and this completes the proof.
Here h·, ·i denotes the duality pairing between Lp (Ω) and Lq (Ω).
1.5. Nonlinear Functional Analysis 13
Proof. Since u ∈ Lp (Ω), the function u(x) is measurable on Ω and thus, Pn by (i) and P
(ii), the
function f (x, u(x)) is also measurable on Ω. From the inequality ( i=1 ξi ) ≤ c ni=1 ξir
r
Remarks. (a) The following remarkable statement can be proved: If f satisfies (i) and (ii)
above and if the corresponding Nemytskii operator is such that N : Lp (Ω) → Lq (Ω), then
N is continuous, bounded and (iii) holds.
(b) If (iii) is replaced by
(iii)0 for all (x, ξ) ∈ Ω × R
|f (x, ξ)| ≤ a(x) + b|ξ|p/r
where b is a fixed nonnegative number, a ∈ Lr (Ω) is nonnegative and 1 < p, r < ∞, then
the above results are valid with q replaced by r. (i.e., N : Lp (Ω) → Lr (Ω).)
(c) We say that f satisfies the Caratheodory property, written f ∈ Car, if (i) and
(ii) above are met. If in addition (iii) is met, then we write f ∈ Car(p).
1.5.3. Differentiability. Let S be an open subset of the Banach space X. The functional
f : S ⊂ X → R is said to be Gateaux differentiable (G-diff) at a point u ∈ S if there
exists a functional g ∈ X* (often denoted by f 0 (u)) such that
d f (u + tv) − f (u)
= [f 0 (u)]v for all v ∈ X.
f (u + tv) = lim
dt t=0 t→0 t
The functional f 0 (u)
is called the Gateaux derivative of f at the point u ∈ S. If f is
G-diff at each point of S, the map f 0 : S ⊂ X → X* is called the Gateaux derivative of
f on S. In addition, if f 0 is continuous at u (in the operator norm), then we say that f
is C 1 at u. Note that in the case of a real-valued function of several real variables, the
Gateaux derivative is nothing more than the directional derivative of the function at u in
the direction v.
The operator B, often denoted by A0 (u), is called the Frechet derivative of A at u. Note
that if A is Frechet differentiable on S, then A0 : S → B(X, Y ). In addition, if A0 is
continuous at u (in the operator norm), we say that A is C 1 at u.
Remark. If the functional f is F-diff at u ∈ S, then it is also G-diff at u, and the two
derivatives are equal. This follows easily from the definition of the Frechet derivative. The
converse is not always true as may be easily verified by simple examples from several variable
calculus. However, if the Gateaux derivative exists in a neighborhood of u and if f ∈ C 1 at
u, then the Frechet derivative exists at u, and the two derivatives are equal.
Example 1.31. (a) Let f (ξ) ∈ C(R). Then for k ≥ 0, the corresponding Nemytskii operator
N : C k (Ω̄) → C(Ω̄) is bounded and continuous. If in addition f (ξ) ∈ C 1 (R), then N ∈ C 1
and the Frechet derivative N 0 (u) is given by
[N 0 (u)v](x) = f 0 (u(x))v(x).
Note that for u, v ∈ C k (Ω̄), |N 0 (u)v|0 ≤ |f 0 (u)|0 |v|k and so N 0 (u) ∈ B(C k (Ω̄), C ( Ω̄)) with
kN 0 (u)k ≤ |f 0 (u)|0 . Clearly N 0 (u) is continuous at each point u ∈ C k (Ω̄). Moreover,
Z 1
0 d
|N (u + v) − N u − N (u)v|0 = sup | [ f (u(x) + tv(x)) − f 0 (u(x))v(x)]dt|
x 0 dt
Z 1
≤ |v|0 sup |f 0 (u(x) + tv(x)) − f 0 (u(x))|dt.
x 0
The last integral tends to zero since f0 is uniformly continuous on compact subsets of R.
More generally, let f (ξ) ∈ C k (R). Then the corresponding Nemytskii operator N :
C k (Ω̄)
→ C k (Ω̄) is bounded and continuous. If in addition f (ξ) ∈ C k+1 (R), then N ∈ C 1
with Frechet derivative given by [N 0 (u)v](x) = f 0 (u(x))v(x). Note that |uv|k ≤ |u|k |v|k for
u, v ∈ C k (Ω̄), and since C k (Ω̄) ⊂ C(Ω̄), the Frechet derivative must be of the stated form.
(b) Let f (ξ) ∈ C k+1 (R), where k > n/2. Then we claim that the corresponding Ne-
mytskii operator N : H k (Ω) → H k (Ω) is of class C 1 with Frechet derivative given by
[N 0 (u)v](x) = f 0 (u(x))v(x).
First, suppose u ∈ C k (Ω̄). Then N (u) ∈ C k (Ω̄) by the usual chain rule. If u ∈ H k (Ω),
let um ∈ C k (Ω̄) with kum − ukk,2 → 0. Since the imbedding H k (Ω) ⊂ C(Ω̄) is continuous,
um → u uniformly, and thus f (um ) → f (u) and f 0 (um ) → f 0 (u) uniformly and hence in L2 .
Furthermore, Di f (um ) = f 0 (um )Di um → f 0 (u)Di u in L1 . Consequently, by Theorem 2.10,
we have
Di f (u) = f 0 (u)Di u.
In a similar fashion we find
Dij f (u) = f 00 (u)Di uDj u + f 0 (u)Dij u
with corresponding formulas for higher derivatives.
1.5.4. Implicit Function Theorem. The following lemmas are needed in the proof of
the implicit function theorem.
1.5. Nonlinear Functional Analysis 15
Lemma 1.32. Let S be a closed nonempty subset of the Banach space X and let M be a
metric space. Suppose A(x, λ) : S × M → S is continuous and there is a constant k < 1
such that, uniformly for all λ ∈ M
kA(x, λ) − A(y, λ)k ≤ kkx − yk for all x, y ∈ S.
Then for each λ ∈ M, A(x, λ) has a unique fixed point x(λ) ∈ S and moreover, x(λ)
depends continuously on λ.
Proof. The existence and uniqueness of the fixed point x(λ) is of course a consequence of
the contraction mapping theorem. To prove continuity, suppose λn → λ. Then
kx(λn ) − x(λ)k = kA(x(λn ), λn ) − A(x(λ), λ)k
≤ kA(x(λn ), λn ) − A(x(λ), λn )k + kA(x(λ), λn ) − A(x(λ), λ)k
≤ kkx(λn ) − x(λ)k + kA(x(λ), λn ) − A(x(λ), λ)k.
Therefore
1
kx(λn ) − x(λ)k ≤ kA(x(λ), λn ) − A(x(λ), λ)k.
1−k
By the assumed continuity of A, the right side tends to zero as n → ∞, and therefore
x(λn ) → x(λ).
Lemma 1.33. Suppose X, Y are Banach spaces. Let S ⊂ X be convex and assume A :
S → Y is Frechet differentiable at every point of S. Then
kAu − Avk ≤ ku − vk sup kA0 (w)k.
w∈S
In other words, A satisfies a Lipschitz condition with constant q = supw∈S kA0 (w)k.
Proof. For fixed u, v ∈ S, set g(t) = A(u + t(v − u)), where t ∈ [0, 1]. Using the definition
of Frechet derivative, we have
0 A(u + (t + h)(v − u)) − A(u + t(v − u))
g (t) = lim
h→0 h
0
hA (u + t(v − u))(v − u) + kh(v − u)kE
= lim
h→0 h
0
= A (u + t(v − u))(v − u).
Hence
kg(0) − g(1)k = kAu − Avk ≤ sup kg 0 (t)k
t∈[0,1]
which implies the desired result.
Lemma 1.34. Let X be a Banach space. Suppose A : B(u0 , r) ⊂ X → X is a contraction,
with Lipschitz constant q < 1, where
r ≥ (1 − q)−1 kAu0 − u0 k.
Then A has a unique fixed point u ∈ B(u0 , r).
We now consider operator equations of the form A(u, v) = 0, where A maps a subset
of X × Y into Z. For a given [u0 , v0 ] ∈ X × Y we denote the Frechet derivative of A (at
[u0 , v0 ]) with respect to the first (second) argument by Au (u0 , v0 ) (Av (u0 , v0 )).
Theorem 1.35. (Implicit Function) Let X, Y, Z be Banach spaces. For a given [u0 , v0 ] ∈
X × Y and a, b > 0, let S = {[u, v] : ku − u0 k ≤ a, kv − v0 k ≤ b}. Suppose A : S → Z
satisfies the following:
(i) A is continuous.
(ii) Av (·, ·) exists and is continuous in S (in the operator norm)
(iii) A(u0 , v0 ) = 0.
(iv) [Av (u0 , v0 )]−1 exists and belongs to B(Z, Y ).
Then there are neighborhoods U of u0 and V of v0 such that the equation A(u, v) = 0 has
exactly one solution v ∈ V for every u ∈ U . The solution v depends continuously on u.
Proof. If in S we define
B(u, v) = v − [Av (u0 , v0 )]−1 A(u, v)
it is clear that the solutions of A(u, v) = 0 and v = B(u, v) are identical. The theorem will
be proved by applying the contraction mapping theorem to B. Since
Bv (u, v) = I − [Av (u0 , v0 )]−1 Av (u, v)
Bv (·, ·) is continuous in the operator norm. Now Bv (u0 , v0 ) = 0, so for some δ > 0 there is
a q < 1 such that
kBv (u, v)k ≤ q
for ku − u0 k ≤ δ, kv − v0 k ≤ δ. By virtue of Lemma 1.33, B(u, ·) is a contraction. Since
A is continuous, B is also continuous. Therefore, since B(u0 , v0 ) = v0 , there is an ε with
0 < ε ≤ δ such that
kB(u, v0 ) − v0 k ≤ (1 − q)δ
for ku − u0 k ≤ ε. The existence of a unique fixed point in the closed ball B(v0 , δ) follows
from Lemma 1.34 and the continuity from Lemma 1.32.
Example 1.36. Let f (ξ) ∈ C 1,α (R), f (0) = f 0 (0) = 0, g(x) ∈ C α (Ω̄) and consider the
boundary value problem
(1.13) ∆u + f (u) = g(x) in Ω, u|∂Ω = 0.
Set X = Z = C α (Ω̄), Y = {u ∈ C 2,α (Ω̄) : u|∂Ω = 0} and
A(g, u) = ∆u + N (u) − g
where N is the Nemytskii operator corresponding to f . The operator A maps X × Y into
the space Z. Clearly A(0, 0) = 0 (A is C 1 by earlier examples) and
Au (0, 0)v = ∆v, v ∈ Y.
It is easily checked that all the conditions of the implicit function theorem are met. In
particular, condition (iv) is a consequence of the bounded inverse theorem. Thus, for a
function g ∈ C α (Ω̄) of sufficiently small norm (in the space C α (Ω̄)) there exists a unique
solution of (1.13) which lies near the zero function. There may, of course, be other solutions
which are not close to the zero function. (Note that the condition f 0 (0) = 0 rules out linear
functions.)
1.5. Nonlinear Functional Analysis 17
Remark. Note that the choice of X = Z = C(Ω̄), Y = {u ∈ C 2 (Ω̄) : u|∂Ω = 0} would fail
above since the corresponding linear problem is not onto. An alternate approach would be
to use Sobolev spaces. In fact, if we take X = Z = W k−2 (Ω), Y = W k (Ω) ∩ H01 (Ω) with k
sufficiently large, and if f (ξ) ∈ C k+1 (R), then as above, we can conclude the existence of a
unique solution u ∈ W k (Ω) provided kgkk−2,2 is sufficiently small. Hence, we get existence
for more general functions g; however, the solution u ∈ W k (Ω) is not a classical (i.e., C 2 )
solution in general.
1.5.5. Generalized Weierstrass Theorem. In its simplest form, the classical Weier-
strass theorem can be stated as follows: Every continuous function defined on a closed ball
in Rn is bounded and attains both its maximum and minimum on this ball. The proof
makes essential use of the fact that the closed ball is compact.
The first difficulty in trying to extend this result to an arbitrary Banach space X is
that the closed ball in X is not compact if X is infinite dimensional. However, as we shall
show, a generalized Weierstrass theorem is possible if we require a stronger property for the
functional.
A set S ⊂ X is said to be weakly closed if {un } ⊂ S, un * u implies u ∈ S, i.e.,
S contains all its weak limits. A weakly closed set is clearly closed, but not conversely.
Indeed, the set {sin nx}∞ 2
1 in L (0, π) has no limit point (because it cannot be Cauchy) so
it is closed, but zero is a weak limit that does not belong to the set. It can be shown that
every convex, closed set in a Banach space is weakly closed.
A functional f : S ⊂ X → R is weakly continuous at u0 ∈ S if for every sequence
{un } ⊂ S with un * u0 it follows that f (un ) → f (u0 ). Clearly, every functional f ∈ X*
is weakly continuous. A functional f : S ⊂ X → R is weakly lower semicontinu-
ous(w.l.s.c.) at u0 ∈ S if for every sequence {un } ⊂ S for which un * u0 it follows that
f (u0 ) ≤ lim inf n→∞ f (un ). According to Theorem 1.18, the norm on a Banach space is
w.l.s.c.. A functional f : S ⊂ X → R is weakly coercive on S if f (u) → ∞ as kuk → ∞
on S.
Theorem 1.37. Let X be a reflexive Banach space and f : C ⊂ X → R be w.l.s.c. and
assume
(i) C is a nonempty bounded weakly closed set in X or
(ii) C is a nonempty weakly closed set in X and f is weakly coercive on C.
Then
(a) inf u∈C f (u) > −∞;
(b) there is at least one u0 ∈ C such that f (u0 ) = inf u∈C f (u).
Moreover, if u0 is an interior point of C and f is G-diff at u0 , then f 0 (u0 ) = 0.
Proof. Assume (i) and let {un } ⊂ C be a minimizing sequence, i.e., limn→∞ f (un ) =
inf u∈C f (u). The existence of such a sequence follows from the definition of inf. Since X is
reflexive and C is bounded and weakly closed, there is a subsequence {un0 } and a u0 ∈ C
such that un0 * u0 . But f is w.l.s.c. and so f (u0 ) ≤ lim inf n→∞ f (un0 ) = inf u∈C f (u),
which proves (a). Since by definition, f (u0 ) ≥ inf u∈C f (u), we get (b).
Assume (ii) and fix u0 ∈ C. Since f is weakly coercive, there is a closed ball B(0, R) ⊂ X
such that u0 ∈ B ∩ C and f (u) ≥ f (u0 ) outside B ∩ C. Since B ∩ C satisfies the conditions
of (i), there is a u1 ∈ B ∩ C such that f (u) ≥ f (u1 ) for all u ∈ B ∩ C and in particular for
u0 . Thus, f (u) ≥ f (u1 ) on all of C.
18 1. Preliminaries
To prove the last statement we set ϕv (t) = f (u0 + tv). For fixed v ∈ X, ϕv (t) has a
local minimum at t = 0, and therefore hf 0 (u0 ), vi = 0 for all v ∈ X.
Remark. Even though weakly continuous functionals on closed balls attain both their inf
and sup (which follows from the above theorem), the usual functionals that we encounter
are not weakly continuous, but are w.l.s.c.. Hence this explains why we seek the inf and
not the sup in variational problems.
1.5.5.1. Convex sets. A set C in the real normed space X is called convex if (1−t)u+tv ∈ C
for all t ∈ [0, 1], u, v ∈ C. The following result is needed later (see, e.g., [32]).
Theorem 1.38. A closed convex set in a Banach space is weakly closed.
A is monotone if
hAu − Av, u − vi ≥ 0 for all u, v ∈ X.
A is strongly monotone if
hAu − Av, u − vi ≥ cku − vkpX for all u, v ∈ X
where c > 0 and p > 1.
A is coercive if
hAu, ui
lim = +∞.
kuk→∞ kuk
Remark. A strongly monotone operator is coercive. This follows immediately from hAu, ui =
hAu − A0, ui + hA0, ui ≥ ckukpX − kA0kkukX .
Proof. Set
Cr = {u ∈ C : f (u) ≤ r}.
It follows from (a) that Cr is closed and convex for all r, and thus is weakly closed (cf.
Theorem 1.38). If f is not w.l.s.c., then there is a sequence {un } ⊂ C with un * u and
f (u) > lim inf f (un ). Hence, there is an r and a subsequence {un0 } such that f (u) > r and
f (un0 ) ≤ r (i.e., un0 ∈ Cr ) for all n0 large enough. Since Cr is weakly closed, u ∈ Cr , which
is a contradiction.
Assume (b) holds and set ϕ(t) = f (u + t(v − u)). Then by Lemma 1.39, ϕ : [0, 1] → R
is convex and ϕ0 is monotone. By the classical mean value theorem,
ϕ(1) − ϕ(0) = ϕ0 (θ) ≥ ϕ0 (0), 0<θ<1
i.e.,
f (v) ≥ f (u) + hf 0 (u), v − ui for all u, v ∈ C.
If un * u, then hf 0 (u), un − ui → 0 as n → ∞. Hence, f is w.l.s.c.
In the first case we say that f is (local) maximal at u0 with respect to Mc , while in the
second case f is (local) minimal at u0 with respect to Mc . A point u0 ∈ Mc is called an
ordinary point of the manifold Mc if its F-derivative g 0 (u0 ) 6= 0.
Let u0 be an ordinary point of Mc . Then u0 is called a critical point of f with respect
to Mc if there exists a real number λ, called a Lagrange multiplier, such that
f 0 (u0 ) = λg 0 (u0 ).
As we shall see, if u0 is an extremum of f with respect to Mc , and if u0 is an ordinary point,
then u0 is a critical point of f with respect to Mc . Note that if u0 is an extremum of f with
respect to X, then we can choose λ = 0, which implies the usual result.
Lemma 1.41. Let X be a Banach space. Suppose the following hold:
(i) f, g : X → R are of class C 1
(ii) For u0 ∈ X, we can find v, w ∈ X such that
Proof. If (a) does not hold, then fix w ∈ X with g 0 (u0 )w 6= 0. By hypothesis and the above
lemma, we must have
f 0 (u0 )v · g 0 (u0 )w = f 0 (u0 )w · g 0 (u0 )v for all v ∈ X.
If we define λ = (f 0 (u0 )w)/(g 0 (u0 )w), then we obtain (b).
Remark. In applying this theorem one should be careful and not choose g(u) = kuk, since
this g is not weakly continuous.
Assume that the imbedding X ⊂ Y is compact and that the imbedding Y ⊂ Z is continuous.
Then for each ε > 0, there is a constant c(ε) such that
(1.16) kukY ≤ εkukX + c(ε)kukZ for all u ∈ X.
Proof. If for a fixed ε > 0 the inequality is false, then there exists a sequence {un } such
that
(1.17) kun kY > εkun kX + nkun kZ for all n.
Without loss of generality we can assume kun kX = 1. Since the imbedding X ⊂ Y is
compact, there is a subsequence, again denoted by {un }, with un → u in Y . This implies
un → u in Z. By (1.17), kun kY > ε and so u 6= 0. Again by (1.17), un → 0 in Z, i.e., u = 0,
which is a contradiction.
Chapter 2
Sobolev Spaces
This chapter is devoted to a discussion of the necessary Sobolev function spaces which
permit a modern approach to the study of differential equations.
where v = Dα u. Motivated by (2.1), we now enlarge the class of functions for which the
notion of derivative can introduced.
Let u ∈ L1loc (Ω). A function v ∈ L1loc (Ω) is called the αth weak derivative of u if it
satisfies
Z Z
α |α| |α|
(2.2) uD ϕdx = (−1) vϕdx for all ϕ ∈ C0 (Ω).
Ω Ω
It can be easily shown that weak derivatives are unique. Thus we write v = Dα u to indicate
that v is the αth weak derivative of u. If a function u has an ordinary αth derivative lying
in L1loc (Ω), then it is clearly the αth weak derivative.
23
24 2. Sobolev Spaces
Example 2.1. (a) The function u(x) = |x1 | has in the ball Ω = B(0, 1) weak derivatives
ux1 = sgn x1 , uxi = 0, i = 2, . . . , n. In fact, we apply formula (2.2) as follows: For any
ϕ ∈ C01 (Ω)
Z Z Z
|x1 |ϕx1 dx = x1 ϕx1 dx − x1 ϕx1 dx
Ω Ω+ Ω−
where Ω+ = Ω ∩ (x1 > 0), Ω− = Ω ∩ (x1 < 0). Since x1 ϕ = 0 on ∂Ω and also for x1 = 0,
an application of the divergence theorem yields
Z Z Z Z
|x1 |ϕx1 dx = − ϕdx + ϕdx = − (sgn x1 )ϕdx.
Ω Ω+ Ω− Ω
|x1 |xi = 0 for i = 2, . . . , n. Note that the function |x1 | has no classical derivative with
respect to x1 in Ω.
(b) By the above computation, the function u(x) = |x| has a weak derivative u0 (x) =
sgn x on the interval Ω = (−1, 1). On the other hand, sgn x does not have a weak derivative
on Ω due to the discontinuity at x = 0.
(c) Let Ω = B(0, 1/2) ⊂ R2 and define u(x) = ln(ln(2/r)), x ∈ Ω, where r = |x| =
+ x22 )1/2 . Then u 6∈ L∞ (Ω) because of the singularity at the origin. However, we will
(x21
show that u has weak first partial derivatives.
First of all u ∈ L2 (Ω), for
Z Z 2π Z 1/2
2
|u| dx = r[ln(ln(2/r))]2 dr dθ
Ω 0 0
and a simple application of L’hopitals rule shows that the integrand is bounded and thus
the integral is finite. Similarly, it is easy to check that the classical partial derivative
− cos θ
ux1 = , where x1 = r cos θ
r ln(2/r)
also belongs to L2 (Ω). Now we show that the defining equation for the weak derivative is
met.
Let Ωε = {x : ε < r < 1/2} and choose ϕ ∈ C01 (Ω). Then by the divergence theorem
and the absolute continuity of integrals
Z Z Z Z
uϕx1 dx = lim uϕx1 dx = lim − ux1 ϕdx + uϕn1 ds
Ω ε→0 Ωε ε→0 Ωε r=ε
The same analysis applies to ux2 . Thus u has weak first partial derivatives given by the
classical derivatives which are defined on Ω\{0}.
2.1. Weak Derivatives and Sobolev Spaces 25
Since we shall be dealing mostly with these spaces in the sequel, we introduce the special
notation:
H k (Ω) = W k,2 (Ω), H0k (Ω) = W0k,2 (Ω).
Theorem 2.2. W k,p (Ω) is a Banach space under the norm (2.3). If 1 < p < ∞, it is
reflexive and if 1 ≤ p < ∞, it is separable.
Proof. We first prove that W k,p (Ω) is complete with respect to the norm (2.3).
Let {un } be a Cauchy sequence of elements in W k,p (Ω), i.e.,
p
X Z
kun − um kk,p = |Dα un − Dα um |p dx → 0 as m, n → ∞.
|α|≤k Ω
(Lp (Ω))n+1 ,
for v = (v1 , . . . , vn+1 ) ∈ then T is a (linear) isometry. Now (Lp (Ω))n+1 is
reflexive for 1 < p < ∞ and separable for 1 ≤ p < ∞. Since W 1,p (Ω) is complete, its image
under the isometry T is a closed subspace of (Lp (Ω))n+1 which inherits the corresponding
properties as does W 1,p (Ω) (see Theorem 1.14). Similarly, we can handle the case k ≥ 2.
Example 2.3. Let Ω be a bounded open connected set in Rn . Divide Ω into N open disjoint
subsets Ω1 , Ω2 , . . . , ΩN . Suppose the function u : Ω → R has the following properties:
(i) u is continuous on Ω̄.
(ii) For some i, Di u is continuous on Ω1 , Ω2 , . . . , ΩN , and can be extended contin-
uously to Ω̄1 , Ω̄2 , . . . , Ω̄N , respectively.
(iii) The surfaces of discontinuity are such that the divergence theorem applies.
Define wi (x) = Di u(x) if x ∈ ∪N
i=1 Ωi . Otherwise, wi can be arbitrary. We now claim
that wi ∈ Lp (Ω) is a weak partial derivative of u on Ω.
Indeed, for all ϕ ∈ C01 (Ω), the divergence theorem yields
Z XZ
uDi ϕdx = uDi ϕdx
Ω j Ωj
!
X Z Z
= uϕni dS − ϕDi udx
j ∂Ωj Ωj
Z
= − ϕDi udx.
Ω
Note that the boundary terms either vanish, since ϕ has compact support, or cancel out
along the common boundaries, since u is continuous and the outer normals have opposite
directions. Similarly, if u ∈ C k (Ω̄) and has piecewise continuous derivatives in Ω of order
k + 1, then u ∈ W k+1,p (Ω).
Remark. More generally, by using a partition of unity argument, we can show the following:
If O is a collection of nonempty open sets whose union is Ω and if u ∈ L1loc (Ω) is such that
for some multi-index α, the αth weak derivative of u exists on each member of O, then the
αth weak derivative of u exists on Ω.
Exercise 1. Consider the function u(x) = sgnx1 + sgnx2 in the ball B(0, 1) ⊂ R2 .
Show that the weak derivative ux1 does not exist, yet the weak derivative ux1 x2 does exist.
Exercise 2. (a) Let Ω be the hemisphere of radius R < 1 in Rn :
n
X
r2 ≡ x2i ≤ R2 , xn ≥ 0 (n ≥ 3).
i=1
Show that
u = (r(n/2−1) `nr)−1 ∈ H 1 (Ω).
(b) Let B = B(0, 1) be the open unit ball in Rn , and let
u(x) = |x|−α , x ∈ B.
2.2. Approximations and Extensions 27
the latter equality being valid since ωh vanishes outside the (open) ball B(x, h). Thus
the values of uh (x) depend only on the values of u on the ball B(x, h). In particular, if
dist(x, supp(u)) ≥ h, then uh (x) = 0.
Theorem 2.4. Let Ω be a nonempty open set in Rn . Then
(a) uh ∈ C ∞ (Rn ).
(b) If supp(u) is a compact subset of Ω, then uh ∈ C0∞ (Ω) for all h sufficiently
small.
With respect to a bounded set Ω we construct another set Ω(h) as follows: with each
point x ∈ Ω as center, draw a ball of radius h; the union of these balls is then Ω(h) . Clearly
Ω(h) ⊃ Ω. Moreover, uh can be different from zero only in Ω(h) .
Corollary 2.5. Let Ω be a nonempty bounded open set in Rn and let h > 0 be any number.
Then there exists a function η ∈ C ∞ (Rn ) such that
0 ≤ η(x) ≤ 1; η(x) = 1, x ∈ Ω(h) ; η(x) = 0, x ∈ (Ω(3h) )c .
Such a function is called a cut-off function for Ω.
28 2. Sobolev Spaces
Proof. Let χ(x) be the characteristic function of the set Ω(2h) : χ(x) = 1 for x ∈ Ω(2h) , χ(x) =
0 for x 6∈ Ω(2h) and set
Z
η(x) ≡ χh (x) = ωh (x − y)χ(y)dy.
Rn
Then Z
η(x) = ωh (x − y)dy ∈ C ∞ (Rn ),
Ω(2h)
Z
0 ≤ η(x) ≤ ωh (x − y)dy = 1,
Rn
and
(R
B(x,h) ωh (x − y)dy = 1, x ∈ Ω(h) ,
Z
η(x) = ωh (x − y)χ(y)dy =
B(x,h) 0, x ∈ (Ω(3h) )c .
In particular, we note that if Ω0 ⊂⊂ Ω, there is a function η ∈ C0∞ (Ω) such that η(x) = 1
for x ∈ Ω0 , and 0 ≤ η(x) ≤ 1 in Ω.
Remark. Henceforth, the notation Ω0 ⊂⊂ Ω means that Ω0 , Ω are open sets and that Ω0 ⊂ Ω.
Proof. Choose a > 0 large enough so that Ω is strictly contained in the ball B(0, a). Then
the function
u(x) if x ∈ Ω,
U (x) =
0 if x ∈ B(0, 2a) \ Ω
belongs to Lp (B(0, 2a)). For ε > 0, there is a function Ū ∈ C(B̄(0, 2a)) which satisfies the
inequality kU − Ū kLp (B(0,2a)) < ε/3. By multiplying Ū by an appropriate cut-off function,
it can be assumed that Ū (x) = 0 for x ∈ B(0, 2a)/B(0, a).
Therefore for |z| ≤ a, kU (x + z) − Ū (x + z)kLp (B(0,2a)) = kU (x) − Ū (x)kLp (B(0,a)) ≤ ε/3.
Since the function Ū is uniformly continuous in B(0, 2a), there is a δ > 0(δ < a) such that
kŪ (x + z) − Ū (x)kLp (B(0,2a)) ≤ ε/3 whenever |z| < δ. Hence for |z| < δ we easily see that
ku(x + z) − u(x)kLp (Ω) = kU (x + z) − U (x)kLp (B(0,2a)) ≤ ε.
Theorem 2.8. Let Ω be a nonempty open set in Rn . If u ∈ Lp (Ω) (1 ≤ p < ∞), then
(a) kuh kp ≤ kukp
(b) kuh − ukp → 0 as h → 0.
2.2. Approximations and Extensions 29
The right-hand side goes to zero as h → 0 since every u ∈ Lp (Ω) is p-mean continuous.
We now prove (c) for k = 0. Let Ω0 , Ω00 be such that Ω0 ⊂⊂ Ω00 ⊂⊂ Ω. Let h0 be the
shortest distance between ∂Ω0 and ∂Ω00 . Take h < h0 . Then
Z
uh (x) − u(x) = [u(y) − u(x)]ωh (x − y)dy.
B(x,h)
If x ∈ Ω̄0 , then in the above integral y ∈ Ω̄00 . Now u is uniformly continuous in Ω̄00 and
ωh ≥ 0, and therefore for an arbitrary ε > 0 we have
Z
|uh (x) − u(x)| ≤ ε ωh (x − y)dy = ε
B(x,h)
Remark. The following example shows that in (c) we cannot replace Ω0 by Ω. Let u ≡ 1 for
R1 Rh
x ∈ [0, 1] and consider uh (x) = 0 ωh (x − y)dy, where ωh (y) = ωh (−y). Now −h ωh (y)dy =
1 and so uh (0) = 1/2 for all h < 1. Thus uh (0) → 1/2 6= 1 = u(0). Moreover, for x ∈ (0, 1)
R x+h
and h sufficiently small, (x − h, x + h) ⊂ (0, 1) and so uh (x) = x−h ωh (x − y)dy = 1 which
implies uh (x) → 1 for all x ∈ (0, 1).
Corollary 2.9. Let Ω be a nonempty open set in Rn . Then C0∞ (Ω) is dense in Lp (Ω) for
all 1 ≤ p < ∞.
Proof. Suppose first that Ω is bounded and let Ω0 ⊂⊂ Ω. For a given u ∈ Lp (Ω) set
u(x), x ∈ Ω0
v(x) =
0, x ∈ Ω\Ω0 .
Then Z Z
p
|u − v| dx = |u|p dx.
Ω Ω\Ω0
By the absolute continuity of integrals, we can choose Ω0 so that the integral on the right
is arbitrarily small, i.e., ku − vkp < ε/2.
Since supp(v) is a compact subset of Ω, Theorems 2.4(b) and 2.8(b) imply that for h
sufficiently small, vh (x) ∈ C0∞ (Ω) with kv − vh kp < ε/2, and therefore ku − vh kp < ε.
If Ω is unbounded, choose a ball B large enough so that
Z
|u|p dx < ε/2
Ω\Ω0
where Ω0 = Ω ∩ B, and repeat the proof just given.
Proof. (Necessity) Suppose v = Dα u. Let S ⊂ Ω, and choose d > 0 small enough so that
the sets Ω0 ≡ S (d/2) , Ω00 ≡ S (d) satisfy Ω0 ⊂⊂ Ω00 ⊂⊂ Ω. For x ∈ Rn define
Z Z
uh (x) = ωh (x − y)u(y)dy, vh (x) = ωh (x − y)v(y)dy.
Ω00 Ω00
Clearly, uh , vh ∈ ∞ n
C (R ) for h > 0. Moreover, from Theorem 2.8 we have kuh − ukL1 (S) ≤
kuh − ukL1 (Ω00 ) → 0.
Now we note that if x ∈ Ω0 and 0 < h < d/2, then ωh (x − y) ∈ C0∞ (Ω00 ). Thus by
Theorem 2.4 and the definition of weak derivative,
Z Z
α α |α|
D uh (x) = u(y)Dx ωh (x − y)dy = (−1) u(y)Dyα ωh (x − y)dy
00 Ω00
ZΩ
= ωh (x − y) · v(y)dy = vh (x).
Ω00
Thus, kDα uh − vkL1 (S) → 0.
|α|
(Sufficiency) Choose ϕ ∈ C0 (Ω) and consider a compact set S ⊃ supp(ϕ). Then as
h→∞
Z Z Z Z
uDα ϕdx ← uh Dα ϕdx = (−1)|α| ϕDα uh dx → (−1)|α| vϕdx
S S S S
2.2. Approximations and Extensions 31
We now note some properties of W k,p (Ω) which follow easily from the results of this
and the previous section.
(a) If Ω0 ⊂ Ω and if u ∈ W k,p (Ω), then u ∈ W k,p (Ω0 ).
(b) If u ∈ W k,p (Ω) and |a(x)|k,∞ < ∞, then au ∈ W k,p (Ω). In this case any weak
derivative Dα (au) is computed according to the usual rule of differentiating the
product of functions.
(c) If u ∈ W k,p (Ω) and uh is its mollified function, then for any compact set S ⊂
Ω, kuh − ukW k,p (S) → 0 as h → 0. If in addition, u has compact support in Ω, then
kuh − ukk,p → 0 as h → 0.
More generally, we have the following global approximation theorems. (See Meyers and
Serrin H = W . The proofs make use of a partition of unity argument.)
Theorem 2.12. Assume Ω is bounded and let u ∈ W k,p (Ω), 1 ≤ p < ∞. Then there exist
functions um ∈ C ∞ (Ω) ∩ W k,p (Ω) such that
um → u in W k,p (Ω).
In other words, C ∞ (Ω) ∩ W k,p (Ω) is dense in W k,p (Ω).
Theorem 2.13. Assume Ω is bounded and ∂Ω ∈ C 1 . Let u ∈ W k,p (Ω), 1 ≤ p < ∞. Then
there exist functions um ∈ C ∞ (Ω̄) such that
um → u in W k,p (Ω).
In other words, C ∞ (Ω̄) is dense in W k,p (Ω).
Proof. According to Theorem 2.10, there exists a sequence {uh } ⊂ C 1 (Ω) such that kuh −
ukL1 (Ω0 ) → 0, kDα uh − Dα ukL1 (Ω0 ) → 0 as h → 0, where Ω0 ⊂⊂ Ω. Thus
Z Z
0
|f (uh ) − f (u)|dx ≤ sup |f | |uh − u|dx → 0 as h → 0
Ω0 Ω0
Z Z
0 α 0 α 0
|f (uh )D uh − f (u)D u|dx ≤ sup |f | |Dα uh − Dα u|dx
Ω0 Ω0
Z
+ |f 0 (uh ) − f 0 (u)||Dα u|dx.
Ω0
Since kuh − ukL1 (Ω0 ) → 0, there exists a subsequence of {uh }, which we call {uh } again,
which converges a.e. in Ω0 to u. Moreover, since f 0 is continuous, {f 0 (uh )} converges to f 0 (u)
a.e. in Ω0 . Hence the last integral tends to zero by the dominated convergence theorem.
Consequently, the sequences {f (uh )}, {f 0 (uh )Dα uh } tend to f (u), f 0 (u)Dα u respectively,
and the first conclusion follows by an application of Theorem 2.10 again.
Since f (0) = 0, the mean value theorem implies |f (s)| ≤ M |s| for all s ∈ R. Thus,
|f (u(x))| ≤ M |u(x)| for all x ∈ Ω and so f ◦ u ∈ Lp (Ω) if u ∈ Lp (Ω). Similarly,
f 0 (u(x))Dα u ∈ Lp (Ω) if u ∈ W 1,p (Ω), which shows that f ◦ u ∈ W 1,p (Ω).
Corollary 2.15. Let Ω be a bounded open set in Rn . If u has an αth weak derivative
Dα u, |α| = 1, then so does |u| and
Dα u
if u > 0
α
D |u| = 0 if u = 0
−Dα u if u < 0
i.e., Dα |u| = (sgn u)Dα u for u 6= 0. In particular, if u ∈ W 1,p (Ω), then |u| ∈ W 1,p (Ω).
Proof. Given u ∈ W01,p (Ω), let un ∈ C01 (Ω) with kun − uk1,p → 0 and define vn = f ◦ un .
Since un has compact support and f (0) = 0, vn has compact support. Also vn is Lipschitz
continuous, for
|vn (x) − vn (y)| = |f (un (x)) − f (un (y))|
≤ c|un (x) − un (y)|
≤ cn |x − y|.
Hence vn ∈ Lp (Ω).
Since vn is absolutely continuous on any line segment in Ω, its partial derivatives (which
exist almost everywhere) coincide almost everywhere with the weak derivatives. Moreover,
we see from above that |∂vn /∂xi | ≤ cn for 1 ≤ i ≤ n, and as Ω is bounded, ∂vn /∂xi ∈ Lp (Ω).
Thus vn ∈ W 1,p (Ω) and has compact support, which implies vn ∈ W01,p (Ω). From the
relation
|vn (x) − f (u(x))| ≤ c|un (x) − u(x)|
it follows that kvn − f ◦ ukp → 0. Furthermore, if ei is the standard ith basis vector in Rn ,
we have
|vn (x + hei ) − vn (x)| |un (x + hei ) − un (x)|
≤c
|h| |h|
and so
∂vn ∂un
lim sup k kp ≤ c lim sup k kp .
n→∞ ∂xi n→∞ ∂xi
But, {∂un /∂xi } is a convergent sequence in Lp (Ω) and therefore {∂vn /∂xi } is bounded in
Lp (Ω) for each 1 ≤ i ≤ n. Since kvn k1,p is bounded and W01,p (Ω) is reflexive, a subsequence
of {vn } converges weakly in W 1,p (Ω), and thus weakly in Lp (Ω) to some element of W01,p (Ω).
Thus, f ◦ u ∈ W01,p (Ω).
Proof. We apply the preceding theorem with f (t) = |t|. Thus |u| ∈ W01,p (Ω). Now
u+ = (|u| + u)/2 and u− = (u − |u|)/2. Thus u+ , u− ∈ W01,p (Ω).
34 2. Sobolev Spaces
2.2.4. Extensions. If Ω ⊂ Ω0 , then any function u(x) ∈ C0k (Ω) has an obvious extension
U (x) ∈ C0k (Ω0 ). From the definition of W0k,p (Ω) it follows that the function u(x) ∈ W0k,p (Ω)
and extended as being equal to zero in Ω0 \Ω belongs to W0k,p (Ω0 ). In general, a function
u ∈ W k,p (Ω) and extended by zero to Ω0 will not belong to W k,p (Ω0 ). (Consider the function
u(x) ≡ 1 in Ω.) However, if u ∈ W k,p (Ω) has compact support in Ω, then u ∈ W0k,p (Ω) and
thus the obvious extension belongs to W0k,p (Ω0 ).
Proof. Suppose first that u ∈ C k (Ω̄). Let y = ψ(x) define a C k diffeomorphism that
straightens the boundary near x0 = (x01 , . . . , x0n ) ∈ ∂Ω. In particular, we assume there is a
ball B = B(x0 ) such that ψ(B ∩ Ω) ⊂ Rn+ (i.e., yn > 0), ψ(B ∩ ∂Ω) ⊂ ∂Rn+ . (e.g., we could
choose yi = xi − x0i for i = 1, . . . , n − 1 and yn = xn − ϕ(x1 , . . . , xn−1 ), where ϕ is of class
C k . Moreover, without loss of generality, we can assume yn > 0 if x ∈ B ∩ Ω.)
Let G and G+ = G ∩ Rn+ be respectively, a ball and half-ball in the image of ψ such
0
that ψ(x0 ) ∈ G. Setting ū(y) = u ◦ ψ −1 (y) and y = (y1 , . . . , yn−1 , yn ) = (y , yn ), we define
an extension Ū (y) of ū(y) into yn < 0 by
k+1
0 0
X
Ū (y , yn ) = ci ū(y , −yn /i), yn < 0
i=1
where the ci are constants determined by the system of equations
k+1
X
(2.5) ci (−1/i)m = 1, m = 0, 1, . . . , k.
i=1
Note that the determinant of the system (2.5) is nonzero since it is the Vandemonde deter-
minant. One verifies readily that the extended function Ū is continuous with all derivatives
up to order k in G. For example,
k+1
X
lim Ū (y) = ci ū(y 0 , 0) = ū(y 0 , 0)
y→(y 0 ,0)
i=1
2.2. Approximations and Extensions 35
Finally
k+1
X
lim Ūyn (y) = ci (−1/i)ūyn (y 0 , 0) = ūyn (y 0 , 0)
y→(y 0 ,0)
i=1
by virtue of (2.5) with m = 1. Similarly we can handle the higher derivatives.
Thus w = Ū ◦ ψ ∈ C k (B 0 ) for some ball B 0 = B 0 (x0 ) and w = u in B 0 ∩ Ω, (If x ∈ B 0 ∩ Ω,
then ψ(x) ∈ G+ and w(x) = Ū (ψ(x)) = ū(ψ(x)) = u(ψ −1 ψ(x)) = u(x)) so that w provides
a C k extension of u into Ω ∪ B 0 . Moreover,
sup |ū(y)| = sup |u(ψ −1 (y))| ≤ sup |u(x)|
G+ G+ Ω
Since a similar computation for the derivatives holds, it follows that there is a constant
c > 0, independent of u, such that
kwkC k (Ω̄∪B 0 ) ≤ ckukC k (Ω̄) .
2.2.5. Trace Theorem. Unless otherwise stated, Ω will denote a bounded open connected
set in Rn , i.e., a bounded domain.
Proof. For simplicity, let n = 2. The more general case is handled similarly. In a neigh-
borhood of a boundary point x ∈ ∂Ω, we choose a local (ξ, η)-coordinate system, where the
boundary has the local representation
η = ϕ(ξ), −α ≤ ξ ≤ α
with the C 0,1 function ϕ. Then there exists a β > 0 such that all the points (ξ, η) with
−α ≤ ξ ≤ α, ϕ(ξ) − β ≤ η ≤ ϕ(ξ)
belong to Ω̄. Let u ∈ C 1 (Ω̄). Then
Z ϕ(ξ)
u(ξ, ϕ(ξ)) = uη (ξ, η)dη + u(ξ, t)
t
where ϕ(ξ) − β ≤ t ≤ ϕ(ξ). Applying the inequality (a + b)p ≤ 2p−1 (ap + bp ) together with
Hölder’s inequality we have
Z ϕ(ξ)
p p−1 p−1
|u(ξ, ϕ(ξ))| ≤ 2 β |uη (ξ, η)|p dη + 2p−1 |u(ξ, t)|p .
ϕ(ξ)−β
Since C 1 (Ω̄) = W 1,p (Ω), the bounded linear operator γ0 : C 1 (Ω̄) ⊂ W 1,p (Ω) → Lp (∂Ω)
can be uniquely extended to a bounded linear operator γ0 : W 1,p (Ω) → Lp (∂Ω) such that
(2.6) remains true for all u ∈ W 1,p (Ω). More precisely, we obtain γ0 u in the following
way: Let u ∈ W 1,p (Ω). We choose a sequence {un } ⊂ C 1 (Ω̄) with kun − uk1,p → 0. Then
kγ0 un − γ0 ukLp (∂Ω) → 0.
The function γ0 u (as an element of Lp (∂Ω)) will be called the trace of the function
u ∈ W 1,p (Ω) on the boundary ∂Ω. (kγ0 ukLp (∂Ω) will be denoted by kukLp (∂Ω) .) Thus the
trace of a function is defined for any element u ∈ W 1,p (Ω).
Theorem 2.21. (Trace) Suppose ∂Ω ∈ C 1 . Then there is a unique bounded linear operator
γ0 : W 1,p (Ω) → Lp (∂Ω) such that γ0 u = u|∂Ω for u ∈ C(Ω̄)∩W 1,p (Ω), and γ0 (au) = γ0 a·γ0 u
for a(x) ∈ C 1 (Ω̄), u ∈ W 1,p (Ω). Moreover, N (γ0 ) = W01,p (Ω) and R(γ0 ) = Lp (∂Ω).
Proof. Suppose u ∈ C(Ω̄) ∩ W 1,p (Ω). Then by Theorem 2.19, u can be extended into
Ω0 (Ω ⊂⊂ Ω0 ) such that its extension U ∈ C(Ω̄0 ) ∩ W 1,p (Ω0 ). Let Uh (x) be the mollified
function for U . Since Uh → U as h → 0 in both the norms k · kC(Ω̄) , k · kW 1,p (Ω) , we find that
as h → 0, Uh |∂Ω → u|∂Ω uniformly and Uh |∂Ω → γ0 u in Lp (∂Ω). Consequently, γ0 u = u|∂Ω .
Now au ∈ W 1,p (Ω) if a ∈ C 1 (Ω̄), u ∈ W 1,p (Ω) and consequently, γ0 (au) is defined. Let
{un } ⊂ C 1 (Ω̄) with kun − uk1,p → 0. Then
γ0 (aun ) = γ0 a · γ0 un
Remark. We note that the function u ≡ 1 belongs to W 1,p (Ω) ∩ C(Ω̄) and its trace on ∂Ω is
1. Hence this function does not belong to W01,p (Ω), which establishes the earlier assertion
that W01,p (Ω) 6= W 1,p (Ω).
Let u ∈ W k,p (Ω), k > 1. Since any weak derivative Dα u of order |α| < k belongs to
W 1,p (Ω),this derivative has a trace γ0 Dα u belonging to Lp (∂Ω). Moreover
kDα ukLp (∂Ω) ≤ ckDα uk1,p ≤ ckukk,p
for constant c > 0 independent of u.
Assuming the boundary ∂Ω ∈ C 1 , the unit outward normal vector n to ∂Ω exists and
is bounded. Thus, the concept of traces makes it possible to introduce, for k ≥ 2, ∂u/∂n
for u ∈ W k,p (Ω). More precisely, for k ≥ 2, there exist traces of the functions u, Di u so
that, if ni are the direction cosines of the normal, we may define
n
X
γ1 u = (γ0 (Di u))ni , u ∈ W k,p (Ω), k ≥ 2.
i=1
The trace operator γ1 : W k,p (Ω) → Lp (∂Ω) is continuous and γ1 u = (∂u/∂n)|∂Ω for u ∈
C 1 (Ω̄) ∩ W k,p (Ω).
For a function u ∈ C k (Ω̄) we define the various traces of normal derivatives given by
∂j u
γj u = |∂Ω , 0 ≤ j ≤ k − 1.
∂nj
Each γj can be extended by continuity to all of W k,p (Ω) and we obtain the following:
Theorem 2.22. (Trace) Suppose ∂Ω ∈ C k . Then there is a unique continuous linear
operator γ = (γ0 , γ1 , . . . , γk−1 ) : W k,p (Ω) → k−1 k−1−j,p (∂Ω) such that for u ∈ C k (Ω̄)
Q
j=0 W
∂j u
|∂Ω , j = 1, . . . , k − 1.
γ0 u = u|∂Ω , γj u =
∂nj
Moreover, N (γ) = W0k,p (Ω) and R(γ) = k−1 k−1−j,p (∂Ω).
Q
j=0 W
The Sobolev spaces W k−1−j,p (∂Ω), which are defined over ∂Ω, can be defined locally.
Proof. Let {un } and {vn } be sequences of functions in C 1 (Ω̄) with kun − ukH 1 (Ω) →
0, kvn − vkH 1 (Ω) → 0 as n → ∞. Formula (2.8) holds for un , vn
Z Z Z
vn Di un dx = un vn ni dS − un Di vn dx
Ω ∂Ω Ω
and upon letting n → ∞ relation (2.8) follows.
Corollary 2.24. Let ∂Ω ∈ C 1 .
2.3. Sobolev Imbedding Theorems 39
Proof. If in (2.8) we replace u by Di u and sum from 1 to n, then Green’s 1st identity is
obtained. Interchanging the roles of u, v in Green’s 1st identity and subtracting the two
identities yields Green’s 2nd identity.
Remark. It is easy to check that the imbedding W 1,p (Ω) ⊂ Lp (Ω) is compact for all p ≥ 1
and all n.
A series of special results will be needed to prove the above theorem. Only selected
proofs will be given to illustrate some of the important techniques.
and
λp
Z Z Z
p p p
|∇uλ | dx = λ |∇u(λx)| dx = n |∇u(y)|p dy.
IRn IRn λ IRn
Inserting these inequalities into (2.12) we find
1 λ
kukLq (Rn ) ≤ C k∇ukLp (Rn )
λn/q λn/p
and so
(2.13) kukLq (Rn ) ≤ Cλ1−n/p+n/q k∇ukLp (Rn ) .
But then if 1 − n/p + n/q > 0(< 0), we can upon sending λ to 0 (∞) in (2.13) obtain a
contradiction (u = 0). Thus we must have q = p* where
np
(2.14) p* =
n−p
is called the Sobolev conjugate of p. Note that then
1 1 1
(2.15) ∗
= − , p∗ > p.
p p n
Next we prove that the inequality (2.12) is in fact correct.
2.3. Sobolev Imbedding Theorems 41
Proof. First assume p = 1. Since u has compact support, for each i = 1, . . . , n we have
Z xi
u(x) = uxi (x1 , . . . , xi−1 , yi , xi+1 , . . . , xn )dyi
−∞
and so
Z ∞
|u(x)| ≤ |∇u(x1 , . . . , yi , . . . , xn )|dyi (i = 1, . . . , n).
−∞
Consequently
n Z 1
n Y ∞
n−1
(2.17) |u(x)| n−1 ≤ |∇u(x1 , . . . , yi , . . . , xn )|dyi .
i=1 −∞
the last inequality resulting from the extended Hölder inequality in the appendix.
We continue by integrating with respect to x2 , . . . , xn and applying the extended Hölder
inequality to eventually find (pull out an integral at each step)
n Z 1
Z
n Y ∞ Z ∞
n−1
|u(x)| n−1 dx ≤ ··· |∇u|dx1 . . . dyi . . . dxn
IRn i=1 −∞ −∞
Z n
n−1
= |∇u|dx
IRn
(γuγ−1 Di u)2
γ 2 if u ≥ 0
(Di |u| ) = γ−1 2 = (γ|u|γ−1 Di u)2 .
(−γ(−u) Di u) if u ≤ 0
Z n−1 Z
γn n
|u(x)| n−1 dx ≤ |∇|u|γ |dx
IRn IRn
Z
= γ |u|γ−1 |∇u|dx
IRn
Z p−1 Z 1
p(γ−1) p p
p
≤ γ |u| p−1 dx |∇u| dx .
IRn IRn
We set
p(n − 1)
γ= >1
n−p
in which case
γn p(γ − 1) np
= = = p∗ .
n−1 p−1 n−p
Thus, the above estimate becomes
Z 1∗ Z 1
p p
p∗ p
|u| dx ≤C |∇u| dx .
IRn IRn
Theorem 2.27. Let Ω ⊂ Rn be bounded and open, with ∂Ω ∈ C 1 . Assume 1 ≤ p < n, and
∗
u ∈ W 1,p (Ω). Then u ∈ Lp (Ω) and
(2.18) kukLp∗ (Ω) ≤ CkukW 1,p (Ω)
where the constant C depends only on p,n and Ω.
Theorem 2.28. Let Ω ⊂ Rn be bounded and open. Assume 1 ≤ p < n, and u ∈ W01,p (Ω).
Then u ∈ Lq (Ω) and
kukLq (Ω) ≤ Ck∇ukLp (Ω)
for each q ∈ [1, p∗ ], the constant C depending only on p,q,n and Ω.
2.3. Sobolev Imbedding Theorems 43
Proof. Since u ∈ W01,p (Ω), there are functions um ∈ C0∞ (Ω) such that um → u in W 1,p (Ω).
We extend each function um to be 0 in Rn \Ω̄ and apply Lemma 2.26 to discover (as above)
kukLp∗ (Ω) ≤ Ck∇ukLp (Ω) .
2.3.2. Morrey’s Inequality. We now turn to the case n < p < ∞. The next result shows
that if u ∈ W 1,p (Ω), then u is in fact Hölder continuous, after possibly being redefined on
a set of measure zero.
Theorem 2.29. (Morrey’s Inequality) Assume n < p < ∞. Then there exists a constant
C, depending only on p and n, such that
(2.20) kuk 0,1− n ≤ CkukW 1,p (Rn ) , ∀ u ∈ C 1 (Rn ).
C p (Rn )
Proof. We first prove the following inequality: for all x ∈ Rn , r > 0 and all u ∈ C 1 (Rn ),
rn |Du(y)|
Z Z
(2.21) |u(y) − u(x)| dy ≤ dy.
B(x,r) n B(x,r) |x − y|n−1
To prove this, note that, for any w with |w| = 1 and 0 < s < r,
Z s
d
|u(x + sw) − u(x)| = u(x + tw)dt
dt
Z0 s
= ∇u(x + tw) · wdt
Z 0s
≤ |∇u(x + sw)| dt.
0
Multiply both sides by sn−1 and integrate over s ∈ (0, r) and we obtain (2.21).
To establish the bound on kukC 0 (Rn ) , we observe that, by (2.21), for x ∈ Rn ,
Z Z
1 1
|u(x)| ≤ |u(y) − u(x)| dy + |u(y)| dy
|B(x, 1)| B(x,1) |B(x, 1)| B(x,1)
! p−1
Z Z 1/p (1−n)p
p
To establish the bound on the semi-norm [u]γ , γ = 1 − np , take any two points x, y ∈ Rn .
Let r = |x − y| and W = B(x, r) ∩ B(y, r). Then
Z Z
1 1
(2.22) |u(x) − u(y)| ≤ |u(x) − u(z)| dz + |u(y) − u(z)| dz.
|W | W |W | W
Note that |W | = βrn , r = |x − y| and W ≤ min{ B(x,r) , B(y,r) }. Hence, using (2.21), by
R R R
Hence, by (2.22),
|u(x) − u(y)| ≤ C |x − y|γ k∇ukLp (Rn ) .
This inequality and the bound on kukC 0 above complete the proof.
Theorem 2.30. (Estimates for W 1,p , n < p ≤ ∞) Let Ω ⊂ Rn be bounded and open,
0,1− n
with ∂Ω ∈ C 1 . Assume n < p < ∞, and u ∈ W 1,p (Ω). Then u ∈ C p (Ω̄) a.e. and
kuk 0,1− n
p
≤ CkukW 1,p (Ω)
C (Ω̄)
2.3.3. General Cases. We can now concatenate the above estimates to obtain more com-
plicated inequalities.
Assume kp < n and u ∈ W k,p (Ω). Since Dα u ∈ Lp (Ω) for all |α| ≤ k, the Sobolev-
Nirenberg-Gagliardo inequality implies
kDβ ukLp∗ (Ω) ≤ CkukW k,p (Ω)
∗
if |β| ≤ k − 1, and so u ∈ W k−1,p (Ω). Moreover, kukk−1,p∗ ≤ ckukk,p . Similarly, we find
∗∗
u ∈ W k−2,p (Ω), where
1 1 1 1 2
∗∗
= ∗− = − .
p p n p n
Moreover, kukk−2,p∗∗ ≤ ckukk−1,p∗ . Continuing, we find after k steps that u ∈ W 0,q (Ω) =
Lq (Ω) for
1 1 k
= − .
q p n
The stated estimate (2.9) follows from combining the relevant estimates at each stage of
the above argument. In a similar manner the other estimates can be established.
Proof. We shall just give the proof for p = q = 2. The other cases are proved similarly.
(a) Since C01 (Ω) is dense in H01 (Ω), it suffices to show that the imbedding C01 (Ω) ⊂ L2 (Ω) is
compact. Thus, let S = {u ∈ C01 (Ω) : kuk1,2 ≤ 1}. We now show that S is totally bounded
in L2 (Ω).
For h > 0, let Sh = {uh : u ∈ S}, where uh is the mollified function for u. We claim
that Sh is totally bounded in L2 (Ω). Indeed, for u ∈ S, we have
Z
|uh (x)| ≤ ωh (z)|u(x − z)|dz ≤ (sup ωh )kuk1 ≤ c1 (sup ωh )kuk1,2
B(0,h)
and
|Di uh (x)| ≤ c2 sup |Di ωh |kuk1,2 , i = 1, . . . , n
so that Sh is a bounded and equicontinuous subset of C(Ω̄). Thus by the Ascoli Theorem,
Sh is relatively compact (and thus totally bounded) in C(Ω̄) and consequently also in L2 (Ω).
46 2. Sobolev Spaces
and
Z Z 1 2
du(x − tz)
Z
2
|u(x − z) − u(x)| dx = dt dx
Ω Ω
0 dt
Z Z 1 2
= (−∇u(x − tz) · z)dt dx
Ω 0
Z Z 1
2
≤ |z| |∇u(x − tz)| dt dx ≤ |z|2 kuk21,2 .
2
Ω 0
Consequently, kuh − uk2 ≤ h. Since we have shown above that Sh is totally bounded in
L2 (Ω) for all h > 0, it follows that S is also totally bounded in L2 (Ω) and hence relatively
compact.
(b) Suppose now that S is a bounded set in H 1 (Ω). Each u ∈ S has an extension
U ∈ H01 (Ω0 ) where Ω ⊂⊂ Ω0 . Denote by S 0 the set of all such extensions of the functions
u ∈ S. Since kU kH 1 (Ω0 ) ≤ ckuk1,2 , the set S 0 is bounded in H01 (Ω0 ). By (a) S 0 is relatively
compact in L2 (Ω0 ) and therefore S is relatively compact in L2 (Ω).
(c) Let S be a bounded set in H 1 (Ω). For any u(x) ∈ C 1 (Ω̄), the inequality (2.7) with
p = 2 yields
c1
(2.24) kuk2L2 (∂Ω) ≤ kuk22 + c2 βkuk21,2
β
where the constants c1 , c2 do not depend on u or β. By completion, this inequality is
valid for any u ∈ H 1 (Ω). By (b), any infinite sequence of elements of the set S has a
subsequence {un } which is Cauchy in L2 (Ω): given ε > 0, an N can be found such that for
all m, n ≥ N, kum − un k2 < ε. Now we choose β = ε. Applying the inequality (2.24) to
um − un , it follows that the sequence of traces {γ0 un } converges in L2 (∂Ω).
(d) By Morrey’s inequality, the imbedding is continuous if α = 1 − (n/p). Now use the
fact that C β is compact in C α if α < β.
Remarks. (a) When p = n, we can easily show that the imbedding in (a) is compact for all
1 ≤ q < ∞. Hence, it follows that the imbedding W01,p (Ω) ⊂ Lp (Ω) is compact for all p ≥ 1.
(b) The boundedness of Ω is essential in the above theorem. For example, let I = (0, 1)
and Ij = (j, j + 1). Let f ∈ C01 (I) and define fj to be the same function defined on Ij by
translation. We can normalize f so that kf kW 1,p (I) = 1. The same is then true for each fj
and thus {fj } is a bounded sequence in W 1,p (R). Clearly f ∈ Lq (R) for every 1 ≤ q ≤ ∞.
Further, if
kf kLq (R) = kf kLq (I) = a > 0
then for any j 6= k we have
Z j+1 Z k+1
kfj − fk kqLq (R) = |fj |q + |fk |q = 2aq
j k
and so fi cannot have a convergent subsequence in Lq (R). Thus none of the imbeddings
W 1,p (R) ⊂ Lq (R) can be compact. This example generalizes to n dimensional space and to
open sets like a half-space.
2.4. Equivalent Norms 47
2.4.1. Equivalent Norms of W 1,p (Ω). The next result plays a crucial part in the next
chapter.
Theorem 2.32. Let ∂Ω ∈ C 1 and let 1 ≤ p < ∞. Set
Z X n
!1/p
kuk = |Di u|p dx + (q(u))p
Ω i=1
Proof. First of all, it is easy to check that k · k defines a norm. Now by (i), it suffices to
prove that there is a positive constant c such that
(2.25) kuk1,p ≤ ckuk for all u ∈ W 1,p (Ω).
Suppose (2.25) is false. Then there exist un (x) ∈ W 1,p (Ω) such that (kun k1,p > nkun k. Set
vn = un /kun k1,p .)
(2.26) kun k1,p = 1 and 1 > nkun k.
According to Theorem 2.31, there is a subsequence, call it again {un }, which converges to u
in Lp (Ω). From (2.26) we have kun k → 0 and therefore un → u in W 1,p (Ω) and q(un ) → 0.
Hence, kuk1,p = 1 and by continuity, q(u) = 0. However, since Di u = 0 a.e. in Ω, u =
const., which must be zero by (ii) and this contradicts kuk1,p = 1.
Example 2.33. Let ∂Ω ∈ C 1 . Assume a(x) ∈ C(Ω), σ(x) ∈ C(∂Ω) with a ≥ 0 (6≡ 0), σ ≥
0 (6≡ 0). Then the following norms are equivalent to k · k1,p on W 1,p (Ω):
Z X n Z p !1/p
|Di u|p dx + udx
R
(2.27) kuk = with q(u) = Ω udx .
Ω i=1 Ω
n
Z X Z p !1/p
p
R
(2.28) kuk = |Di u| dx + γ0 udS with q(u) = ∂Ω γ0 udS .
Ω i=1 ∂Ω
n
Z X Z !1/p
p p
1/p
σ|γ0 u|p dS
R
(2.29) kuk = |Di u| dx + σ|γ0 u| dS with q(u) = .
Ω i=1 ∂Ω
48 2. Sobolev Spaces
n
Z X Z !1/p
p p
R p
1/p
(2.30) kuk = |Di u| dx + a|u| dx with q(u) = Ω a|u| dx .
Ω i=1 Ω
In order to verify condition (i) of Theorem 2.32, one uses the trace theorem in (2.28) and
(2.29).
R
2.4.2. Poincaré’s Inequalities. If u has zero mean value, i.e., Ω udx = 0, then from
(2.27) it follows that
Z n
Z X
(2.31) |u(x)|p dx ≤ c |Di u|p dx, u ∈ W 1,p (Ω)
Ω Ω i=1
2.4.3. Fourier Transform Methods. For a function u ∈ L1 (Rn ), we define the Fourier
transform of u by
Z
1
û(y) = e−ix·y u(x) dx, ∀ y ∈ Rn ,
(2π)n/2 Rn
and the inverse Fourier tranform by
Z
1
ǔ(y) = eix·y u(x) dx, ∀ y ∈ Rn .
(2π)n/2 Rn
Theorem 2.34. (Plancherel’s Theorem) Assume u ∈ L1 (Rn ) ∩ L2 (Rn ). Then û, ǔ ∈
L2 (Rn ) and
kûkL2 (Rn ) = kǔkL2 (Rn ) = kukL2 (Rn ) .
Since L1 (Rn ) ∩ L2 (Rn ) is dense in L2 (Rn ), we can use this result to extend the Fourier
transforms on to L2 (Rn ). We still use the same notations for them. Then we have
Theorem 2.35. (Property of Fourier Tranforms) Assume u, v ∈ L2 (Rn ). Then
(i) Rn uv̄ dx = Rn ûv̂¯ dy,
R R
(ii) Ddα u(y) = (iy)α û(y) for each multiindex α such that D α u ∈ L2 (Rn ),
ˇ
(iii) u = û.
Using the Fourier transform, we can also define fractional Sobolev spaces H s (Rn ) for
any 0 < s < ∞ as follows
H s (Rn ) = {u ∈ L2 (Rn ) | (1 + |y|s ) û ∈ L2 (Rn )},
and define the norm by
kukH s (Rn ) = k(1 + |y|s ) ûkL2 (Rn ) .
From this we easily get the estimate
kukL∞ (Rn ) ≤ kûkL1 (Rn )
= k(1 + |y|s )û (1 + |y|s )−1 kL1 (Rn )
≤ k(1 + |y|s )ûkL2 (Rn ) k(1 + |y|s )−2 k2L1 (Rn )
≤ C kukH s (Rn ) ,
where C = k(1 + |y|s )−2 k2L1 (Rn ) < ∞ if and only if s > n2 . Therefore we have an easy
embedding, which is known valid for integers s by the previous Sobolev embedding theorem,
H s (Rn ) ⊂ L∞ (Rn ) if s > n2 .
Chapter 3
3.1.1. Linear Elliptic Equations. Consider the (Dirichlet) boundary value problem
(BVP)
(3.1) Lu = f in Ω, u = 0 on ∂Ω.
Here f is a given function in L2 (Ω) (or more generally, an element in the dual space of
H01 (Ω)) and L is a formal second-order differential operator in divergence form
given by
n
X n
X
Lu ≡ − Di (aij (x)Dj u) + bi (x)Di u + c(x)u
i,j=1 i=1
with real coefficients aij (x), bi (x) and c(x) only in L∞ (Ω) and aij (x) = aji (x) (i, j =
1, . . . , n). Moreover, L is assumed to be uniformly elliptic in Ω, i.e., there exists a
number θ > 0 such that for every x ∈ Ω and every real vector ξ = (ξ1 , . . . , ξn ) ∈ Rn
n
X n
X
(3.2) aij (x)ξi ξj ≥ θ |ξi |2 .
i,j=1 i=1
A function u ∈ H01 (Ω) is called a weak solution of (3.1) if the following holds
Z X n n
X Z
(3.3) B1 (u, v) ≡ aij Dj uDi v + ( bi Di u + cu)v dx = f vdx, ∀ v ∈ H01 (Ω).
Ω i,j=1 i=1 Ω
Much of the material in this chapter is devoted to proving the existence of weak solutions;
the regularity problem will not be studied. Other problems can also be formulated in the
weak sense as above.
51
52 3. Existence Theory for Linear Problems
X N
X
k
b (x, u, Du) = bkl
j (x) Dj u
l
+ ckl (x) ul .
1≤j≤n, 1≤l≤N l=1
In this case, as above for single equations, we work in the Hilbert space H01 (Ω; RN ), which
has the inner product defined by
X Z
(u, v) ≡ Di uk Di v k dx.
1≤i≤n, 1≤k≤N Ω
Then a weak solution of linear system (3.4) is a function u ∈ H01 (Ω; RN ) such that
Z
(3.7) B2 (u, v) ≡ akl D
ij j ul
Di v k
+ dkl l
i u D i v k
+ bkl
j D j ul k
v + ckl l k
u v dx = hF, vi
Ω
3.1. Differential Equations in Divergence Form 53
holds for all v ∈ H01 (Ω; RN ). Here the conventional summation notation is used.
There are some ellipticity conditions for the system (3.5) in terms of the leading coeffi-
cients A(x, u, ξ). Assume A is smooth on ξ and define
∂Aki (x, u, ξ)
Akl
ij (x, u, ξ) = , ξ = (ξjl ).
∂ξjl
The system (3.5) is said to satisfy the (uniform, strict) Legendre ellipticity condition
if there exists a ν > 0 such that, for all (x, s, ξ), it holds
n X
X N
(3.8) Akl k l
ij (x, s, ξ) ηi ηj ≥ ν |η|
2
for all N × n matrix η = (ηik ).
i,j=1 k,l=1
Note that for systems with linear leading terms A given by (3.6), the Legendre condition
and Legendre-Hadamard condition become, respectively,
n X
X N
(3.10) akl k l
ij (x) ηi ηj ≥ ν |η|
2
∀ η;
i,j=1 k,l=1
n X
X N
(3.11) akl k l 2
ij (x) q q pi pj ≥ ν |p| |q|
2
∀ p, q.
i,j=1 k,l=1
Remark. The Legendre-Hadamard condition does not imply the Legendre ellipticity condi-
tion. For example, let n = N = 2 and define constants akl
ij by
2
X
akl k l 2
ij ξi ξj ≡ det ξ + |ξ| .
i,j,k,l=1
Since
2
X
akl k l 2 2 2
ij pi pj q q = det(q ⊗ p) + |q ⊗ p| = |p| |q| ,
i,j,k,=1
the Legendre-Hadamard condition holds for all > 0. But, if 0 < < 1/2, then the Legendre
ellipticity condition fails.
Exercise 1. Prove that the Legendre condition holds for this system if and only if
> 1/2.
54 3. Existence Theory for Linear Problems
Remark. Let u = (v, w) and (x1 , x2 ) = (x, y). Then the system of differential equations
defined by akl
ij given above is
Proof. For each fixed u ∈ H, the functional v 7→ B(u, v) is in H ∗ , and hence by the Riesz
Representation Theorem, there exists a unique element w = Au ∈ H such that
B(u, v) = (w, v) ∀ v ∈ H.
It can be easily shown that A : H → H is linear. From (i), kAuk2 = B(u, Au) ≤ αkukkAuk,
and hence kAuk ≤ αkuk for all u ∈ H; that is, A is bounded. Furthermore, by (ii),
βkuk2 ≤ B(u, u) = (Au, u) ≤ kAukkuk and hence kAuk ≥ βkuk for all u ∈ H. By the
Riesz Representation Theorem again, we have a unique w0 ∈ H such that hf, vi = (w0 , v)
for all v ∈ H and kf k = kw0 k. We will show that the equation Au = w0 has a (unique)
solution. There are many different proofs for this, and here we use the Contraction Mapping
Theorem. Note that the solution u to equation Au = w0 is equivalent to the fixed-point
of the map T : H → H defined by T (v) = v − tAv + tw0 (v ∈ H) for any fixed t > 0. We
will show for t > 0 small enough T is a contraction. Note that for all v, w ∈ H we have
kT (v) − T (w)k = k(I − tA)(v − w)k. We compute that for all u ∈ H
k(I − tA)uk2 = kuk2 + t2 kAuk2 − 2t(Au, u)
≤ kuk2 (1 + t2 α2 − 2βt).
3.3. Gårding’s Estimates and Existence Theory 55
We now choose t such that 0 < t < α2β2 . Then the expression in parentheses is positive and
less than 1. Thus the map T : H → H is a contraction on H and therefore has a fixed point.
This fixed point u solves Au = w0 and thus is the unique solution of (3.12); moreover, we
have kf k = kw0 k = kAuk ≥ βkuk and hence kuk ≤ β1 kf k. The proof is complete.
For the bilinear forms B1 , B2 defined above, one can easily show the boundedness:
|Bj (u, v)| ≤ αkukkvk
for all u, v in the respective Hilbert spaces H = H01 (Ω) or H = H01 (Ω; RN ) for j = 1, 2.
The strong positivity (also called coercivity) for both B1 and B2 is not always guaranteed
and involves estimating on the quadratic form Bj (u, u); this is usually called Gårding’s
estimates. We will derive these estimates for both of them and state the corresponding
existence theorems below.
Proof. Take γ from (3.13), let λ ≥ γ and define the bilinear form
B λ (u, v) ≡ B1 (u, v) + λ(u, v)2 for all u, v ∈ H
which corresponds to the operator Lu + λu. Then B λ (u, v) satisfies the hypotheses of the
Lax-Milgram Theorem.
Example 3.4. Consider the Neumann boundary value problem
∂u
(3.14) −∆u(x) = f (x) in Ω, = 0 on ∂Ω.
∂ν
A function u ∈ H 1 (Ω) is said to be a weak solution to (3.14 if
Z Z
(3.15) ∇u · ∇v dx = f v dx, ∀ v ∈ H 1 (Ω).
Ω Ω
Obviously,
R taking v ≡ 1 ∈ H 1 (Ω), a necessary condition to have a weak solution is
Ω f (x) dx = 0. We show this is also a sufficient condition for existence of the weak so-
lutions. Note that, if u is a weak solution, then u + c, for all constants c, is also a weak
solution. Therefore, to fix the constants, we consider the vector space
Z
H = u ∈ H 1 (Ω)
u(x) dx = 0
Ω
equipped with inner product
Z
(u, v)H = ∇u · ∇v dx.
Ω
By the theorem on equivalent norms, it follows that H with this inner product, is indeed a
Hilbert space, and (f, u)L2 (Ω) is a bounded linear functional on H:
|(f, u)L2 (Ω) | ≤ kf kL2 (Ω) kvkL2 (Ω) ≤ kf kL2 (Ω) kvkH .
Hence the Riesz Representation Theorem implies that there exists a unique u ∈ H such
that
(3.16) (u, w)H = (f, w)L2 (Ω) , ∀ w ∈ H.
It follows that u is a weak solutionR to the Neumann problem since for any Rv ∈ H 1 (Ω) we
1
take w = v − c ∈ H, where c = |Ω| Ω vdx, in (3.16) and obtain (3.15) using Ω f dx = 0.
Example 3.5. Let us consider the nonhomogeneous Dirichlet boundary value problem
(3.17) −∆u = f in Ω, u|∂Ω = ϕ
where f ∈ L2 (Ω) and ϕ is the trace of a function w ∈ H 1 (Ω). Note that it is not sufficient
to just require that ϕ ∈ L2 (∂Ω) since the trace operator is not onto. If, for example,
ϕ ∈ C 1 (∂Ω), then ϕ has a C 1 extension to Ω̄, which is the desired w.
The function u ∈ H 1 (Ω) is called a weak solution of (3.17) if u − w ∈ H01 (Ω) and if
Z Z
∇u · ∇vdx = f vdx for all v ∈ H01 (Ω).
Ω Ω
Let u be a weak solution of (3.17) and set u = z + w. Then z ∈ H01 (Ω) satisfies
Z Z
(3.18) ∇z · ∇vdx = (f v − ∇v · ∇w)dx for all v ∈ H01 (Ω).
Ω Ω
Since the right hand side belongs to the dual space H01 (Ω)∗ , the Lax-Milgram theorem yields
the existence of a unique z ∈ H01 (Ω) which satisfies (3.18). Hence (3.17) has a unique weak
solution u.
3.3. Gårding’s Estimates and Existence Theory 57
Example 3.6. Now let us consider the boundary value (also called Dirichlet) problem for
the fourth order biharmonic operator:
∂u
∆2 u = f in Ω, u|∂Ω = |∂Ω = 0.
∂n
We take H = H02 (Ω). By the general trace theorem, H = H02 (Ω) = {v ∈ H 2 (Ω) : γ0 v =
γ1 v = 0}. Therefore, this space H is the right space for the boundary conditions.
Accordingly, for f ∈ L2 (Ω), a function u ∈ H = H02 (Ω) is a weak solution of the
Dirichlet problem for the biharmonic operator provided
Z Z
∆u∆vdx = f vdx ∀ v ∈ H.
Ω Ω
where f ∈ C(Ω̄).
(c) If u ∈ C 2 (Ω̄) satisfies the equation in (b), find the underlying BVP.
Hence, deduce that k∆uk2 defines a norm on H02 (Ω) which is equivalent to the usual norm
on H02 (Ω).
Exercise 3. Let Ω = (1, +∞). Show that the BVP −u00 = f ∈ L2 (Ω), u ∈ H01 (Ω) does
not have a weak solution.
58 3. Existence Theory for Linear Problems
3.3.2. Theory for B2 . We will derive the Gårding estimates for B2 (u, u). For simplicity,
let H = H01 (Ω; RN ) and let (u, v)H and kukH be the inner product and norm defined above
on H. Define the bilinear form of the leading terms by
X n X N Z
A(u, v) = akl l k
ij (x) Dj u Di v dx.
i,j=1 k,l=1 Ω
Proof. In the first case, the conclusion follows easily from the Legendre condition. We
prove the second case when Aklij are constants satisfying the Legendre-Hadamard condition
n X
X N
akl k l 2 2
ij q q pi pj ≥ ν |p| |q| , ∀p ∈ Rn , q ∈ RN .
i,j=1 k,l=1
We prove
n X
X N Z Z
A(u, u) = akl l k
ij Dj u Di u dx ≥ ν |Du|2 dx
i,j=1 k,l=1 Ω Ω
for all u ∈ C0∞ (Ω; RN ). For these test functions u, we extend them onto Rn by zero outside
Ω and thus consider them as functions in C0∞ (Rn ; RN ). Define the Fourier transforms
for such functions u by
Z
−n/2
û(y) = (2π) e−i y·x u(x) dx; y ∈ Rn .
Rn
Then, for any u, v ∈ C0∞ (Rn ; RN ),
Z Z
u(x) · v(x) dx = û(y) · v̂(y) dy,
Rn Rn
D[ k ck
j u (y) = i yj u (y);
the last identity can also be written as Du(y) c = i û(y) ⊗ y. Now, using these identities, we
have Z Z
akl
ij D i uk
(x) D j ul
(x) dx = akl [k [l
ij Di u (y) Dj u (y) dy
n n
Z R RZ
kl k l kl k l
= aij yi yj u (y) u (y) dy = Re
c b aij yi yj u (y) u (y) dy .
c b
Rn Rn
Write û(y) = η + iξ with η, ξ ∈ RN . Then
Re u (y) u (y) = η k η l + ξ k ξ l .
ck bl
i,j=1 k,l=1
Hence,
n X
X N Z
A(u, u) = akl k l
ij Di u (x) Dj u (x) dx
n
i,j=1 k,l=1 R
3.3. Gårding’s Estimates and Existence Theory 59
n X
X N Z
= Re akl
ij yi yj
ck (y) ubl (y) dy
u
i,j=1 k,l=1 Rn
Z Z
2 2
≥ν |y| |û(y)| dy = ν |iû(y) ⊗ y|2 dy
Rn n
Z ZR
2
=ν |Du(y)|
c dy = ν |Du(x)|2 dx.
Rn Rn
The proof is complete.
Theorem 3.8. Let B2 (u, v) be defined by (3.7). Assume
1) akl
ij ∈ C(Ω̄),
2) the Legendre-Hadamard condition holds for all x ∈ Ω; that is,
akl k l 2 2
ij (x) q q pi pj ≥ ν |p| |q| , ∀p ∈ Rn , q ∈ RN .
3) bkl kl kl ∞
i , c , di ∈ L (Ω).
Then, there exist constants λ0 > 0 and λ1 ≥ 0 such that
B2 (u, u) ≥ λ0 kuk2H − λ1 kuk2L2 , ∀ u ∈ H01 (Ω; RN ).
m=1
and each term (no summation on m)
akl 2 k l kl k l
ij (x) ϕm Di u Dj u = aij (x) Di (ϕm u ) Dj (ϕm u )
60 3. Existence Theory for Linear Problems
−akl
ij (x) ϕ m Di ϕm ul
D i uk
+ ϕ m D i ϕm uk
Dj ul
+ D i ϕ m D j ϕ m uk l
u .
Since ϕm u ∈ C0∞ (Ω ∩ B/2 (xm ); RN ) and diam(Ω ∩ B/2 (xm )) ≤ , we have by (3.19)
Z Z
ν
akl
ij (x) D (ϕ
i m uk
) D j (ϕ m ul
) dx ≥ |D(ϕm u)|2 dx.
Ω 2 Ω
Note also that
|D(ϕm u)|2 = ϕ2m |Du|2 + |Dϕm |2 |u|2 + 2 ϕm Di ϕm uk Di uk .
Therefore, we have by (3.20) and the fact that M 2
P
m=1 ϕm = 1 on Ω,
Z
akl k l
ij (x) Di u Dj u dx
Ω
Z
ν
≥ |Du|2 dx − C1 kukL2 kDukL2 − C2 kuk2L2 .
2 Ω
The terms in B2 (u, u) involving b, c and d can be estimated by kukL2 kDukL2 and kuk2L2 .
Finally, by all of these estimates and the inequality
1 2
ab ≤ a2 + b
4
we have B2 (u, u) ≥ λ0 kuk2H −λ1 kuk2L2 for all u ∈ H01 (Ω; RN ). This completes the proof.
Note that the bilinear form B λ (u, v) = B2 (u, v) + λ (u, v)L2 satisfies the condition of
the Lax-Milgram theorem on H = H01 (Ω; RN ) for all λ ≥ λ1 ; thus, by the Lax-Milgram
theorem, we easily obtain the following existence result.
Theorem 3.9. Under the hypotheses of the previous theorem, for λ ≥ λ1 , the Dirichlet
problem for the linear system
(3.21) − div(A(x, u, Du)) + b(x, u, Du) + λu = F, u|∂Ω = 0
has a unique weak solution u in H01 (Ω; RN )
for any bounded linear functional F on H.
Moreover, the solution u satisfies kukH ≤ C kF k with a constant C depending on λ and the
L∞ -norms of the coefficients of linear terms A(x, s, ξ) and b(x, s, ξ) given above.
Corollary 3.10. Given λ ≥ λ1 as in the theorem, the operator K : L2 (Ω; RN ) → L2 (Ω; RN ),
where, for each F ∈ L2 (Ω; RN ), u = KF is the unique weak solution to the BVP (3.21)
above, is a compact linear operator.
Proof. By the theorem, kukH01 (Ω;RN ) ≤ CkF kL2 (Ω;RN ) . Hence K is a bounded linear opera-
tor from L2 (Ω; RN ) to H01 (Ω; RN ), which, by the compact embedding theorem, is compactly
embedded in L2 (Ω; RN ). Hence, as a linear operator from L2 (Ω; RN ) to L2 (Ω; RN ), K is
compact.
Here akl ∞
ij (x) and c(x) are given functions in L (Ω).
The bilinear form associated to L is
Z XN X n
(3.22) B(u, v) = akl l k
ij (x)Dj u Di v + c(x)u · v
dx, u, v ∈ H01 (Ω; RN ).
Ω k,l=1 i,j=1
In order for B to be symmetric on H01 (Ω; RN ), that is, B(u, v) = B(v, u) for all u, v ∈
H01 (Ω; RN ), we need the following symmetry condition:
(3.23) akl lk
ij (x) = aji (x), ∀ i, j = 1, 2, · · · , n; k, l = 1, 2, · · · , N.
We also assume the Gårding inequality holds (see Theorem 3.8 for sufficient conditions):
(3.24) B(u, u) ≥ σkuk2H 1 − µkuk2L2 , ∀ u ∈ H01 (Ω; RN ),
0
3.4.2. The Compact Inverse. For each F ∈ L2 (Ω; RN ), define u = KF to be the unique
weak solution in H01 (Ω; RN ) of the BVP
Lu + µu = F in Ω, u|∂Ω = 0.
By Theorem 3.9 and Corollary 3.10, this K is well defined and is a compact linear operator
on L2 (Ω; RN ). Sometime, we write K = (L+µI)−1 . Here I denotes the identity on L2 (Ω; RN )
and also the identity embedding of H01 (Ω; RN ) into L2 (Ω; RN ).
Theorem 3.11. K : L2 (Ω; RN ) → L2 (Ω; RN ) is symmetric and positive; that is,
(KF, G)L2 = (KG, F )L2 , (KF, F )L2 ≥ 0, ∀ F, G ∈ L2 (Ω; RN ).
Furthermore, given λ ∈ R and F ∈ L2 (Ω; RN ), u ∈ H01 (Ω; RN ) is a weak solution of
Lu − λu = F if and only if [I − (λ + µ)K]u = KF.
where Gi = Kvi ; that is, Gi is the unique weak solution to the BVP: LGi + µGi = vi , in
H01 (Ω; RN ).
Lu − λu = 0, u|∂Ω = 0
has nontrivial weak solutions in H01 (Ω; RN ); these nontrivial solutions are called the eigen-
functions corresponding to eigenvalue λ.
From Theorem 3.11, we see that λ is an eigenvalue of L if and only if equation (I − (λ +
1
µ)K)u = 0 has nontrivial solutions u ∈ L2 (Ω; RN ); this exactly says that λ 6= −µ and λ+µ
is an eigenvalue of operator K. Since, by (3.25), K is strictly positive, all eigenvalues of K
consist of a countable set of positive numbers tending to zero and hence the eigenvalues of
L consist of a set of numbers {λj }∞j=1 with −µ < λ1 ≤ λ2 ≤ · · · ≤ λj → ∞.
Theorem 3.12. (Eigenvalue Theorem) Assume (3.23) and (3.24) with µ = 0. Then the
eigenvalues of L consist of a countable set Σ = {λk }∞
k=1 , where
0 < λ1 ≤ λ2 ≤ λ3 ≤ · · ·
and
lim λk = ∞.
k→∞
Note that
Proof. The first part of the theorem relies on the characterization (3.27) and a maximum
principle which we do not study here. We prove only the second part. Let w be another
eigenfunction. Then, either w(x) > 0 for all x ∈ Ω or w(x) < 0 for all x ∈ Ω. Let t ∈ R be
such that
Z Z
w(x) dx = t w1 (x)dx.
Ω Ω
Remark. Why is the eigenvalue problem important? In one specific case, one can use
eigenvalues and eigenfunctions to study some evolution (i.e. time-dependent) problems. For
example, to solve the initial boundary value problem (IBVP) for the time-dependent
parabolic equation:
(3.28) ut + Lu = 0, u|∂Ω = 0, u(x, 0) = ϕ(x),
one can try to find the special solutions of the form: u(x, t) = e−λt w(x); this reduces to the
eigen-problem: Lw = λw. Therefore, for each pair (λi , wi ) of eigenvalue and eigenfunction,
one obtains a special solution to the evolution equation given by ui (x, t) = e−λi t wi (x). Then
one proceeds to solve the IBVP (3.28) by finding the solution of the form
X
u(x, t) = ai e−λi t wi (x),
i
where ai is determined by the eigen-expansion of the initial data ϕ:
X
ϕ(x) = ai wi (x).
i
However, this course does not study the parabolic or other time-dependent problems.
Chapter 4
Variational Methods
for PDEs
where A[u] denotes a given PDE for unknown u and B[u] is a given boundary value condition.
There is, of course, no general theory for solving such problems.
The Calculus of Variations identifies an important class of problems which can be solved
using relatively simple techniques from nonlinear functional analysis. This is the class of
variational problems, where the operator A[u] can be formulated as the first variation
(“derivative”) of an appropriate “energy” functional I(u) on a Banach space X; that is,
A[u] = I 0 (u). In this way, A : X → X ∗ and the equation A[u] = 0 can be formulated weakly
as
hI 0 (u), vi = 0, ∀ v ∈ X.
The advantage of this new formulation is that solving problem (4.1) (at least weakly) is
equivalent to finding the critical points of I on X. The minimization method for a
variational problem is to solve the problem by finding the minimizers of the related energy
functional. In this chapter and the next, we shall only study the variational problems on
the Sobolev space X = W 1,p (Ω; RN ).
We should also mention that many of the physical laws in applications arise directly as
variational principles. However, although powerful, not all PDE problems can be formulated
as variational problems; there are lots of other (nonvariational) important methods for
studying PDEs.
65
66 4. Variational Methods for PDEs
The coupled differential system in divergence form for u is called the system of Euler-
Lagrange equations for the functional I(u).
where ζ ∈ C0∞ (Ω) is a given scalar test function. Note that Di ϕk (x) = ρ0 ( x·p k
)pi q ζ + O()
as → 0+ . Substitute this ϕ into (4.6) and let → 0+ and we obtain
Z Xn X N
Fξk ξl (x, u, Du) pi pj q k q l ζ 2 dx ≥ 0.
i j
Ω i,j=1 k,l=1
4.2.3. Ellipticity Conditions and Convexities. We now consider the ellipticity of the
Euler-Lagrange equation (4.4), where A(x, s, ξ), b(x, s, ξ) are given by (4.5) and F (x, s, ξ) is
C 2 in ξ. In this case, the Legendre ellipticity condition (3.8) and the Legendre-Hadamard
condition (3.9) defined in the previous chapter reduce to, respectively:
Obviously, (4.10) implies (4.11). We also have the following equivalent conditions.
Lemma 4.1. Let F be C 2 in ξ. Then, the conditions (4.10) and (4.11) are equivalent to
the following conditions, respectively:
ν
(4.12) F (x, s, η) ≥ F (x, s, ξ) + Fξk (x, s, ξ) (ηik − ξik ) + |η − ξ|2 ;
i 2
ν 2 2
(4.13) F (x, s, ξ + q ⊗ p) ≥ F (x, s, ξ) + Fξk (x, s, ξ) pi q k + |p| |q|
i 2
for all x ∈ Ω, s, q ∈ RN , ξ, η ∈ MN ×n and p ∈ Rn .
From this and the Taylor formula, inequalities (4.12) and (4.13) are equivalent to (4.10)
and (4.11), respectively.
Remark. Rank-one convexity does not imply convexity. For example, take n = N ≥ 2, and
F (ξ) = det ξ. Then F (ξ) is rank-one convex but not convex in ξ. (Exercise!) Later on, we
will study other convexity conditions related to the energy functionals given by (4.2).
Proof. Given u, v ∈ X = W 1,p (Ω; RN ), let h(t) = I(u + tv). Then, by (4.15), h is finite
valued, and for t 6= 0 we have
h(t) − h(0) F (x, u + tv, Du + tDv) − F (x, u, Du)
Z Z
= dx ≡ F t (x) dx,
t Ω t Ω
where for almost every x ∈ Ω,
Z t
t 1 1 d
F (x) = [F (x, u + tv, Du + tDv) − F (x, u, Du)] = F (x, u + sv, Du + sDv) ds.
t t 0 ds
Clearly,
lim F t (x) = Fξk (x, u, Du) Di v k (x) + Fsk (x, u, Du) v k (x) a.e.
t→0 i
We also write
1 th
Z i
F t (x) = Fξk (x, u + sv, Du + sDv) Di v k (x) + Fsk (x, u + sv, Du + sDv) v k (x) ds.
t 0 i
ap bq 1 1
Using conditions (4.16), (4.17), and Young’s inequality: ab ≤ p + q , where p + q = 1, we
obtain that, for all 0 < |t| ≤ 1,
|F t (x)| ≤ C1 (|Du|p + |Dv|p + |u|p + |v|p ) + C2 (x), C2 ∈ L1 (Ω).
4.2. Multiple Integrals in the Calculus of Variations 69
Dirichlet Classes. Given ϕ ∈ W 1,p (Ω; RN ), we define the Dirichlet class of ϕ to be the set
Dϕ = {u ∈ W 1,p (Ω; RN ) | u − ϕ ∈ W01,p (Ω; RN )}.
4.2.6. Minimality and Uniqueness of Weak Solutions. We study the weak solutions
of Euler-Lagrange equations under the hypotheses of convexity and certain growth condi-
tions. For this purpose, we consider a simple case where F (x, s, ξ) = F (x, ξ) satisfies, for
some 1 < p < ∞,
Theorem 4.5. Let F (x, ξ) be C2and convex in ξ. Let u ∈ W 1,p (Ω; RN ) be a weak solution
of the Euler-Lagrange equation of I and I(u) < ∞. Then u must be a minimizer of I in
the Dirichlet class Du of W 1,p (Ω; RN ). Furthermore, if F satisfies the Legendre condition
(4.10) (with ν > 0), then u is the unique minimizer of I in Du .
where ν = 0 if F is only convex in ξ and ν > 0 if F satisfies the Legendre condition. This
implies
Z Z Z
F (x, Dv) dx ≥ F (x, Du) dx + Fξk (x, Du) Di (v k − uk ) dx
i
Ω Ω Ω
Z
ν
+ |Du − Dv|2 dx.
2 Ω
Since u is a weak solution, we have
Z
Fξk (x, Du) Di (v k − uk ) dx = 0.
i
Ω
Therefore, it follows that
Z
ν
(4.23) I(v) ≥ I(u) + |Du − Dv|2 dx ≥ I(u)
2 Ω
Proof. This theorem is a special case of Theorem 1.37 under assumption (ii) there; the proof
involves the basic ideas of what is known as the direct method of calculus of variations. We
explain this method by giving a proof of this theorem. First of all, by the weak coercivity,
one can easily show that inf u∈C I(u) is finite; so take a sequence {uν }, called a minimizing
sequence, such that
lim I(uν ) = inf I(u).
ν→∞ u∈C
Then the weak coercivity condition implies that {uν } must be bounded in X. Since 1 <
p < ∞ and thus X = W 1,p (Ω; RN ) is reflexive, there exists a subsequence of {uν }, denoted
4.3. Direct Method for Minimization 71
by {uνj }, and u0 ∈ X such that uνj * u0 weakly in X. The weak closedness of C implies
u0 ∈ C. Now the w.l.s.c. of I implies
I(u0 ) ≤ lim inf I(uνj ) = inf I(u).
j→∞ u∈C
Exercise 1. Let n ≥ 3 and 2 ≤ p < n. Show the theorem is valid if (4.27) above is
replaced by
|f (x, s)| ≤ a(x) + b |s|q ,
72 4. Variational Methods for PDEs
q+1
where b ≥ 0 is a constant, a ∈ L q (Ω) and 1 ≤ q < p∗ − 1.
4.4.1. Nonlinear Eigenvalue Problems. The following theorem was proved in the Pre-
liminaries.
Theorem 4.8. Let X be a Banach space. Let f, g : X → R be of class C 1 and g(u0 ) = c.
Assume u0 is a local extremum of f with respect to the constraint g(u) = c. Then either
g 0 (u0 )v = 0 for all v ∈ X, or there exists λ ∈ R such that f 0 (u0 )v = λg 0 (u0 )v for all v ∈ X;
that is, u0 is a critical point of f − λg.
where α = inf l(x). Hence kuk1 is uniformly bounded if g(u) ≤ const. Furthermore, by
Ehrling’s inequality, for each ε > 0, there is an absolute constant c(ε) > 0 such that
Thus
1 1
f (u) ≥ (1 − ε max |k|)kuk21,2,0 − c(ε) max |k|kuk21 .
2 2
If we choose ε small enough so that 1 − ε max |k| > 0, then f (u) → ∞ as kuk1,2,0 → ∞ with
g(u) ≤ const.
It easily follows that g 0 (u) = 0 iff u = 0. Moreover, since the range of the functional
g(u) is [0, ∞), we can apply Theorem 4.8 to conclude that for each R ∈ (0, ∞) there exists
a nontrivial weak solution (uR , λR ) of (4.31) with g(uR ) = R. Moreover, if βR = f (uR ) =
min f (u) subject to g(u) = R, then it is easily seen that λR = (τ2β R
+1)R .
Corollary 4.10. For 1 < τ < (n + 2)/(n − 2) there exists a nontrivial weak solution of
∆v + |v|τ −1 v = 0 in Ω, v|∂Ω = 0
Remarks. (i) Another method to show the existence of nontrivial weak solution to (4.32) is
given later by the Mountain Pass Theorem.
n+2
(ii) The nonexistence of nontrivial classical solutions to problem (4.32) when τ ≥ n−2
will be studied later for certain domains with simple topological property.
(iii) For domains like annulus, problem (4.32) always has nontrivial solutions for all
τ > 1. See the following exercise.
Exercise 1. Let Ω be the annulus 0 < a < r < b, r = |x| and suppose τ > 1. Prove
that the BVP
∆u + |u|τ −1 u = 0 in Ω, u|∂Ω = 0
has a nontrivial solution.
Hint: Minimize the functional
Z b
f (u) = (u0 )2 rn−1 dr, r = |x|
a
4.4.2. Harmonic Maps and Liquid Crystals. We now consider the Dirichlet energy
Z
1
I(u) = |Du|2 dx
2 Ω
for u ∈ H 1 (Ω; RN ) with point-wise constraint |u(x)| = 1 for almost every x ∈ Ω. Let
C = {u ∈ Dϕ | |u(x)| = 1 a.e.},
where Dϕ is a Dirichlet class. We assume C is non-empty. Then C is weakly closed in
H 1 (Ω; RN ). We have the following result.
Theorem 4.11. There exists u ∈ C satisfying
I(u) = min I(v).
v∈C
Moreover, u satisfies
Z Z
(4.33) Du · Dv dx = |Du|2 u · v dx
Ω Ω
for each v ∈ H01 (Ω; RN ) ∩ ∞
L (Ω; RN ).
Remark. In this case, we see u is a weak solution to the harmonic map equation:
−∆u = |Du|2 u in Ω.
The Lagrange multiplier in this case is the function λ = |Du|2 , instead of a constant.
Proof. The existence of minimizers follows by the direct method as above. Given any
v ∈ H01 (Ω; RN ) ∩ L∞ (Ω; RN ), let be such that ||kvkL∞ (Ω) ≤ 12 . Define
u(x) + v(x)
w (x) = , h() = I(w ).
|u(x) + v(x)|
Note that w ∈ C and h(0) = I(u) = minC I ≤ h() for sufficiently small ; hence, h0 (0) = 0.
Computing this h0 (0) and using the fact (Du)T u = 0 (which follows from |u| = 1) yield the
weak form of harmonic equation (4.33).
Liquid Crystals. Let n = N = 3. Then the harmonic map Dirichlet energy can be considered
as a special case of the Oseen-Frank energy for liquid crystals defined earlier:
Z Z
I(u) = F (u, Du) dx = (κ1 (div u)2 + κ2 (u · curl u)2 + κ3 (u × curl u)2 ) dx
Ω
ZΩ
+ κ4 (tr((Du)2 ) − (div u)2 ) dx,
Ω
where Ω is a bounded domain in R3 ,
u : Ω → R3 , div u = trDu is the divergence of u and
curl u = ∇ × u denotes the curl vector of u in R3 . In the first part of the total energy
I(u), the κ1 -term represents the splay energy, κ2 -term represents the twist energy and
κ3 -term represents the bending energy, corresponding to the various deformations of the
nematic director u with |u(x)| = 1; the κ4 -term is a null-Lagrangian, depending only
on the boundary data of u (see Chapter 5).
From the algebraic relation |Du − (Du)T |2 = 2| curl u|2 one easily sees that
|Du|2 = | curl u|2 + tr((Du)2 ).
Furthermore, since |u(x)| = 1, it easily follows that
| curl u|2 = (u · curl u)2 + |u × curl u|2 .
4.4. Minimization with Constraints 75
If κ > 0, then it can be shown that the first part of F (u, ξ) is convex and quadratic in ξ, and
hence the first part of the energy I is w.l.s.c. on H 1 (Ω; R3 ) by Tonelli’s theorem (Theorem
5.1 of Chapter 5). By (4.34), one can easily obtain that
Z
|Du(x)|2 dx ≤ c0 I(u) + Cϕ , ∀ u ∈ Dϕ ⊂ H 1 (Ω; R3 ).
Ω
Therefore, by the direct method, we have established the following existence result for
minimizers.
1
Remark. If all four κi are
R equal2 to 2 , then I(u) reduces to the Dirichlet integral for har-
1
monic maps: I(u) = 2 Ω |Du| dx.
4.4.3. Stokes’ Problem. Let Ω ⊂ R3 be open, bounded and simply connected. Given
f ∈ L2 (Ω; R3 ), the Stokes’ problem
det Du(x) = 1 ∀ x ∈ Ω.
The suitable space to work in this case is W 1,p (Ω; R3 ) with p ≥ 3. For further details, see
next chapter.
76 4. Variational Methods for PDEs
4.4.5. A Nonlocal Problem: Ferromagnetism. (See James & Kinderlehrer [23], and
Pedregal & Yan [34] for more results.)
We study a minimization problem in ferromagnetism which involves a constraint and
a nonlocal term. After some simplifications, the total energy I(m) in ferromagnetism is
given by
Z Z
1
(4.35) I(m) = ϕ(m(x)) dx + |∇u(x)|2 dx,
Ω 2 Rn
Proof. Let
Z
1
X = {u ∈ Hloc (Rn ) | 2 n
∇u ∈ L (R ; R ),n
u(x) dx = 0}.
B
Given m ∈ L2 (Ω; Rn ), define a functional J : X → R by
Z Z
1
J(u) = |∇u|2 dx − m · ∇u dx, u ∈ X.
2 Rn Ω
We solve the minimization problem: inf u∈X J(u). Note that X is simply a nonempty set
of functions and has no topology defined. But we can always take a minimizing sequence
uk ∈ X such that
lim J(uk ) = inf J(u) < ∞.
k→∞ u∈X
By Cauchy’s inequality with , it follows that
Z Z Z
1 2 2
J(u) ≥ |∇u| dx − |∇u| dx − C |m|2 dx.
2 Rn Rn Ω
1
Taking = 4 yields
Z
1
J(u) ≥ |∇u|2 dx − C, ∀ u ∈ X.
4 Rn
Therefore {∇uk } is bounded in L2 (Rn ; Rn ) and hence, via a subsequence, converges weakly
to some F ∈ L2 (Rn ; Rn ); this F can be written as F = ∇ū for a unique u ∈ X. (The
last fact needs a little more analysis; see the Hodge decompositions below!) Using this
4.4. Minimization with Constraints 77
where ν is the unit outward normal on the boundary and the formula is valid on each piece
Ωj of Ω̄ where mk is constant m0 or −m0 ; hence div mk = 0 on each Ωj . Moreover, mk ·ν = 0
on ∂Ωj \ ∂Ω. Hence we have
Z Z
|uk (x)| dS ≤ |∂Ω|1/2 kuk kL2 (∂Ω) → 0 as k → ∞.
mk · ∇uk dx ≤
Ω ∂Ω
Proof. Suppose m̄ is a minimizer. Then I(m̄) = inf I = 0 and hence ϕ(m̄(x)) = 0 and
T m̄ = 0. Therefore,
m̄(x) = χE (x)m0 + (1 − χE (x))(−m0 ), div(m̄χΩ ) = 0,
where E = {x ∈ Ω | m̄(x) = m0 } is a measurable subset of Ω. Therefore
Z
(4.39) f (x)∇ζ(x) · m0 dx = 0 ∀ ζ ∈ C0∞ (Rn ),
Rn
where f (x) = χΩ (x)[2χE (x) − 1] = 2χE (x) − χΩ (x). Let x = x0 + tm0 where x0 ⊥ m0 and
write f (x) = g(x0 , t). In (4.39), by change of variables, we have
Z
g(x0 , t)ζt (x0 , t) dx0 dt = 0 ∀ ζ ∈ C0∞ (Rn ).
Rn
This implies the weak derivative gt (x0 , t) ≡ 0 on Rn , and hence g(x0 , t) = h(x0 ) is indepen-
dent of t. But h(x0 ) = f (x0 + tm0 ) vanishes for large t and hence h ≡ 0. So f ≡ 0 on Rn .
However f (x) ∈ {±1} for x ∈ Ω. This is a contradiction.
4.4.6. The Hodge Decompositions on Rn . We now present a short theory for Hodge
decompositions on Rn with elementary proofs. See [38] for some related results.
Let X = L2 (Rn ; Rn ) denote the Hilbert space with the inner product and norm defined
by Z Z
1 1 n n
(u, v) = (u v + · · · + u v ) dx = u · v dx; kuk = (u, u)1/2 .
Rn Rn
For u ∈ L2 (Rn ; Rn ), we define div u, curl u = (curl u)ij as distributions as follows:
Z Z
1 n
hdiv u, ϕi = − (u ϕx1 + · · · + u ϕxn ) dx = − u · ∇ϕ dx;
Rn Rn
Z
h(curl u)ij , ϕi = − (ui ϕxj − uj ϕxi ) dx, ∀ ϕ ∈ C0∞ (Rn ).
Rn
1,1
Note that, if u ∈ Wloc (Rn ; Rn ), then div u = tr Du and curl u = Du − (Du)T . In the case
n = 2 or n = 3, the operator curl u can be identified as follows:
curl u ≈ ∇⊥ · u = ∇ · u⊥ = div(u⊥ ) = u1x2 − u2x1 (n = 2);
curl u ≈ ∇ × u = (u3x2 − u2x3 , u1x3 − u3x1 , u2x1 − u1x2 ) (n = 3).
4.4. Minimization with Constraints 79
Note that in the sense of distribution, the Laplacian of m ∈ X (for each component of m)
can be written as
(4.40) ∆m = ∇(div m) + div(curl m),
which means
n
X
i
∆m = ∂xi (div m) + ∂xj (curl m)ij ∀ i = 1, 2, · · · , n.
j=1
Proof. Let m ∈ Xdiv ∩ Xcurl . Then curl m = div m = 0 in the sense of distributions. Hence,
by (4.40) above, ∆m = 0 holds also in the sense of distributions. Hence m ∈ C ∞ (Rn ; Rn )
and each of its components mi is a harmonic function in Rn . Sine each mi also belongs to
L2 (Rn ), the mean value property and Hölder’s inequality imply that
Z
1
|mi (x)| ≤ |mi | dy ≤ c R−n/2 kmk
|BR (x)| BR (x)
for any x ∈ Rn and R > 0, where BR (x) = {y ∈ Rn | |y − x| < R} denotes the ball of radius
R and center x. Letting R → ∞ shows mi = 0 and hence m = 0.
To characterize the spaces Xdiv and Xcurl and prove the Hodge decomposition theorem,
we introduce the linear function spaces:
Y = {f ∈ L2loc (Rn ) | ∇f ∈ X}
and, for n = 3,
M = {m ∈ L2loc (R3 ; R3 ) | ∇ × m ∈ L2 (R3 ; R3 )}.
It is easy to see that ∇f ∈ Xcurl for all f ∈ Y and, for n = 3, ∇ × m ∈ Xdiv for all m ∈ M.
The converse is also true; we have the following results.
Theorem 4.18. There exists a uniform constant Cn > 0 such that, for each v ∈ Xcurl ,
there exists a f ∈ Y satisfying that
Z Z
1 2
(4.41) (a) v = ∇f ; (b) sup n+2 |f | dx ≤ Cn |v|2 dx,
R≥1 R BR (0) Rn
where QR (y) = {x ∈ R3 | |xi −yi | < R, i = 1, 2, 3} denotes the cubes centered y of side-length
2R.
80 4. Variational Methods for PDEs
The estimate (4.41b) in Theorem 4.18 above is not sharp as we will obtain some better
estimates later. However, the estimate (b) in both (4.41) and (4.42) does provide a way to
represent a curl-free v or divergence-free field w by some local function f or m, with f being
viewed as the potential function of v and m the velocity of w; initially, these local functions
have only been defined as the Schwartz distributions. The estimate (4.41b) also suggests
that we equip the space Y with the norm kf k∗ defined by
Z Z
2 2 1
(4.43) kf k∗ = |∇f (x)| dx + sup n+2 |f (x)|2 dx.
Rn R≥1 R BR
Let Y1 be the subspace of Y defined by
1
Y1 = {f ∈ Hloc (Rn ) | kf k∗ < ∞}.
It can be easily shown that Y1 is indeed a Banach space with the norm defined. We shall
try to find the minimal subspace Z of Y1 for which the gradient operator ∇ : Z → Xcurl is
bijective. In fact such a Z can be completely determined when n ≥ 3.
Theorem 4.20. Let Y2 be the closure of C0∞ (Rn ) in Y1 . Then, for n ≥ 3,
2n
Y2 = {f ∈ L n−2 (Rn ) | ∇f ∈ L2 (Rn ; Rn )}.
Furthermore, Y2 has the equivalent norms
kf k∗ ≈ k∇f kL2 (Rn ) ∀ f ∈ Y2 ,
and the gradient operator ∇ : Y2 → Xcurl is bijective.
The proof of this theorem relies on the Sobolev-Gagliardo-Nirenberg inequality for
H 1 (Rn ) functions when n ≥ 3 (see [2, 16, 21]); note that in this case the finite num-
ber 2∗ = n−22n
is the Sobolev conjugate of 2. In the case n = 2, there is no such a Sobolev-
Gagliardo-Nirenberg inequality; instead, there is a John-Nirenberg-Trudinger type of BMO-
estimates for functions with gradient in L2 (R2 ) (see [21, 35]. However, we shall try to avoid
the BMO-estimates. One of the minimal subspaces of Y1 on which the gradient operator is
bijective can be characterized as follows.
Theorem 4.21. Let n = 2 and Z be the closure in Y1 of the subspace
S = {ϕ − hϕiρ | ϕ ∈ H 1 (R2 )},
R
where hϕiρ = R2 ϕ(x)ρ(x) dx and ρ(x) is the weight function defined by
1 1
(4.44) ρ(x) = χ{|x|≤1} + 4 χ{|x|>1} .
2π |x|
Then Z has the equivalent norms kf k∗ ≈ k∇f kL2 (R2 ) ∀ f ∈ Z, and the gradient operator
∇ : Z → Xcurl is bijective.
Given any u ∈ X = L2 (Rn ; Rn ), for each R > 0, let BR = BR (0) and consider the
following minimization problem:
Z
(4.45) inf |∇ϕ − u|2 dx.
ϕ∈H01 (BR ) BR
By the direct method, this problem has a unique solution, which we denote by ϕR , also
extended by zero to all Rn . This sequence {ϕR } is of course uniquely determined by u ∈ X.
It also satisfies the following properties:
1
R
(4.46) Rn (∇ϕR − u) · ∇ζ dx = 0 ∀ ζ ∈ H0 (Ω), Ω ⊆ BR ,
(4.47) k∇ϕR kL2 (Rn ) ≤ kuk.
4.4. Minimization with Constraints 81
therefore, v = u if u ∈ Xcurl .
Given any ∈ Xcurl , choose the sequence ϕ̃R corresponding to v 0 − v. Since div(v − u) = 0,
v0
it easily follows that
(v 0 − v, v − u) = lim (∇ϕ̃R , v − u) = 0.
R→∞
Hence kv 0 − uk2 = kv − v 0 k2 + 2(v 0 − v, v − u) + kv − uk2 ≥ kv − uk2 ; this proves (4.50).
w ∈ Xdiv . Then 0 = (u, w) = (v, w) + |w|2 = |w|2 ; hence w = 0 and u = v ∈ Xcurl . This
⊥ . Similarly, it follows X
proves Xcurl = Xdiv ⊥
div = Xcurl .
Theorem 4.24. Let Y1 be the space with the norm k·k∗ defined above. Then Y1 is a Banach
space. Moreover, the gradient operator ∇ : Y1 → Xcurl is surjective; more precisely, for any
v ∈ Xcurl , there exists a f ∈ Y1 such that
v = ∇f, kf k∗ ≤ Cn kvk.
Proof. The proof that Y1 is a Banach space follows directly by the definition and will not
be given here. We prove the rest of the theorem.
Given v ∈ Xcurl , let v = v ∗ ρ be the smooth approximation of v. Then v ∈ Xcurl ∩
C ∞ (Rn ; Rn ). Define
Z 1
f (x) = v (tx) · x dt.
0
Then one can easily compute that
Z 1 Z 1
j
∂xj f (x) = v (tx)dt + (∂xj v )(tx) · tx dt
0 0
Z 1 Z 1
j
= v (tx)dt + ∇vj (tx) · tx dt
0 0
Z 1 Z 1
j d j
= v (tx)dt + [v (tx)]t dt
0 0 dt
Z 1 Z 1
j j
1 j
= v (tx)dt + v (tx)t0 − v (tx)dt
0 0
= vj (x),
where we have used the condition ∂xj v = ∇vj since curl v = 0. This proves ∇f (x) = v (x)
for all x ∈ Rn . Therefore, for all x, y ∈ Rn ,
Z 1
f (x + y) − f (y) = v (y + tx) · x dt.
0
Hence
Z 1
2
2
|f (x + y) − f (y)| = v (y + tx) · x dt
0
Z 1
2
≤ |x| |v (y + tx)|2 dt.
0
where M(h) is the maximal function of h (see Stein [35]). Since |v |2 ∈ L1 (Rn ), it follows
that
5n 5n
Z Z
n 2 2
m{y ∈ R | M(|v | )(y) > α} ≤ |v | dx ≤ |v|2 dx.
α Rn α Rn
Let
E = {y ∈ B1 | M(|v |2 )(y) ≤ α0 },
where we choose
2 · 5n
Z
α0 = |v|2 dx.
|B1 | Rn
Then it follows that |E | ≥ 12 |B1 | for all . Therefore, it is a simple exercise to show that
there exists a sequence k → 0 and a point y0 ∈ B1 such that y0 ∈ ∩∞ k=1 Ek ; that is,
2 · 5n
M(|vk |2 )(y0 ) ≤ α0 = kvk2 , ∀ k = 1, 2, · · · .
|B1 |
Using this y0 we define a new sequence
gk (z) = fk (z) − fk (y0 ), z ∈ Rn .
Then, for all R ≥ 1, we have
2 · 5n
Z Z
2
|gk (z)| dz ≤ |gk (z)|2 dz ≤ (2R)n+2 kvk2 .
BR B2R (y0 ) |B1 |
By using diagonal subsequences, there exists a subsequence gkj and a function f ∈ L2loc (Rn )
such that gkj * f weakly as kj → ∞ on all balls BR (0), R > 0. This function f must
satisfy ∇f = v ∈ L2 (Rn ; Rn ) and
Z Z
1 2
sup n+2 |f (x)| dx ≤ Cn |v(x)|2 dx;
R≥1 R BR Rn
Integrating this inequality over the cube QR (0) = {x ∈ R3 | |xi | < R, i = 1, 2, 3}, we obtain
Z Z
2 2
(4.53) |q (x, c)| dx ≤ 2R(R + 1) |w1 (x)|2 dx.
QR (0) R3
84 4. Variational Methods for PDEs
|E | ≥ 1 for all . Therefore, as above, there exists a sequence k → 0 and a point c0 ∈ [−1, 1]
such that c0 ∈ ∩∞ k=1 Ek ; that is,
Z
Hk (c0 ) ≤ |w3 (x)|2 dx ∀ k = 1, 2, · · · .
R3
Hence by (4.56)
Z Z
2 3
(4.57) |f (x, c0 )| dx ≤ 4R |w3 (x)|2 dx.
QR (0) R3
Using this c0 we define a new sequence uk (x) = mk (x, c0 ). Then we have wk = curl uk
and, for all R ≥ 1, by (4.53)-(4.56),
Z
(4.58) |uk (x)|2 dx ≤ CR3 kwk2 .
QR (0)
By using diagonal subsequences, there exists a subsequence ukj and a function m ∈ L2loc (R3 )
such that ukj * m weakly as kj → ∞ on all cubes |xi | < R, R > 0. This field m must
satisfy w = curl m ∈ L2 (R3 ; R3 ); hence m ∈ M. Moreover, by (4.58),
Z Z
1
sup 3 |m(x)|2 dx ≤ C |w(x)|2 dx.
R≥1 R QR (0) R3
Proof of Theorem 4.20. Let n ≥ 3 and Y1 , Y2 be the spaces defined above, and define
2n
W = {f ∈ L n−2 (Rn ) | ∇f ∈ L2 (Rn ; Rn )}.
We proceed with several lemmas to prove Theorem 4.20.
Proof. Let f ∈ Y2 . Then there exists a sequence fj ∈ C0∞ (Rn ) such that kfj − f k∗ → 0 as
j → ∞. Therefore k∇fj kL2 → k∇f kL2 . By Sobolev-Galiardo-Nirenberg inequality,
kfj kL2∗ (Rn ) ≤ C k∇fj kL2 (Rn ) ∀ j.
∗
Hence fj * g ∈ L2 (Rn ). Since fj → f in L2 (BR ) for all R > 0. We have f = g. Hence
f ∈ W. Furthermore, by Hölder’s inequality,
kfj kL2 (BR ) ≤ cn R2 kfj kL2∗ (BR ) ≤ C R2 k∇fj kL2 (Rn ) .
Hence, by taking limits as j → ∞, it follows that
Z Z
1
sup n+2 |f |2 dx ≤ C |∇f (x)|2 dx,
R≥1 R BR Rn
Proof. Given any v ∈ Xcurl , let ϕR ∈ H01 (BR ) be the function determined as in the
minimization problem (4.45) above with u = v. Then ∇ϕR → v in X = L2 (R2 ; R2 ) as
R → ∞. Lemma 4.25 implies that {ϕR } is a Cauchy sequence in Y2 and hence its limit f
belongs to Y2 and satisfies ∇f = v. This completes the proof.
for all ψ ∈ H 1 (B1 ), where (ψ)0 is the average value of ψ on B1 ; that is,
Z Z
1
(ψ)0 = ψ(x) dx = 2 ψ(x)ρ(x) dx.
π B1 {|x|<1}
Given ϕ ∈ H 1 (R2 ), let ψ(x) = ϕ( |x|x2 ). Then ψ ∈ H 1 (B1 ). Using the above Poincaré
inequality for this ψ, after change of variable, one obtains that
Z Z
2 −4
(4.62) |ϕ(y) − (ψ)0 | |y| dy ≤ C |∇ϕ(y)|2 dy,
{|y|>1} {|y|>1}
where
Z Z Z
1 1
(ψ)0 = ψ(x) dx = ϕ(y)|y|−4 dy = 2 ϕ(x)ρ(x) dx.
π B1 π {|y|>1} {|x|>1}
Combining (4.61) for ψ(x) = ϕ(x) with (4.62) we obtain (4.60).
86 4. Variational Methods for PDEs
Let
S = {ϕ − hϕiρ | ϕ ∈ H 1 (R2 )}.
Then the previous lemma and (4.63) imply
Z Z Z
1 2 2
sup 4 |f | dx ≤ 2π |f | ρ dx ≤ C |∇f |2 dx ∀ f ∈ S.
R≥1 R BR R2 R2
Corollary 4.29. Let Z be the closure of S in Y1 . Then kf k∗ ≈ k∇f kL2 for all f ∈ Z.
Furthermore, for all f ∈ Z,
Z Z Z
2
f (x)ρ(x) dx = 0, |f (x)| ρ(x) dx ≤ C |∇f |2 dx,
R2 R2 R2
Finally, we prove the following result to complete the proof of Theorem 4.21.
Proof. The proof is similar to that of Lemma 4.27 above. Given any v ∈ Xcurl , let ϕR ∈
H01 (BR ) be the function determined as in the minimization problem (4.45) above with u = v.
Let fR = ϕR − hϕR iρ . Then fR ∈ S and ∇fR = ∇ϕR → v in X = L2 (R2 ; R2 ) as R → ∞.
Corollary 4.29 implies that {fR } is a Cauchy sequence in Y1 and hence its limit f belongs
to the closure Z of S and satisfies ∇f = v. This completes the proof.
This lemma shows that if c is not a critical level, then we can nicely deform the set Ac+ε
into Ac−ε for some ε > 0.
4.5.3. The Mountain Pass Theorem. We now prove the main theorem of this section.
Theorem 4.32. (Ambrosetti-Rabinowitz) Let E : X → R be a C 1 functional satisfying
(PS) on the real Banach space X. Let u0 , u1 ∈ X, c0 ∈ R and R > 0 be such that
(i) ku1 − u0 k > R
(ii) E(u0 ), E(u1 ) < c0 ≤ E(v) for all v such that kv − u0 k = R.
Then E has a critical point u with E(u) = c, c ≥ c0 ; the critical value c is defined by
(4.64) c = inf sup E(p(t))
p∈K t∈[0,1]
where K denotes the set of all continuous maps p : [0, 1] → X with p(0) = u0 and p(1) = u1 .
Proof. Let c be as defined in (4.64). If it is not a critical value, then Kc = ∅. Let η and
ε̄ be as in the deformation lemma. Now by condition (ii) of the theorem, we can choose ε
small enough so that 0 < ε < ε̄ and E(u0 ), E(u1 ) 6∈ [c − ε, c + ε] (since c ≥ c0 ). Let p ∈ K
and define the path ξ : [0, 1] → X by
ξ(t) = η(1, p(t)).
Since p(0) = u0 and p(1) = u1 , it follows, by the choice of ε, that
ξ(0) = η(1, u0 ) = u0 , ξ(1) = η(1, u1 ) = u1
using condition (ii) of the lemma. Thus ξ ∈ K. Now, we can choose p ∈ K such that
max E(p(t)) < c + ε.
t∈[0,1]
88 4. Variational Methods for PDEs
4.5.4. Saddle Point Solutions. In order to illustrate these ideas we apply the MPT to
the problem
(4.65) ∆u + f (x, u) = 0 in Ω, u|∂Ω = 0
where f ∈ Car satisfies the growth condition
|f (x, z)| ≤ c(1 + |z|τ ), |fz (x, z)| ≤ c(1 + |z|τ −1 )
with 1 < τ < (n + 2)/(n − 2). Moreover, we assume f (x, 0) = 0 and
0 ≤ F (x, z) ≤ γf (x, z)z
Rz
for some constant γ < 1/2, where F (x, z) = 0 f (x, s)ds. Finally we assume
a|z|τ +1 ≤ |F (x, z)| ≤ A|z|τ +1
for constants 0 < a ≤ A. Associated with (4.65) is the functional I defined by
Z
1
I(u) = |∇u|2 dx − J(u),
2 Ω
where Z
J(u) = F (x, u)dx.
Ω
The following lemma simplifies some of the technical details that follow.
Lemma 4.33. Let I and J be defined on H = H01 (Ω) as above. Then both I and J are of
C 1 on H and
(a) J 0 : H → H ∗ is compact.
(b) If {un } is a bounded sequence in H such that I 0 (un ) → 0 as n → ∞, then {un }
has a convergent subsequence.
Since hI 0 (u), vi = hAu, vi − hJ 0 (u), vi, and A−1 exists and is bounded on H ∗ (by the Lax-
Milgram theorem), it follows that
A−1 I 0 (u) = u − A−1 J 0 (u).
Let {un } be a bounded sequence in H with I 0 (un ) → 0 as n → ∞. Since A−1 is continuous
and J 0 is compact, by passing to a subsequence if necessary, we have un = A−1 I 0 (un ) +
A−1 J 0 (un ) converges.
Remark. From the above proof we see that the compactness of the embedding H = H01 (Ω) ⊂
Lτ +1 (Ω) is necessary for proving (PS).
Theorem 4.34. The boundary value problem (4.65) has at least one nontrivial solution
u ∈ H01 (Ω).
We now establish a nonexistence result which shows that the boundary value problem (4.31)
(with k = 0, l = 1) does not have a positive solution for τ > (n + 2)/(n − 2).
Proof. Given x0 ∈ ∂Ω, since ∂Ω is C 1 , there exists a ball B = Bε (x0 ) and a C 1 function φ
on B such that
Ω ∩ B = {x ∈ B | φ(x) < 0}, ∂Ω ∩ B = {x ∈ B | φ(x) = 0}.
∇φ(x0 )
Note that the outward unit normal at x0 is given by ν(x0 ) = |∇φ(x 0 )|
. Let δ > 0 be
sufficiently small so that λx0 ∈ B and hence, from Ω being star-shaped, λx0 ∈ Ω̄ ∩ B for all
λ ∈ [1 − δ, 1]. Consider h(λ) = φ(λx0 ) defined on λ ∈ [1 − δ, 1]. Then h has the maximum 0
at right-end point 1 and hence
h0 (1− ) = ∇φ(x0 ) · x0 ≥ 0.
This proves x0 · ν(x0 ) ≥ 0 for all x0 ∈ ∂Ω.
Proof. Suppose u is a nontrivial C 2 (Ω̄) solution of (4.68). Applying formula (4.67) with
f (u) = |u|τ −1 u (and thus F (u) = τ +1
1
|u|τ +1 ), and using Lemma 4.37, we obtain
|u|τ +1 2−n
Z Z Z
1 2
(4.70) u (x · ν)dS = n dx + |u|τ +1 dx
2 ∂Ω n τ + 1 2
Ω Z Ω
n n−2 τ +1
(4.71) = − |u| dx ≥ 0,
τ +1 2 Ω
n n−2 n+2
which yields τ +1 ≥2 and hence a contradiction: τ ≤ n−2 .
If u is a positive C 2 (Ω̄) solution of (4.69) with τ = n+2
n−2 . Since
u = 0 on ∂Ω, by a sharp
maximum principle (see Serrin’s Maximum Principle below), we have un ≤ −σ on ∂Ω for a
positive number σ. By (4.71), we obtain
Z Z
σ2 x · ν dS ≤ u2n (x · ν) dS = 0,
∂Ω ∂Ω
from which we have a desired contradiction:
|x|2
Z Z
0= x · ν dS = ∆( ) dx = n|Ω|.
∂Ω Ω 2
This completes the proof.
92 4. Variational Methods for PDEs
4.6.4. Radial Symmetry of Solutions on a Ball. Let Ω = B be the open unit ball in
Rn . We shall study the nonlinear problem
(4.72) ∆u + f (u) = 0 in Ω, u|∂Ω = 0.
We are interested in positive solutions: u > 0 in Ω. We assume that f : R → R is Lipschitz,
but is otherwise arbitrary. Our intention is to prove that u is necessarily radial, i.e., u(x)
depends only on r = |x|. This is a remarkable conclusion, since we are making essentially
no assumptions on the nonlinearity. The technique we use is due to Gidas, Ni & Nirenberg
[20], and is based upon an extension of the maximum principle and the method of moving
planes.
4.6.6. Moving Plane Method. Let Ω+ denote the open upper half ball Ω ∩ {xn > 0},
and Ω− the open lower half ball Ω ∩ {xn < 0}.
Lemma 4.40. Let u ∈ C 2 (Ω̄) be a positive solution of (4.72). Then
uxn < 0 in Ω near ∂Ω+ .
Proof. Fix any point x0 ∈ ∂Ω+ and let ν = (ν1 , ..., νn ) denote the outer unit normal to
∂Ω+ at x0 . Note that νn > 0. We claim that uxn < 0 in Ω near x0 .
We shall give the proof under the assumption f (0) ≥ 0. Then
0 = −∆u − f (u) + f (0) − f (0)
Z 1
∂
≤ −∆u − f (su(x))ds
0 ∂s
≤ −∆u + cu
R1
for c(x) = − 0 f 0 (su(x))ds. According to Lemma 4.39, uν (x0 ) = ∇u(x0 ) · ν < 0. Since
∇u is parallel to ν on ∂Ω and νn > 0, we conclude that uxn (x0 ) < 0, and thus uxn <
0 in Ω near x0 .
Theorem 4.41. Let u ∈ C 2 (B̄) be a positive solution of (4.72). Then u is radial, i.e.,
u(x) = v(r) (r = |x|)
for some strictly decreasing function v : [0, 1] → [0, ∞).
Weak Lower
Semicontinuity on
Sobolev Spaces
Theorem 5.1. Let F (x, s, ξ) ≥ 0 be smooth and convex in ξ. Assume F, Fξ are both
continuous in (x, s, ξ). Then the functional I(u) defined above is sequentially weakly (weakly
star if p = ∞) lower semicontinuous on W 1,p (Ω; RN ) for all 1 ≤ p ≤ ∞.
Proof. We need only to prove I(u) is w.l.s.c. on W 1,1 (Ω; RN ). To this end, assume {uν }
is a sequence weakly convergent to u in W 1,1 (Ω; RN ). We need to show
I(u) ≤ lim inf I(uν ).
ν→∞
95
96 5. Weak Lower Semicontinuity on Sobolev Spaces
Therefore, since F ≥ 0,
Z
I(uν ) ≥ F (x, uν , Duν ) dx
K
Z h i
≥ F (x, uν , Du) + Fξk (x, uν , Du) (Di ukν − Di uk )
i
ZK Z
= F (x, uν , Du) + Fξk (x, u, Du) (Di ukν − Di uk )
i
K K
Z
+ [Fξk (x, uν , Du) − Fξk (x, ukν , Du)] (Di ukν − Di uk ).
i i
K
Since F (x, s, ξ) is uniformly continuous on bounded sets and uν (x) → u(x) uniformly on K
we have
Z Z
lim F (x, uν , Du) dx = F (x, u, Du) dx,
ν→∞ K K
Now since Fξk (x, u, Du) is bounded on K and Di ukν converges to Di uk weakly in L1 (Ω) as
i
ν → ∞, we thus have
Z
lim Fξk (x, u, Du) (Di ukν − Di uk ) dx = 0.
ν→∞ K i
From these estimates, noting that, for any two sequences {aν }, {bν },
we have
Z
lim inf I(uν ) ≥ F (x, u, Du).
ν→∞ K
If F (x, u, Du) ∈ L1 (Ω), i.e., I(u) < ∞, then for any given > 0, we use Lebesgue’s absolute
continuity theorem to determine δ > 0 so that
Z Z
F (x, u, Du) ≥ F (x, u, Du) − , ∀E ⊂ Ω, |Ω \ E| < δ.
E Ω
On the other hand, if I(u) = ∞ then for any given large number M > 0 we choose δ > 0
so that
Z
F (x, u, Du) dx > M, ∀E ⊂ Ω, |Ω \ E| < δ.
E
5.1.2. Existence in the Convex Case. Using the theorem, we obtain the following
existence result for convex functionals.
Theorem 5.2. In addition to the hypotheses of the previous theorem, assume there exists
1 < p < ∞ such that
F (x, s, ξ) ≥ c |ξ|p − C(x),
where c > 0, C ∈ L1 (Ω) are given. If for some ϕ ∈ W 1,p (Ω; RN ), I(ϕ) < ∞, then mini-
mization problem inf u∈Dϕ I(u) has a minimizer in the Dirichlet class Dϕ .
Proof. This follows from the abstract existence Theorem 4.6 in the previous chapter.
Remark. Both theorems in this section hold for more general functions F (x, s, ξ). For ex-
ample, we can replace the continuity condition by the Carathéodory condition.
5.2.1. Lipschitz Convergence. Note that W 1,∞ (Ω; RN ) can be identified with the space
of all Lipschitz maps from Ω to RN . A sequence {uν } converges to u in the weak star
topology of W 1,∞ (Ω; RN ) if and only if {uν } converges to u in the sense of Lipschitz
convergence; that is,
1) uν → u uniformly in C(Ω̄; RN );
2) the Lipschitz norms of uν and u are bounded.
Proof. Let Q be a fixed open cube containing Ω̄ with center x̄ and side-length 2L. We
prove this theorem by several lemmas.
Lemma 5.4. Suppose
Z
(5.4) F (x0 , s0 , ξ0 ) ≤ − F (x0 , s0 , ξ0 + Dφ(x)) dx
Q
Proof. For any φ ∈ W01,∞ (Ω; RN ), we extend φ by zero onto Q; then φ ∈ W01,∞ (Q; RN ).
Inserting it into (5.4) yields (5.3).
From the uniform continuity of F (x, s, ξ) on bounded sets and the fact that uν → ũ uni-
formly on Ω we have
Z Z
(5.6) lim inf F (x, uν , Duν ) dx = lim inf F (x, ũ, Duν ) dx.
ν→∞ Q0 ν→∞ Q0
We now compute
2nν Z
2ν L
Z
X
F (x, ũ, Duν ) dx = F x, ũ, ξ0 + Dφ x̄ + (x − x̄νj ) dx
Q0 Qνj l
j=1
nν
2
2ν L
Z
X
= F x̄νj , ũ(x̄νj ), ξ0 + Dφ x̄ + (x − x̄νj ) dx + o(1)
Qνj l
j=1
2 nν n Z
X l
F x̄νj , ũ(x̄νj ), ξ0 + Dφ(y) dy + o(1)
= ν
2 L Q
j=1
2 nν
X
(5.7) = F̃ (x̄νj ) |Qνj | + o(1),
j=1
5.2. Morrey’s Quasiconvexity 99
for every ξ ∈ MN ×n .
Define φk = ζk zνk . Then φk ∈ W01,∞ (Q; RN ) and we can use them as test functions in the
definition of quasiconvexity to obtain
Z
|Q| F (ξ) ≤ F (ξ + Dφk (x)) dx
Q
Z Z
= F (ξ + Dzνk ) + F (ξ + ζk Dzνk + zνk ⊗ Dζk )
Qk Q\Qk
Z
= F (ξ + Dzνk (x)) dx + k ,
Q
where Z
k = [F (ξ + ζk Dzνk + zνk ⊗ Dζk u) − F (ξ + Dzνk (x))] dx.
Q\Qk
Since F (ξ) is bounded on bounded sets and |Q \ Qk | → 0 as k → ∞, we easily have k → 0
as k → ∞. Therefore,
Z
|Q| F (ξ) ≤ lim inf F (ξ + Dzν (x)) dx.
ν→∞ Q
This completes the proof.
We now prove the sufficiency of quasiconvexity for the lower semicontinuity of the func-
tional Z
I(u) = F (x, u(x), Du(x)) dx
Ω
under the Lipschitz convergence on Ω.
Theorem 5.6. Assume F (x, s, ξ) is continuous on Ω̄ × RN × MN ×n and is quasiconvex in
ξ. Then the functional I defined above is s.l.s.c. with respect to Lipschitz convergence on
Ω.
Proof. Let {zk } be any sequence converging to 0 in the sense of Lipschitz convergence on
Ω, and let u ∈ W 1,∞ (Ω; RN ) be any given function. We need to show
Z Z
(5.9) F (x, u, Du) ≤ lim inf F (x, u + zk , Du + Dzk ).
Ω k→∞ Ω
For any given > 0, we choose finitely many disjoint cubes Qj contained in Ω such that
Z
I(u) ≤ F (x, u, Du) dx +
∪Qj
and Z
I(u + zk ) ≥ F (x, u + zk , Du + Dzk ) dx − ,
∪Qj
for all k = 1, 2, · · · . In what follows, we prove for each cube Q = Qj
Z
IQ (u) ≡ F (x, u, Du) dx ≤ lim inf IQ (u + zk ).
Q k→∞
This, by (5.2), will certainly imply the conclusion of the theorem. To this end, for each
positive integer ν, we divide Q into small cubes {Qνj } with center x̄νj as in the proof of
Theorem 5.3: nν
2[
Q= Qνj ∪ E, |E| = 0.
j=1
5.2. Morrey’s Quasiconvexity 101
Define Z Z
(u)νj = − u(x) dx, (Du)νj = − Du(x) dx,
Qνj Qνj
and nν nν
2
X 2
X
ν
U (x) = (u)νj · χQνj (x), ν
M = (Du)νj · χQνj (x).
j=1 j=1
Note that
kU ν kL∞ + kM ν kL∞ ≤ kukW 1,∞
and that the sequences {U ν } and {M ν } converge almost everywhere to u and Du on Q as
ν → ∞, respectively. We now estimate IQ (u + zk ).
Z
IQ (u + zk ) = F (x, u + zk , Du + Dzk ) = ak + bνk + cνk + dν + IQ (u),
Q
where
Z
ak = [F (x, u + zk , Du + Dzk ) − F (x, u, Du + Dzk )] dx,
Q
2nν
Z
X
bνk = [F (x, u, Du + Dzk ) − F (x̄νj , (u)νj , (Du)νj + Dzk )] dx,
j=1 Qνj
2nν
Z
X
cνk = [F (x̄νj , (u)νj , (Du)νj + Dzk ) − F (x̄νj , (u)νj , (Du)νj )] dx,
j=1 Qνj
2nν
Z
X
ν
d = [F (x̄νj , (u)νj , (Du)νj ) − F (x, u, Du)] dx.
j=1 Qνj
By the uniform continuity of F (x, s, ξ) on bounded sets and the pointwise convergence of
{U ν } and {M ν } we have
lim ak = 0, lim dν = 0
k→∞ ν→∞
and limν→∞ bνk = 0 uniformly with respect to k. We apply Theorem 5.5 to each Qνj to
obtain, by (5.2),
lim inf cνk ≥ 0
k→∞
for all ν = 1, 2, · · · . Therefore, again by (5.2),
lim inf IQ (u + zk ) ≥ IQ (u),
k→∞
as desired. The proof is complete.
Proof. Let G ⊂ Rn be any bounded open set with |∂G| = 0, and ψ ∈ W01,∞ (G; RN ).
Assume ȳ ∈ G. For any x ∈ Ω and > 0 let
G(x, ) = {z ∈ Rn | z = x + (y − ȳ) for some y ∈ G}.
Then there exists an x > 0 such that x ∈ G(x, ) ⊂ Ω for all x ∈ Ω and 0 < < x . This
means the family
{G(x, ) x ∈ Ω, 0 < < x }
covers Ω in the sense of Vitali covering. Therefore, there exists a countable disjoint
subfamily {G(xj , j )} and a set E of measure zero such that
∞
[
(5.11) Ω= G(xj , j ) ∪ E.
j=1
∞
X Z
= nj F (ξ + Dψ(y)) dy
j=1 G
|Ω|
Z
= F (ξ + Dψ(y)) dy,
|G| G
where the last equality follows since, by (5.11), ∞ n
P
j=1 j = |Ω|/|G|. We have thus proved
(5.10).
Proof. Since any function ψ ∈ W01,∞ (Σ; RN ) can be extended as a Σ-periodic function on
Rn , we easily see that (5.12) implies (5.10) for G = Σ thus the quasiconvexity of F . We
have only to prove (5.12) holds if F is quasiconvex. Let φ ∈ W 1,∞ (Rn ; RN ) be a Σ-periodic
function. Define
1
φj (x) = φ(jx)
j
for all j = 1, 2, · · · . It is easily seen that φj → 0 in the sense of Lipschitz convergence on
W 1,∞ (Σ; RN ). Therefore the theorem of Meyers, Theorem 5.5, and the quasiconvexity of F
implies Z
F (ξ) ≤ lim inf F (ξ + Dφj (x)) dx.
j→∞ Σ
Note that Z Z
F (ξ + Dφj (x)) dx = F (ξ + Dφ(jx)) dx
Σ Σ
Z
= j −n F (ξ + Dφ(y)) dy
jΣ
and that, besides a set of measure zero,
j n
[
jΣ = (x̄ν + Σ),
ν=1
where x̄ν are the left-lower corner points of the subcubes obtained by dividing the sides of
jΣ into j-equal subintervals. Since Dφ(x) is Σ-periodic, we thus have
Z Xjn Z Z
n
F (ξ + Dφ(y))dy = F (ξ + Dφ(y))dy = j F (ξ + Dφ(x))dx,
jΣ ν=1 x̄ν +Σ Σ
and therefore Z
F (ξ) ≤ F (ξ + Dφ(x)) dx
Σ
as needed; the proof is complete.
104 5. Weak Lower Semicontinuity on Sobolev Spaces
Proof. Let F = E h(x) dµ. Since G : RL → R is convex, there exists lF ∈ RL (note that
R
Proof. This follows easily from Jensen’s inequality and the divergence theorem given above.
Hence, by (5.8) and noting that Duk (x) and Dφk (x) take only on finitely many values,
Z
˜
F (ξ) ≤ lim F (ξ˜ + Dφk (x)) dx
k→∞ G
Z
= lim F (ξ˜ + Duk (x)) dx
k→∞ G
x + te ∈ Σ̄; hence, choosing t such that x + te ∈ ∂Σ, we have φ(x) = 0 for all x ∈ Σ. Hence,
ξ ∈ {A, B}, as claimed.
Theorem 5.15. Every finite-valued quasiconvex function is locally Lipschitz continuous.
Hadamard condition, where > 0 and δij , δ αβ are the usual delta notation. Using the
Fourier transform as before, we can show that
Z
Akl k l
ij Di φ (x)Dj φ (x) dx ≥ 0
Rn
Proof. Let F be a rank-one convex third degree polynomial. Then the polynomial f (t) =
F (ξ + tq ⊗ p) is convex and of degree ≤ 3 in t, and hence the degree of f (t) cannot be 3.
Note that the coefficient of t2 term in f is half of
(5.17) Fξk ξl (ξ) pi pj q k q l ≥ 0,
i j
which holds for all ξ, p, q. Since Fξk ξl (ξ) is linear in ξ, condition (5.17) implies
i j
Fξk ξl (ξ) pi pj q k q l ≡ 0.
i j
Therefore f (t) = F (ξ +tq⊗p) is affine in t and hence F is rank-one affine. Consequently, the
result follows from the fact that a rank-one affine function must be a null-Lagrangian.
5.3. Properties of Quasiconvex Functions 107
5.3.4. Šverák’s Example. The following example of Šverák settles a long-standing open
problem raised by Morrey.
Theorem 5.18 (Šverák [36]). If n ≥ 2, N ≥ 3 then there exists a rank-one convex function
F : MN ×n → R which is not quasiconvex.
Proof. We only prove the theorem for n = 2, N = 3. Consider the periodic function
u : R2 → R3 by
1
u(x) = (sin 2πx1 , sin 2πx2 , sin 2π(x1 + x2 )).
2π
Then
cos 2πx1 0
Du(x) = 0 cos 2πx2 .
cos 2π(x1 + x2 ) cos 2π(x1 + x2 )
Let L be the linear subspace of M = M3×2 ; that is,
r 0
L = [r, s, t] ≡ 0 s r, s, t ∈ R .
t t
Note that a matrix ξ = [r, s, t] ∈ L is of rank ≤ 1 if and only if at most one of {r, s, t} is
nonzero. Define g : L → R by g([r, s, t]) = −rst. Using formula 2 cos α · cos β = cos(α + β) +
cos(α − β), we easily obtain
1 1
g(Du(x)) = − − sin 4π(x1 + x2 ) + cos 4πx1 + cos 4πx2
4 4
and hence by a direct computation that
Z Z 1Z 1
1
(5.18) g(Du(x)) dx = g(Du(x)) dx1 dx2 = − .
Σ 0 0 4
We now extend g to the whole M. Let P : M → L be the orthogonal projection onto L and
consider the fourth degree polynomials:
(5.19) F,k (ξ) = g(P ξ) + (|ξ|2 + |ξ|4 ) + k |ξ − P ξ|2 .
Lemma 5.19. For each > 0 there exists a k = k > 0 such that F,k is rank-one convex.
Proof. We use contradiction method. Suppose otherwise; then there exists an 0 > 0 such
that F0 ,k is not rank-one for each k > 0. Hence there exist sequences ξk ∈ M, pk ∈ R2 , qk ∈
R3 with |pk | = |qk | = 1 such that
∂ 2 F0 ,k (ξk )
(5.20) qkα qkβ pki pkj ≡ D2 F0 ,k (ξk ) [qk ⊗ pk , qk ⊗ pk ] ≤ 0, ∀ k = 1, 2, · · · .
∂ξiα ∂ξjβ
Computing f 00 (0) for f (t) = F,k (ξ + tη) yields
D2 F,k (ξ) [η, η] = f 00 (0)
= D2 g(P ξ) [P η, P η] + 2 |η|2 + (4|ξ|2 |η|2 + 8(ξ · η)2 ) + 2k |η − P η|2 .
The term D2 g(P ξ) is linear in ξ; the third term is quadratic and positive definite in ξ if
η 6= 0 (this is the reason the |ξ|4 -term is needed for F,k ). Using this formula with ξ = ξk
and η = qk ⊗ pk and from (5.20), we deduce {ξk } is bounded as k → ∞. Assume, via
subsequence,
ξk → ξ,¯ qk → q̄, pk → p̄ as k → ∞.
108 5. Weak Lower Semicontinuity on Sobolev Spaces
Since D2 F,j (ξ) [η, η] ≤ D2 F,k (ξ) [η, η] for all k ≥ j, we deduce
We now complete the proof of Šverák’s theorem. Let u be the periodic function above.
We choose > 0 small enough such that
Z
1
(|Du(x)|2 + |Du(x)|4 ) dx < .
Σ 4
Let F (ξ) = F,k (ξ) be a rank-one function determined by the previous lemma. Since
Du(x) ∈ L, it follows by (5.18) that
Z Z Z
F (Du(x)) dx = g(Du) + (|Du|2 + |Du|4 ) < 0 = F (0).
Σ Σ Σ
This shows that F is not quasiconvex by Theorem 5.10 above. The theorem is now proved.
5.4.1. Determinant and Adjugate Matrix. Let ξ be a n×n square matrix. We denote
by det ξ and adj ξ the determinant and adjugate matrix of ξ, respectively, which satisfy
the following relation:
ξ (adj ξ) = (adj ξ) ξ = (det ξ) I,
where I is the n × n identity matrix. From this relation, we have
∂ det ξ
(5.22) = (adj ξ)T ,
∂ξ
where η T is the transpose matrix of η.
Lemma 5.20. For all u ∈ C 2 (Ω̄; Rn ) it follows that div(adj Du(x))T = 0; that is,
n
X
Di [(adj Du(x))ik ] = 0 (k = 1, 2, · · · , n).
i=1
5.4. Polyconvex Functions and Null-Lagrangians 109
Proof. This result follows from a general result on null-Lagrangians later, but we present
a direct proof here. Note that by the identity above, we have (det ξ)I = ξ T (adj ξ)T . For
ξ = Du(x) this implies
n
Di (uk (x)) (adj Du(x))jk ,
X
(det Du(x)) δij = i, j = 1, 2, · · · , n.
k=1
Differentiating this identity with respect to xj and summing over j = 1, 2, · · · , n, we have
n n
(Dj Di uk ) (adj Du)jk + (Di uk ) Dj [(adj Du)jk ]
X X
δij (adj Du)m k
k Dj (Dm u ) =
j,k,m=1 k,j=1
5.4.2. Subdeterminants. Let σ = min{n, N }. Given an integer k ∈ [1, σ], for any two
ordered sequences of integers
1 ≤ i1 < i2 < · · · < ik ≤ N, 1 ≤ α1 < α2 < · · · < αk ≤ n,
i
letJαi11iα2 2···ibe the determinant of the k × k matrix whose (q, p) position element is ξαqp
k
···αk (ξ)
for all 1 ≤ p, q ≤ k. Note that, by the usual notation,
∂(ui1 , ui2 , · · · , uik )
iq
i1 i2 ···ik ∂u
Jα1 α2 ···αk (Du(x)) = = det .
∂(xα1 , xα2 , · · · , xαk ) ∂xαp
Let J (ξ) be the collection of all Jαi11iα2 2···i
···αk (ξ) for all k ∈ [1, σ] with the same ordered integral
k
sequences {iq }, {αp } for all ξ. In this way, J defines a function from MN ×n to RL , where
σ
X N n
L = L(n, N ) = .
k k
k=1
Theorem 5.21. Let J (ξ) be defined as above, and let Σ be the unit cube in ⊂ Rn . Then it
follows that Z
J ξ + Dφ(x) dx = J (ξ)
Σ
for all φ ∈ C0∞ (Σ; RN ) and ξ ∈ MN ×n .
Let Σ0 , Σ00 be the unit cubes in x0 , x00 variables, respectively. Fix x00 ∈ Σ00 , for t ≥ 0,
consider maps Vt , Ut : Σ0 → Rk such that
We can choose t > 0 sufficiently large so that Vt , Ut are both diffeomorphisms on Σ0 , and
therefore
Z Z Z Z
0 0 0 0
det(DUt (x )) dx = dy = dy = det(DVt (x0 )) dx0 .
Σ0 Ut (Σ0 ) Vt (Σ0 ) Σ0
Since both sides are polynomials of t of degree k, it follows that this equality holds for all
t. When t = 0 this implies
Z Z
0 00 0
(5.24) Jk (ξ + Dφ(x , x )) dx = Jk (ξ) dx0 .
Σ0 Σ0
where
G1 (ξ, t) = |ξ|2 − t, G2 (ξ, t) = (ξ11 − ξ22 )2 + (ξ12 + ξ21 )2 + t
are both convex functions of (ξ, t).
By the theorem above, the left-hand side is G(J (ξ)) = F (ξ) and therefore
Z Z Z
F (ξ) ≤ G(h(x)) dx = G(J (ξ + Dφ(x))) dx = F (ξ + Dφ(x)) dx,
Σ Σ Σ
Proof. Define h(t) = I(U (t, ·)) for t ∈ [0, 1], where U (t, x) = tu(x) + (1 − t)v(x). Then
Z
h0 (t) = Fξk (x, U (t, x), DU (t, x))Di (uk − v k ) + Fsk (x, U (t, x), DU (t, x))(uk − v k ) dx.
i
Ω
Since u − v, U (t, ·) ∈ C 2 (Ω̄; RN ) and u − v = 0 on ∂Ω, using the divergence theorem and
(5.26) with u = U (t, ·) we have h0 (t) = 0 for all t ∈ (0, 1) and hence h(0) = h(1) and the first
part of the result follows. The second part follows by the continuity of I on W 1,p (Ω; RN )
under the given assumption.
Proof. We will not show the equivalence of (5) to other conditions, but it is important to
know that null-Lagrangians can only be the linear combination of subdeterminants. We
prove other equivalent conditions.
It is easy to see that (3) implies (2). Note that (2) is equivalent to that both F and −F
are quasiconvex; the latter is equivalent to that both IΩ and −IΩ are lower semicontinuous
with respect to the Lipschitz convergence on W 1,∞ (Ω; RN ). Therefore, (4) is equivalent to
(2). It remains to show (2) implies (1) implies (3). Let us first show that (1) implies (3).
To this end, given u ∈ C 2 (Σ̄; RN ) and φ ∈ C0∞ (Σ; RN ), let
Z
f (t) = F (Du(x) + tDφ(x)) dx.
Σ
Then, by (1), Z
f 0 (t) = Fξαi (Du(x) + tDφ(x)) Dα φi (x) dx = 0.
Σ
Hence f is a constant function; hence f (0) = f (1), which proves (3). The proof of that (2)
implies (1) will follow from several lemmas proved below.
Lemma 5.26. If F is of C 2 , then (2) implies (1).
Since F is of C 2 , this implies that, for any φ, ψ ∈ C0∞ (Ω; RN ), the function
Z
f (t) = F (t Dφ(x) + Dψ(x)) dx
Ω
Now given u ∈ C 2 (Ω̄; RN ), we can select a sequence {ψν } in C0∞ (Ω; RN ) such that ψν → u
in C 2 (supp φ; RN ). Using the identity above with ψ = ψν and letting ν → ∞ yield
Z
Fξk (Du(x)) Di φk (x) dx = 0,
i
Ω
which, by the divergence theorem, shows the Euler-Lagrange equation for I is satisfied by
u ∈ C 2 (Ω̄; RN ), and hence F is a null-lagrangian.
Proof. Since (2) is equivalent to that F and −F are rank-one convex, which is equivalent
to that F is rank-one affine.
We need some notation. Let µiα = ei ⊗ eα , where {ei } and {eα } are the standard bases
of RN and Rn , respectively. For each 1 ≤ k ≤ σ = min{n, N } and 1 ≤ i1 , · · · , ik ≤ N, 1 ≤
α1 , · · · , αk ≤ n, we define, inductively,
Fαi11 (ξ) = F (ξ + µiα11 ) − F (ξ),
i ···i i ···i
Fαi11···i 1 k−1 ik 1 k−1
···αk (ξ) = Fα1 ···αk−1 (ξ + µαk ) − Fα1 ···αk−1 (ξ).
k
Proof. We use induction on k. Assume this is true for all k ≤ s − 1. Let {10 , 20 , · · · , s0 } be
a permutation of {1, 2, · · · , s}. We need to show
i 0 ···i
Fαi11···i s 1 s 0
(5.27) ···αs (ξ) = Fα10 ···αs0 (ξ).
By definition
i ···i i ···i
Fαi11···i 1 s−1 is 1 s−1
···αs (ξ) = Fα1 ···αs−1 (ξ + µαs ) − Fα1 ···αs−1 (ξ).
s
Remark. The function F (ξ) = (tr ξ)2 − tr ξ 2 is a null-Lagrangian on Mn×n . To see this, note
n n n n
that F (ξ) = ( ξii )( ξkk ) − ξik ξki = Pik (ξ), where Pik (ξ) = ξii ξkk − ξki ξik . For each
P P P P
i=1 k=1 i,k=1 i,k=1
pair (i, k), i 6= k, Pik (ξ) is a 2 × 2 subdeterminant of ξ. Hence F (ξ) is a null-Lagrangian.
Remark. The following example shows that the weak convergence uj * ū in W 1,k (Ω; RN )
does not imply the weak convergence Jk (Duj ) * Jk (Dū) in L1 (Ω).
Example 5.33. (Ball and Murat [7]). Let B be the unit open ball in Rn . Consider, for
j = 1, 2, · · · , the radial mappings
jr if 0 ≤ r ≤ 1/j,
Uj (|x|)
uj (x) = x, Uj (r) = 2 − jr if 1/j ≤ r ≤ 2/j,
|x|
0 if 2/j ≤ r < 1.
is a constant independent of j. This shows that {det Duj } is not equi-integrable in B, and
therefore it does not converges weakly in L1 (B), or in L1loc (B). Therefore, although uj * ū
in W 1,k (Ω; RN ), Jk (Duj ) → Jk (Dū) in the sense of distribution, and {Jk (Duj )} is bounded
in L1 (Ω), but it is not true that Jk (Duj ) * Jk (Dū) weakly in L1 (Ω).
However, if we have det Duj (x) ≥ 0 a.e. in Ω and uj → ū weakly in W 1,n (Ω; Rn ) then
det Duj * det Dū weakly in L1loc (Ω). This is the well-known theorem of S. Müller [29]. We
will not prove this result here, but will present an easier result which can be proved without
using Müller’s theorem and Theorem 5.31.
Theorem 5.34. Let uj ∈ Dϕ , the Dirichlet class of ϕ ∈ W 1,n (Ω; Rn ). Assume uj * ū in
W 1,n (Ω; Rn ) and det Duj (x) ≥ 0 a.e. in Ω. Then det Duj * det Dū weakly in L1 (Ω).
Proof. Note that, since det Duj is bounded in L1 (Ω), the weak convergence is equivalent
to
Z Z
(5.31) lim det Duj (x) dx = det Dū(x) dx
j→∞ E E
for all measurable sets E ⊆ Ω. By Theorem 5.24, we have
Z Z Z
(5.32) det Duj (x) dx = det Dū(x) dx = det Dϕ(x) dx.
Ω Ω Ω
Hence (5.31) holds when E = Ω. Let E ⊂ Ω. Let
F (x, ξ) = χE (x) max{0, det ξ},
116 5. Weak Lower Semicontinuity on Sobolev Spaces
The first is due to the orientation-preserving assumption and the second is due to the
fact that infinite energy is needed to press a solid material into zero-volume.
Therefore, for nonlinear elasticity problems, we often need to consider extended-
valued energy functionals.
5.5. Existence in Nonlinear Elasticity 117
5.5.2. Polyconvex Energy Density Functions. We assume the energy density function
F (x, s, ξ) satisfies the following assumptions:
(1) F (x, s, ξ) = +∞ if and only if det ξ ≤ 0;
(2) F (x, s, ξ) is continuous in (x, s) ∈ Ω̄×R3 and polyconvex convex in ξ ∈ M3×3 , det ξ >
0; that is, there exists a continuous function W (x, s, ξ, η, τ ) which is convex in
(ξ, η, τ ) ∈ M3×3 × M3×3 × R+ such that, for all x, s,
F (x, s, ξ) = W (x, s, ξ, adj ξ, det ξ) for all ξ ∈ M3×3 with det ξ > 0;
(3) F (x, s, ξ) ≥ c0 |ξ|3 − C(x) for all ξ ∈ M3×3 , where c0 > 0 and C ∈ L1 (Ω) are given.
Note that these assumptions are compatible with the assumption (5.35) above.
where F satisfies the previous assumptions. Let ϕ ∈ W 1,3 (Ω; R3 ) be a given function such
that I(ϕ) < ∞. Then we have the following existence result.
Theorem 5.35. (Existence in Nonlinear Elasticity) There exists at least one mini-
mizer ū ∈ Dϕ with det Dū(x) > 0 for almost every x ∈ Ω such that
I(ū) = min I(u).
u∈Dϕ
Proof. Let {uν } be a minimizing sequence in Dϕ . Since lim I(uν ) = inf Dϕ I ≤ I(ϕ) < ∞,
from Assumption (1), it follows that
det Duν (x) > 0 a.e.
Now Assumption (3) implies that {|Duν |} is bounded in L3 (Ω) and hence from the fixed
Dirichlet boundary condition, {uν } is bounded in W 1,3 (Ω; R3 ). Therefore, via a subsequence,
we assume uν * ū in W 1,3 (Ω; R3 ) as ν → ∞. Hence, by Corollary 5.32 and Theorem 5.34,
(5.36) adj Duν * adj Dū in L3/2 (Ω), det Duν * det Dū in L1 (Ω).
It is easy to see that det Dū(x) ≥ 0 for almost every x ∈ Ω. We now show det Dū(x) > 0
a.e. in Ω. (We follow Pedregal.) Let
h(τ ) = inf{F (x, s, ξ) | (x, s, ξ) ∈ Ω̄ × R3 × M3×3 , det ξ = τ }.
Then h(τ ) is continuous and h(τ ) = ∞ for τ ≤ 0. Note that
Z Z
sup h(det Duν (x)) dx ≤ F (x, uν (x), Duν (x)) dx < ∞.
ν Ω Ω
By Fatou’s lemma,
Z Z Z
0≤ w(x)dx ≤ lim inf det Duν (x)dx = det Dū(x)dx = 0.
E\{w=0} ν→∞ E\{w=0} E\{w=0}
118 5. Weak Lower Semicontinuity on Sobolev Spaces
Remark. For more general cases, see Müller, Tang & Yan [31].
Example 5.37. Let f (ξ) = (ξ 2 − 1)2 for ξ ∈ R and consider the functional
Z 1 Z 1
I(u) = f (u0 (x)) dx = ((u0 )2 − 1)2 dx
0 0
and minimize it over all Lipschitz functions u satisfying boundary conditions u(0) = 0, u(1) =
a. We show that this nonconvex problem has minimizers for all values of a; moreover, the
minimizer is unique if |a| ≥ 1 and there exist infinitely many minimizers if |a| < 1.
Proof. Let first of all |a| < 1. Then it can be easily seen that there are infinitely many
Lipschitz functions u such that u0 (x) ∈ {−1, 1} with u(0) = 0 and u(1) = a; any of such
functions is a minimizer of I with minimum value zero. Now let |a| ≥ 1. Note that the
strict inequality
f (ξ) − [f (a) + f 0 (a)(ξ − a)] > 0
holds for all ξ 6= a if |a| > 1, or holds for all ξ ∈ / {−1, 1} if |a| = 1. Therefore, for all
Lipschitz functions u with u(0) = 0, u(1) = a it follows that
Z 1 Z 1
I(u) = f (u0 (x)) dx ≥ f (a) + f 0 (a)(u0 (x) − a) dx = f (a);
0 0
hence the minimum value of I for such u’s is f (a) and, certainly, ū(x) = ax is a minimizer.
We show this is the unique minimizer. Suppose v is any minimizer of I over all Lipschitz
functions u with u(0) = 0, u(1) = a; that is,
Z 1
I(v) = f (v 0 (x)) dx = f (a)
0
with v(0) = 0 and v(1) = a. If a = 1 then since f (1) = 0 it must follow that v 0 (x) ∈ {−1, 1}
R1
a.e. x ∈ (0, 1) and 0 (1−v 0 (x)) dx = 0 and hence v 0 (x) = 1 on (0, 1). This implies v(x) = x.
Similarly if a = −1 then v(x) = −x. Now assume |a| > 1. Since f (ξ)−f (a)−f 0 (a)(ξ −a) > 0
for all ξ 6= a, from
Z 1
[f (v 0 (x)) − f (a) − f 0 (a)(v 0 (x) − a)] dx = 0,
0
it follows that v 0 (x) = a for a.e. x ∈ (0, 1). Hence v(x) = ax.
for any open set Ω ⊂ Rn with |∂Ω| = 0. This can be proved by using the Vitali covering ar-
gument as before. We now claim that, for all piecewise affine Lipschitz continuous functions
φ ∈ W01,∞ (Σ; RN ), it follows that
Z
qc
(5.39) F (ξ) ≤ F qc (ξ + Dφ(x)) dx.
Σ
To see this, let Σ = ∪i Ωi ∪ E be such that |E| = 0 and each Ωi is an open set with |∂Ωi | = 0
and such that φ ∈ W01,∞ (Σ; RN ) takes constant gradients Dφ = Mi on each Ωi . Let > 0
be given. By the above remark on the definition of F qc , there exists ψi ∈ W01,∞ (Ωi ; RN )
such that Z
F qc (ξ + Mi ) ≥ − F (ξ + Mi + Dψi (x)) dx − .
Ωi
With each ψi being extended to Σ, we set ψ = φ + i ψi . Then ψ ∈ W01,∞ (Σ; RN ) and we
P
have Z X
F qc (ξ + Dφ(x)) dx = |Ωi |F qc (ξ + Mi )
Σ i
Z
≥ F (ξ + Dψ(x)) dx − ≥ F qc (ξ) − ,
Σ
and the (5.39) follows. From this we can show F qc is rank-one convex (using “sawtooth”
like piecewise affine functions) and therefore is locally Lipschitz continuous (since it is
convex in each coordinate direction). Since piece-wise affine Lipschitz functions are dense
in W01,p (Σ; RN ), (5.39) holds for all φ ∈ W01,p (Σ; RN ), and hence F qc is quasiconvex. The
proof of the theorem is complete.
5.6.4. Relaxation Principle. We have the following theorem; the proof of the theorem
is also omitted; see, e.g., Acerbi & Fusco [1], Buttazzo [9], or Dacorogna [13].
Theorem 5.40 (Relaxation Principle). Assume
c |ξ|p ≤ F (x, s, ξ) ≤ C (|ξ|p + |s|p + 1)
holds for some constants c > 0, C > 0, p > 1. Then
Z Z
inf F (x, u, Du) dx = min F qc (x, u, Du) dx
u∈Dϕ Ω u∈Dϕ Ω
Remark. The passage from F to F qc is called relaxation. The relaxation principle above
seems too nice for the nonconvex variational problems since it replaces a nonconvex prob-
lem which may not have any solution by a quasiconvex problem that has solutions. But,
there are costs for this: we may lose some information about the minimizing sequences; a
minimizer so obtained for the relaxed problem may not characterize what seems more in-
teresting in applications the finer and finer patterns of minimizing sequences. In the phase
transition problems for certain materials, it accounts for the loss of information about the
microstructures by a macroscopic effective processing (relaxation). For more information,
we refer to Ball & James [6], Müller [30] and the references therein.
The following theorem is due to A. Cellina [10, 11]. We do not give a proof of this
theorem, but simply remark that even without convexity assumption the minimization may
still have solutions. However, the existence can not follow from the direct method, but has
to rely on different methods.
Theorem 5.41. Given ξ ∈ Rn , the minimization problem
Z
inf f (∇u)dx
u∈W 1,1 (Ω), u|∂Ω=ξx Ω
has a minimizer if and only if either ∂f (ξ) 6= ∅, or there exist ξ1 , ξ2 , · · · , ξq such that
ξ ∈ int conv{ξ1 , · · · , ξq } and ∩qi=1 ∂f (ξi ) 6= ∅.
Remark. By the relaxation principle, any minimizer u of the above problem must be a
minimizer for the relaxed problem and satisfy
(5.41) ∇u(x) ∈ K = {ξ ∈ Rn | f (ξ) = f # (ξ)}, a.e. x ∈ Ω,
where f#
is the convexification of f . The proof of Cellina’s theorem relies on constructing
minimizers u of the relaxed problem satisfying the first-order partial differential relation
(5.41).
Bibliography
[1] E. Acerbi and N. Fusco, Semicontinuity problems in the calculus of variations, Arch. Rational Mech.
Anal., 86 (1984), 125–145.
[2] R. A. Adams and J. Fournier, “Sobolev Spaces,” Second Edition. Academic Press, New York, 2003.
[3] A. Bruckner, J. Bruckner and B. Thomson, “Real Analysis,” Prentice-Hall, 1997.
[4] J. M. Ball, Convexity conditions and existence theorems in nonlinear elasticity, Arch. Rational Mech.
Anal., 63 (1977), 337–403.
[5] J. M. Ball, J. C. Currie and P. J. Olver, Null Lagrangians, weak continuity, and variational problems
of arbitrary order, J. Funct. Anal., 41 (1981), 135–174.
[6] J. M. Ball and R. D. James, Proposed experimental tests of a theory of fine microstructures and the
two well problem, Phil. Trans. Roy. Soc. London, A 338 (1992), 389–450.
[7] J. M. Ball and F. Murat, W 1,p -Quasiconvexity and variational problems for multiple integrals, J.
Funct. Anal., 58, (1984), 225–253.
[8] V. Barbu and Th. Precupanu, “Convexity and Optimization in Banach Spaces,” Reidel, Boston,
1986.
[9] G. Buttazzo, “Semicontinuity, Relaxation and Integral Representation in the Calculus of Variations,”
Pitman Res. Notes Math. Ser. 207, Harlow, 1989.
[10] A. Cellina, On minima of a functional of the gradient: necessary conditions, Nonlinear Anal., 20
(1993), 337–341.
[11] A. Cellina, On minima of a functional of the gradient: sufficient conditions, Nonlinear Anal., 20
(1993), 343–347.
[12] R. Coifman, P.-L. Lions, Y. Meyer and S. Semmes, Compensated compactness and Hardy spaces, J.
Math. Pures Appl., 72(9) (1993), 247–286.
[13] B. Dacorogna, “Direct Methods in the Calculus of Variations,” Springer, New York, 1989.
[14] D. Dunninger, Lecture Notes on Applied Analysis, (unpublished) Michigan State University
[15] L. C. Evans, Quasiconvexity and partial regularity in the calculus of variations, Arch. Rational Mech.
Anal., 95 (1986), 227–252.
[16] L. C. Evans, “Partial Differential Equations,” AMS-GSM, Vol. 19, 1998.
[17] M. Giaquinta, “Multiple Integrals in the Calculus of Variations and Nonlinear Elliptic Systems,”
Princeton University Press, 1983.
[18] M. Giaquinta, “Introduction to Regularity Theory for Nonlinear Elliptic Systems,” Birkhäuser, 1993.
[19] M. Giaquinta and E. Giusti, On the regularity of the minima of certain variational integrals, Acta
Math., 148 (1982), 31–46.
[20] B. Gidas, W.-M. Ni and L. Nirenberg, Symmetry and related properties via the maximum principle,
Comm. Math. Physics 68 (1979), 209–243.
123
124 Bibliography
[21] D. Gilbarg and N. S. Trudinger, “Elliptic Partial Differential Equations of Second Order,” 2nd ed.,
Springer-Verlag, Berlin, 1984.
[22] R. Hart, D. Kinderlehrer and F.H. Lin, Existence and partial regularity of static liquid crystal con-
figurations, Commun. Math. Phys., 105 (1986), 547–570.
[23] R. D. James and D. Kinderlehrer, Frustration in ferromagnetic materials, Cont. Mech. Thermodyn.,
2 (1990), 215–239.
[24] F.-H. Lin, On nematic liquid crystals with variable degree of orientation, Comm. Pure Appl. Math.,
44(3) (1991), 453–486.
[25] N. Meyers, Quasi-convexity and lower semicontinuity of multiple variational integrals of any order,
Trans. Amer. Math. Soc., 119 (1965), 125–149.
[26] V. P. Mikhailov, “Partial Differential Equations,” Mir, 1978.
[27] C. B. Morrey, Jr., Quasiconvexity and the lower semicontinuity of multiple integrals, Pacific J. Math.,
2 (1952), 25–53.
[28] C. B. Morrey, Jr., “Multiple Integrals in the Calculus of Variations,” Springer, New York, 1966.
[29] S. Müller, Higher integrability of determinants and weak convergence in L1 , J. Reine Angew. Math.,
412 (1990), 20–34.
[30] S. Müller, Variational models for microstructure and phase transitions, in “Calculus of Variations
and Geometric Evolution Problems,” (Cetraro, 1996), pp. 85–210, Lecture Notes in Math. 1713,
Springer, 1999.
[31] S. Müller, Q. Tang and B. Yan, On a new class of elastic deformations not allowing for cavitation,
Ann. Inst. Henri Poincaré, Analyse non linéaire, 11(2) (1994), 217–243.
[32] L. Nirenberg, “Functional Analysis.” Courant Institute Lecture Notes, New York University, 1960.
[33] P. Pedregal, “Parametrized Measure and Variational Principles.” Birkhauser, Basel, 1997.
[34] P. Pedregal and B. Yan, On two-dimensional ferromagnetism, Preprint.
[35] E. Stein, “Harmonic Analysis.” Princeton University Press, Princeton, 1993.
[36] V. Šverák, Rank-one convexity does not imply quasiconvexity, Proc. Roy. Soc. Edinburgh Sect. A,
120 (1992), 185–189.
[37] L. Tartar, The compensated compactness method applied to systems of conservation laws, in “Systems
of Nonlinear Partial Differential Equations”, NATO ASI Series, Vol. 103, D. Reidel, Dordrecht, 1983.
[38] R. Temam, “Navier-Stokes Equations: Theory and Numerical Analysis.” Revised Edition. Elsevier
Science Publishers, Amsterdam, 1984.
[39] K. Yosida, “Functional Analysis,” 6th edition, Springer, New York, 1980.
[40] L. C. Young, “Lectures on the Calculus of Variations and Optimal Control Theory,” Saunders, 1969
(reprint by Chelsea, 1980).
[41] E. Zeidler, “Nonlinear Functional Analysis and its Applications,” Vol.1-4, Springer.
Index
125
126 Index