Asymptotic Behaviour Methods For The Heat Equation. Convergence To The Gaussian
Asymptotic Behaviour Methods For The Heat Equation. Convergence To The Gaussian
2018
Abstract
In this expository work we discuss the asymptotic behaviour of the solutions of the classical
heat equation posed in the whole Euclidean space. After an introductory review of the main
facts on the existence and properties of solutions, we proceed with the proofs of convergence
to the Gaussian fundamental solution, a result that holds for all integrable solutions, and
represents in the PDE setting the Central Limit Theorem of probability. We present several
methods of proof: first, the scaling method. Then several versions of the representation
method. This is followed by the functional analysis approach that leads to the famous related
equations, Fokker-Planck and Ornstein-Uhlenbeck. The analysis of this connection is also
given in rather complete form here. Finally, we present the Boltzmann entropy method,
coming from kinetic equations.
The different methods are interesting because of the possible extension to prove the asymptotic
behaviour or stabilization analysis for more general equations, linear or nonlinear. It all
depends a lot on the particular features, and only one or some of the methods work in each
case. Other settings of the Heat Equation are briefly discussed in Section 9 and a longer
mention of results for different equations is done in Section 10.
1
Contents
1 The Cauchy Problem in RN for the Heat Equation 3
11 Historical comments 32
2
1 The Cauchy Problem in RN for the Heat Equation
The classical heat equation (HE) ∂t u = ∆u, is one of the most important objects in the theory of
partial differential equations, and it has great relevance in the applied sciences. It has been devel-
oped since the seminal work of J. Fourier, 1822, [32], to describe phenomena of heat transport and
diffusion in many contexts. The great progress of the mathematical theory in these two centuries
has had a strong influence not only on PDEs, but also on Probability, Functional Analysis, as
well as Numerics. It now has well-established connections with other subjects like viscous fluids,
differential geometry, image processing or finance. See more on motivation in standard textbooks,
like [31, 58] or in the survey paper [69].
The heat equation enjoys a well developed theory that has many distinctive features. It can be
solved in many settings, suitable for different interests, that lead to quite different results and use
different tools. In this paper we will study the most typical setting, the Cauchy Problem posed
in RN , N ≥ 1:
(
∂t u = ∆u in RN × (0, ∞)
(CPHE)
u(x, 0) = u0 (x) in RN ,
with initial data u0 ∈ L1 (RN ). The equation appears in the theory of Probability and Stochastic
Processes as the PDE description of Brownian motion, and in that application the function u(·, t)
R probability density of the process at time t, hence it must be nonnegative and the total
denotes the
integral u(x, t) dx, often called the mass, must be one. Such restrictions on sign and integral
are not needed in the analytical study, and the extra generality is convenient both for the theory
and for a number of other applications.
The first question to be addressed by mathematicians is to find appropriate functional spaces
to solve this problem and to prove that it is a well-posed problem in the sense of Hadamard’s
definition (existence, uniqueness and stability). In tackling such a problem we will be lucky
enough to find a very explicit representation of the solution by means of the so-called “fundamental
solution” procedure.
Definition. We call fundamental solution of the HE in RN the solution of the equation with
initial data a “Dirac delta”:
(
∂t U = ∆U en RN × (0, ∞)
(FS)
U (x, 0) = δ0 (x) en RN .
In probabilistic terms, we start from a unit point mass distribution concentrated at the origin
x0 = 0, and we look for the way it expands with time according to the heat equation.
Exercise 1. (i) Prove by the method of Fourier Transform that the fundamental solution is given
by the formula
N |x|2
(GF) U (x, t) = (4πt)− 2 e− 4t .
e = −|ξ|2 U
Sketch. The Fourier transform of U satisfies ∂t U e with U
e (ξ, 0) = 1, hence U
e (ξ, t) =
−|ξ| 2t
e . Then, invert the Fourier transform.
(ii) Prove that U (x, t) → δ0 (x) when t → 0 in the sense of distributions, i. e.,
Z
U (x, t)φ(x)dx → < δ0 , φ >:= φ(0), for all φ ∈ Cc∞ (RN ), as t → 0.
RN
3
(iii) Prove that for every sequence tn → 0 the functional sequence U (x, tn ) is a C ∞ approximation
of the identity.
Notations and remarks. This function is known as the Gaussian Function or Gaussian Kernel
and we use the notation G(x, t) or Gt (x) instead of U . We will also use the Gaussian measure
dµ = Gt dx; it is a Radon measure in RN with parameter t. The Gaussian kernel is named after
the famous German mathematician Carl Friedrich Gauss, the Prince of Mathematics.
In statistics and probability theory the Gaussian function appears as the density function of the
normal distribution. Then the standard deviation of the distribution is taken as the parameter in
the formula, and we have 2t = σ 2 . This relation : time proportional to square space deviation, is
called the Brownian scale and appears in many calculations of evolution processes related to heat
propagation or Brownian motion. Finally, the same function appears in Statistical Mechanics
with the name of Maxwellian distribution (after James Clerk Maxwell, 1860). See other historical
comments at the end of the paper.
A main result in those disciplines is the fact that G appears as the limiting probability distri-
bution of many discrete processes, according to the famous Central Limit Theorem. The limiting
behaviour of the solutions of the Heat Equation is our main concern in this paper, and the
Gaussian kernel will play a main role.
Exercise 2. Calculate the Gaussian function by the method of self-similar solutions, U (x, t) =
t−α F (x t−β ), plus the condition of mass conservation.
Hint. Substitute the self-similar form into the PDE and check that time disappears explicitly
from the resulting equation for F = F (ξ) (with ξ = x t−β ) if we put β = 1/2. Check that the
choice α = N/2 means conservation of mass. This selects both similarity exponents. Then write
the resulting elliptic equation for F (ξ) assuming that it is a function of r = |ξ| (it is a radial
function) to get the ODE
1
(rN −1 F 0 (r))0 + (rN F )0 = 0.
2
2 /4
Calculate that log F (r) = a − r2 /4, with a ∈ R, hence F = Ce−|x| .
4
and check that it is dimensionally correct. Dimensional analysis is a powerful tester for correct
formulas (if applied well).
is a solution of the heat equation with initial data u0 . Check that it is a C ∞ function of x and t
for every x ∈ RN and t > 0. Prove that u(x, t) → u0 (x) pointwise when t → 0.
(ii) Prove that the initial data is taken uniformly if u0 ∈ BU C(RN ), the space of bounded and
uniformly continuous functions in RN .
(iii*) Consider whether the boundedness condition of point (i) can be relaxed or eliminated.
Exercise 5. Prove that formula (GRF) is still valid when the initial data belong to Lp (RN ),
1 ≤ p ≤ ∞.
(ii) What is the regularity of the solution?
(iii) In which sense are the initial data taken?
Exercise 6. (i) Prove that (GRF) generates a continuous contraction semigroup in the following
Banach spaces
X = BU C(RN ), L2 (RN ), Lp (RN ), 1 < p < ∞ ,
by means of the definition St : X → X, St u0 = u(·, t), where u is the solution of the heat equation
with initial data u0 .
Sketch. What you have to check is that, as a function of time t f (t) := St u0 belongs to
C([0, ∞ : X), that St ∗ Ss = St+s , and also St u0 → u0 as t → 0 in X. Moreover, contraction
means that kSt u0 kX ≤ ku0 kX for every t > 0 and every u0 ∈ X.
(ii) Study in which sense the semigroups agree (explain if they are the same or not). You may
define the core of the semigroup by restricting it to a nice and dense set of functions, and then
explain how the concept of density is used to recover the complete semigroup in each of the above
spaces.
Notation. We will often write u(t) instead of u(·, t) for brevity in the hope that no confusion
will arise. Hence, u(t) is a function of x for every fixed t.
Exercise 7. (i) Prove that for p = 1 we have the stronger property of mass conservation
Z Z
(MC) u(x, t) dx = u0 (x) dx ,
RN RN
5
(iv*) The curious reader may want to learn more about that dissipation and how to measure it
accurately. Any ideas?
Exercise 8. Conservation of moments. (i) We define the first (signed) moment as the vector
quantity
Z
(1.1) N1 (u(t)) = x u(x, t) dx ,
RN
which in principle evolves with time. Prove that thisR quantity is actually conserved in time for
any solution of the HE with initial data such that (1 + |x|)| u0 (x)| dx is finite.
(ii) For data such that |x|2 |u0 (x)| dx is finite, we define the second moment as the scalar
R
time-dependent quantity
Z
(1.2) N2 (u(t)) = |x|2 u(x, t) dx .
RN
Prove that this quantity is not conserved in time for solutions of the HE, and in fact
(Recall here that N is the space dimension). These results will be important to understand the
finer versions of the asymptotic convergence to the Gaussian. Deeper analysis proves that the
only integral quantities conserved by the solutions of the Heat Equation are the mass and the
first moment.
Exercise 9. (i) Prove the so-called ultra-contractive estimates, also called smoothing effects.
They allow to pass from data in Lp to solutions u(t) in L∞ for every t > 0. More precisely, prove
that there exists a constant C = C(p, N ) such that for every u0 ∈ Lp (RN )
ku0 kp
ku(t)k∞ ≤ C .
tN/2p
We may say that the constant is universal since it does not depend of the particular solution we
take.
(ii) Calculate the exponent N/2p using only dimensional calculus.
Exercise 10. Regularity. (i) Prove that there exists a universal constant C = C(p, N ) > 0
such that for every p ≥ 1 and every t > 0
du(t) ku0 kp
k kp ≤ C .
dt t
(ii) Prove that for nonnegative solutions we have a stronger pointwise inequality
∂u u
≥ −C1 with C1 = N/2 .
∂t t
Prove that this constant is optimal. Hint. Check the properties first for the fundamental solution,
then use convolution.
(iii) Prove that for every t > 0 the x-derivatives of u satisfy
≤ C ||u0 ||1 L∞ .
∂
∂xi u(x, t)
t2
6
Obtain a similar estimate for data in Lp RN .
(iv) Check that the dimensions are all correct in these formulas.
Exercise 11. Gaussian Representation Formula with measure data. (i) Prove that the
representation formula can be used with any bounded Radon measure as initial data, in the form
Z
|x−y|2
−N
(GRFm) u(x, t) = (Gt ∗ µ)(x, t) := (4πt) 2 e− 4t dµ(y) .
RN
(ii) Prove that it produces a classical solution of the HE for all t > 0 and that the aRpriori
estimates hold with the L1 norm of u0 replaced by the total mass of the measure, |µ(RN )| = |dµ|.
(iii) Prove that the initial data are taken in the weak sense of measures.
(iv) Observe that the fundamental solution U = Gt falls into this class.
Exercise 12. (i) Find explicit solutions of polynomial type of heat equation. Suggestion: try the
solution types
u = A0 (x), u = A0 (x) + A1 (x)t, u = A0 (x) + A1 (x)t + A2 (x)t2 , . . .
Find all the solutions of the first two types. Maybe the best known solutions in this class are
u = 1 and u = |x|2 + 2N t.
(ii) Prove that they also satisfy the representation formula, even though they are not integrable
or bounded.
The general theory says that the (GRF) holds for all initial data which are locally bounded
measures µ with a weighted integrability condition
Z
2
e−c|x| |dµ(x)| < ∞ for some c > 0 ,
RN
see [70], but we will not use that sharp result in these notes. We will only point out the following
example of blow-up solution with very large growth as |x| → ∞
2 /4(T −t)
(1.4) Ub (x, t) = C (T − t)−N/2 e|x| .
Exercise 13. Check that Ub is a classical solution of the heat equation for −∞ < t < T and
blows up everywhere in x as t → T .
7
2 Asymptotic convergence to the Gaussian
The main result on the asymptotic behaviour of general integrable solutions of the heat equation
consists in proving that they look increasingly like the fundamental solution. Since this solution
goes to zero uniformly with time, the estimate of the convergence has to take into account that
fact and compensate for it. This happens by considering a renormalized error that divides the
standard error in some norm by the size of the Gaussian solution U (t) = Gt in the same norm.
For instance, in the case of the sup norm we know that
Theorem 2.1 Let u0 ∈ L1 (RN ) and let u0 (x)dx = M be its mass. Then the solution u(t) =
R
u(·, t) of the HE in the whole space ends up by looking like M times the fundamental solution
U (t) = Gt in the sense that
for all 1 ≤ p ≤ ∞.
8
what the solution looks like, we have to search further to identify the terms that may give us the
size and shape of such a solution. This question will be addressed below. For the moment let us
point out that differentiation in xi of the Heat Kernel produces a new solution with zero integral
2 /4t
ui (x, t) = ∂xi Gt (x) = C t−(N +2)/2 xi e−|x| ,
defined for real constants C, c 6= 0. A more extreme case is the blow-up solution Ub (x, t) of formula
(1.4) that not only does not stay bounded with time, it even blows up in finite time.
• About the three convergence results of the Theorem, it is clear that (2.3) follows from (2.1)
and (2.2). Now, what is interesting is that (2.2) follows from (2.1) and the smoothing effect
(Exercise 9). We ask the reader to prove this fact. Hint: use (2.1) between 0 and t/2 and then
the smoothing effect between t/2 and t.
• It is interesting to note that the convergence in L1 norm of formula (2.1) can be formulated
without mention to the existence of a fundamental solution in the following form:
Alternative Theorem. Let u(t) and v(t) be any two solutions of the RCauchy problem
R for the
HE in the whole space, and let us assume that their initial data satisfy u0 dx = v0 dx. Then
we have
We could use a similar approach for the L∞ estimate but some Gaussian information appears
in the weight tN/2 . The alternative approach has been first remarked by specialists in stochastic
processes of Brownian type, and it is known as a mixing property. It has been generalized to
many variants of the heat equation.
9
3 Proof of convergence by scaling
There are many approaches to the proof of the main result, and we will show some of the best
known below. They are interesting for their possible extension to similar asymptotic results for
other equations, both linear and nonlinear, see Sections 9 and 10. The first proof we give is based
on scaling arguments. In the proposed method the proof is divided into 5 steps.
Step 1. Scaling transformation. It is easy to check that the HE is invariant under the following
one-parameter family of transformations Tk defined on space-time functions by the formula
see also Exercise 14 (iii). This transformation maps solutions u into new solutions uk = Tk u,
called rescalings of the original solution. It also conserves the mass of the solutions
Z Z
Tk u(x, t) dx = u(y, k 2 t) dy = constant.
RN RN
Step 2. Uniform estimates. The estimates proved on the general solutions in Section 1 imply
that the whole family uk is uniformly bounded in space and time if time is not so small: x ∈ RN
and t ≥ t1 > 0 (cf. Exercise 9). They also have uniform estimates on all derivatives under the
same restriction on time (cf. Exercise 10).
Step 3. Limit problem. We can now use functional compactness and pass to the limit k →
∞ along suitable subsequences to obtain a function Û (x, t) that satisfies the same estimates
mentioned above and is a weak solution of the HE in Q = RN × (0, ∞). By the estimates on
derivatives the solution is classical. The convergence takes place locally in Q in the sup norm for
the functions and their space and time derivatives.
Step 4. Identifying the limit. We now have to check that the limit solution Û (x, t) has the same
mass M as the sequence uk and that is takes the Dirac delta as initial data. By uniqueness for
solutions with measure data (which we accept as part of the theory) we will then conclude that
Û (x, t) is just a fundamental solution, Û (x, t) = M Gt .
(i) The proof of this step is best done when u0 is nonnegative, compactly supported and bounded,
since in that case the solution is bounded above by constant times the fundamental solution,
|u(x, t)| ≤ C G(x, t + 1). Applying the transformation we get
This means a uniform control from above of the mass of all the tail mass of all the solutions uk ,
by which we mean the mass lying in exterior regions of space. Such control allows to avoid the
loss of mass that could occur in the limit by Fatou’s Theorem. Hence, the mass of Û is the same,
i.e., M .
The convergence of Û to M δ(x) as t → 0 happens because uk (x, 0) → M δ(x), and the previous
tail analysis shows that Û takes zero initial values for x 6= 0.
(ii) To recover the same result for a general u0 ≥ we use approximation by data as above and
then the L1 contraction of the heat semigroup. For signed solutions, separate the positive and
negative parts of the data and solve separately.
10
Step 5. Recovering the result. (i) We now use the convergence of the rescaled solutions at a
fixed time, say t = 1,
lim |uk (x, 1) − M G1 (x)| → 0.
k→∞
This convergence takes place locally in RN in all Lp norms. By the tail analysis, it also happens
in L1 (RN ). But since the derivatives are also uniformly bounded we have also an L∞ estimate.
1/2
(ii) We now use the meaning to the transformation Tk and write k = t1 to get
N/2 1/2
lim |t1 u(xt1 , t1 ) − M G1 (x)| → 0.
t1 →∞
−1/2
But this is just the result we wanted to prove after writing x = y t1 and observing that
−N/2
Gt1 (y) = t1 G1 (x). Similarly for the L1 norm.
This 5-step proof is taken from paper [48], where it was applied to the p-Laplacian equation. See
whole details for the porous medium equation in the book [66]. It has had further applicability.
Theorem 4.1 Under the assumptions that u0 ∈ L1 (RN ) and that the first absolute moment is
finite
Z
(4.1) N1 = |u0 (y)y| dy < ∞ ,
as well as
Proof. (i) We may perform the proof under the further restriction that u0 ≥ 0. For a signed
solution we must only separate the positive and negative parts of the data and apply the results
to both partial solutions.
11
(ii) Let us do first the sup convergence. We have
Z Z
u(x, t) − M Gt (x) = u0 (y)Gt (x − y) dy − Gt (x) u0 (y) dy
Z
= u0 (y)(Gt (x − y) − Gt (x)) dy
Z Z 1
= u0 (y) ∂s (Gt (x − sy)) ds dy
Z 10
x − sy −|x−sy|2 /4t
Z
= Ct−N/2 dy ds u0 (y)hy, ie
0 2t
Taking into account that Gt is of order t−N/2 in sup norm, we write the result as (4.2), and
C = C(N ) is a universal constant.
(iii) For the L1 convergence we start in the same way and arrive at
Z 1
x − sy −(x−sy)2 /4t
Z
−N/2
u(x, t) − M Gt (x) = C t dy ds u0 (y)hy, ie .
0 2t
Now we integrate to get
1
|x − sy| −|x−sy|2 /4t
Z Z Z
−N/2
ku(x, t) − M Gt (x)k1 ≤ C t dx dy ds |u0 (y)y| e
Z 1 RN ZRN 0 2t
|x − sy|
Z
2
=C dy ds |u0 (y)y|t−1/2 ( t−N/2 1/2 e−|x−sy| /4t dx)
R N 0 RN t
With the change of variables x − sy = t1/2 ξ we already know that the last integral is a constant
independent of u, hence the formula for the L1 error. Note that now we are speaking of masses
and we do not need any renormalization time factor.
(iv) We ask the reader to prove the optimality as an exercise.
Exercise 16. Take as solution the Gaussian after a space displacement, u(x, t) = G(x + h, t), and
find the convergence rate to be exactly O(t−1/2 ). This is just a calculus exercise but attention to
details is needed. Hint: Write
h2 2
G(x + h, t) = Gt (x) + h∂x Gt (x) + D Gt (ξ)
2 x
(where ξ = x + sh, 0 < s < 1), and check that
2 /4
∂x Gt (x) = Ct−(N +1)/2 ξe−ξ = O(t−(N +1)/2 ) ,
12
uniformly in x, and Dx2 Gt (ξ) = O(t−(N +2)/2 ) uniformly in x. This exercise shows that the term
h∂x Gt (x) is the precise corrector with relative error O(t−1/2 ). We could continue the analysis by
expanding in Taylor series with further terms, see below. Exact correctors and longer expansions
for general solutions will be done later in Section 7 by the methods of Functional Analysis.
Remark. The factor tN/2 is the appropriate weight to consider relative error. We point out that
the more precise relative error formula
|u(x, t) − Gt (x)|
(4.4) rel (t) =
Gt (x)
does not admit a sup bound, as can be observed by choosing u(x, t) = Gt (x−h) for some constant
h since then
2
rel (t) = |exh/2t e−h /4t − 1| ,
which is not even bounded. It is then quite good that our weaker form does admit a good
estimate. Same happens for the L1 norm. This comment wants to show that error calculations
with Gaussians are delicate because of the tail (i.e., the behaviour for large |x|.
• Proof of Theorem 2.1 in this approach. Given an initial function u0 ∈ L1 (RN ) without any
assumption on the first moment, we argue by approximation plus the triangular inequality. In
the end we get a convergence result, but less precise. To quote, if u0 is integrable with integral
M and let us fix an error δ > 0. First, we find an approximation u01 with compact support and
such that
ku0 − u01 k1 < δ.
Due to the already mentioned effect L1 → L∞ , and applied to the solution u − u1 , we know that
for all t > 0
ku(t) − u1 (t)k∞ < C δ t−N/2 .
On the other hand, we have just proved that for data with finite moment:
13
4.1 No explicit rates for general data. Counterexample
Let us explain how the lack of a rate for the whole class of L1 functions is shown. Given any
decreasing and positive rate function φ such that φ(t) → 0 as t → ∞ we construct a modification
of the Gaussian kernel that produces a solution with the same mass M = 1 and such that it
satisfies a lower bound for the error of the form
Put mn = 2−n (any other summable series will do). Choose iteratively tn and xn as follows.
Given choices for the steps 1, 2, . . . , n − 1, pick tn to be much larger than tn−1 and such that
φ(tn ) ≤ mn /2n. This is where we use the fact that φ(t) tends to zero, even if it may decrease in
2
a very slow way. Choose now |xn | = rn so large that e−rn /4tn < 1/2. Essentially, the mass has
1/2
to be displaced at distance equal or larger than O(tn ). Then,
2
mn (1 − e−xn /4tn ) ≥ nφ(tn ) .
Let us make some further practical calculations (with no precise scope in mind): Let for example
φ(t) = t− . Choose mn = 2−n . Then tn ≥ 2n ,
tn ∼ 2n/ , rn ∼ 2n/2
and the mass in the outer region {|x| ≥ rn } is approximately 2−n ; hence such a mass is M (r) ∼
r−2 , which is not so small if → 0.
• Spatial tails for positive solutions. Let us examine the precise form of the tail in the
simplest case where u0 has compact support. For simplicity and w.l.o.g. we assume that u0 is
supported in the ball BR of radius R > 0 centered at x = 0. Take x1 ∈ RN such that |x1 | = r1 > 1,
14
say x1 = r1 e1 . Then the representation formula implies that a bound from above is obtained by
displacing all the mass to the nearest point to x1 inside BR , which is x00 = Re1 , and we get the
upper bound
M −(|x1 |−R)2 /4t
u(x1 , t) ≤ M Gt (x1 − x00 ) = e .
(4πt)N/2
In the opposite direction, moving all the mass to x000 = −Re1 we get the lower bound
M 2
u(x1 , t) ≥ M Gt (x1 − x000 ) = e−(|x1 |+R) /4t .
(4πt)N/2
Both estimates are clearly optimal in this context. Since the equation is invariant under rotations
the result holds for all x such that |x| > R instead of our restricted choice of x1 . Moreover, we see
that the ratio of both estimates tends to infinity as |x| → ∞. It is therefore convenient to take
logarithms, and then we easily get a general formula where BR is centered at xc with |xc | = Rc .
R
Proposition 4.2 Let u0 be supported in the ball BR (xc ) and let M = u0 (x) dx. Then for every
t > 0 and every x 6∈ BR (xc ) we have
N 1 N 1
(4.6) log(4πt) + (|x − xc | − R)2 ≤ − log(u(x, t)/M ) ≤ log(4πt) + (|x − xc | + R)2 .
2 4t 2 4t
It follows that
log(u(x, t)/M ) 1
(4.7) lim =− .
|x|→∞ |x|2 4t
We conclude that in first approximation the tail at infinity of all solutions with compactly
supported initial data is universal in shape and depends only on the mass of the data and time.
Of course, the second term in the expansion depends also on the radii Rc and R.
Another observation is that the asymptotic space behaviour allows to calculate the time elapsed
since the solution had compact support (if we already know the mass M ).
Let us also remark that solutions with more general nonnegative data can have other type of
tails and we invite the reader to calculate some of them, both for integrable and non-integrable
data. Here is an example that decays like a simple exponential.
Exercise 17. Consider the heat equation in 1D. Show that when u0 is integrable, positive and
bounded and u0 (x) = e−x for x ≥ 0, then for all times
lim ex u(x, t) = et .
x→∞
Sketch: Putting u0 (x) = 0 for x < 0 for simplicity, use the representation formula to write
Z x
1 2
x−t
e u(x, t) = 1/2
e−(y−2t) /4t dy
(4πt) −∞
Let then x → ∞. You may also use the explicit solution U (x, t) = et−x+c .
In any case the behaviour described in Proposition 4.2 is the minimal one for nonnegative
solutions.
15
• Signed solutions do not become everywhere positive.
The square exponential tail behaviour of the fundamental solution has more curious consequences.
R + if a signed solution u(x, t) has initial data such that theRmass
Thus, of the positive part M+ =
u0 (x)dx is larger than the mass of the negative part M− = u− 0 (x)dx, then we know that it
converges as t → ∞ to the positive Gaussian M Gt with M = M+ − M− > 0 so that
for every x ∈ RN . It would be natural to expect that when M+ is much larger than M− then the
solution u is indeed positive everywhere for large enough, finite times. Now, this is true in most
of the space because of the previous convergence, but it is not true in all the space.
Exercise 18. (i) Take N = 1. Show that for the choice u0 (x) = δ(x) − ε δ(x − 1) with 0 < ε < 1
(a combination of delta functions) the solution u is positive on the left of a line x = r(t) and
negative for x > r(t). Show that for large times r(t) ∼ 2 log(1/ε)t.
Remark. We see that the problem arises at the far away tail. Of course, u(x, t) becomes positive
for large times at all points located at or less than the typical distance, |x| ≤ Ct1/2 .
(ii) Show that a similar result is true for integrable data if u0 is positive for x < 0 and negative
for x > 1, and zero in the middle. Show in particular that u(x, t) < 0 if x > 2t log(1/ε) + 1/2.
(iii) State similar results in several dimensions.
Exercise 19. Consider a time displacement of the fundamental solution and show that u(x, t) =
G(x, t + t0 ) has the precise convergence rate O(1/t) towards G(x, t).
The result we prove is as follows.
Theorem 5.1 Under the assumptions that u0 ∈ L1 (RN ) and that the signed first moment
Z
(5.1) N1,i = yi u0 (y) dy
RN
16
and
X
(5.4) ku(x, t) − M Gt (x) + N1,i ∂xi Gt (x)kL1 (RN ) ≤ C N2 t−1 .
i
Remark. The signed first moment is called in Mechanics the center of mass, for M 6= 1, M 6= 0,
we use the formula
Z
1
(5.5) xc = x u0 (x)dx .
M
In probability it is the average location of the sample and M = 1. When xc is finite and M 6= 0,
the center of mass can be reduced to zero by just a displacement of the spatial axis. This very
much simplifies formulas (5.3), (5.4), see below.
We have
1 2 1 2 1 2
f 0 (s) = hy, x − syie−|x−sy| /4t , f 00 (s) = (− |y| + 2 |hy, x − syi|2 )e−|x−sy| /4t .
2t 2t 4t
Using these results, we get D = D1 + D2 , where
Z
x X Z
−(N +1)/2 −|x2 |2 /4t
D1 = (4πt) u0 (y)hy, 1/2 ie dy = − yi u0 (y) dy ∂i Gt (x) .
2t i
√
On the other hand, putting ξ = (x − sy)/2 t and yb = y/|y|, we also have
Z Z 1
−(N +2)/2 1 2
D2 = (4πt) 2
|y| u0 (y) − + (hb y , ξi) e−ξ dyds.
2
RN 0 2
Since the factor dependent on ξ is uniformly bounded for all ξ we have
Z
|D2 | ≤ C(N ) u0 (y) |y| dy) t−(N +2)/2 .
2
RN
Exercise 20. Give examples of well-known probability distributions for which the first moment
is finite or infinite. Same for the second moment. Try with examples of the form
17
For a = 1 we get the well-known Cauchy distribution that is integrable only in 1D.
Answers. Integrable 2a > N , 1st moment 2a > N + 1, 2nd moment 2a > N + 2.
Reformulation of the result. It is well-known that the first moment can be eliminated to
moving the origin of coordinates to the center of mass xc defined in formula (5.5) when M 6= 0.
If we do that and then apply Theorem 5.1 we get the following result
Higher development result. A continuation of this method into stricter convergence rates
using higher moments can be done by using further terms in the Taylor series development. We
will not do it but only quote the statement that can be found in [30].
Theorem 4 [30] Let G(x, t) be the heat kernel. For any 1 ≤ p ≤ N/(N − 1) and k ≥ 0 an integer
the solution of initial value problem for the heat equation satisfies:
X (−1)|α| Z
ku(x, t) − f (x)x dx Dα G(x, t)kp ≤ Ck t−(k+1)/2 k|x|k+1 f (x)kp ,
α
α!
|α|≤k−j
for any initial data f ∈ L1 (RN , 1 + |x|k ) such that |x|k+1 f ∈ Lp (RN ).
Let us note that their proof is different from the previous ones and interesting. As a conclusion,
we also have a result about convergence in the first moment norm.
Theorem 5.2 Under the assumptions that u0 ∈ L1 (RN ) and that the first, second and third
moments are finite, we get the convergence
X
(5.7) ku(x, t) − M Gt (x) + N1,i ∂xi Gt (x)kL1 (|x|dx) ≤ C t−1/2 .
i
The constant C > 0 depends on u0 through the second and third moments.
Proof. (i) We start with formulas from Theorem (5.1) where it is proved that
X
|u(x, t) − M Gt (x) + N1,i ∂xi Gt (x)| = D2
i
with Z Z 1
1 2
D2 = (4πt)−(N +2)/2 y , ξi)2 e−ξ dyds,
|y|2 u0 (y) − + (hb
RN 0 2
√
and ξ = (x − sy)/2 t. Multiplying by |x| and integrating we have
Z ZZZ
−(N +2)/2 1 2
D2 |x|dx = (4πt) 2
y , ξi) e−ξ |x|dx dyds.
|y| u0 (y) − + (hb 2
RN 2
18
√
Writing now |x| ≤ 2|ξ| t + s|y|, we split the upper estimate of this integral into I1 + I2 , where
ZZZ
−(N +1)/2 1 2
I1 = Ct 2
|y| u0 (y) − + (hb y , ξi) e−ξ 2|ξ|dx dyds ,
2
2
and ZZZ
−(N +2)/2 1 2
I2 = Ct 2
y , ξi) e−ξ s|y|dx dyds .
|y| u0 (y) − + (hb 2
2
As for the first integral, the separate integrals are bounded and only the ξ integral gets a time
factor tN/2 from dx = tN/2 dξ, so that
I1 ≤ C t−1/2 .
The second integral easily gives I2 ≤ C t−1 by the assumption on the third moment of u0 . The
proof is complete.
General conclusion. All integrable solutions of HE in the whole space converge to Gaussian (in
the renormalized forms we have written) if the initial mass is finite, u0 ∈ L1 (RN ). But the speed
with which they do depends on how much initial mass is located far away, in other colloquial words
for probabilists, on “how populated the tails are”. The quantitative versions we have established
use mainly the moments of order 1 and 2. The moment of order 2 is called in Probability the
(square of) the standard deviation. We remind the reader that not all probability distributions
have a finite standard deviation (see Exercise 20).
Take the classical Heat Equation posed in the whole space RN for τ > 0:
1
uτ = ∆y u
2
19
with notation u = u(y, τ ) that is useful since we want to save the standard notation (x, t) for later
use. We know the (self-similar) fundamental solution, also called the evolution Gaussian profile
2 /2τ
U (y, τ ) = C τ −N/2 e−y .
It was proved in previous sections that this Gaussian is an attractor for all solutions in its basin
of attraction, consisting on all solutions with initial data that belong to L1 (RN ) with integral
M = 1. See Sections 3, 4, and 5.
Remark. We have inserted a harmless factor 1/2 in front of the left-hand side following the prob-
abilistic convention in order to get a Gaussian with clean exponent −|y|2 /2t which has standard
deviation t1/2 with no extra factors. Eliminating the prefactor leads to the exponential expres-
sion with exponent −|y|2 /4t, usual in PDE books. Our convention leads to some other simpler
constants.
• Fokker-Planck equation. It is the first step in this approach to the asymptotic study. The
scaling on the variables u and y to factor out the expected size of both which must mimic the
Gaussian sizes, and then take logarithmic scale for the new time
After some simple computations this leads to the well-known Fokker-Plank equation for v(x, t):
wt = L2 (w) = G−1 ∇ · G ∇w = ∆w − x · ∇w ,
(6.2)
a symmetrically weighted heat equation with Gaussian weight. Note that the corresponding
stationary solution is now W = 1, much easier.
(ii) The two-term form of the r.h.s. looks easier, with a diffusion and a convection term. Indeed,
the weighted form of the Ornstein-Uhlenbeck operator L2 = LOU is very convenient for our
20
calculations. To begin with, it allows to prove the symmetry of the operator in the weighted
space X = L2 (Gdx): for every two convenient functions w1 and w2 we have
Z Z Z
(6.3) (L2 w1 ) w2 Gdx = w1 (L2 w2 ) Gdx = − h∇w1 , ∇w2 iGdx.
RN RN RN
It seems natural to introduce the Gaussian measure dµ = Gdx as a reference measure in the
calculations. The important consequence of this computation is that A = −L2 is a positive and
self-adjoint operator in the Hilbert space X = L2 (dµ). This is a rather large space that includes
all functions with polynomial growth. We will keep the notation w(x, t) for the solutions of the
Ornstein-Uhlenbeck equation throughout this section.
(iii) We may also observe that L1 (v) = G L2 (v/G) and that L2 is the adjoint to L1 in the sense
that for conveniently smooth and decaying functions
Z Z
(L1 v)w dx = v (L2 w)dx.
RN RN
More formally, we can consider the duality between the spaces X = L2 (Gdx) and X 0 = L2 (G−1 dx)
given precisely by the integral of the product and the operators are adjoint.
(iv) Finally, to complete the comparison we can write the Fokker-Planck equation as
∂t v = ∇ · (G∇(v/G)) .
The analogy says that this operator is negative and self-adjoint in the stranger space X1 =
L2 (G−1 dx), that is much smaller than L2 (RN ). For a detailed mathematical presentation of the
Ornstein-Uhlenbeck semigroup we refer to [2].
• The Hamiltonian connection. Start from the Fokker-Planck equation and use now the
change of variables v = zG1/2 . Then,
We have
that we may write as a real Schrödinger Equation zt = L3 (z) = −H(z) with Hamiltonian operator
1 N
H(z) = −∆z + V (x)z, V = |x|2 − ,
4 2
2
In calculating the Schrödinger potential we have used G1/2 = e−|x| /4 . Operator H is directly
symmetric in L2 with no weight. The fact that it is positive is not clear from the formulas but it
will follow from the equivalence with the Ornstein-Uhlenbeck operator.
• The equivalence of this equation with the former ones comes from the transformation formulas
21
and also that if vi = G1/2 zi = Gwi , i = 1, 2, we get
Z Z Z
v1 v2 G−1 dx = z1 z2 dx = w1 w2 Gdx,
RN RN RN
which allows to show that all three operators Li are self-adjoint and dissipative since we have
already proved it for L2 . The interesting remark for the Schrödinger representation is that it does
not need any weighted space, X3 = L2 (RN ).
The rich equivalence among the three equations and also with the heat equation is a favorite
topic in Linear Diffusion and Semigroup Theory.
(6.4) vt = ∆v + ∇ · (∇S v) .
Standard assumption is that S must be a positive and convex function in RN , called the potential.
The stationary state is now G(x)
e = C e−S(x) , and the equation reads then vt = L1 (v) with
L1 (v) = ∇ · (G
e ∇(v/G)).
e
The other equations, General Ornstein-Uhlenbeck and General Hamiltonian Equation, follow in
the same way as before using only the expressions in terms of G that is replaced by G. e The
weighted scalar products have no difference and the relation of norms still holds. In the Hamilto-
nian representation we get a potential (put G = e−S )
1 1
V (x) = −G−1/2 ∆(G1/2 ) = |∇S|2 − ∆S .
4 2
On the other hand, the analysis of the complete spectrum is not possible unless we have very
particular cases of potentials S. Moreover, the connection with a renormalization of the heat
equation is completely lost.
The Ornstein-Uhlenbeck formulation allows for a very clear and simple treatment of the problem
of convergence with rate to the Gaussian profile. We may assume without lack of generality that
Z Z Z
w dµ = v dx = u dy = 1 ,
with the above notations for u, v, and w. We now make a simple but crucial calculation on the
time decay of the energy for the OUE:
Z Z
dF(w(t))
(6.5) F(w(t)) = |w − 1|2 G dx, = −2 |∇w|2 G dx = −D(w(t)).
RN dt R N
1
The standard notation is U (x) but we will change the notation here to S(x) to avoid confusion with other uses
of U .
22
We can now use a result from abstract functional analysis: the Gaussian Poincaré inequality
with measure dµ = G(x) dx:
Z Z 2 Z
2
w dµ − w dµ ≤ CGP |∇w|2 dµ
RN RN RN
The sharp constant in this inequality is precisely CGP = 1, with no dependence on dimension.
Moreover the functions that realize the optimal constant are w(x) = xi for any i = 1, 2, . . . , N .
2 We will give below a proof based on the analysis of the spectrum of the Ornstein-Uhlenbeck
operator.
Then, the left-hand side is just F and the inequality implies
d
− F(w(t)) ≥ 2F(w(t)),
dt
which after integration gives F(w(t)) ≤ F(w0 )e−2t , i.e.:
Z Z
|w − 1|2 dµ ≤ e−2t |w0 − 1|2 dµ ∀ t ≥ 0
RN RN
This is theR first of the well known Functional Estimates for the solutions to the HE. The renor-
malization w0 dµ = 1 is no restriction.
R
In terms of v the hypotheses are v0 dx = 1 and kv0 − GkL2 (G−1 dx) < ∞. The weight is now
K = G−1 and the measure dν = K dx, which behaves inversely in infinity. The result is
Z Z
2 −2t
|v − G| dν ≤ e |v0 − G|2 dν ∀ t ≥ 0.
RN Rd
Recall that we have a new logarithmic time t = log(1 + τ ). The rate of convergence in real time
in both variables is then O((1 + τ )−1/2 ) = O(τ −1/2 ) as τ → ∞.
• Then by regularity theory (regularizing effect from L1 to L∞ ) for the heat equation and the
other equations, we get estimates in the sup norm with similar relative rates at least locally in
space. But note that since w = xe−t is a solution of the OUE, we cannot get a uniform estimate
in L∞ of the Ornstein-Uhlenbeck variable.
2
This is an old inequality in the folklore of Hermite polynomials, and probably was known in one dimension to
both mathematicians and physicists in the 1930’s in relation to eigenvalue problems, as mentioned in [9].
23
7 Calculation of spectrum. Refined asymptotics
We will proceed with a further step in the analysis to get more precise asymptotics. Indeed, the
knowledge of the spectrum of the equivalent operators allows to obtain a complete description
of the long-time behaviour in weighted spaces. This is a well-known fact in the study of the HE
posed in bounded domains, that has a parallel here.
7.1 Spectrum
We will make all computations on the Ornstein-Uhlenbeck operator. The Gaussian Poincaré
inequality is a simple consequence of the following analysis of the spectrum of the Ornstein-
Uhlenbeck Operator in L2 (dµ):
(a) Since the FP and the OU operators have a compact inverse, we conclude that they have a
discrete spectrum, [50]. The ground state of the OUE is formed by the constant function w = 1
(which comes from the Gaussian function v = G for the FPE) with eigenvalue λ0 = 0. The next
eigenfunctions are the coordinate functions φi = xi corresponding to the eigenvalue λ1 = 1 with
multiplicity N .
(b) In 1D we find the rest of the eigenfunctions and eigenvectors as the family of Hermite
polynomials, given by the compact formula
that tells much about how we will see them. Indeed, the formula can be derived from the fact
that the derivatives in y of the Gaussian evolution solution U (y, τ ) are still solutions of the heat
equation with different decay rate. Passing to the FPE we conclude that the derivatives Dk G
are eigenfunctions of L1 (here D = d/dx). It is easy to see that these solutions have the form
Hk (x)G(x) where Hk is a polynomial of degree k (proof by induction). Passing to the OUE we
get the formula above. More precisely, we have the recursion formula:
d
GHk+1 = −d/dx(Hk G) = −(Hk0 − xHk )G, Hk+1 = (x − )Hk .
dx
The first members of the family Hk are 1, x, x2 − 1, . . .
The corresponding eigenvalue to Hk for L2 = LOU is λk = k. This can be seen from the heat
equation formula since differentiating in x adds a factor τ −1/2 to the decay, which goes over as
e−t for every derivative we take. Induction proof at the FP level: if we assume that
then
L1 (Dk+1 G) = (Dk G)000 + x(Dk G)00 = −λk Dk G0 − (x(Dk G)0 )0 + x(Dk G)00 = −(λk + 1)Dk+1 G.
24
This produces new eigenfunctions in higher dimensions, and it gives for the product function the
sum of the eigenvalues: λ(w) = λ(w1 ) + λ(w2 ). The set of combinations generates a base of
eigenfunctions, this is essentially due to Fubini’s theorem, cf. [10]. Our account is very short but
we consider this part an extension, and it is well documented in the corresponding literature.
From the spectrum we can get a very precise description of the convergence in the weighted
spaces by using the equivalent of the Fourier analysis on bounded domains. The meaning of the
coefficients for the original equation has to be understood.
We get X X
w(x, t) = cα Hα (x)e−kt , v(x, t) = cα ∂α G(x)e−kt
α α
hw0 , Hα iL2µ
R
v0 (x)Hα (x) dµ(x)
cα = = R ,
hHα Hα iL2µ Hα2 (x) dµ(x)
R
so that c0 is the mass and for k = 1, and ci are the first coordinate moments v0 (x)xi dx after
normalization, and so on. The convergence of the w series holds in L2 (dµ), with errors of the
order of the first term that is left out. The convergence of the v series holds in L2 (G−1 dx), with
errors of the order of the first term that is left out.
and we assume that the data are such that the initial entropy is finite. We recall that no decay
is possible without some restriction on the data.
Differentiating along the flow (i.e., for a solution) leads to
2
∇v
Z Z
dE(v)
v |∇ log(v/G)|2 dx .
= −I(v), I(u) = v + x dx =
dt RN v RN
For some reasons the dissipation I(v) is called Fisher information. Let us continue the proof.
Putting now v = Gf 2 we find that
Z Z
2
E(v) = 2 f log(f ) dµ, I(v) = 4 |∇f |2 dµ.
RN RN
25
The famous logarithmic Sobolev inequality proved by Gross in 1975, [42], says than that (for all
suitable functions, not only solutions)
1
E ≤ I,
2
−2t
and we obtain the decay E(t) ≤ E(0) e . This means a precise decay for the entropy functional.
The calculations are justified for smooth solutions, and then we can pass to the limit for general
solutions with finite entropy.
In order to obtain decay in standard norms, there are formulas connecting the entropy with
other norms, like the Cziszar-Kullback inequality that implies that that
for any f, G ∈ L1 (RN ) positive with equal mass, see [8]. This is paper is a very good early
reference to the subject of entropies and the central limit theorem.
There are many works dealing with the use of functionals and functional inequalities to arrive at
asymptotic behaviour results plus a rate of convergence for this kind of equations. Let us mention
here [3, 4, 25, 53, 63] and the references to be mentioned in Section 10.
∂t f + v · ∇x f = Q(f, f ) .
It describes the evolution of a gas of particles having binary collisions at the kinetic level; f (t, x, v)
is a time dependent distribution function (probability density) defined on the phase space (x, v) ∈
N N
RR
R ×R . The Boltzmann entropy: H[f ] := f log(f )dxdv measures irreversibility. The famous
H-Theorem (1872) says that
ZZ
d
H[f ] = Q(f, f )log(f )dxdv ≤ 0 .
dt
Other approaches to thermodynamic entropy are due to Carathéodory (1908), Lieb-Yngvason
(1997),... see [52].
An important version of entropy appears in Information Theory. In 1948, while working at
Bell Telephone Laboratories Claude Shannon, an electrical engineer, set out to mathematically
quantify the statistical nature of “lost information” in phone-line signals (cf. Wikipedia article).
He arrived at an analog to thermodynamic entropy for use in information entropy.
There is also a concept of entropy in probability theory (with reference to an arbitrary measure).
26
mentioned some other methods like the transport method of Jordan-Kinderlehrer-Otto [44], 1998,
where the Fokker-Planck equation is interpreted as the steepest descent for a free energy related
to Boltzmann-Gibbs entropy, taken with respect to the Wasserstein metric. This novel technique,
based on mass transportation [71], has played an increasing role since then
A modification that still keeps the flavor of this presentation consists of considering a forcing term
(9.1) ∂t u = ∆u + f ,
Exercise 21. (i) Prove that the L1 estimate of Theorem 2.1, formula (2.1), holds if f ∈ L1 (Q∞ ),
and we take as M the accumulated mass defined as
Z ZZ
(9.2) M= u0 (x) dx + f (x, t) dxdt.
RN Q∞
(ii) The L∞ statement (2.2) needs some further decay condition on f as t → ∞. Prove that it
holds e.g. if kf (·, t)k∞ ≤ Ct−γ with γ ≥ 1 + N/2.
The results of Section 4 can be repeated under conditions that we invite the reader to provide.
• Let us continue with a variation of the heat equation with forcing. In fact, there are many studies
where the forcing term takes the form f = f (u), and they fall into what is called reaction-diffusion,
[60]. The simplest case correspond to linear forcing, f (u) = κu with κ 6= 0. The modifications in
the analysis are minimal since the change of variables v(x, t) = u(x, t) e−κt transforms equation
(9.3) ∂t u = ∆u + κu ,
into the classical form vt = ∆v, to which our results apply. We thus conclude that for very large
t we have the asymptotic behaviour
which preserves the Gaussian profile as asymptotic shape but not the decay rates in time. The
reader is asked to write the precise theorems using the results of Sections 3, 4, and 5.
27
of course under the condition that this vector v does not vanish. The ei are the canonical basis
and Dv denotes directional derivative. Therefore, u(x, t) = O(t−(n+1)/2 ).
• There is a very interesting application of this result in N = 1. Indeed, we may solve the problem
of asymptotic behaviour of the solutions of the heat equation posed in a half line I = (0, ∞) ⊂ R
for t > 0 with lateral Dirichlet data u(0, t) = 0, let us call it (DP-HE-HL). We assume that
u0 ∈ L1 (I) and u0 ≥ 0 (the last assumption is made for simplicity). The idea is tho extend the
initial data to the whole line by putting
u
b0 (x) = u0 (x) if x > 0, b0 (x) = −u0 (−x) if x < 0
u
(called anti-symmetric reflection). Note that u b ∈ L1 (R), its total mass is zero, and the first
moment 2N1 is not zero. Solving the heat equation with the usual representation formula we
obtain a solution ub(x, t) defined for x ∈ R and t > 0. This solution must be antisymmetric,
b(x, t) = −b
u u(−x, t) by the form of the data and an elementary symmetry property of the heat
equation. Restricting u
b to x > 0 we find a unique solution of Problem (DP-HE-HL). We can now
copy our asymptotic results to u b and translate them to u. Let us write
x 2 /4t
(9.6) D(x, t) = −∂x Gt (x) = e−|x| .
t3/2
Theorems 5.1 and 5.2 imply that D(x, t) is the asymptotic attractor of the evolution in the half
line.
Theorem 9.1 Let us assume that u0 ∈ L1 (I) and the first moment in I
Z ∞
(9.7) N1 = y u0 (y) dy
0
R∞
is finite, as well as the second moment : N2 = 0 |y 2 u0 |(y) dy < ∞. Then, we get the following
convergence formulas with rate for the solutions of (DP-HE-HL)
The rate O(t−1 ) is optimal under such assumptions. If the third moment is finite we also have
Function D(x, t) given by (9.6) is called the dipole solution because it takes the derivative of
the unit Dirac delta as initial data. It hasR a constant-in-time first moment that characterizes its
∞
strength. Not that the mass in I, M+ = 0 u(x, t) dx, is not conserved in time, but decays like
O(t−1/2 ). 3
We can derive a more general convergence result
Theorem 9.2 Let us assume that u0 ∈ L1 (I) and the first moment in I, N1 , is finite. Then,
as t → ∞.
3
Dipole solutions appear often in Physics, specially in Electromagnetism.
28
Proof. (i) Assume that u ≥ 0. Approximate u0 by a compactly supported function u e0 ≤ u0 , so
that the error in the first moment is ε. Then,
Z ∞
ku(x, t) − u
e(x, t)kL1 (I;|x|dx) = (u(x, t) − u
e(x, t))xdx ≤ ε
0
because this expression is conserved with time for solutions of Problem (DP-HE-HL) in I (check
this!). The previous theorem for the solution u
e gives
• Similar formulas hold for the heat equation posed in a half space Ω in N dimensions. After
rotation and translation we may take Ω = {x ∈ RN : x1 > 0}. We consider zero lateral Dirichlet
data: u(x, t) = 0 for x = (0, x2 , . . . , xN ). n this case a multi-dimensional dipole solution appears,
D = −∂x1 Gt . Problems in half spaces can be solved too. The equivalent of Theorem 9.1 holds.
We leave the easy details to the reader.
Finally, solutions of the type D12 = ∂x1 ∂x2 Gt are attractors for problems posed in quadrant
domains {x ∈ RN : x1 > 0, x2 > 0}. And so on.
• By using the symmetric extension instead of the anti-symmetric one we can solve the heat
equation posed in a half line I = (0, ∞) ⊂ R for t > 0 with lateral Neumann data ux (0, t) = 0,
let us call it (NP-HE-HL). Let the reader fill in the details if needed.
• A related problem in several dimensions occurs when the domain is an exterior domain with
one or several holes and appropriate boundary conditions. Thus, with zero Dirichlet boundary
conditions and integrable data convergence to the Gaussian holds, while in 1D we fall back into
the dipole problem.
There are a number of problems involving the heat equation that have been studied in great
detail, like the heat equation posed in a bounded domain with boundary conditions of different
types (Dirichlet, Neumann, mixed, or other), but these settings lead to quite different results that
depart too much from the picture presented here, so we will not comment on them.
There are also equations with coefficients or weights; they form a large topic that leads also very
far from the present presentation.
The construction of the heat equation has been carefully studied when the equation is posed on
a Riemannian manifold (M N , g). The equation then takes the form
N
X
(9.10) ∂t u(x, t) = ∆g u(x, t) = |g(x)|−N/2 ∂i g(x)ij |g(x)|N/2 ∂j u(x, t) ,
i,j=1
29
where gij is the metric tensor, g ij its inverse, |g| its determinant, so that ∆g is the Laplace-
Beltrami operator, [40]. Two particular manifolds are specially relevant because their internal
symmetries and homogeneity make the theory specially strong and mathematically appealing:
the N -dimensional sphere SN and the hyperbolic space HN . The heat flow on the former is easily
shown to stabilize to a constant (much like a Neumann problem in RN ). The flow on HN is more
interesting, and typical solutions with finite mass converge to a modified Gaussian function, the
hyperbolic fundamental solution, that is described in detail in [41], see also [39].
• A prominent example that has been much studied is the Porous Medium Equation, ∂t u = ∆um ,
or better ∂t u = ∆(|u|m−1 u), m > 1. The asymptotic study depends crucially on the existence and
properties of a distinguished family of solutions, the Barenblatt solutions, [6], that are compactly
supported and self-similar, one for every mass M . Explicit formulas exist for them (1950, 1952):
1/(m−1)
(10.1) B(x, t; M ) = t−α F(x/tβ ), F(ξ) = C − kξ 2 +
where C is a free constant (to be determined by the mass M > 0) and k > 0 is a function of
m, N (k = (m − 1)/2m(N (m − 1) + 2)). They replace the Gaussian fundamental solutions in the
statement of the asymptotic theorems. Convergence of finite mass solutions to the Barenblatt
solution with the same mass is proved by the scaling method in [65]. A much earlier proof used
a method of optimal upper bounds, [33], 1980.
The faster convergence of solutions of the porous medium equation with changing sign is studied
in [49]. Here dipole solutions appear when the total mass is zero. Dipole solutions for nonlinear
parabolic problems were also studied in [7, 34, 43]. Convergence to the Barenblatt kernel for the
PME posed in RN minus one or several holes was studied in [19, 37].
These methods did not produce rates of convergence, but such rates were established using the
entropy method by Carrillo-Toscani in [22], 2000, via Bakry-Emery inequalities [4, 5], and then in
Del Pino-Dolbeault [27], 2002, using Gagliardo-Nirenberg inequalities. Convergence in the sense
of Wasserstein distances was introduced by F. Otto in 2001, [54]. Entropy methods suitable for
weighted porous media equations are used in [28].
• The Fast diffusion equation (i. e., the PME for m < 1) behaves much like the porous medium
equation for m close to 1, even if the shape of the selfsimilar profile is different,
−1/(1−m)
(10.2) F(ξ) = C + k(m, N )ξ 2
with fat tails as |x| → ∞. This is not the case for m < (N − 2)/N due to the phenomenon of
extinction in finite time. A detailed analysis of convergence to so-called pseudo-Barenblatt profiles
is done in [14, 17] by an entropy method which relies on some Hardy-Poincaré inequalities. This
is anyway a quite different scenario.
• The asymptotic convergence for the p-Laplacian equation ∂t u = ∆p u has been treated in [48] by
the scaling method, after settling the uniqueness of the fundamental solution of Barenblatt type.
30
This was done for p > 2, and it extends to some p < 2, to be precise to 2N/(N + 1) < p < 1.
The entropy method is used in [26, 29] using the analogy with the porous medium equation. The
doubly nonlinear equation ∂t u = ∆p (um ) was studied in [1] and subsequent works.
• There many variants of heat equation with lower-order terms, either first order or zero order. If
these terms are strong enough they will destroy the convergence towards some attractive solution
with a Gaussian shape. This is a huge research field and we will give only some ideas. Maybe
the best known models correspond to the case that can be written as ∂t u = ∆u + f (u) with
f (u) = −up , u ≥ 0; we then have so-called diffusion-absorption equations. Let us take and
p > 1, since the case p = 1 was explained before. Even if conservation of mass does not hold,
convergence of finite-mass solutions to a Gaussian profile with some positive mass M∞ is proved
when p > pc = (2 + N )/N . The limit case pc is very interesting and was studied in [38]. For
p < pc we enter into completely new asymptotic profiles. See also [45, 64].
There are many extensions of these ideas. The reaction cases, f (u) = +up , lead to the existence
of blow-up in finite time, a huge topic that falls complete out of the scope of these notes, see
[35, 61, 59]. Another reaction case that has attracted the attention of researchers is f (u) =
u(1 − u), called the Fisher-KPP model, of interest in biology and chemistry; here the long term
behaviour takes the form of expanding travelling waves and no trace of a Gaussian is seen, [51].
• Let us now examine some of the recent work on diffusion with fractional Laplace operators. The
linear fractional heat equation ∂t u + (−∆)s u = 0 has a rather complete theory in the paper [18].
It is well known that the self-similar fundamental solution exists for every N ≥ 1, 0 < s < 1 and
has the form P (x, t; s) = t−N/2s Fs (|x|t−1/2s ) where Fs is a smooth and positive profile function
with a fat tail as |x| → ∞
see [16]. Convergence to the self-similar fundamental solution can be proved for finite-mass
solutions by the scaling method (no rates), or by the representation analysis (with rates). See
separate notes by the author. For the entropy method see [12, 36].
• We continue with nonlinear fractional heat equations of porous medium type. The model studied
by Caffarelli and Vázquez [20] admits self-similar solutions that we may call fractional Barenblatt
solutions [13, 21, 11]. The entropy method is used in [21] to establish asymptotic convergence
without rates. Rates in 1D were obtained in [24]. Convergence with rates in several dimensions
is not known.
The alternative model of fractional porous equation, ∂t u+(−∆)s (|u|m−1 u) = 0 was studied in [55,
56]. Unique fundamental solutions of Barenblatt type were described in [67], where convergence
(without rates) was proved by the scaling method.
• There are a number of other equations that have been studied, like thin film equations [23],
the Barenblatt equation of elastoplastic filtration KPV91, inhomogeneous heat or porous medium
equations with weights, [47], or chemotaxis models, like [15]. A very important topic is the study
of the heat equation and the nonlinear diffusion models on manifolds, like the hyperbolic space,
see [41, 68].
• See [69] for a general presentation of linear and nonlinear diffusion equations including a detailed
survey of recent research work.
31
11 Historical comments
We add some historical notes on the origins and development of the Gaussian function, borrowed
from Wikipedia and other widely available sources, with no claim to be a rigorous historical
presentation. It seems that the so-called Gaussian function has its origin in Statistics. The 18th
century statistician Abraham de Moivre, a Frenchman exiled in England, seems to have been
the first person who noticed the existence of a bell-shaped curve as the limit of the probability
distributions of repeated random trials done independently. He was led by the practical problems
of calculating odds in gambling, not a every elevated motivation indeed. But he was a very fine
mathematician, appreciated by Newton. The curve he discovered is now called the “normal curve”.
One of the first applications of the normal distribution was to the analysis of errors of measure-
ments made in astronomical observations. A century later than De Moivre, the mathematicians
Adrain in 1808 and Gauss in 1809 developed independently the formula for the normal distribu-
tion and showed that errors were fit well by this distribution. The brilliant Gauss received much
credit, Adrain’s work was not known for many years.
This same distribution had been discovered by Laplace in 1778 when he derived the extremely
important central limit theorem, a main topic of this paper. Laplace showed that even if a
distribution is not normally distributed, the means of repeated samples from the distribution
would be very nearly normally distributed, and that the larger the sample size, the closer the
distribution of means would be to a normal distribution.
The distribution appeared later in another disguise in Statistical Mechanics as the Maxwell-
Boltzmann distribution, shortly called the Maxwellian. The original derivation in 1860 by James
Clerk Maxwell was an argument based on molecular collisions of the kinetic theory of gases as well
as certain symmetries in the speed distribution function; Maxwell also gave an early argument
that these molecular collisions entail a tendency towards equilibrium. After Maxwell, Ludwig
Boltzmann in 1872 also derived the distribution on mechanical grounds and argued that gases
should over time tend toward this distribution, due to collisions (see H-theorem).
The normal distribution has a wide implication to social issues. Thus, Quételet seems to have
been the first to apply the normal distribution to human characteristics. He noted that charac-
teristics such as height, weight, and strength were normally distributed.
Evidence for the Gaussian function as the fundamental solution of the heat equation came
after the work of probabilists in the 20th century to establish the link between heat equation and
Brownian diffusion, which is turn is the limit of discrete processes based on iterated random trials.
The close connection between stochastic differential equations and parabolic partial differential
equations is very much influenced by the role of the Gaussian function in both theories.
32
References
[1] M. Agueh. Asymptotic behavior for doubly degenerate parabolic equations. C. R. Math.
Acad. Sci. Paris 337 (2003), no. 5, 33–336.
[5] D. Bakry, I. Gentil, M. Ledoux. “Analysis and geometry of Markov diffusion operators”,
Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical
Sciences], 348, Springer, Cham, 2014.
[6] G. I. Barenblatt. On self-similar motions of compressible fluids in porous media, Prikl. Math.,
16 (1952), pp. 679–698 (in Russian).
[8] A. R. Barron. Entropy and the central limit theorem, Ann. Probab., 14, (1986), 336–342.
[9] W. Beckner. A generalized Poincaré inequality for Gaussian measures, Proc. Amer. Math.
Soc. 105 (1989), 397–400.
[10] J. Bell. Gaussian measures, Hermite polynomials, and the Ornstein-Uhlenbeck semigroup,
2015; online.
[11] P. Biler, C. Imbert, G. Karch. Nonlocal porous medium equation: Barenblatt profiles and
other weak solutions. Arch. Ration. Mech. Anal. 215 (2015), no. 2, 497–529.
[12] P. Biler, G. Karch. Generalized Fokker-Planck equations and convergence to their equilibria.
In Evolution equations (Warsaw, 2001), vol. 60 of Banach Center Publ. Polish Acad. Sci.,
Warsaw, 2003, pp. 307–318.
[13] P. Biler, G. Karch, R. Monneau. Nonlinear diffusion of dislocation density and self-similar
solutions. Comm. Math. Phys. 294 (2010), no. 1, 145–168.
[15] A. Blanchet, J. Dolbeault, M. Escobedo, J. Fernández. Asymptotic behaviour for small mass
in the two-dimensional parabolic-elliptic Keller-Segel model, J. Math. Anal. Appl. 361 (2010),
no. 2, 533–542.
[16] R. M. Blumenthal, R. K. Getoor. Some Theorems on Stable Processes, Trans. Amer. Math.
Soc. 95 (1960), 263–273.
33
[17] M. Bonforte, J. Dolbeault, G. Grillo, J. L. Vázquez. Sharp rates of decay of solutions to the
nonlinear fast diffusion equation via functional inequalities, Proc. Natl. Acad. Sci. USA 107
(2010), no. 38, 16459–16464.
[18] M. Bonforte, Y. Sire, J. L. Vázquez. Optimal Existence and Uniqueness Theory for the
Fractional Heat Equation, arXiv:1606.00873v1, to appear in Nonlin. Anal.
[19] C. Brändle, F. Quirós, J. L. Vázquez. Asymptotic behaviour of the porous media equation
in domains with holes, Interfaces Free Bound. 9 (2007), no. 2, 211–232.
[20] L. A. Caffarelli, J. L. Vázquez. Nonlinear porous medium flow with fractional potential
pressure. Arch. Rational Mech. Anal. 202 (2011), 537–565.
[22] J. A. Carrillo, G. Toscani. Asymptotic L1 -decay of solutions of the porous medium equation
to self-similarity. Indiana Univ. Math. J. 49 (2000), 113–141.
[23] J. A. Carrillo, G. Toscani. Long-time asymptotics for strong solutions of the thin film equa-
tion, Comm. Math. Phys. 225 (2002), no. 3, 551–571.
[25] D. Chafaï. Entropies, convexity, and functional inequalities: on Φ-entropies and Φ-Sobolev
inequalities, J. Math. Kyoto Univ. 44 (2004), no. 2, 325–363.
[26] M. Del Pino, J. Dolbeault. Nonlinear diffusions and optimal constants in Sobolev type in-
equalities: asymptotic behaviour of equations involving the p-Laplacian. C. R. Math. Acad.
Sci. Paris 334 (2002), no. 5, 365–370.
[27] M. Del Pino, J. Dolbeault. Best constants for Gagliardo-Nirenberg inequalities and applica-
tion to nonlinear diffusions, J. Math. Pures Appl. (9) 81 (2002), 847–875.
[28] J. Dolbeault, I. Gentil, A. Guillin, F.Y. Wang. Lq -functional inequalities and weighted porous
media equations, Potential Anal. 28 (2008), no. 1, 35–59.
[29] M. Del Pino, J. Dolbeault. Asymptotic behavior of nonlinear diffusions. Math. Res. Lett. 10
(2003), no. 4, 551–557.
[31] L. C. Evans. “Partial differential equations”. Graduate Studies in Mathematics, 19. American
Mathematical Society, Providence, RI, 1998.
[32] J. Fourier. “Théorie analytique de la Chaleur”; reprint of the 1822 original: Éditions Jacques
Gabay, Paris, 1988. English version: “The Analytical Theory of Heat”, Dover, New York,
1955.
34
[33] A. Friedman, S. Kamin. The asymptotic behavior of gas in an N-dimensional porous medium,
Trans. Amer. Math. Soc. 262 (1980), 551–563.
[38] A. Gmira, L. Véron. Large time behaviour of the solutions of a semilinear parabolic equation
in RN . J. Differential Equations 53 (1984), no. 2, 258–276.
[39] A. A. Grigor’yan. On the fundamental solution of the heat equation on an arbitrary Rieman-
nian manifold. (Russian) Mat. Zametki 41 (1987), no. 5, 687–692, 765. English translation:
Math. Notes 41 (1987), no. 5–6, 386–389.
[40] A. Grigor’yan. “Heat Kernel and Analysis on Manifolds”, American Mathematical Society,
Providence, 2009.
[41] A. A. Grigor’yan, M. Noguchi. The heat kernel on hyperbolic space, Bull. Lond. Math. Soc.
30 (6) (1998) 643–650.
[43] J. Hulshof, J. L. Vázquez. The Dipole Solution for the Porous Medium Equation, Annali
Scuola Normale Sup. Pisa, Serie IV, 20 (1993), 193–217.
[44] R. Jordan, D. Kinderlehrer, F. Otto. The variational formulation of the Fokker-Planck equa-
tion, SIAM J. Math. Anal., 29 (1998), 1–17.
[47] S. Kamin, G. Reyes, J. L. Vázquez. Long time behavior for the inhomogeneous PME in a
medium with rapidly decaying density, Discrete Contin. Dyn. Syst. 26 (2010), 521–549.
[48] S. Kamin, J. L. Vázquez. Fundamental solutions and asymptotic behaviour for the p-
Laplacian equation, Rev. Mat. Iberoamericana 4:2 (1988), 339–354.
[49] S. Kamin, J. L. Vázquez. Asymptotic behaviour of solutions of the porous medium equation
with changing sign, SIAM J. Math. Anal. 22 (1991), no. 1, 34–45.
35
[50] O. Kavian. Remarks on the large time behavior of a nonlinear diflusion equation, Ann. Inst.
H. Poincare Anal. Non Linéaire 4. No. 5 (1987), 423–452.
[52] E. H. Lieb, J. Yngvason. A Guide to Entropy and the Second Law of Thermodynamics,
Notices Amer. Math. Soc. 45 (1998), no. 5, 571–581.
[53] P. A. Markowich, C. Villani. On the trend to equilibrium for the Fokker-Planck equation: an
interplay between physics and functional analysis. Mat. Contemp. 19 (2000), 1–29.
[54] F. Otto. The geometry of dissipative evolution equations: the porous medium equations,
Comm. Partial Diff. Equations, 26 (1–2) (2001), pp. 101–174.
[57] H. Risken. “The Fokker-Planck equation, methods of solutions and applications”, 2nd ed.,
Springer-Verlag, berlin, Heidelberg, (1989).
[58] S. Salsa. “Partial differential equations in action. From modelling to theory.” Third edition,
Unitext, 99. Springer, 2016.
[61] P. Quittner, P. Souplet. “Superlinear parabolic problems. Blow-up, global existence and
steady states”. Birkhäuser Advanced Texts, Birkhäuser Verlag, Basel, 2007.
[62] G. E. Uhlenbeck, L. S. Ornstein. On the theory of Brownian motion, Physical Review 36,
(1930), 823–84.
[63] G. Toscani. Entropy production and the rate of convergence to equilibrium for the Fokker-
Planck equation, Quart. Appl. Math. 57 (1999), 521–541.
[65] J. L. Vázquez. Asymptotic behaviour for the Porous Medium Equation posed in the whole
space. Journal of Evolution Equations 3 (2003), 67–118.
36
[66] J. L. Vázquez. “The Porous Medium Equation. Mathematical Theory”, vol. Oxford Math-
ematical Monographs, Oxford University Press, Oxford, 2007.
[67] J. L. Vázquez. Barenblatt solutions and asymptotic behaviour for a nonlinear fractional heat
equation of porous medium type, J. Eur. Math. Soc. (JEMS) 16 (2014), no. 4, 769–803.
[68] J. L. Vázquez. Fundamental solution and long time behaviour of the Porous Medium Equation
in hyperbolic space, J. Math. Pures Appl. (9) 104 (2015), no. 3, 454–484
[69] J. L. Vázquez. The mathematical theories of diffusion. Nonlinear and fractional diffusion, in
“Nonlocal and Nonlinear Diffusions and Interactions: New Methods and Directions”. Springer
Lecture Notes in Mathematics, 2186, C.I.M.E. Foundation Subseries, yr 2017, pg. 205-278.
[70] D. V. Widder. “The Heat Equation”, Academic Press, New York, 1975.
[71] C. Villani. “Topics in Optimal Transportation”, Graduate Studies in Math, 58, AMS, 2003.
Address:
Juan Luis Vázquez. Departamento de Matemáticas, Universidad Autónoma de Madrid,
Campus de Cantoblanco, 28049 Madrid, Spain. e-mail address: [email protected]
37