F G Derivatives
F G Derivatives
by
Francis J. Narcowich
January, 2021
There are two types of derivatives that get used in connection with non-
linear functions (and functionals), the Gâteaux (weak) derivative and the
Fréchet (strong) derivative.
Let V and W be Banach spaces, Ω an open set in V , and F a function
that maps Ω into W . Fix y in Ω.
F (y + εv) − F (y)
∆y F (v) := lim , for all v ∈ V, (0.1)
ε→0 ε
then we say that F is Gâteaux differentiable at y and that ∆y F is the Gáteaux
derivative of F at y.
The matrix on the far right is called the Jacobian matrix, and it’s usually de-
noted by J or Jy . The calculation above works for any system of equations of
the form F (x1 , . . . , xn ) = [f1 (x1 , . . . , xn ) · · · fm (x1 , . . . , xn )]T . The Jacobian
1
The main reference for these notes is Jordan Bell, Fr’echet and Gâteaux Derivatives,
https://individual.utoronto.ca/jordanbell/.
2
This is Gâteaux’s definition. Other authors use a directional derivative and even allow
nonlinear derivatives.
3
Note that ε is not required to be positive.
1
∂fi
matrix for the larger system is m × n and has entries Jij = ∂x j
. The same
calculation as above works for the more general case. Thus, ∆y F = Jy .
Another example, which differs from the one above, is the following: Let
H : R2 → R be defined by
( 3
|x| y
x4 +y 2
(x, y) 6= (0, 0)
H(x, y) = (0.3)
0 x = 0, y = 0.
satisfies
K(ε(u, v)) − G(0, 0)
= K(u, v),
ε
which is obviously nonlinear in v. Consequently it’s Gâteaux doesn’t exist.
One last remark. The existence of the Gâteaux derivative doesn’t neces-
sarily imply continuity of a function. For instance, let
( 4
x y
x6 +y 3
(x, y) 6= (0, 0)
M (x, y) :=
0 (x, y) = (0, 0).
A computation nearly identical to the one above shows that ∆(0,0) M (v) = 0.
However, M (x, x2 ) = 12 6= 0. Thus, M is discontinuous at (0, 0), even though
the Gâteaux derivative exists.
2
The Fréchet derivative is defined in a way that is somewhat different than
the Gâteaux derivative. Let V , W , Ω and F be as defined earlier. Again, fix
y ∈ Ω.
Definition 0.2 (Fréchet Derivative). If there is a bounded linear map ∆y F :
V → W that satisfies
kF (y + v) − F (y) − ∆y F (v)kW
lim = 0, (0.4)
kvk→0 kvkV
then we say that F is Fréchet differentiable at y and we call ∆y F the the
Fréchet derivative of F at y.
Like the Gâteaux, one can show that the Fréchet derivative is unique,
if it exists. However, unlike the definition of the Gâteaux derivative, where
the Gâteaux derivative is given directly via the formula in (0.1), the Fréchet
derivative is defined indirectly, without a formula, as a linear operator satisfy-
ing (0.4). This would seem to make it more difficult to compute the Fréchet
derivative. However, if the Fréchet derivative does exist, it can always be
computed via the Gâteaux derivative:
Proposition 0.3. If the Fréchet derivative exists, then the Gâteaux deriva-
tive also exists, and the two are equal.
Proof. Suppose that the Fréchet derivative exists at y. Fix v and note that
the linearity of ∆y F implies that ∆y F (εv) = ε∆y F (v), so that
By (0.4) and the fact that any approach to zero for kvk is allowed in the
Fréchet case, the limit on the right is zero. Hence, the limit on the left is
also zero. Thus, the Gâteaux derivative exists and is equal to the Fréchet
derivative, whenever the latter exists.
Consider a function F (x1 , . . . , xn ) = [f1 (x1 , . . . , xn ) · · · fm (x1 , . . . , xn )]T .
When is F guaranteed to be Fréchet diffentiable? We leave it as an exercise
to show that F is Fréchet differentiable at a given point if and only if all of
∂f
the first partials ∂kj are continuous there.
The Gâteaux derivative may exist, but the Fréchet derivative may not.
Earlier, we showed that the Gâteaux derivative for the function defined in
3
(0.3) was ∆(0,0) H(v) = 0. If the Fréchet derivative exists, it is equal to the
Gâteaux derivative. Thus, by (0.4), we have that
kH(u, v) − H(0, 0) − ∆(0,0) H(v)k |u|3 v
lim = lim 1 = 0.
kvk→0 kvk kvk→0 (u2 + v 2 ) 2 (u4 + v 2 )
∆x (G ◦ F ) = ∆y G ◦ ∆x F,
which is the chain rule, holds.
Proof. Let v be in Ω. Since F is Fréchet differentiable at x, F (x + v) =
F (x) +∆x F (v) + o(kvk). In addition,
| {z }
y