Mean Value Theorems For Vector Valued Functions
Mean Value Theorems For Vector Valued Functions
1. Introduction
The object of this paper is to give a generalisation to vector valued functions
of the classical mean value theorem of differential calculus. In that theorem
we have
for some c in the open interval ]a, 6[ w h e n / is a real valued function which is
continuous on the closed interval [a, b~\ and differentiable on the open interval.
The counterpart to (1) when / has values in an n-dimensional vector space
turns out to be
f(b)-f(a) = (b-a) £ XJ'{ck) (2)
k = 1
n
where ck e ]a, b[_, 0^Xk, and £ ^* = *•
The key to the algebraic conclusion (2) lies in geometric ideas. Convex
sets play a basic role in the statements and proofs of the mean value theorems
developed in § 3. Let us recall a few definitions here.
Definitions, (a) Let x and y be elements of a vector space F. The segment
joining x and y is the set of elements Xx + {\—X)y for all X such that O g l ^ 1.
(b) A subset A of F is convex provided the segment joining each pair of points
in A is contained in A. (c) If A is any subset of F, the convex cover HA of A
is the minimal convex set containing A, that is, the intersection of all convex
sets containing A. (d) When F is a topological vector space the closed convex
cover KA of A is the minimal closed convex set containing A.
The equation (2) is the end of a three step development. The first step
is to prove that (f(b)—f(a))/(b—a) is in the closed convex cover of the set of
values of the derivative. At this stage / may have values in any locally convex
topological linear space. Then under additional hypotheses, for example
that the space is finite dimensional, it is also true that (f(b)—f(a))l(b—a) is
in the convex cover of the set of values of the derivative. The algebraic form
of this conclusion is
Kb) -f(a) = {b-a) "Z XJ'(ck) (3)
k = 1
2. Increment theorems
It is usual to prove that a function with a positive derivative is increasing
by using the mean value theorem. Here we shall reverse the procedure. From
a sufficient condition for a function to be increasing we can obtain mean
value theorems for vector valued functions. From the same starting point
and by a very similar method it is also possible to prove increment theorems
for vector valued functions. In fact, for economy of exposition it is advantage-
ous to derive the mean value theorems from an increment theorem for vector
valued functions.
The increasing function theorem that is suited to the present purpose,
Theorem A below, is a special case of theorems given by Aumann (1, p. 222)
and Gal (6, p. 310). The principal simplification is in replacing a semi-
continuity condition on / by the assumption of continuity.
Definitions, (a) Let / be a continuous function from an interval [a, ft]
into a topological linear space F over the real field R. Let / e [a, ft[. An
S —f
(£>"/ may take on the values — oo and + oo but it will be assumed throughout
that right-hand derivative values are finite when / is real valued.)
Theorem A. Let f be a continuous function from [a, ft] into R. Let M be
a countable subset of ]a, ft[ and RDf a real valued function such that RDf(f)
is a right-hand derivative value of f for all t in ]a, b\_ — M. If there is a set N
of Lebesgue measure zero such that 0^DRf(t) for all t in ]a, b\_— N, then f is
an increasing function on [a, b~\. Moreover, if 0 <DRf(t) for at least one t in
-\a,blthenf(a)<fQ>).
Definition. A sublinear funtional on a linear space F is a function p from
F into R such that
p(Xx) = Xp(x) and p(x+y)^p(x)+p(y)
for all A^O and all x and y in F. (p is a linear functional when the first relation
holds for all A in R and equality holds for all x and y in the second relation.)
The following theorem differs in detail from those stated by Bourbaki
and Dieudonne. The proof is the one suggested in (4), p. 156, problem 6.
Theorem B. Let F be a topological linear space. Let p be a continuous
sublinear functional on F. Let f and g be continuous functions from [a, b~\ into
F and R respectively. Suppose there is a countable subset M of~\a, b\_ such that,
for each t in ]a, ft[ —M, / and g have right-hand derivative values RDf(t) and
RDg(f) associated with the same sequence decreasing to t. Suppose also there
is a set N of Lebesgue measure zero such that M^N^]a, b[ and
p{RDf(t))^RDg(f) forallt£~\a,b\_-N. (4)
Then
pU{d)-f(c))^g{d)-g(c)
for all [c, </] contained in \a, b~\. Moreover, if inequality holds in (4) for at least
one value oft, then p{f(b)-f(a))<g(b)-g(a).
Proof, (a) Given x0 e F, let Fo = {Ax0 : A e R}. On the subspace Fo
define u0 by MO0**O) = *P(xo)- Then uo(x)£p(x) for all x in Fo. The Hahn-
Banach theorem (7, p. 42) provides an extension u of w0 to all of F such that
u is a linear functional and u(x)^p(x) for all x in F. Moreover u is continuous
at 0 since p is continuous and
| u(x)\ g max {p(x\ p(- x)}
for all x in F. Therefore u is continuous on F.
is locally convex.
Theorem 1. Let F be a locally convex topological linear space. Let f be
a continuous function from [a, ft] into F. Let M be a countable subset of~\a, ft[
and RDf a right-hand derivative value function for f on ]a, b[_ — M. Let N be
a set of Lebesgue measure zero such that M^N^~]a, ft[. Set
D= {RDf(t):te-\a,bl-N}
and
Q = {(f(d)-f(c))/(d-c) : aSc<d^b}.
Let KD and KQ be the closed convex covers of D and Q. Then KD = KQ.
k = 1
m
ck e]a, b[—N, 0^Xk, and Y At = 1.
k = 1
Proof. The convex cover of any set A consists of all sums of the form
YJ Kxk where xk e A, 0^Xk, £ Xk = 1, and m ranges over all the positive
k = 1 * = 1
integers. (See (7) p. 131.) The conclusion follows. (For convenience call
such a sum a convex combination.)
More can be said about m when Fis finite dimensional. Also the hypothesis
that HD has an interior point relative to V becomes superfluous.
If A is a subset of En, each element of HA can be expressed as a convex
combination of elements of A having n + 1 terms. Moreover, if A has at
most n connected components, the convex combinations having n terms yield
all of HA. (See Eggleston (5) p. 35.) Simple examples show no further
reduction of m is possible.
Theorem 3. Replace F by En in the conditions of Theorem 1. Then Q c HD
and i.f(b)—f{dj)l{b — a) is an interior point of HD relative to the minimal linear
variety V containing HD. Moreover,
f(b) -f(a) = (b-a) " l XkRDf(ck) (5)
k= 1
n+1
where cke~]a, b[ — N, 0^1k, and £ Xk = 1. Finally, if D has at most n
k = 1
connected components the sum in (5) may be replaced by a sum having n terms.
Proof. Every convex set in En has an interior point relative to the minimal
variety containing the set (5, p. 16). Moreover, every variety in En is a closed
variety. Thus Theorem 2 can be applied to the restriction o f / t o each sub-
interval [c, d]. Then, with obvious notation,
(f{d) -f(c))/(d- c) e HD{c, d] £ HD
for all real /. Let E be the set of all real t such that l.k + tvk^0 for k = 2,
..., n + l. Then 0 e £ since every kk is positive. There is no loss in generality
in supposing some vk is negative. Then E is bounded above. Let t0 = sup E.
t £ ( * o
k= 2
n+l
Then v^A t >0 and v " 1 ^ * ^ £ V^-Mo* 1 *)** is a convex combination
fc = 2
of a proper subset of {xl5 ..., x n+1 } which is equal to zero. This contradicts
our assumption.
The linearly independent set A contains n elements. Thus any element y
n+l
in En has one and only one representation £ /****• Furthermore y is a
boundary point of C if and only if f-ik^0 for 2^Ar^«+ 1 and nk = 0 for at
least one k.
Let J = sup{f : / ' ( 0 i , t])^C}. Since /'(<a) e C and f'(c2)$C it follows
that ct^d^c2. Clearly/'([f1!, rf[)^C C is a closed convex set. From
Theorem 1, (f(t)-f(d))/(t-d)e C for all f in [c 1; </[. T h u s / V ) e C and
d<c2- On the other hand there exists a sequence fl5 ?2> '3. ••• decreasing to rf
such that / '(tk) $ C. Since / ' is right-hand continuous at d it is true that
n+l
f'(d) belongs to the closure of the complement of C. Therefore/ '{d) = X ^*x*
k = 2
with (ik^0 for all k and nk = 0 for at least one fc. This equation can be
rewritten as
* 2
/ n+l \
Multiplication by 1/1 1 + Y nk) produces the desired convex combination
V *=2 /
of n values of/' equal to 0.
The condition for reduction of the number of terms in the convex com-
bination given in Theorem 4 is not a consequence of the one given in Theorem 3
when n > 1. To establish this it suffices to give an example of a function /
satisfying the hypotheses of Theorem 4 for which f'Qa, b[_-M) has more
than n components. The function / whose construction follows is continuous
on [0, 1/48], differentiable on ]0, l/48[, and its derivative is right-hand con-
tinuous on ]0, l/48[. It has its values in E2. Since E2 is a linear subspace of
En for n>1 we can also consider/as a function having values in £„ for all
«^2. The components of/'(]0, l/48[) are the same whether this set is con-
sidered as a subset of E2 or as a subset of En for n > 2. The collection of
components is countably infinite. Thus this one example serves for every n
greater than 1.
Let a be a real constant. Set g(0, a) = (0, 0) and g(t, a) = r2(cos alt, sin ajt)
for t i=- 0. The map t-+g'{t, a) carries each interval \r, 0] onto {(0, 0)}uS
where 5 is a spiral which approaches the circle centred at (0, 0) with radius a
from the outside. Set
h{t) = g(t, a)-g(r, a) + (t-r)[y-g'(r, d
Then/ satisfies the hypotheses of Theorem 4. B u t / ' maps ]0, l/48[ onto
an infinite union of spirals and each spiral is a connected component of the
union.
In the case n = 1 Theorem 4 is a consequence of Theorem 3. Indeed,
/ '(0 exists throughout ]a, b[ hence, by the Theorem of Darboux, / ' has the
intermediate value property. Thus f'(\a, Z>[) is an interval and the number
of components is 1.
We used right-hand derivative values in Theorems 1, 2 and 3. Left-hand
derivative values can be used equally well. In fact the corresponding theorems
are corollaries of the ones given. Indeed, if/ has left-hand derivative values
LDf(t) for all t in ]a, b\_-M and
9(0 = -/(-'), -b£t£-a,
then g has right-hand derivative values such that RDg(t) = LDf(—t). More-
over (g(d)-g(c))l(d-c) = (f(-d)-f(-c))l(-d+c). Thus application of
Theorems 1, 2 and 3 to g yields corresponding theorems for / in which D,
HD and KD are formed from the values of LDf. Furthermore, if/ has both a
left-hand and a right-hand derivative value function, the two sets KD formed
from LDf and RDf are equal since both are equal to KQ.
4. Applications
Many of the theorems of elementary calculus which are usually derived
from the classical mean value theorem can just as well be got from an increment
theorem like Theorem B. Dieudonne has given an excellent account of their
extensions to Banach spaces in (4) pp. 156 ff. We will avoid duplicating them
The linear mapping u is called the derivative of/ at y and is denoted by f'(y).
Its value at any x in E is denoted by f'(y) . x.
If g is a function from [a, b~\ into G having a derivative value RDg(t) the
chain rule for the composite function fg reads
RDfg(t)=f'(g(t)).RDg(t).
Definitions, (a) Let x and y belong to the connected open set G. Let
P(x, y) denote the collection of all functions g from [0, 1] into G such that
g(0) = x, g{\) = y, g is continuous on [0, 1], and there exists a countable
set M such that g has a right-hand derivative value function RDg on ]0, 1 [ - M.
(b) Suppose/is a differentiable function from G into F. Let K(x, y; g) denote
the closed convex cover of D(x, y; g) where, for fixed g in P(x, y),
D(x, y; g) = {f'(g{t)). RDg(t) : t e]0, 1[-M}.
Also let
K(x, y)= f) x<>> y'' 9)-
geP(x.y)
Theorem 8. Let f be a differentiable function from the connected open set G
in the normed space E into the normed space F. Then
f(y)-f(x)eK(x,y)
5. Examples
There are standard examples which show that the countable set M cannot
be replaced by a set of measure zero in Theorem A and its consequences.
Let Tbe the Cantor ternary set in [0, 1]. Let k be the continuous function
on [0, 1] such that A;(0) = 1, ifc(l) = 0, k(t) = \ for \<t<\, k{t) = J for
£ < * < ! , k(t) = | for i < t <f, etc. Then k'(i) = 0 for all t in [0, 1] - T. Since
T is a set of Lebesgue measure zero this example shows that it is not possible
REFERENCES
(1) GEORG AUMANN, Reelle Funktionen (Berlin, 1954).
(2) A. K. Aziz and J. B. DIAZ, On a mean value theorem of the differential
calculus of vector-valued functions, and uniqueness theorems for ordinary differential
equations in a linear-normed space, Contributions to Differential Equations, 1 (1963),
251-269.
(3) N. BOURBAKI, Elements de Mathimatique, Livre IV, Fonctions d'une variable
reelle, Chaps. 1, 2, 3 (2nd ed., Paris, 1958).
(4) J. DIEUDONNE, Foundations of Modern Analysis (New York, 1960).
(5) H. G. EGGLESTON, Convexity (Cambridge, 1958).
(6) ISTVAN S. GAL, On the fundamental theorems of the calculus, Trans. Amer.
Math. Soc. 86 (1957), 309-320.
(7) ANGUS E. TAYLOR, Introduction to Functional Analysis (New York. 1958).
E.M.S.—P