Nonlinear Systems and Differential Geometry
Nonlinear Systems and Differential Geometry
1976
1, JANUARY
parameter systems,” Trans. ASME J. Basic Eng., pp. 67-79, [ 1451 T. K. Yu and J. H. Seinfeld, “Obsewability of a class of hyper-
Mar. 1964. bolicdistributedparametersystems,” IEEE TransAutomat.
[ 142 ] E. D. Ward, “Identification of parameters in nonlinear boundary Contr., (corresp.),vol. AC-16, pp. 495-490, Oct. 1971.
conditionsofdistributedsystemswith linear fields,”Ph.D. [ 1461 V. P. Zhivoglyadovand V. Kh. Kaipov,“Application ofthe
dissertation, Purdue Univ., Lafayette, IN,Aug. 1971. method of stochastic approximations in the problem of identifi-
[ 1431 E. D. Ward and R. E. Goodson, “Identification of nonlinear cation,” Automat. Telemek., vol. 27, pp. 54-58, Oct. 1966.
boundary conditions in distributed systems with linear fields,” [ 1471 -, “Identification of distributedparameterplants in the
Trans.ASME J. DynamicSyst. Meas. Contr., vol. 9 5 , pp. presence ofnoises,” in PreprintsIFAC Symp., June, 1967,
390-395, Dec. 1973. Paper 3-5.
[ 1 4 4 ] M. J . Wozny, W. T. Carpenter, and G. Stein, “Identification of [ 1481 V. P. Zhivoglyadov, V. Kh.Kaipov,and I. M. Tsikunova,
Green’s functionfordistributed parameter systems,” IEEE “Stochastic algorithms of identification and adaptive control of
Trans. Automat.Contr. (comesp.),vol.AC-15, pp. 155-157, distributed parameter systems,” in Preprints IFAC Symp. Con-
Feb. 1970. trol of Distributed Parameter Systems, June 1971, Paper 13-1.
Abstract-Nonlinear systems which are governed by a fmite number In spite of the rather general form of nonlinear systems we
of ordinary differential equations with controls present constituk a work with here, most of the results given here could be stated
large andimportant c k d model for proclical purposegInthe in even greater generality. For example, the restriction to one
last few years, there h x been considerable progress m our understmd-
ing of this class of model. This is an expository paper devoted to sur- input is completelyunnecessary; many resultsapply to i =
wying and explaining m e of the main results currentlyavailable. f ( x , u ) , etc.It seemed counterproductiveto move in this
@ackgoundmaterial on m d d d theory is included in ader to make direction, however,ina paperintended as an expository
&e paper more nearly seifcontained. introduction.
We introduce a few standardnotations. R n standsfor
I.INTRODUCTION
n-dimensional Cartesian space with itsordinary topology,
HE PURPOSE of this paper is to describe some of the if x E R n then IlxII = ( x : + x : + . - * x ; ) ’ ’ * . We use letters
main theoretical results onthe class of input/output such as M and N for differentiablemanifolds (see appendix
(I/O) models which take the form for definition). We always assume that f(-), g(.), and h ( . ) in
(1) are infinitelydifferentiable andoften we assume even
more, that they depend analytically on their arguments. We
call systems of this latter type linear-analytic systems.
The most significant results relate to a) controllability theory, The special case of bilinear systems
b) 1/0 theory based on Volterra series, c) isomorphism t h e e
rem and bilineaxization, and d) stochastic theory. i ( t )= A x ( t ) + u ( t ) B x ( t ) ; y ( t ) = (c, x ( t ) > : x ( 0 ) = x o
Our intention is to give an introduction to this area. We will has received a great deal of attention in the literature in the
h e a number of the most important results but we do not past few years.Although this theory is,itself, quite a suc-
give proofs unless there are special circumstances which make cessful generalization of the linear theory, it now appears that
it desirable. The material to be discussed has a great deal of it should be viewed in the context of the more general class
intuitive content but it also requires some technical develop of systems described here. The reasons for this are that many
ments. In fact, the solution of some of the central problems of the principleresults on bilinear systems are truefor this
required the development of some new pure mathematics. In wider class of systemsand the class of bilinear systems
an attempt to present a blend of intuition and solid theory, has the unfortunate propertyof not being closed under compo-
we have included in an appendix the definition of many tech- sition and feedback.This is not to say that bilinearsystems
nical terms and also precise statements of a few background are of littleinterest. On thecontrary,they are the most
theorems. interesting specialization of this theory, and, in fact, they a p
pear from the point of view of the theory of linear-analytic
ManuscriptreceivedApril 23,1975; revised July18, 1975. This systems to be even more general, and hence of greater interest,
paper waswrittenwhilethe authorheldaGuggenheim Fellowship.
This work was supported in part by the U S . Office of Naval Research than was first expected.
undertheJointServicesElectronics Program Contract N00014-75-C- With regard to the literature, there is a recent conference
0648.
The authac is with the Division of Engineering and Applied Physics, proceedings [ 11 which contains papers on most of the topics
Harvard University, Cambridge,M A 02138. to be discussed here. In particular, the paper by Lobry is rele
horized licensed use limited to: J.R.D. Tata Memorial Library Indian Institute of Science Bengaluru. Downloaded on September 18,2024 at 07:51:00 UTC from IEEE Xplore. Restrictions ap
62 PROCEEDINGS
JANUARY OF THE IEEE, 1976
with a principal axis. That is, we have f and g but not necessarily {f, g}LA. A particularly simple
special case of this kind of result is expressed by the following.
n = u(t)g Theorem 5: Let L = {f,g}LA and let LO be the smallest
for some constant vector g. Again, the motion with u = 0 is subalgebra of L which contains g and is closed under Lie
periodic (elliptic function theory!) thus mitigating the effect bracketing with f. Suppose thatfor all h in LO wehave
of the drift term. [ h ,g] = ahg for some constant a. Then
If we linearize this system about w1 = wz = w3 = 0, then R ( t )= { exp L O}G exp tfxo .
linear theory will predicta one dimensionalreachable set,
provided g is nonzero. We will see that the Lie theory, on the Proof: See [9, p. 945, Corollary 11 and make the neces-
other hand, answers the problem precisely. We begin by com- sary modifications to cover the present setup.
puting some of the vector fields belonging to the Lie algebra Example [ 10, p. 2771 : Consider the linear time invariant
of vector fields generated by f and g. Let a , b , and c denote system
the components of g. We have, for example, i ( t ) = A x ( t ) + b u (xt()0; ) E R " .
[f,gl =
[
a(bw3 + cw2 1
Kaw3 + c u l l
y(aw2 + bwl I]
[[f,gI,gl= 2
and that
L = {b,Ab,-,A"-'b,Ax}
Lo = { b , A b ; . . , A " - ' b }
[ [ f , g l , [ [ f , g l , g l=
l 2 [Lo,bI= (0).
Thus thereachable set at time t is
=/?ya4(0cZ - y b 2 ) + ~ ~ y b ~ (- 7
a ca2~)+4?c4(ab2 - pa2). 111. I N P U T ~ U T P U TDESCRIFTIONS
One of the central ideas of system theory is the relationship
We now analyze the case of the symmetric rigid body. In
between empirical "external" descriptions of systems and the
this case, we can assume 7 = 0 and since physics demands that
various detailed "internal" models one can propose to account
a + p + y = 0, we see that the determinant is simply ka3c4 * for the observed phenomena.Inthe caseof linearsystems
(b2 + a Z ) . If this vanishes, then one of three things happens-
this circle of ideas is expressed in terms of the relationship
either a is zero, c is zero, or a and b are both zero. The first between theintegral equation description
possibility implies that p is zero, thus f " 0 and the reachable
set is one dimensional. The second implies, since y = 0, that
w 3 is a constant regardless of u and the reachable set is not y ( t ) = L ' w(t,u ) u ( u ) d u + t ( t ) (6)
three dimensional. The third implies that w: + wz is acon-
stant regardless of u , thus the reachable set is not three dimen-
and the linear differential equationdescription
sional. All cases are incompatible with controllability. If
none of these occur,thenthe system is controllable and i ( t ) = A ( t ) x ( t )+ B ( t ) u ( t ) , y(t) = c(t)x(t).
ius this is a complete analysis of the symmetriccase.
If y is nonzero, then theanalysis is apparently more tedious, In the present context, the internal descriptions will be by
and it is not clear that the particular Lie brackets displayed means of nonlineardifferential equations. What substitute
here tell the whole story. Of course, if they do not, then we can be found for (6) as an external description? We can re-
must look at additional elements of {f,g}LA. gard (6) as the fust two terms in a power series expansion for
y . The general series is the Volterra series:
We concludethis section with a discussion of one further
type of result on controllability which has been investigated
by Hirschorn [ 9 ] . The idea is, that under some circumstances,
it should be possible to "factor out" the effect of the drift v ( t )= w o ( t ) +
OD 1'lul ...~''wj(t,ul,uz,...,uj)
i=l
term for
.u(u')u(u~)...u(u~)du~du~
"'dUj.
i ( t )= f [ x ( t ) l + u ( t ) g [ x ( t ) l , x ( t ) = x0 (5)
This series, as it turns out, is just the right alternative for the
so as to be able to
express the reachable set at time t as integral equation (6) whendealing with linear analytic systems.
This is due inpart to thefollowing theorem from [ 1 11.
R ( t )= {exp L}G (exp t f h
Theorem 6: Suppose that f(*,*):R'X R" +R" and g(*,.):
where L is some Lie algebra of vector fields constituted from R' X R" +R" are continuous with respect to their first argu-
horized licensed use limited to: J.R.D. Tata Memorial Library Indian Institute of Science Bengaluru. Downloaded on September 18,2024 at 07:51:00 UTC from IEEE Xplore. Restrictions ap
66 PROCEEDINGS OF THE
IEEE, JANUARY 1916
mentandanalyticwith respect to theirsecond. Given any are all of the form ip= A p x p + uBpxp + ubp, and the Vol-
interval [0, T] such that the solutionof terra series for these bilinear systems can be computed using
the Peano-Baker series.
i ( t )=f[t,x ( t ) ] , x0 = 0
As an example of Volterra series computation, we compute
exists on [0, T I , there exists an E > 0 and a Volterra series the frequency modulation Volterra series. The model is
for
X(f) + (a2 + u ( t ) ) x ( t ) = 0 , x ( 0 ) = 0, i ( 0 ) = 1,
i ( t ) = f [ t , x ( t ) l+ u ( t ) g [ t , x ( t ) l , x(O)=xo,
Y 0 ) = x (t).
Y ( t ) = h[ x ( t ) l (7) Writing this as a first-order system gives
with the Volterra series converginguniformly on [O,TI to
the solution of ( 7 ) provided lu(t)l < E . Moreover, the Vol-
terra series is unique.
We give someindicationofhow the proofgoes. For bi-
linear systems
i ( t ) = A ( t ) x ( f )+ u ( t ) B ( t ) x ( t ) , ~ ( 0= )
xO, Y ( t )= CX(~)
and
we can introduce z ( t ) via x ( t ) = @ ~ ( O
t ,) z ( t ) . Then z ( t )
satisfies
def
; ( f )= U ( t ) @ i ’ ( f , O ) B ( t ) @ A ( t O, ) Z ( f ) =
Y ( t ) = a-’ sin at
l’ a+ sin a(t - u ) sin aou(o) do
Jo” [’
U(f)G(f)Z(f).
Using the Peano-Baker formula, we have + a-3 sin a(t - u) sin n(a - p )
z ( t ) = [I+[‘g(ol)dol +IT‘’
0 0
g(ol)B”(oz)u(ol)u(02) sin a p u ( p ) u ( o ) dpdo + *
* dozdol + .
* * ] xo.
with the nth
kernel being
w , = a-” sin a(t - U l ) sin a(o1 - uz)
Then c x ( t ) is characterized by theVolterra kernels
-- sin a(onml- a,) sin ao,.
w k ( t , o1,o2 * * ’ (Tk)= &A ( t , 0 1 )B(ol ) @ ( D l , uZ)B(oZ) ’*
As stated, the existence theorem for Volterraseries is only a
@A (ok-1, ok)xO. locally valid result. Global representation of a nonlinear sys-
(seed’Alessandro et aZ. [ 121).Fortime invariantbilinear tem by means of an everywhere convergent Volterra series is
systems, we have not expected except for special cases. These special cases art
the analogsof the entirefunctions ofcomplexfunction
w,(tl o1 * 0,) = c&fe-AulB&u~ e-A%B& “3 - -
* theory-Le., functions of a complex variable which have the
whole complex plane as a domain of holomorphy. d’Alessan-
e-A ‘nB&‘nxo.
dro et aZ. [12] have pointed out that, forbilinearsystems,
An algorithmforconstructing the Volterra series inthe the Volterra series doesconverge for all locally bounded u.
general case is given in [ l l I . Itamounts to expanding all This is, perhaps, the firstlarge class of nonlinearsystems
functions in their Taylor series, forming a sequence bilinear for which global convergence of the Volterra series has been
approximations ofincreasingaccuracy,andcomputing the established.
Volterra series forthe bilinearapproximations.Theidea Anobviousnecessaryconditionforglobalconvergence of
behind the derivationcanbededuced fromthe following the Volterra series isthatthere should benofinite escape
example. times for the given initial condition of the system, regardless
Consider of the choice of u . Equivalently, one should ask that there be
nofiniteescapetimefromanypointreachablefrom XQ
i ( t ) = - x ( t ) + x Z ( t ) + u ( t ) [ I+ x Z ( t ) l , y(t)=x(t). As waspointed out before,avector field ona manifold
said t o be complete if the solution through any initial point
Now if we look at x 2 ( t ) , it satisfies a certain differential equa- canbe extendedforwardsandbackwardsintime to give a
tion which is linear in u ( t ) . More generally, x p satisfies a dif- solution on (-m, CO). Thus onemay reasonably restrict the dis-
ferential equation which is also linear in u. We have cussion to complete vector fields. Even s o , this is a problem
0
of some subtlety in that it can happen that f and g are corn
- 12 + u
plete, and yet for
=[:-: 4+2u \
-3
i ( t )= f [ x ( t ) l + u ( t ) g [ x ( t ) l ,
Suppose that {f,g}LA is a collection of complete analytic series of algebras J? d e f i e d by = L ], =[e, J?] is [ek,
vector fields on a manifold M and suppose that every point on the trivial algebra for some k . Checking for solvability involves
M can be reached from every other point on M (this assump- computing derivatives andlinear algebra. Solutions of dif-
tion will be weakened later). If ferential equationsare not required.
We say that aVolterra series is finitely generated if there
exists a finite number of separable kernels w 1 , w2,* . , w y and
an analytic function q5 such that y ( t ) = @ ( y l ( t ) , y 2 ( t ) ,* * , -
Y r ( t ) )with
u(ok) dol do2 . ' ' dok
* u((71) u ( ( J 2 )*
y i ( t )= I
,'* * luni w i ( t ,0 1 , . . , ani)
i ( t ) = f [ ~ ( t )+l u ( t > g [ x ( t ) l , y ( t ) = h [ x ( t ) l , ~ ( 0 =) x 0
* -
u(o1) . * u(o,,) do,, * * dol.
and if for some u ( * ) defined on [ 0, TI the Volterra series
diverges, then we can still determine y ( t ) from a knowledge Naturally any Volterra series which is itself finite is finitely
-4 u ( * )and the kernels wk as follows. generated, but the converse is clearly false as exp u ( o ) do Ji
From Theorem 6, given anybounded u ( * ) , there exists e, demonstrates.
depending onlyontheboundon u , suchthatthe Volterra Theorem 8: Given (7),if the Lie algebra is solvable, then the
series converges on [0,e l . Moreover, given any u ( ' ) on [0, TI Volterra series is finitelygenerated. Conversely, given any
there exists a trajectory on M generated by u. At each point finitely generated Volterra series, there exists a realization of
on this trajectory, there is a Volterra series corresponding to it with {f,g}LA solvable.
the given differential equationandoutputmap,butwith Proof (Sketch): The second statement comes from [ 11,
altered initial state. At each point along the trajectory, there Theorem 61 with minor changes. The fmt part is an immediate
is an E such that the Volterra series based on that starting state consequence of an analysis of Chen [ 14, Section 41 ; we sketch
converges on [0,e ] for u ( t ) < M . It is not difficult to show how his argument goes. Let { f , ( x ) } i = l be a basis for the Lie
(using the proof of [ 1 1, Theorem 1] ) that the infimum of all algebra generatedby f and g. (This Lie algebra is finite
these E'S is positive, say eo. Now we sum the Volterra series dimensional since it is generatedbya f i i t e number of
by splitting up the interval [0,TI into subintervals of length elements and is solvable.) We look for a solution of
eo or less. Because of causality,i.e., because y ( t ) does not
i ( t )=f [ x ( t ) l + g [ x ( t ) l, x(0) = x 0
depend on u ( 7 ) for 7 > t , we can rearrange the Volterra series
at time neo as which is of the form
x ( t ) = (exp ml ( t )f~) (exp m2 0 ) f 2 ) * * (exp m A t ) f r ) x o .
Using the BakerCampbell-Hausdorff formula,i.e., the vector
field analog of the matrix identity
* w ; ( t , ol, ,ok) u(ol + neo) u(o2 + neo)
* * *
e F ~ e - F = ~ ++ $[ [~F , B~I I~ + * * a
We say that a linearanalytic realization is bilaterallycon- it is one to one by observability. (All points are reachable by
trollable on a manifold M if, given any two pointsX I , x2 on M , 6
going forward and backward in time, so is defined on all of
there exists r E { exp {f,g}LA }G such that m1 = x ? . This is M and is onto N , no two starting states give identical 1/0maps.)
the same as asking that there be a sequence of points x' ,x z , Clearly then, $is a one to one and onto map. We have only to
* -- ,x" with x 1 = x 1and x 2 = x" and x' reachable from x 1 establish its smoothness properties. We want to show that is, 6
and x 3 , x 4 reachable from x 3 and x 5 , . That is, bilateral -- in fact, analytic. Notice, however, that it must happen that for
controllability means one can get from any point onM to any each { t l , t 2 ; . - , t n } , w e h a v e
other point on M moving forward and backward in time. We
say that alinear analytic realization on amanifold M'is ij[(exp tlf) (exp t2g) (exp t 3 f ) * *) x01
observable if, given any two distinct starting statesx l , x2 in M , =(exptla)(expr2b)(expta)(~")x~l.
there exists an input u ( * )whose choice may depend on x1 and
x2 , such that the output fromx 1 differs from the output from If y e pick coordinates in M and coordinates in N , we can lo01
x 2 . Realizationswhich have these properties will be called at @ as a map ,of R" + R". The setup is diagramed in Fig. 4 .
minimal. Notice that the idea of dimension plays no role in The analyticity follows from the analyticity of the exp map
this definitionalthoughone would hopethat all minimal and its inverse.
realizations of the same 1/0 map would be realized on Theother(moredifficult)part of this circle of ideas is
manifolds of the same dimension, and this turns out tobe the related to theexistence of minimal realizationsand is expressed
case. by the following theorem.
Theorem 9 (Specialcase of Sussmann /I 71): Suppose we Theorem IO: Given any 1 / 0 map which is realizable by an
are given two linear analytic systems initialized linear-analytic
system on manifold
a M with
{ f , g } L A being complete,there exists arealization of the
i ( t )= f [x(t)l + u ( t ) g [ x ( t ) l , Y ( t )= hIx(t)l, x0 =x(0) same 1/0 map on amanifold N suchthatthe system is
and bilaterally controllable on N , and any two distinct states can
be distinguished from the output forsome input.
i ( t )'= a [ z ( t ) l + u ( t ) b [ z ( t ) l , y ( t )= c [ z ( r ) ~ , z o = Z(O) The proof of thistheorem requiredasharpening of the
conditions of Theorem 2. Inorder to show that{exo
which are minimal realizations on manifolds M and N , respec- {f,g}LA }G is always a differentiable manifold [ 181 , [ 191 am
tively, of the same 1/0map. Suppose that all the vector fields also a generalization of the closed subgroup theorem of dif-
in { f , g } L A and { a , b)LA arecomplete.Thenthere existsa ferential geometry [ 201. Both thesegeneralizations require
diffeomorphism @ :M + N such that new mathematical technique and yield new basic mathematics
in return.
It is of interest to contrast Theorems 9 and 10 withthe
corresponding results for bilinear systems. Bilinear systems in
R"
b [@-I ( x ) ] = g ( x )
ax i ( t ) = Ax(t)+ u(t) Bx(t), y(r) = (c, x ( t ) ) , x ( 0 ) = x o
c[@-'(x)l = h(x).
have a reachable set which is a proper subset of R". One says
Proof (Sketch): Since the realizations are minimal, there that they are minimal [ 121 , [ 21 ] if there is no subspace of
exists a map $:M+ N which sends x0 into zo and which sends R" which contains the reachableset for all time and if any
any point x in M wbich,is reachable from x0 to the point z in twodistinctstartingstates, x 1 and x2, yield different 1/0
N reached by the application of the same controlsequence t o maps. The analog of Theorem 9 is that any bilinear system
the z system. Controllability insures that this map is onto N ; can bereduced to one which is minimal, and the analog of
thorized licensed use limited to: J.R.D. Tata Memorial Library Indian Institute of Science Bengaluru. Downloaded on September 18,2024 at 07:51:00 UTC from IEEE Xplore. Restrictions app
BROCKETT: NONLINEAR SYSTEMS AND DIFFERENTIAL GEOMETRY 69
Theorem 10 is thatanytwo minimal bilinear systemsare over the set of a l l u’s which steer the deterministicsystem
isomorphic in the following very strong, sense. If, A , B , c, and
x. and F , G ,h , and zo are the data which specify the systems,
i ( t ) = A ( t )x ( t ) + b(t) u(t)
then there exists an invertible matrix P such that PAP-’ = F , from x = 0 at t = 0 t o x = x 1 at t = tl is
PBP-’ = G , cP’c = h , and Pxo = z o . Thus in the case of bilinear
systems, the structure oneassumes to begin with is ratherrigid, ~ * ( ~ ~ , t ~ ) = x ~ ~ - ~ ( t ~ ) x ~ .
but the conclusion is also very precise (linearisomorphism). Comparing this with the formula for the density, we see that
With linear-analytic systems the structure is less rigid and the if we start at x = 0 at t = 0, then the value of the probability
conclusion is less precise. densityatanypoint is inversely proportional to the ex-
We note in passing that the results we have given, together ponential of the “energy” required to getthere. In this
with a result of [ 121, allow us to give an answer to a question section,we.describetheextent to whichtheseresultscarry
which has received some attention in the literature. That is, over to linear analytic systems.
given a linear analytic systems, when does there exist a change Because of theproperties of theIto calculus, stochastic
of coordinates which puts it in bilinear form? The answer is equations on manifolds take a somewhat peculiar form. The
the following. Compute the Volterra series for the stationary basic well-posedness conditionforthe class of systems of
linear analytic system; if there exists a square matrix T ( t ) with interest here is expressed by the following theorem.
a rational Laplace transform, and vectors c and x o , such that Theorem 11: Let f , g : R ” + R” be analytic. Suppose that
the kthkernel is M = { x : x E R ” , & ( x ) = 0, i = 1, 2, * . , p ) is an analytic
variety in R ” . Given the Ito equation
w k ( t , 0 1 ,0 2 , * * * 9 0k)
-
= c‘T(t - ul) T ( U ~- u2). * T ( U ~ -- ~uk)x. dx = f ( x ) d t + g ( x ) d w , x(0)E M (8)
then the Volterra series is bilinearly realizable (cf. [ 121). If we can assert that x ( t ) belongs to M with probability one, if
no such T,c, x0 exists, then the system is not bilinearizable. for all x
The work of Fleiss [ 221 is devoted to the derivation of some C(x)&(x)=O, i=1,2;*.,p
of the results on bilinearsystems mentioned here using the
Kleene-Schutzenberger representation theorem and the theory
( ~ ( x -) 3 c Z ( x ) )~ ~ (=x0, ) i = 1;2, - * ,p
of formal power series. where F and G are given by
The paper of Porter [23] contains further material develop-
ing Volterra series ideas in adifferent way.
V. STOCHASTIC
EQUATIONS
One of themost convincingapplications of thecontroll-
ability concept for linear systems occurs in the study of linear
stochastic equationsof the Ito type Proof: (see [25] and [26] .)
As application of thisresult we look at an example from
d x ( t ) = A ( t ) x ( t ) dt + b(t) dw(t). [ 281. Consider the Ito equation
The density which results from a particular initial density can
d x = A x d t + B x d w , x ( 0 ) = xI oI x, o l l = 1.
be expressed in terms of the density which results from x(0) =
0. Thisis Under what circumstances can we ,assert that Ilx(t)ll = 1 with
probability one? In this case 4 = x x - 1 and G @ ( x )= 0 if and
only if B is skew symmetric. Likewise, ( F - (1/2) G 2 ) $ ( x )=
0 if A - (1/2) B2 is skew symmetric. Thus for example,
where W is a controllability Grammian,
w(r)=~f~~(f.(I)b(D)bf(0)~~(t,O)d.
defines aprocess which stays on the circle {x: + x f = constant}
almost surely in themean-square sense. The apparent tendency
and W is invertible, as required for the expression for p to be of the drift term to pull the process inside the circle is offset
meaningful, when the system has the controllability property. by the Ito correction term.
Thus controllability is exactly the right concept for deciding Given a stochastic differential equation, the question arises
if smooth densities exist. Moreover, if A and b are constant as to whether or not smooth transition densities exist as they
and if the eigenvalues of A have negative real parts, then, under do in the linear controllable case. If they exist, then we denote
thecontrollabilityhypothesis,thereis a unique invariant by p(t, x , y ) the probabilitydensity at time t and point x given
measure which is C” (Gaussian actually) such that a l l measures that at t = 0 the state was y .
approach it as t goes to infinity. The Fokker-Planck equationforthetransition densities
Infact,there is one more connectionbetweencontrol- associated with the Ito equation(8) is easily expressed in terms
lability and the probability density which is even more striking. of the above notation. It is
It is well known and easily seen [ 241 that the minimum value
of ($ 3
+ F ( x ) - G z ( x ) p ( t , x , y ) = 0. 1
This equation has the formal appearance of a heat equation
exceptthatthe second-order part, represented bythe G 2
uthorized licensed use limited to: J.R.D. Tata Memorial Library Indian Institute of Science Bengaluru. Downloaded on September 18,2024 at 07:51:00 UTC from IEEE Xplore. Restrictions apply
70 PROCEEDINGS OF THE
IEEE, JANUARY 1976
term, is not positive definite, so the classical existence theory property that for any twodistinct points x1 and x2 in X,there
does not apply. Fortunately, Hormander [27] has worked out are elements of r , N ( x l ) and N ( x 2 )such that N(x ) fl N(x2 ) is
a theory of such equations which exactly fits the present case. empty with x 1 E N ( x l ) and x2 E N ( x 2 ) . We ask that (X, r ) be
Hormander calls an operator separable, that is, thatthere exista countable subset of X
which is dense in (X,7). As a consequence of this, there is a
L=(F+G~) countable subset of r whichgenerates all of r under union;
hypoelliptic if, forany given Schwartz distributions 4 and that is (X,r ) is second countable. So much for the topology.
J, suchthat L $ ( x ) = $ ( x ) , it follows that is C" off the
$J
Now we consider the collection of maps @ = {@ilt$i: Mi + N i } .
support of the singular part of J, after a suitable modification We assume that there exists an integer n , a collection of open
of @ on a zero measure. A necessary and sufficient condition sets Ni, and continuous one to one maps $i :Ni + M i C R"
for hypoellipticity is that { f , g}LA should span .the tangent where Mi are open subsets of R" such that @T1 :Mi + Ni are
space of the manifold at each point. The a/& term in the also continuous. Thiscollection CP is assumed to have the
Fokker-Planck equation must be combined with L to give an following additional properties.
operator which is hypoelliptic on M X [ 0, 00) where [ 0,w) is (i) The domains of the @i form an open covering of X.
the time axis. Since a/at spans the time direction, we have (ii)Each 4i maps its domain bicontinuously onto an open
hypoellipticity if the smallest Lie algebra, which contains g subset of R",
and is closed under bracketing with f , spans the tangent space (iii)If Ni nNi is not empty and if 4i : Ni n Ni + Pii and
of M at each point of the manifold. These ideas were put 4j :Ni nNi + Qii, then @j 0 4zT1 is an infinitely differ-
together by Elliott [ 261, [ 271 to get the following results. entiable map of Pii into Qij.
Theorem 11: Suppose that { f , g}LA consists of complete (iv) This collection @ is maximal with respect to properties
vector fields on a manifold M . Suppose that the smallest Lie (ii)and (iii).
algebra, which contains g and which is closed under bracketing
with f , spans the tangent space of M at each point. Then the The locally defined maps & are called charts and @ is called,
corresponding Ito equation ratherappropriately,an atlas. The last condition (iv) is not
too important in that if we have some charts that we want to
use, then we just declare Q, to be the collection of all charts
which are compatible with the given ones. If in the previous
paragraph we replace "infinitely differentiable" by "analytic",
d e f i e s C" transition densities on M . then we obtain an analytic manifold. (Incidentally, because we
Notice that,in statingthis theorem, we have takenthe assume that the manifolds are second countable, they are auto-
opposite point of view from Theorem 10 in that here f and g matically paracompact, a condition which is sometimes assumed
definevectorfields onthe manifold andthe "correction in place of second countable.)
term" (b/2) ( a g / a x )g is what is needed to insure that the give0 In addition to R" itself, a standard source of manifolds are
Ito equation evolves on the manifold. In Theorem 9,g and f - the so-called differential(analytic)varieties, in R" described
(1 /2) ( a g / a x )g were the vector fields on the manifold. by the following theorem.
Specific instances of processes meeting these conditions have Theorem A: Let @1@2,* * * ,, ,@, be C" (analytic) functions
been discussed in the literature. For example, in [ 281 the case on R" suppose that
of processes on spheres is studied in some detail.
Theorem 11 then asserts that some essential features of the Rank[(V41,V4z,"',V4nI=m
linear stochastic differential equation can betransferred to at each point on the setM = { x : & ( x ) = 0). Then this Set with
manifolds. Only the last fact citedat the beginning of the the topology r consisting of all sets of the form 0 nM 0 open
section regarding theproportionality of the density to the in R n can be given thestructure of a c" (analytic)differ-
exponential of the control energy has failed to be generalized.
entiable manifold.
In special cases (essentially Abelian Lie groups), some progress
Proof: Cee Singer and Thorpe [ 3 1, p. 1201 .)
has been made on finding a suitable extension of this result,
A submanifold of a C"-manifold M is a pair ( N , 4) where N
but a general theory seems to require new developments.
is a C"-manifold and I#J is a one to onemapping q5 : N + M sucb
There is agrowing literature on the formulation and solution
that ( d $ ) / ( d x )has a trivial kernel at each point of N . In some
of filtering and stochastic control problems on manifolds (see, cases, one defines an n-dimensionaldifferentiablemanifold
for example, [ 301 ).
M as a connected subset of the standard pdimensional Car-
APPENDIX tesian space subjecttotherequirementthat,at each point
x of R" which belongs to M , it is possible to establish a (gen-
The purpose of this appendix is to give the basic definition erally curvilinear) coordinate system for RPwith basis elements
from the subject of differentiable manifolds in a form which y , , y 2 ,* * ,y p (RP-valued C"-functions on R p ) such that in a
is most compatable with the main body of this paper. We do neighborhood N of x , the intersection of M and N is exactly
so both to provide a convenient reference for the reader and the set of points which satisfy
also because there are many slight but potentially annoying
differences in the general literature. Yn+1 = o
By a C" differentiablemanifold, we understand a triple
(X,r , a) whereby X is a set, r is topology on X,and is a set Yn+2 = 0
of continuous maps from open subsets of X into R" with its
usual topology. Both r and CP are subject to certain restrictions
which we now describe. We ask thatthe topologicalspace
(X,r ) be a Hausdorffspace, i.e., theopensets have the y p = 0.
uthorized licensed use limited to: J.R.D. Tata Memorial Library Indian Institute of Science Bengaluru. Downloaded on September 18,2024 at 07:51:00 UTC from IEEE Xplore. Restrictions appl
BROCKETT: NONLINEAR SYSTEMS AND DIFFERENTIAL GEOMETRY 71
fi-
a =0, i = 1,2;..,p.
axi
Thus we can restrict F to T(M)and get a vector field on M.
The symbols a / ( a x i ) can be thought of as unit vectors to be
Fig. 5 . illustrating submanifold of R 3. comparedwiththe symbols i, j , k, etc. in classical vector
analysis. This notation associates witheach vector field a
partial differential operator whose domain is the differentiable
functionson M. It is particularlyuseful, for example, in
discussing the Fokker-Planck equation.
By the Whitney imbedding theorem,any manifold is dif-
feomorphic to a submanifold of R ” . In R” , there are admissi-
ble coordinate systemswhich are global-e.g., thestandard
coordinate system for R n , It also serves to identify points in
Fig. 6. Tangent space of M at m . any submanifold M C R ” and thus we may describe vector
fields on M in terms of this single global coordinate system.
When we speak of a differentiable dynamical system being
The remaining coordinate functions y 1 , y 2 , . * ,Y n then serve on a manifold, what we mean is that at each point x0 of M ,
to define points in M unambiguously. Thus adifferentiable we can pick local coordinates and express the evolution in a
manifold is a pointsetand a set of coordinatecharts. This patch about x . by
definition is fully consistent with the more abstract definition
given above and, in fact, defines the differentiable structure i ( t ) = a [ x ( t ) ]+ u ( t ) b [ x ( t ) ] .
alluded to in Theorem A (see Fig. 5). However, it may happen that M is a subset of R m , and we look
Example: We may realize the n-dimensional sphere shell, as only at the redundant coordinates
asubmanifold of Rn+’ asfollows.Let 4 1 of Theorem A be
x ~ + x ~ * * * x ~ + T l -h e1 n. a t e a c h p o i n t o f M = { x : 4 ( x ) = =f [ z ( t ) l + u(t) g[z(t)l
0) the rank of V@ is one. Thus M can be given amanifold in R” space. It is in this second way that one is most likely to
structure. Since V@ has a trivial kernel, thismakes (M, 4) a encounter a practical problem.
submanifold of R “ + l .
According to the Whitney imbeddingtheorem[32] , any REFERENCES
@anifold is diffeomorphic to one which is imbeddedin a D. Q. Mayne and R. W. Brockett, Eds., Geometric Methods in
Euclidean space. System Theory, e r d r e c h t , Holland: Reidel, 1973.
G. Frobenius,“Uber die unzerlegbaren diskreten, Bewegungs-
We now set up the definition of a vector field on a manifold. gruppen,” Sitzungskrichte d . Konig. Peruss. Akd. W h . vol. 29,
Let M be amanifold and let Fo(M) be theset of all C” pp. 654-665, 1911.
(analytic) functions on M ,F’(M) = { f l F :M -P R , f~ 6” or W. L. Chow, “Uber Systeme Von Linearen Partiellen Differential-
gleichungen ester Ordinung,” Math. Ann., vol. 117, pp. 98-105,
analytic}. A tangent vector at m E M is an operator 1939.
u : F o ( M )+ R such that forall f,g , E Fo(M) R. H e m a n n , “On the accessibdity problem in control theory,”
in NonlinearDifferentialEquationsandNonlinearMechanics,
J. P. LaSalle and S. Lefschetz, E&. New York: Academic Press,
(i) o ( f + g ) = ~ ( f+)vk) 1963.
(ii) 4 f g ) = ~ ( f* d) m ) + v(g) f h ) . H. Hermes and G. W.Haynes, “On the nonlinear control problem
with control appearing linearly,” J. SIAM Contr., vol. 1, pp. 85-
This definition does not look verygeometrical, but observe 108,1963.
first of all that the space of all tangent vectors at m is a real C. Lobry, “Quelquesaspectsqualitatifs de la theories de la
commande,” L’Universite Scientifique et Medicale de Grenoble,
vectorspace, andsecond, if x 1 , x ? , * ,x, definesalocal pour obtenir le titre de Docteur es Sciences Mathematiques, May
coordinate system in the manifold, thenfor each i, u ( f ) = 19, 1972.
a f ) / ( a x i ) J mdefines a tangent vector. In fact, one can see on A. Krener, “A generalization of Chow’s theorem and the bang-
bang theorem t o nonlinear control problems,” SIAM J. Contr.,
the basis of a Taylor series argument that these vectors form a vol. 12, no. 1, pp. 43-52,1974.
basis for the set of all tangent vectors at m , i.e., the tangent H. Sussmann and V. Jurdjevic, “Controllabilityofnonlinear
space at m . Thisvector space is denoted by Tm(M)(see Fig. 6 ) . systems,” J.DifferentialEquations, vol. 12, pp. 95-116, July
1972.
By a vector field on manifold, we understand a map which R. Hirschorn, “Topological semigroups,setsofgenerators, and
assigns to each point of m an element of Tm(M) controllability,” Duke Math. J.,vol. 40, pp. 937-947, Dec. 1973.
R. W. Brockett, “System theory ongroupmanifolds and coset
spaces,”SZAMJ. Contr.,vol. 10, no. 2, pp. 265-284, 1972.
-, “Volterra series and geometric control theory,” 1975 Pre-
printsZFAC, Philadelphia, Pa.: ISA.
P. d’Alessandro, A. Isidori,and A. Ruberti,“Realization and
structure theory of bilinear systems,” SZAM 1. Contr., vol. 12,
If the ai are C” (analytic), then this is called a C” (analytic) no. 3, pp. 517-535, 1974.
vector field. A. J. Krener, “Linearization and bilinearization of control
In R ” , we may then associate with every function f : R” -+ systems,” in h o c . 1974 AllertonConf. Circuit and System
Theory, 1974.
R” a vector field on R” K. T. Chen,“Decompositionofdifferential equations,” Math.
Ann., vol. 146, pp. 263-278, 1962.
R. W. Brockett and A. S. Willsky, “Finite group homomorphic
sequentialsystems,” ZEEE Tmns. Automat.Contr., vol. 17,
pp. 483-490, Aug. 1972.
YE (M,4) is a submanifold of R n such that FJI vanishes for all J . Baras and R. W. Brockett, “H’-functions and infinite dimen-
uthorized licensed use limited to: J.R.D. Tata Memorial Library Indian Institute of Science Bengaluru. Downloaded on September 18,2024 at 07:51:00 UTC from IEEE Xplore. Restrictions appl
12 PROCEEDINGS OF THE IEEE, VOL. 64,NO. 1, JANUARY 1976
sional realization theory,” =AM J. Contr., vol. 13,pp. 221-241, J. Wiley, 1970.
Jan. 1965. [25] J. M. C. Clark, “An introduction to stochastic differential equa-
H. J. Sussmann, “Existence and uniqueness of minimal realiza- tions on manifolds,” in Geometric Methods in System Theory,
tions of nonlinear systems-Part I: Initialized systems,” J. Math. D. Q. Mayne and R. W. Brockett, Eds. Dordrecht, Holland:
Syst. Theory, to be published. Reidel, 1973.
-, “Orbits of families of vector fields and integrability of [26] D. Elliott, “Diffusions onmanifolds arising from controllable
systemswith singularities,” Bull. Amer. Math. SOC., vol. 79, systems,” in Geometric Methods in System Theory, D. Q. Mayne
pp. 197-199, 1973. and R. W. Brockett, Eds. Dordrecht, Holland: Reidel, 1973.
1191 P. Stefan, “Two proofs of Chow’s theorem,” in Geometrk I271 - , “Controllable systems driven by bite noise,” Ph.D. dis-
Methods in System Theory, D. Q. Mayne and R. W. Brockett, sertation, Univ. of Calif., Los Angeles, 1969.
Eds. Dordrecht, Holland: Reidel, 1973. I281 R. W. Brockett, “Lie theoryandcontrol systemsdefined on
H. J. Sussmann, “On quotients of manifolds: A generalization spheres,” SZAMJ. Appl. Math.,vol.25, pp. 213-225,Sept. 1973.
of the closed subgroup theorem,” J . Differential Geo., vol. 10, [ 29 1 L. Hormander,“Hypoellipticsecond-orderdifferential equa-
Mar. 1975. tions,”Acta. Math., vol. 119, pp. 147-171, 1967.
R. W. Brockett, “On the algebraic structure of bilinear systems,” I301 J. T. L o and A. Willsky, “Estimation for rotational processes with
in TheoryandApplications of VariableStructureSystems, R. one degree of freedom-Parts I-II1,”ZEEE Trans. Automat. Contr.,
Mohler and A. Ruberti, Eds. New York: Academic Press, 197:; vol. 20, pp. 10-33,Feb. 1975.
M. Fleiss, “Sur la rialization des systimes dynamiques biliniares, 131 1 I.M. Singer and J . A. Thorpe, LectureNotesonElementary
C. R . Acad. Sc. Paris, vol. A-277,pp. 923-926, 1973. Topology and Geometry. Glenview, Illinois: Scott, Foresman
W. A. Porter, “An overview of polynomic system theory,” this and Co., 1967.
issue. [32] L. Auslander and R. E. MacKenzie,Zntroduction to Differentiable.
R. W. Brockett, Finite Dimensional Linear Systems. New York: Manifolds. New York: McGraw-Hill, 1963.
Abstmct-Lineaa passivetime-varirblenetworksareinvestigated P i - [6,Sec. 3.51 and sometimes in the time domain [ 71.Many of
the use Of distniutionai kernels as applied to the the results based upondistributiontheory to date are sum-
scattering matrix treated in the time domain. Necessary and sufficient
conditions for are obtained,md the matrix is shown marized in the books of Doleial [8]
and, more recently,
to be ameasure s~tisfyingan energy formconstraint. Lossless con- ikmanian [91.
stnints pertinent to synthesis are deveioped while networks consisting For time-invariant networks, there are also available more or
of a finite number of circuit eknents are considezed in some less classical-type works [lo]-[ 151 based upon frequency do-
Examples illustrating interesting behavior are presented.
mainconcepts.These concepts have been taken over to the
I. INTRODUCTION timedomaincharacterizations, primarily throughthestate
[ 161, such that extensions to time-variable synthesis based
HE MATHEMATICAL FIELD of functional analysis is upon the state [ 171, [ 181 become straightforward, albeit with
now recognized asa rich one witha varied, though strangeresults (as instabilities of passive structures [ 19 I).
modem,history [ 11. Within functional analysis, the Likewise, there are recent applicable developments in operato!.
theory of distributions [2]-[4] plays a particularly interesting theory [ 201, especially with regard to resolution space con-
role, especially for physicalsystems. Indeed,motivationfor cepts [21]-[ 241 as well as some distributional treatments of
the theory came,in part, from the circuit theory aspects of time-variable networks [25]-[28]. However, whenoneturns
Heaviside’s work [ 51 while more recent applications have led toward synthesis of time-variable networks,thefunctional
to the development of the properties of passive networks in a analysis results are scarce [29]-[33].Thusit seems that a
distributional framework, sometimes in the frequency domain functional analysis treatment of linear passive time-variable
networks with an emphasis upon results important for synthe-
Manuscript received January 3, 1975; revisedMarch 21, 1975.This sis is in order.
work was supported in part by the U.S. Air Force Office of Scientific In this paper, some of themostimportantproperties of
Research underGrants AF-AFOSR-337-63and AFOSR 70-1910,in
part by the Australian Government Services Canteens Trust Fund, in linear passive networks aredevelopedin terms of the time-
part by the Australian Research Grants Committee, and in part by the varying scattering matrix [34]~ ( 7t ),; this distributional kernel
U.S. Educational Foundation in Australia under a Fulbright Traveling s appears to be one of the most generally useful descriptions
Grant.
B. D. 0. Anderson is with the Department of Electrical Engineering, available, especially for synthesis. The paper is structured such
University of Newcastle, Newcastle, N.S.W., Australia. that Sections IV and V contain the general results. Section I1
R. W. Newcomb is with the Department of Electrical Engineering, essentially serves as a review section where the underlyingcon-
College of Engineering, University of Maryland, College Park, MD
20742. cepts of interest are d e f i e d ; among these are a network an4
uthorized licensed use limited to: J.R.D. Tata Memorial Library Indian Institute of Science Bengaluru. Downloaded on September 18,2024 at 07:51:00 UTC from IEEE Xplore. Restrictions appl