The Complementarity Class of Hybrid Dynamical Systems: W.P.M.H. Heemels and B. Brogliato
The Complementarity Class of Hybrid Dynamical Systems: W.P.M.H. Heemels and B. Brogliato
# 2003 EUCA
This paper gives an introduction to the field of friction, and one-sided springs; dynamic versions of
dynamical complementarity systems. A summary of linear and nonlinear programming problems; and
their main applications and properties together with dynamic Walrasian economies. In many of these
connections to other hybrid model classes is provided. applications a prominent role is played by a special
Moreover, the main mathematical tools which allow combination of inequalities, which is similar to the
one to study complementarity systems are presented linear complementarity problem (LCP) [31] of math-
briefly. Many examples illustrate the developments. ematical programming. Coupling such ``com-
The available results on modeling, simulation, controll- plementarity conditions'' to differential equations
ability, observability and stabilization are presented leads to dynamical extensions of the LCP that are
and further suggestions for reading can be found in this called complementarity systems (CS) [45,92,93]. These
overview. systems have already a long history within various
application fields like unilaterally constrained
Keywords: Hybrid systems; Complementarity; Piece- mechanical systems [22,39,65,74,75,82,88]. The main
wise linear systems; Dynamics and inequalities; Well- aim of the current paper is to introduce CS to readers
posedness; Stability; Controllability; Constrained who are unaware of what they are and what physical
mechanics; Switched circuits; Convex analysis or abstract systems they are able to model, it presents
their main features and results, which have already
1. Introduction been obtained in the domain of systems and control
theory. It has to be noted that there is a considerable
In many technical and economic applications one inherent complexity in systems of differential equa-
encounters systems of differential equations and tions and inequalities, since nonsmooth trajectories
inequalities. For a quick roundup of examples, one and possibly jumps have to be taken into account.
may think of the following: motion of rigid bodies Actually, the combinations of differential or differ-
subject to unilateral constraints; switched electrical ence equations and inequalities gives rise to systems
networks; optimal control problems with inequality that switch between modes on the basis of certain
constraints in the states and/or controls; dynamical inequality constraints and that behave within each
systems with piecewise linear characteristics, such as mode as ordinary differential systems. This ``multi-
saturation functions, dead zones, relays, Coulomb modal'' way of thinking is natural in a number of
applications: in the study of Coulomb friction, one has
the transition between stick mode and slip mode; in
This work has been supported by the European project SICONOS the study of electrical networks with ideal diodes,
IST2001-37172 (http://maply.univ-lyon1.fr/siconos)
y
E-mail: [email protected]
Correspondence and offprint requests to: W.P.M.H. Heemels,
Department of Electrical Engineering, Eindhoven University of Received 21 March 2003; Accepted 17 April 2003.
Technology, P.O. Box 513, 5600 MB Eindhoven, The Netherlands. Recommended by I. Postlethwaite, J.M. Maciejowski, K. Glover and
E-mail: [email protected] P.J. Fleming.
Hybrid Dynamical Systems 323
there is the transition between the conducting and the Definition 2.1. LCP (q, M): Given an m-vector q and
blocking mode of each diode; and in the context of an m m matrix M find an m-vector such that
dynamic optimization, one has mode transitions when
an inactive constraint becomes active, or vice versa. A > 0; w : q M > 0; T w 0:
1
similar point of view may be found in the literature on
hybrid systems encompassing both continuous and The LCP(q, M) can equivalently be written as
discrete dynamics, which have recently been a popular 0 6 ? q M > 0,
2
subject of study both for computer scientists and
for control theorists. Actually, the class of com- where the notation w ? expresses the orthogonality
plementarity systems has connections to various other between w and and the inequalities should be
class of hybrid dynamical models; as shown in [50] interpreted componentwise. An interesting general-
in discrete-time strong links exist to piecewise linear ization of the LCP is the so-called ``cone com-
and affine systems [96], min-max-plus-scaling systems plementarity problem'', which has been mentioned
[34] and mixed logic dynamic systems [8] and in for instance in [31, p. 31]).
continuous-time to evolutional variational inequalities
[41], differential inclusions, piecewise smooth systems Definition 2.2. LCP (q, M): Let C be a cone and
[38], projected dynamical systems [35,84] and so on. C? fz j zT v 6 0, 8v 2 Cg its polar cone1 [51,91]. Given
For the previously mentioned classes several system an m-vector q and an m m matrix M find an
and control theoretic issues such as existence and uni- m-vector such that
queness of solutions (well-posedness), controllability,
2 C; w : q M 2 C? ; T
q M 0:
observability, stability and feedback control are dif-
ficult to settle due to the hybrid behavior. CS have the
3
great advantage of being large enough in terms of
We say that the LCPC
q, M is solvable if such a
applications, and being small enough in order to allow
exists. In this case, we also say that solves
is a
one to lead deep and complete theoretical investiga-
solution of LCPC
q, M. The set of all solutions of
tions (which is much more difficult for larger classes of
LCPC
q, M is denoted by SOLC
q, M. If C Rm then
hybrid systems without any additional assumptions).
LCPC
q, M becomes the ordinary LCP(q, M)
Given the wealth of possible applications it is of
de®ned in De®nition 2.1.
interest to overcome these difficulties. Roughly speak-
ing, what makes dynamical complementarity systems The LCP and its extensions play a key role in many
so specific is that tools from complementarity pro- economic and engineering areas [37] and an extensive
blems and convex analysis are at the core of their literature [31,83] is available on this problem. After
study. This is clearly not the case for some other introducing its dynamical generalizations in the next
classes of hybrid systems. Moreover, it is important section, we will go into more details of the application
when dealing with CS to keep the right mathematical areas.
formalisms in order not to lose these particular fea-
tures. This is the message that this paper (see also [22]) 2.2. Complementarity Systems
tries to transmit.
In this paper we focus on a class of nonsmooth
2. Complementarity Modeling dynamical systems, which can on one hand be inter-
preted as a specific class of hybrid dynamical systems
Since the word complementarity is so important in the and on the other as a dynamical extension of the LCP
context of this paper and the systems under study are or cone complementarity problem mentioned above.
the dynamical extension of it, let us start by introdu- These systems are called complementarity systems (CS)
cing some basic tools of complementarity analysis. and can be formulated in a general form by [22,92]
Later on we shall see how convex analysis, monotone
multi-valued mappings, and other tools interfere with x_ f
x, t, u, ,
4a
complementarity. y
x, u, ,
4b
C? 3 w ?
2 C,
4c
g
w, , u, x, t 0,
4d
State x re-initialization rule,
4e
discrete part of the dynamics is modeled by means of a combinations of inequalities are found in all kinds of
graph whose vertices are called locations, discrete optimization problems [61], contact problems in
states or modes, and whose edges are transitions. The mechanics [59,74], resistive switched electrical circuits
continuous state takes values in a vector space Rn. For [12,64], piecewise linear (PWL) maps [36] and so on.
each mode there is a set of trajectories, which are Actually, in [37] one states that ``the concept of com-
called the activities in [1], and which represent the plementarity is synonymous with the notion of system
continuous dynamics of the system. The ``sets of equilibrium,'' which suggest that the system itself (so
activities'' may be compared to the well-known not only the equilibrium) might be described by a
``behaviors'' advocated by J. Willems and they are dynamical model in which complementarity plays a
typically described by differential or difference alge- role as well. These are exactly the complementarity
braic equations like the ones in Eq. (7). Interaction systems that we have introduced in the previous sec-
between the discrete dynamics (mode transitions) and tions, which might even be more fascinating to study.
the continuous dynamics takes place through invar- Historically, complementarity has been introduced in
iants and transition relations in which the latter is optimization by Karush, John, Kuhn and Tucker as
sometimes split in a set of guards and reset maps for early as 19392 [56,61], in mechanics by Moreau in the
each transition. Each mode has an invariant asso- sixties [74] and in electrical circuit theory by Van
ciated to it, which describes the conditions that the Bokhoven [12]. The development of methods based on
continuous state has to satisfy at this mode. In (5) complementarity, is closely linked to the develop-
these are given by the inequalities wi(t) > 0, i 2 = I and ments of convex analysis which have been led in the
(t) > 0, i 2 I. Each transition has an associated tran- second part of the twentieth century by Rockafellar
sition relation, which describes the conditions on the [91] (motivated by optimization) and Moreau [76]
continuous state under which that particular transi- (motivated by mechanics). Before going into the
tion may take place (called the guard ) and the effect details about these theoretical developments, let us
that the transition will have on the continuous state first consider several applications of CS.
(called the reset map). Invariants and transition rela-
tions play supplementary roles: whereas invariants
3.1. Electrical Networks with Diodes
describe when a transition must take place (namely
when otherwise the motion of the continuous state as
Consider a linear electrical network consisting of
described in the set of activities would lead to viola-
resistors, capacitors, inductors, gyrators, transformers
tion of the conditions given by the invariant), the
and of k ideal diodes [48]. The RLCGT components
transition relations (in particular the guards) serve as
form a multi-port network, which can under certain
``enabling conditions'' that describe when a particular
mild conditions be described by a state space repre-
transition may take place. Note that in (5) the tran-
sentation x_ Ax B, w Cx D [2] with state
sition relations consist of the re-initialization rule and
variable x representing for instance, fluxes through
a mapping that will describe which mode has to be
the inductors and charges at the capacitors and input/
selected when the invariant will be violated.
output variables and w representing the port vari-
As we see, CS fit nicely in the description of these
ables connected to the diodes, i.e. i Vi, wi Ii or
hybrid automata (see also [22] for a discussion). On
i Ii, wi Vi, where Vi and Ii are the voltage across
one hand CS can profit from the theory in this field,
and current through the i-th diode, respectively.
but on the other can contribute to the development of
Finally, the ideal diode characteristics of the i-th diode
a general hybrid systems theory. Note that the above
are given by Vi 6 0, Ii > 0, (Vi 0 or Ii 0). By suitable
``hybrid'' point of view can be used to define trajec-
substitutions the following system description is
tories of the system, though this is not at all necessary
obtained:
(see [6,7,62,87]). However, before we go into these
details of the behavior of CS we will first indicate the _ Ax
t B
t,
x
t
8a
application domains and the relations to other w
t Cx
t D
t,
8b
(hybrid) modeling formalisms.
0 6 w
t ?
t > 0,
8c
network, (A, B, C, D) satisfies a passivity condition as One sees that the control input u and the multiplier
we will use later (see Section 7.3.2). In this framework enter in various places of the dynamics in each case.
one could also add pure switches and control inputs to This may not be without consequences on stabiliza-
regulate such switching systems [49]. Examples can be tion, controllability, and observability properties.
found in the area of power converters.
Let us illustrate these developments with the
following simple electrical circuits. 3.2. Constrained Mechanical Systems
Example 3.1. The circuit in Fig. 2(a) has the dynamics:
In mechanics, many fields of application concern
x_ 1 x2 , multi-body systems with unilateral contacts and fric-
R u 1 tion. To realize this we only have to look at our
x_ 2 x2 x1 ,
9
L L L LC everyday environment. For instance, home circuit
0 6 ? x2 > 0, breakers are constituted of complex kinematic chains
(10±20 bodies) with several (20±30) unilateral con-
where u denotes the voltage applied to the system, x1 is
tacts and friction. In such mechanisms, the uni-
the charge of the capacitor and x2 is current through
laterality and the friction are essential phenomena,
inductor. A second example is displayed in Fig. 2(b)
which cannot be disregarded for virtual prototyping.
and has the dynamics:
Nuclear plant reactors have cooling bars which are
1 1 also subject to such phenomena and may induce very
x_ 1 x2 u x1 ,
R C particular (and possibly dangerous) dynamics with
1 1 complicated bifurcation phenomena. Some compact
x_ 2 u x1 , discs players possess a stabilization system based on
L C
steel balls constrained to move in a circular rig (a so-
1 1
0 6 x2 u x1 ? > 0: called automatic balancing unit). Once again shocks
R C
between the balls and friction are the fundamental
10 physical phenomena that make this mechanical device
A final circuit is given in Fig. 2(c) and leads to the compensate for the irregularities of the disc. Watches
following dynamics: form also nice examples of everyday devices, which
1 1 contain complex mechanics inside. Also aerospace
x_ 1 x2 x1 u, (models for planes landing and taking-off, liquid slosh
RC R
1 2
11 phenomena in satellites) and automotive applications
x_ 2 x1 u , (clutches, engine dynamics or for virtual reality
LC L L
0 6 ? x2 > 0: applications like virtual mounting and assembly),
form a source of numerous impact and friction pro-
blems. Evidently robotics is a vast field of applica-
tions, as many robotic tasks involve contact and
intermittent motion (drilling, hammering, polishing,
and other machining tasks). Among them, bipedal
locomotion is certainly a well-known one, at least
from the academic side.
More will be said about mechanical systems in
Section 7.4 and on Moreau's work. To give one way
of modeling this type of systems let us take a con-
servative mechanical system in which q denotes
the generalized coordinates and p the generalized
momenta. The free motion dynamics can be expressed
in terms of the Hamiltonian H(q, p), which reflects
the total energy in the system. The equations are
@H
q_
q, p,
12a
@p
@H
p_
q, p
12b
Fig. 2. Electrical circuits. @p
Hybrid Dynamical Systems 327
with @H/@p and @H/@q denoting partial derivatives. is equivalent to LCP(q, M) with w the slack variable.
The system is subject to the geometric inequality [31±83, Section 9.3.1].
constraints given by h(q) > 0. Dynamical extensions of this result are also well-
Friction effects are not modeled here, for simplicity, known via Pontryagin's maximum principle, which
even though Coulomb's characteristic can be recast was developed in the early 1960 [89]. Before showing
into a complementarity framework (as many other this connection to CS, we will first go to dynamical
piecewise linear characteristics, in fact). To obtain a version in discrete-time that is directly connected to
complementarity formulation, we introduce (as in the classical KKT conditions.
[45,92,93]) the Lagrange multiplier generating the
constraint forces needed to satisfy the unilateral con- Remark 3.2. Some other links between com-
straints h(q) > 0. According to the rules of classical plementarity and optimization exist as well. For
mechanics, the system can then be written as follows: instance, Moreau introduced complementarity in
@H Lagrangian systems and proved with convex analysis
q_
q, p,
13a
@p tools that Gauss' principle still holds in this case
@H [74,75] (Gauss' principle of mechanics is a minimiza-
p_
q, p rh
qT ,
13b tion problem involving the acceleration as the
@p
unknown). This was rediscovered by LoÈdstedt much
w h
q,
13c later [65]. Some of the material presented in
Section 5.5 wil also use the link between optimization
together with the complementarity conditions 0 6 and complementarity.
w? > 0. Note that rh(q) (@h/@q)(q). These condi-
tions express the fact that the Lagrange multiplier i is
only nonzero, if the corresponding constraint is active 3.3.1. Model Predictive Control and
(wi 0). Vice versa, if the constraint is inactive (wi > 0), Optimal Control
the corresponding multiplier i is zero. One also has to
add the impact rules to make the dynamics complete. The relationship between quadratic programming and
Another way to write the dynamics is via the linear complementarity problems has the immediate
Lagrangian formalism, which is actually the more consequence that the well-known control methodo-
useful and used formalism in mechanics and control: logy model predictive control in a linear quadratic
M
q _ rh
qT ,
q F
q, q setting leads to closed-loop systems that can be con-
0 6 h
q ? > 0,
14 sidered as discrete-time complementarity systems.
Collision mapping, Indeed, in [11] one studies the control of the discrete-
time linear time-invariant system
_ contains Coriolis, centrifugal
where the vector F
q, q
and conservative generalized forces. The collision
mapping can be chosen in various ways, as proposed x
k 1 Ax
k Bu
k,
16
by Moreau [78,81], Pfeiffer and Glocker [88], y
k Cx
k,
FreÂmond [39], Stronge [98], etc. Some mappings will
be discussed later.
where x(k) 2 Rn, u(k) 2 Rm, and y(k) 2 Rp are the state,
3.3. Optimization and Complementarity input, and output vector, respectively. In particular,
one is interested in the problem of tracking the
It is well-known in mathematical programming that a output reference signal r(k) 2 Rp while ful®lling the
necessary and sufficient condition for the optimality constraints
of a quadratic program (QP) with a positive definite
matrix in the cost criterion is equivalent to a set of D1 x
k D2 u
k D3 u
k 6 d4
17
complementarity conditions by introducing so-called
slack variables and applying the Karush±John±
Kuhn±Tucker's (usually known as the KKT) theorem. at all time instants k > 0, where u(k):
In particular, the quadratic optimization problem u(k) u(k 1) are the increments of the input.
Assume for the moment that a full measurement of
min imize
n
qT 12 T M the state x(k) and the previously implemented control
2R
value xu(k) : u(k 1) (which might be considered as
subject to > 0
15 an additional state) are available at the current time k.
328 W.P.M.H. Heemels and B. Brogliato
Then, the optimization problem these type of controlled systems comes down to
Ny
! studying CS.
X X
N u 1
which is a LCS. Note that we have boundary condi- Eqs. (25b) and (25c) form a LCP. It can easily be seen
tions for t 0 and t T. Interestingly, it is possible that this problem has two characteristics (just as in the
that jumps occur in the adjoint variable (which physical system) given by:
corresponds to ``Dirac pulses'' in ). Also for these
Inactive spring: If q > 0, then we have 0 and
jumps additional relations are available. We do not
w kq. This means that we have the ``free motion
specify all the available conditions, but only would
dynamics'' mq u.
like to illustrate that this kind of optimal control
Active spring: If q 6 0, then it holds that kq
problems ®t in the class of CS.
and w 0. Hence, the system evolves according to
The general formulation in [42] is called an informal
mq u kq.
theorem, because the result is not rigorously estab-
lished. It might be interesting to see how the study on Hence, certain dynamical (PWL or PWA) systems can
CS can contribute to settle the difficult issues that play be recast as CS. This is in agreement with the com-
a role in this challenging field. bination of the facts that closed-loop MPC systems
For a further overview of connections between turned out to be CS (cf. Section 3.3.1) and that in [10]
optimization and complementarity, see [95]. it was shown that the closed-loop systems can also be
described by discrete-time piecewise affine systems. As
a consequence, the MPC context indicates that there
3.4. Piecewise Affine Systems
must be a strong relationship between PWA and
(linear) CS in discrete-time. In the static case the
To link the optimal control problem in the previous
connection between piecewise affine mappings and
section to the next application domain of CS being
LCP is already known for a long time [36]. Also in
piecewise affine (PWA) systems, one can see that the
circuit theory many static piecewise-linear (PL) circuit
first condition in (22) for a constrained set U Rm elements play a role (next to the ideal diode char-
and a quadratic criterion gives uopt max(0, BT).
acteristics of Section 3.1), which were suitably for-
This is a PWL relationship, that is equivalent to the
mulated as complementarity models. Indeed,
complementarity description
Kevenaars en Leenaerts (see e.g. [64]) showed that all
0 6 uopt ? uopt BT > 0:
24 the explicit piecewise linear canonical representations
proposed by Chua and Kang, GuÈzelis and GoÈknar,
In this case additional complementarity conditions and Kahlert and Chua used for PWL circuit elements
would be added to (23). Observe that we have actually are all covered by one implicit model based on the
rewritten a simple piecewise linear function as a LCP. This implicit model was developed by Van
complementarity model. Before we go to more general Bokhoven [12].
piecewise linearities, let us ®rst consider an application To give some example consider the rather arbitrary
of the max-relation in the piecewise linear mechanical piecewise affine function as given in Fig. 4, which is
system below. described by
Example 3.3. A simple mechanical example is v g r1 z
r2 r1 max
d a1 , 0
depicted in Fig. 3 with mass m, position q and spring
r3 r2 max
d a2 , 0:
26
stiffness k. This system is described by
By using the same trick (24) as for rewriting the max-
m
q u ,
25a relation, we obtain
w kq ,
25b u g r1 z
r2 r1 1
r3 r2 2
27
0 6 ? w > 0: 25c
with 0 6 i d ai ? i > 0. Having these type of non- 4.1.1. Linear Complementarity (LC) Systems
smooth functions in a dynamical setting, e.g. control
systems with saturations or dead zones, gives rise to In view of Eq. (33) discrete-time LCS are given by the
CS. Another dynamical example in this context are equations
linear relay systems given by
x
k 1 Ax
k B1 u
k B2
k,
33a
x_ Ax Bu,
28
y
k Cx
k D1 u
k D2
k,
33b
y Cx Du,
29
u sign y,
30
w
k E1 x
k E2 u
k E3
k g4 ,
33c
where x(k) [xrT(k) xbT (k)]T with xr(k) 2 Rnr and and/or boundedness of certain variables. For all the
xb(k) 2 {0, 1}nb (y(k) and u(k) have a similar structure), details on the transformations, consult [50]. To illus-
and where z(k) 2 Rrr and (k) 2 {0, 1}rb are auxiliary trate the results here, we consider an example taken
variables. Inequalities (35c) have to be interpreted from [8,50].
componentwise. Systems of the form (35) are called
MLD systems. Example 4.1.
0:8x
k u
k if x
k > 0,
4.1.4. Max-Min-Plus-Scaling Systems x
k 1
0:8x
k u
k if x
k 6 0
37
The last discrete-time class that was described in [50]
consists of discrete-event systems, which can be with m 6 x(k) 6 M. In [8] it is shown that Eq. (37) can
modeled using the operations maximization, mini- be written as
mization, addition and scalar multiplication. In this
x
k 1 0:8x
k u
k 1:6z
k,
way they extended the well-known max-plus systems.
Expressions that are built using these operations are m
k 6 x
k m, x
k 6 M
k,
called max-min-plus-scaling (MMPS) expressions, e.g. z
k 6 M
k, z
k > m
k,
5x1 3x2 7 max(min(2x1, 8x2), x2 3x3). z
k 6 x
k m
1
k,
Consider now the MMPS systems, which are
z
k > x
k M
1
k
38
described by
x
k 1 Mx
x
k, u
k, d
k,
36a and the condition (k) 2 {0, 1}.
One can verify that Eq. (37) can be rewritten as the
y
k My
x
k, u
k, d
k,
36b MMPS model
Mc
x
k, u
k, d
k 6 c,
36c x
k 1 0:8x
k 1:6 max
0, x
k u
k
39
where Mx , My and Mc are MMPS expressions in and as the LC formulation
terms of the components of x(k), the input u(k) and
the auxiliary variables d(k), which are all real valued. x
k 1 0:8x
k u
k 1:6
k,
40a
In Fig. 5 the relationships between the classes as Note that the MLD representation (38) requires
derived in [50] are indicated. An arrow going from bounds on x(k), u(k) (although such bounds can be
class A to class B means that A is a subset of B. arbitrarily large). The PWA, MMPS, and LC
Moreover, arrows with a star (?) require conditions to expressions do not require such a restriction.
establish the indicated inclusion. The conditions are
related to well-posedness (i.e. existence and unique-
ness of solution trajectories given an initial condition) 4.2. Continuous-time CS
Fig. 5. Graphical representation of the links between the Let us notice an important fact about the
classes of hybrid systems considered in this paper. complementarity condition C? 3 w ?
2 C. This
332 W.P.M.H. Heemels and B. Brogliato
set of conditions means that w and () have to be Further links exist to convex analysis as the com-
perpendicular, while belonging to C? and C, respec- plementarity conditions are closely related to the
tively. This can be thought of as defining a mapping subdifferentiation of convex functions. Indeed, from
Mcomp that associates to each w a set of possible values basic convex analysis one obtains the equivalence
of (). Let us consider two couples (w1, 1) and
(w2, 2) that satisfy the complementarity conditions. 0 6 ? w > 0 () 2@
R m
w,
42
Then the following holds:
where and w are m-dimensional vectors. The func-
hw1 w2 , 1 2 i > 0,
41 tion K(x) is the indicator function of the closed
convex set K, de®ned as
where h., .i is the scalar product in Rn. Indeed a
straightforward calculation yields hw1 w2 , 1 2 i 0 if x 2 K,
K
x
43
wT1 1 wT2 T2 wT1 2 wT2 1 wT1 2 wT2 1 > 0 1 if x 2 = K,
since the cones C? and C are polar cones. Inequality
(41) states that Mcomp is a monotone multi-valued and given a convex function F : R ! Rn its sub-
mapping [51,91]. The multi-valued feature of Mcomp is differential @F(x) at x is de®ned as
easily deduced by noticing that for a given w, there
@F
x f
2 Rn j hy x, F
y F
xi
may in general be an in®nity of which satisfy the
complementarity conditions. For instance in the > h
, y xi for all y 2 Rn g:
44
scalar case and with C R :
1; 0 and C? R
0, 1, one gets 0 6 w ? > 0. The mapping w 7! Subdifferentiation is a generalization of the notion of
is monotone and its graph is the so-called corner law as a derivative of a function, which applies to convex
depicted in Fig. 6. Another important notion is the functions which may be nondifferentiable (in ordinary
maximality of a monotone operator, which has to be sense), and may even take in®nite values. It can also be
understood in terms of graph inclusion. Roughly extended to set-valued mappings. An example in the
speaking, a monotone operator is maximal if its graph former case is the absolute value function abs : x 7! |x|
is ``completely ®lled in.'' For instance, the graph in that is subdifferentiable at x 0 with
Fig. 7(a) is maximal (note that this is the sign-char- (
1 if x < 0,
acteristic of Eq. (31)), but the graph in Fig. 7(b) is not @abs
x 1, 1 if x 0, sign x,
maximal. 1 if x > 0
and the mapping whose graph is the corner law, is also
subdifferentiable.
Computing the subdifferential of the indicator,
leads to the set-valued function (being a convex cone
for any x) given by
where
Fig. 6. The corner law (a multi-valued monotone mapping). is the normal cone [51,91] to K at x. Outside K one
de®nes Nk(x) @ k(x) : ;.
Convex functions taking infinite values like the
indicator function have been introduced by Moreau in
relation with unilateral constrained mechanics (the
reader may think of K(x) as a potential function
associated with nonpenetration of the ``position'' x
outside the domain K). See [76,91] for details. Exam-
ples in the plane are depicted in Fig. 8. In unilaterally
constrained mechanics, this means that the contact
force should be zero in the interior of K, and should
belong to the normal cone when the system is on the
Fig. 7. A maximal and a nonmaximal graphs. boundary of K.
Hybrid Dynamical Systems 333
PK
x v x
K
x, v lim ,
51
!0
that some kind of PSS was obtained from CS by using 4.2.3. From LCS to PWL systems
a multi-modal or hybrid point of view as given in
Section 2.3. However, one has to notice that often one To show the connection between continuous-time
does not include the possibility of discontinuities in LCS and PWL systems we consider the bimodal LCS
the state trajectory in a PWA or PSS framework.
Moreover, one normally assumes that the smooth x_ Ax b eu,
55a
submodels of a PWA system or a PSS live on a part of
T
the state space which has a non-trivial interior, while w c x d,
55b
smooth phases of CS (5) given by the DAE (7) might 0 6 w ? > 0,
55c
be defined on lower-dimensional subspaces of the
total space due to the presence of the algebraic equa-
tions. For instance, the constrained motion of a robot where A 2 Rnn, b 2 Rn1, c 2 Rn1, d 2 R, and
arm that is in contact with its environment looses two 0 6 e 2 Rn1.
degrees of freedom (the relative distance between We assume that either d > 0 or (d 0 and cTb > 0).
environment and arm and the corresponding velocity This assumption guarantees that for each input u 2 L2
are zero). This is also nicely illustrated by the two- and initial state x0 (with cTx0 > 0 in case d 0), there
carts example below, where the constrained motion exists a unique (absolutely) continuous state trajectory
evolves on a two-dimensional space, while the system x with x(0) x0 and a unique pair
, w 2 L112 such
has a four-dimensional state space. that Eqs. (55) hold almost everywhere. For a treat-
ment of the well-posedness of LCS with external
inputs, we refer to [25,26,49].
The complementarity conditions (55c) imply that
4.2.2. From PSS to CS either or w is zero at (almost) each time instant (see
Section 2.3). As a consequence, this gives a system
Consider the following example with four smooth with two modes, i.e. a bimodal system. Indeed, to see
regimes valid in their own part of the state space the bimodal structure more explicitly, consider first
specified by certain inequalities. the case d > 0. Then, one can rewrite (55) as
8
> f11
x, when h1
x > 0, h2
x > 0, Ax eu if cT x > 0,
>
< x_
56
f10
x, when h1
x > 0, h2
x < 0,
A bd 1 cT x eu if cT x 6 0.
x_
>
> f
x, when h1
x < 0, h2
x > 0,
: 01
f00
x, when h1
x < 0, h2
x < 0: For the case d 0 and cTb > 0, one can rewrite (55) as
53 8
< Ax eu if
cT x, cT Ax cT eu 0,
x_ P
Ax eu if cT x 0 and
This can be rewritten as :
cT Ax cT eu 6 0,
57
x_ 1 2 f11
x 1
1 2 f10
1 1 2 f01
x
1 1
1 2 f00
x,
54 where P I b(cTb) 1cT [92]. Indeed, the mode w
cTx 0 yields by differentiating that cTAx cTb cT
with i 2 12 12sign (hi(x)). Hence, we obtain (again) eu 0. Hence, one can solve as a function of x and u
a differential inclusion. As these type of sign- and by proper substitutions one gets the result above.
characteristics can easily be transformed into com- Note that this illustrates the remark made earlier that
plementarity conditions (see Eq. (32)), this shows how within a mode the state variable x might ``live'' on a
to transform PSS to CS. Note that the right-hand side lower-dimensional subspace (in Eq. (57) on the
of Eq. (54) is a set-valued function at the separating (n 1)-dimensional subspace {x j cTx 0}) and actu-
boundaries as one is allowed to take any convex ally if the linear system has higher relative degree
combination of the vector ®elds in the neighboring (d 0, cTb 0), then also resets of the state variable
regions. In this manner the solution concept can be are necessary as for instance in the two-carts systems
taken as Filippov's convex de®nition as presented in studied below. This is a distinctive feature of CS,
[38]. In a similar way one can also use Utkin's which is hardly studied in the framework of PWL
equivalent control de®nition to de®ne the ``sliding systems. The material in this section provides a pre-
behavior'' at the switching surface. liminary answer to the problem OP 4 in [22].
Hybrid Dynamical Systems 335
x_ 1
t x3
t,
5.1.1. Two-carts System x_ 2
t x4
t,
Let us illustrate the dynamics of CS by an example x_ 3
t 2x1
t x2
t,
consisting of two carts as depicted in Fig. 10 as was used x_ 4
t x1
t x2
t,
also in [45,92]. The carts are connected by a spring and z
t 0:
Constrained
x_ 1
t x3
t,
x_ 2
t x4
t,
x_ 3
t 2x1
t x2
t
t,
x_ 4
t x1
t x2
t,
Fig. 10. Two-carts system. w
t x1
t 0:
336 W.P.M.H. Heemels and B. Brogliato
Remark 5.1. Note that the unconstrained mode is differently, from (0, 1, 0, 0)T smooth continuation in
directly stated as an ordinary differential equation the unconstrained mode is possible.
(ODE) in the state variable x. However, the con-
strained mode is a DAE (of high index) as the variable This last transition is a special one in the sense that,
is not explicit. Since w x1 0 in this mode, also first, the constrained mode is active, causing the cor-
w_ 0 holds, which yields x3 0. Similarly, we must responding state reset. After the reset, no smooth
have that w 0, which leads to 2x1 x2 0. continuation is possible in the constrained mode
Hence, we have x2. By substituting this in the resulting in a second mode change back to the
DAE of the constrained mode, we obtain x1 x3 0 unconstrained mode.
and x2 x2 (note that the constrained mode has a A possible trajectory is given in Fig. 11 for initial
state variable (x2, x4)T of dimension 2). Hence, by state
differentiating the variable w, we can rewrite the DAE
x0 e A
0 1 10T
as an ODE.
0:3202, 0:4335, 0:3716, 1:0915T
When the system is represented by either of these
modes, the triple (, x, w) is given by the corre- on the interval [0, 3].
sponding dynamics as long as the following inequal- Note the complementarity between (denoted by
ities in (59): ``u'' in Fig. 11) and w x1 and the discontinuity in the
derivative of x1 at time t 1.
Unconstrained
w
t x1
t > 0, 5.2. Solution Concepts
Constrained For CS one may develop several solution concepts
t x2
t > 0: (see [47]), which may be similar to the notion of an
execution for hybrid automata [54], or to the solution
are satis®ed. A change of mode is triggered by viola- concept for differential inclusions as used for differ-
tion of one of these inequalities and during a transi- ential equations with discontinuous right-hand sides
tion from the unconstrained to the constrained mode [38]. A solution concept of the first type can for
we assume that the re-initialization rule is inelastic, i.e. instance be formulated as follows.
x3 : 0. The mode transitions that are possible for the
two-carts systems are described below. Definition 5.2. A set E R is called an admissible
Unconstrained ! constrained. The inequality event times set, if it is closed and countable, and 0 2 E.
w(t) > 0 tends to be violated at a time instant t . To each admissible event times set E, we associate a
The left cart hits the stop and stays there. The collection of intervals between events E f
t1 , t2
velocity of the left cart is reduced to zero instanta- R j t1 , t2 2 E [ f1g and
t1 , t2 \ E ;g.
neously at the time of impact: the kinetic energy of
the left cart is totally absorbed by the stop due to a
purely inelastic collision. A state for which this 1.5
omous CS
x_2
1
x_ f
x, ,
w h
x, , 0
06w ? >0 –1
x
some CS in which this phenomenon shows up.
2
1
5
We assume that (E, , x, w) is nonredundant, i.e. there does not
4
An element t of a set E is said to be a left (right) accumulation exist a t 2 E and t0 , t00 with t0 < t < t00 such that (, x, w) is analytic
point if for all t0 > t
t0 < t
t, t0 \ E
t0 , t \ E is not empty. on (t0 , t00 ), i.e. there are no void event times in E.
338 W.P.M.H. Heemels and B. Brogliato
Note that if a left-Zeno free solution concept is to x() (0, 1, 0, 0)T. This reset can be modelled as
used for Filippov's example, there is well-posedness: the result of a (Dirac) pulse exerted by the stop. In
existence and uniqueness of solutions given an fact, results in the state jump x() x()
initial condition. However, if a general hybrid solution (0, 0, 1, 0)T. This motivates the use of distributional
concept (including the possibility of left-accumula- theory as a suitable mathematical framework for
tions in the event times set) is used then the system is describing physical phenomena such as collisions with
not well-posed as we have nondeterminism (non- discontinuities in the state vector as is done for
uniqueness). As one could have expected, the well- instance in [45].
posedness issue depends strongly on the notion of
Let us consider another Zeno example including
solution.
re-initialization maps.
Definition 5.3 requires that the state x of a solution
trajectory is continuous across events. For so-called Example 5.8. (Bouncing ball ). Consider a ball (height
``high-index'' systems (e.g. constrained mechanical of ball is x) with dynamics x g and constraint
systems as the two-carts example), this requirement is x > 0. To complete the model we include Newton's
too strong and, as noted before, one has to add jump _
restitution rule x
ex
_ when x( ) 0 and
rules that connect continuous states before and after _
x
< 0
0 < e < 1. In case x
x
_ 0,
an event has taken place. The solution concept can the dynamics are equal to x 0 due to the constraint
then be generalized by interconnecting the continuous x > 0. The event times { i}i2N are related through (see
state x just before and after the event via these re- [17, p. 346]) 0 0 and
initialization rules. For linear CS a general jump rule
2ei x
0
_
has been proposed in [45]. In this study the issue of ir- i1 i , i2N
regular initial states had to be tackled, i.e. the initial g
states for which there is no solution in the sense assuming that x(0) 0 and x
0 _ > 0. Hence, { i}i2N
defined so far for CS. A distributional framework was _
has a ®nite limit equal to 2x
0=
g ge < 1.
used to obtain a new solution concept for LCS [45]. In _
Since the continuous state
x
t, x
t converges to
principle, this framework is based P on so-called Bohl (0, 0) when t " ? a continuation beyond ? can be
distributions of the form
t li0 i
i reg
t, _
de®ned by
x
t, x
t
0, 0 for t > ?. The physical
where is the delta or Dirac distribution (supported at interpretation is that the ball is at rest within a ®nite
0), (i) is the i-th derivative of and reg is a Bohl time span, but after in®nitely many bounces.
function. These distributions can equivalently be An alternative concept that foregoes explicit men-
characterized by the inverse Laplace transforms of tion of events is the following one, which is closer to
rational functions. A Bohl distribution (, x, w) is the solution concept used for differential inclusions.
called an initial solution for initial state x0, if it satisfies The notion turns out to be convenient for CS that
x_ Ax B x0 ; w Cx D as equalities of dis- satisfy a certain passivity condition.
tributions, there exists an I {1, . . . , m} with wi 0,
i 2 I and i 0, i 2= I and finally, the Laplace trans- Definition 5.9. A triple
, x, w 2 L2mnm [0, T ] is
^ > 0 and w
forms satisfy
^ > 0 for all sufficiently said to be an L2-solution of (60) on the interval [0, T ]
large . In case ((t), x(t), w(t)) is an ordinary function with initial condition x0 if for almost all t 2 [0, T ] the
these conditions mean that the system's equations (6) following conditions hold:
Z t
are satisfied on an interval of the form [0, ") for some
x
t x0 f
x
s,
s ds,
" > 0. In case the initial solution is not a function, the 0
impulsive part ofP (t) will result in a state jump from w
t h
x
t,
t,
x0 to x : x0 iAiB i. Particularly, in [45] it is
shown that the above re-initialization procedure cor- 0 6 w
t ?
t > 0:
responds for linear mechanical systems with unilateral
constraints to the inelastic jump rule as proposed by 5.3. Well-posedness
Moreau [78]. Moreover, in some cases the jump of the
state variable can be made more explicit in terms of Well-posedness roughly means that solutions exist
the linear projection operator onto the consistent and are unique for any given initial condition. If
subspace of the new mode along a jump space [45]. solutions exist and are unique, a given system
description defines a mapping from initial conditions
Example 5.7. Reconsider the two-carts example. From to trajectories. In the theory of smooth dynamical
state x() (0, 1, 1, 0)T, we can enter the con- systems, it is usually taken as part of the definition of
strained mode by starting with an instantaneous reset well-posedness that this mapping is continuous with
Hybrid Dynamical Systems 339
recall a few interesting results just to give you the system (LPCS) [26,48]. Note that this case occurs
flavor of them. typically for electrical circuits containing ideal diodes
Consider an (input-free) LCS given by discussed in Section 3.1.
6
This means that (A, B) is controllable and (C, A) observable.
7
The denotes conjugate transpose of vectors of matrices. 8
It can also be shown that this solution is unique in L2.
Hybrid Dynamical Systems 341
(iii) The re-initialized state x is the unique on an interval of the form [0, ") for some " > 0 for
minimizer of all initial conditions and initial well-posedness mean-
minimize 1
p x0 T K p x0
71a ing the existence and uniqueness of an initial solution
2
(see Section 5.2) given arbitrary initial condition
subject to Cp 2 Q?D :
71b x(0) x0. Loosely, speaking this means that for each
where K is any solution to (68) and thus initial state there is the unique possibility of either a
V
x 12 xT Kx is a storage function for re-initialization or a smooth continuation. In the
(A, B, C, D). terminology of hybrid automata [54], initial well-
(iv) The jump multiplier 0 is the unique minimizer posedness is equivalent to the LCS being nonblocking
of and deterministic.
We present now three results on initial, local and
minimize 12
x0 BvT K
x0 Bv
72a global well-posedness of LCS.
subject to v 2 QD :
72b
5.6.1. Initial Well-posedness
Just as outlined in Section 5.2 these jump rules are For the LCS(A, B, C, D) as in [67] the rational matrices
based on a distributional framework in which Dirac G(s) and Q(s) are defined by C(sI A) 1B D and
pulses can occur in the currents and voltages (repre- Q(s) C(SI A) 1.
senting ``sparks''). For passive LCS without inputs
there are only jumps at the initial time t 0. After that Theorem 5.16. [44]. LCS(A, B, C, D) is initially well-
there is a hybrid solution with a continuous state posed if and only if for all x0 LCP(Q()x0, G()) is
trajectory meaning that Cx
t 2 Q?D for all t > 0. uniquely solvable for sufficiently large 2 R
Note that this gives existence and uniqueness of The strength of this theorem is that dynamical
solutions on [0, 1) for all arbitrary initial conditions, properties of an LCS are coupled to properties of
which is called global well-posedness. families of static LCPs, for which a wealth of existence
There are some nice physical interpretations for and uniqueness are available [31]. For instance, a
the two optimization problems in Theorem 5.15. sufficient condition [31] for initial well-posedness is
Statement (iii) expresses the fact that among the G() being a P-matrix9 for sufficiently large .
admissible re-initialized states p (admissible in the Clearly, initial well-posedness does not imply local
sense that smooth continuation is possible after the existence of solutions as in principle, an infinite
reset, i.e. Cp 2 Q?D ) the nearest one is chosen in the number of re-initializations (jumps) may occur on one
sense of the metric defined by any arbitrary storage time-instance without ``time progressing.'' This phe-
function corresponding to (A, B, C, D). The quadratic nomenon is sometimes called ``live-lock.'' However,
program in (iv) states that the jump multiplier 0 sufficient conditions for local well-posedness have
satisfies the complementarity conditions (i.e. 0 2 QD) been provided for LCS [45,92], as presented next.
and minimizes the internal energy (expressed by the
storage function 12 xT Kx) after the jump. Note that
x0 B0 is the re-initialized state. Under the assump- 5.6.2. Local Well-posedness
tion of x x0 2 im B, it can be shown that the two
optimization problems are actually each other's dual Consider the LCS(A, B, C, D) as in Eq. (67) with
(see e.g. [31, p 117]). Markov parameters H0 D, H1 CB, H2 CAB,
We will see later that these conditions can be used to H3 CA2B, etc. and define the leading row and
establish stability of these types of systems. column indices by
j inffi j Hij
6 0g,
5.6. Initial, Local and Global Well-posedness i
73
j inffi j Hj 6 0g,
In Section 5.5 we derived conditions that imply the so- where j 2 {1, . . . , m} and inf Æ : 1. The leading row
called global well-posedness, i.e. the existence and coef®cient matrix M and leading column coef®cient
uniqueness of a solution on the interval [0, 1) for any matrix N are then given for ®nite leading row and
initial condition. Depending on the interval on which
solutions exist, we can now distinguish between two
other types of well-posedness; local well-posedness as 9
A matrix M 2 Rmm is called a P-matrix, if all its principal minors
the existence and uniqueness of solution trajectories satisfy det MII > 0 for all I {1, . . . , m}.
342 W.P.M.H. Heemels and B. Brogliato
does represent the important physical effect of ``stick'' the nonsmoothness. This means that they may be
that is not taken into account if the model is much more reliable than event-driven schemes, if
regularized. ``hard'' events occur (like events accumulations,
multiple events), because they guarantee that the
global behavior is the right one. Also in the context
6.1. Event-driven Simulation of sampled-data control these methods might be a
promising way to go.
Event-driven algorithms of numerical simulation, are
based on the simple idea of integration between
``events'' and the application of algebraic conditions 6.3. Smoothing
when an event occurs. Hence, this complies directly
with the hybrid point of view as in Section 2.3. In Smoothing is a known technique in mechanics and
complementarity dynamics as in (4) one may distin- optimization. In the framework of (4) it consists of
guish between two types of events: those with a state replacing some complementarity conditions by a reg-
re-initialization, and those involving only a mode ularized version of it that approaches it when a reg-
transition: a change in the vector field f() structure ularization parameter goes to infinity. For instance, in
due to a new (a bounded function of x, u, t computed case of a sign-characteristic x 7! sign x, a suitable
from some complementarity problem). The routine regularization might look like x 7! tanh nx for n large.
consists for (5) of repetitive cycles of: Though this appears to many as a miraculous solution
which solves in one shot all the complicated problems
1. DAE simulation: by standard integration routines
that arise with unilaterality and complementarity,
compute a solution to Eq. (7) for the currently
reality is more subtle as stiff differential equations
active mode I;
have to be solved which might yield unreliable results
2. event detection: determine the event time at which
or are computationally very demanding. To say
the inequalities !i(t) > 0, i 2
= I and i(t) > 0, i 2 I
nothing on physical parameters estimation which may
get violated, say at time and state x( );
be a hard task in practice.
3. re-initialization: calculate the re-initialized state
just after the event x() (note that it is not
excluded that multiple resets are required); 7. Analysis
4. mode selection: determine the new active set I that
will be active on the next time interval starting at For the analysis of controllability, observability and
time with state x() and go back to ``DAE- stability it has to be noted that all the ingredients in (4)
simulation'' again. play a role. The state jumps are important phenom-
ena, but the complementarity conditions and their
6.2. Time-stepping interconnection with the vector field f() are funda-
mental as well.
The time-stepping schemes are based on a time-
discretization of the whole dynamics, i.e. of the differ-
7.1. Controllability
ential equation and of the complementarity and state
jump rules. The first schemes developed along these
The controllability issue is crucial in control systems
lines, have been proposed by Moreau [80] in relation
for understanding to what extend we can influence the
with the numerical simulation of the sweeping process
system and hence, what can be achieved by (feedback)
[62]. Other contributions have been made in [87] and
control. It is important to observe that CS can in
[3]. The advantages of such schemes is that they do not
general not live at the whole state space Rn, but only
require the calculation of the event times. Evidently
on a subset of it. This subset is specified by the
this holds provided that a convergence proof has been
admissible states as introduced next.
derived. However, time-stepping schemes lend them-
To simplify the definition we consider a system (5)
selves for such theoretical proofs, much better than
that is time-invariant (i.e. the time dependence of the
event-driven ones. Other schemes have been devel-
functions f and g is absent).
oped for electrical circuits [26] with convergence
proofs. A drawback of time-stepping schemes may be Definition 7.1. A state x0 is called admissible for
their low order which precludes very accurate results system (5), if there exists an " > 0 and a control input
during free motion phases. However the low order function u 2 [0, ") 7! Rl such that there exists a solu-
seems to be a direct and unavoidable consequence of tion (, x, w) on the interval [0, ") for initial state
344 W.P.M.H. Heemels and B. Brogliato
x(0) x0. The set of all admissible states is denoted or more compactly as x_ x max(0, x u). Note
by . that R. It is easily seen that R R , C R
and thus the system is uncontrollable. However, the
Just as for well-posedness, there is, of course, a
slightly modi®ed system given by
relationship between the nature of the considered
solutions and the controllability properties. Also the x_ x u,
set of admissible inputs should be sufficiently friendly. w x ,
Definition 7.2. (Controllability). The CS (5) is con- 0 6 w ? > 0,
trollable, if for any pair of states x1, x2 2 , there exists or
an admissible input u defined on [0, T ], such that the
x u, when x > 0,
corresponding state trajectory t 7! x (t, 0, x1, u) with x_
2x u, when x 6 0
x(0) x1 satisfies x(T, 0, x1, u) x2.
is controllable, which can be most easily seen by
We assume for the moment that the origin lies in the feedback linearizing the system by applying the swit-
admissible set . ched feedback
Definition 7.3. Consider a CS given by (5). We say that
a state x2 2 is reachable (from the origin), if there v, when x > 0,
u
exists an admissible input u defined on [0, T], such that x v, when x 6 0,
the state trajectory t 7! x(t, 0, 0, u) with x(0) 0 satis-
which leads to the linear system x_ x that is
fies x(T , 0, 0, u) x2. The set of reachable states is
obviously controllable.
denoted by R. Similarly, we call a state x1 2 con-
Note that the feedback linearization of Example 7.5
trollable (to the origin), if there exists an admissible
can always be applied to LC systems of that form and
input u defined on [0, T ], such that the state trajectory
state dimension one, where the complementarity
t 7! (t, 0, x1, u) with x(0) x1 satisfies x(T,
variable w is not influenced by u, but the differential
0, x1, u) 0. The set of controllable states is denoted
equation is. However, for LC systems with state
by C.
dimension two this argument is only valid in parti-
To show the diversity of situations that arise in CS
cular cases. Fortunately, some work has been done for
we start off by some simple linear or affine com-
the case of state dimension 2 in [28]. Consider the LCS
plementarity systems with state dimension 1 to indi-
(55) with n 2, (cT, A) is observable, and d > 0, i.e. we
cate the problems that might occur.
look at the piecewise linear system (56) with n 2.
Example 7.4. Consider the example
Theorem 7.6. Consider the system (55) with n 2,
x_ u ,
74a (cT, A) is observable, and d > 0. It is controllable if
W x,
74b and only if
06W ? >0
74c
f T Ae f T
A bd 1 T
c e > 0
75
of which the admissible set is given by R . Since
holds, where f is such that f Te 0 and f 6 0.
u, when x > 0,
x_
0, when fx 0 and u 6 0g. Another result which applies to planar systems but
it can be easily be seen that the system is controllable. with d 0 in Eq. (55) can be found in [24]. It relies on
This example illustrates the need of the admissible the study of what happens on the boundary @K of the
states set as it would not make sense to require for set K {x j cTx 0}. The complementarity conditions
controllability that there should be a smooth control are shown to play a major role for the controllability
input that steers the state from xa > 0 to xb < 0. properties in this setting. Some other examples are in
order now to show some tricky issues for systems with
Example 7.5. The system impact maps and affine CS.
x_ x , Example 7.7. Consider the following simple mechan-
w x u, ical system
06w ? >0 x_ 1 x2 ,
76
x_ 2 u ,
77
can be rewritten as
W x1 ,
78
x, when x u > 0,
x_
2x u, when x u 6 0, 06W ? >0
79
Hybrid Dynamical Systems 345
with reset map at x1 0 given by x2(t) ex2(t ). linear, it is possible to derive a constrained equation of
A solution concept is used such that x1 is absolutely the form
continuous and discontinuous are possible in x2 A
w, zi zf B
zi 0,
according to the given jump rule. Note that the ,
84
C
w, zi zf D
w; zi > 0
admissible set is equal to {x 2 R2 j x1 > 0}. For e 0
the system is not controllable as it is impossible to whose solvability is equivalent to having the ®nal state
reach states x 2 R2 with x1 0 and x2 > 0 (unless you zf reachable from the initial one zi , and is w an inter-
start initially at these points). In case e 2 (0,1] this mediate input used in the analysis. The matrices, A(),
system is controllable. This immediately shows the B(), C() and D() have a strong structure which
influence of the rest map on these type of questions. allows one to derive some conditions for controll-
ability [23].
Example 7.8. Let " > 0 be given and formulate then the The results in [68] concern the characterization of
following affine complementarity system (ACS) the reachable subspaces for a planar juggler, and a
x u ,
80 control for the stabilization of trajectories is proposed.
W x_ ",
81 Experimental results corroborate the theory.
The examples and theorems above indicate the
06W ? > 0:
82
diversity of the difficulties one might encounter when
_ T j x_ > "g.
The admissible set is given by f
x, x studying controllability for these type of hybrid sys-
tems. This area is widely open and the answers seem to
u, when x > 0,
x_ be quite involved as is also evidenced by the proof of
0, when x 0 and u 6 0g:
Theorem 7.6 in [28]. In the case of discrete-time pie-
This system is accessible (cf. [85]), but not controllable cewise affine systems [9], or piecewise linear con-
as x_ > " prevents you from steering the state to other tinuous-time systems [63,99] some results might be
states with smaller x-values. found, but in general it is far from complete.
design asymptotically stable observers. A two-degree- common or multiple Lyapunov functions [14] and in
of-freedom system is considered in [73] (this is an the field of piecewise linear systems one has approa-
impacting pair that may model dynamic backlash [69], ches based on quadratic and piecewise quadratic
and whose dynamics fits within jugglers dynamics as Lyapunov functions [55]. However, in these approa-
in Eq. (83)). The measured output is assumed to be ches situations including state re-initializations or the
y(t) q2(t). Similarly as for controllability and stabi- fact that the dynamics associated with some modes
lization, observing the state z1 can be achieved only live on lower dimensional subspaces are hardly stu-
through the impacts. We will not elaborate on the died. Here we will present some preliminary work in
observer construction, but let us rather focus on a this direction within the context of complementarity
quite interesting comment in [73]. The dynamics is systems.
given by
and tk generically denotes the impact times. The res- 0 6 ? w > 0,
87c
titution matrix E is function of the restitution coef®-
cient e 2 [0, 1] and of the masses. Consider that u(t) 1, where A 2 Rnn, b 2 Rn1, c 2 Rn1, and d 2 R. As
usual, the system (87) is said to be asymptotically
p data q2
0 q1
0 1, 0 < q_ 1
0
with initial
stable (with equilibrium 0) if all possible state trajec-
q_ 2
0 < 2. Then the following holds
tories x satisfy limt ! 1x(t) 0. A solution (, x, w) of
z
t
k1 A1
A2 z
tk B2 B1 ,
the system is called periodic if all three functions are
86 periodic.
y
t
k Cz
tk ,
Remark 7.10. Normally, one also includes Lyapunov
which can be thought of as a Poincare map stability in the definition of asymptotic stability. Due
with Poincare section f
q, q _ j h
q1 , q2 0; to the structure of the system, we get Lyapunov sta-
rh
q1 , q2 T q_ > 0g (this is called an impact Poincare bility for free in case we have asymptotic stability as
map). The matrices A1 and B1 take into account the defined above. Moreover, in that case we even have
impact conditions, while the matrices A2 and B2 come global exponential stability and asymptotic Lyapunov
from the continuous part of the dynamics. The pair stability (see, e.g. [58] for the exact definitions).
(C, A2) is not observable, but the pair (C, A1A2) is
observable. This nicely indicates that impacts may Note that (87) is replaced by
render the system observable from the output q2(t).
Ax if cT x > 0,
_x
88
A bd c x if cT x 6 0
1 T
7.3. Stability
if d > 0 and by11
Stability issues are of utmost importance for control
Ax if
cT x, cT Ax 0,
applications. It is in fact well-known that a com- x_
89
PAx if cT x 0 and cT Ax 6 0,
plementarity system can be stable (resp. unstable)
while the corresponding unconstrained system is where P I b(cTb) 1cT in case d 0 and cb > 0.
unstable (resp. stable), see e.g. [84, Example 3.2, and Compare this to Eqs (56) and (57)
41]. Hence, as is well-known also for piecewise smooth
systems [14], stability of the submodels (7) does in
general not say anything about stability of the overall 11
The notation denotes lexicographic ordering, i.e. for this case
system. Some studies can be based on the use of either CTx > 0 or CTx 0 and CTAx > 0.
Hybrid Dynamical Systems 347
transfer matrix meaning that (68) leads to the ATK KA < 0 holds with strict inequality in (68),
existence of positive de®nite matrices P PT and then x 0 is the only equilibrium point, which is
Q QT such that PA ATP Q and BTP C asymptotically stable. The jumps as in Theorem 5.15
[67]. Moreover, CA 1B BTA TCT is negative satisfy V(x ) 6 V(x0) with V(x) xTKx an arbitrary
de®nite. storage function obtained by (68).
(b) ' : Rm ! R [ {1} is convex lower semicontin-
Note that the results in this section extend the
uous, so that @' is a maximal monotone multi-
passivity theorems that are formulated for the abso-
valued mapping (see e.g. [15, Example 2.3.4]).
lutely stability problems for Lur'e systems [58]. More
Then the following is true: general versions of the results presented here in the
direction of inclusion of pure switches and external
Lemma 7.13 [20]. Let assumptions (a) and (b) hold. If inputs (voltage and current sources) have been
Cx(0) 2 dom @', then the system in (91) has a unique obtained in [49].
absolutely continuous solution on [0, 1). Assume
also that the graph of @' contains (0, 0). Then: (i) x 0 7.4. Stabilization of Complementarity Lagrangian
is the unique solution of the generalized equilibrium Systems
equation Ax 2 B@'(Cx). (ii) The unique fixed point
x 0 of the system in (91) is exponentially stable. The so-called Lagrange±Dirichlet theorem in
mechanics, states that a mechanical system as in
In the particular case of the system in (90), the proof Eq. (99) whose potential energy is strictly convex, has
can also be found in [26]. In [26] one does not allow an equilibrium point that is Lyapunov stable. This
only continuous solutions, but also allows the possi- result has been used in control by suitably modifying
bility of state re-initializations. The corresponding the potential energy (shaping it) so that a desired
jump rules are derived in Section 5.5 and were based equilibrium is stabilized (see e.g. [67]). The question is:
on a distributional framework. Indeed it is apparent how does this extend to dynamics as in (14)? First of
from Lemma 7.13 that initial states which do not all let us add to (14) an impact law defined as [81]
satisfy the indicated constraint, have to jump instan-
taneously to some consistent value. Moreover, [26] _
q
tk _ k
eq
t
also included the case (67) with D 6 0 (and even allows _ k , V
q
tk ,
1 eproxM
q
tk q
t
external inputs in the study of well-posedness).
93
Lemma 7.14. A state x is an equilibrium point of (67),
if and only if there exist and w
2 Rm satisfying where e 2 [0, 1] is a restitution coef®cient. The set
V(q(tk)) is a tangent cone and it is de®ned as [81]
0 Ax B,
92a
C
w x D,
92b V
x fv 2 Rn j vT rhi
x > 0, 8i 2 J
xg
94
06w ? > 0:
92c with J(x) {i 2 {1, . . . , m} j hi(x) 6 0}, see Fig. 8 for
Moreover, if A is invertible, then we obtain the examples in the plane. The ``prox'' in Eq. (93) means
homogeneous LCP that characterizes all equilibria that the vector
q
t_ _ k =
1 e has to be
k eq
t
chosen as the closest vector to q
t _ k , in V(q(tk)). The
x A 1 B and > 0:
0 6 CA 1 B D? distance is understood in the kinetic metric, i.e. it is
de®ned from the scalar product xTM(q(tk))y for two
Note that these equations comply with the gen-
vectors x and y. The link with quadratic programming
eralized equilibrium equation Ax 2 B@'(Cx) in
is easily done.
Lemma 7.13. From Lemma 7.14 it follows that x 0
It is necessary at this stage, to recall that the solu-
is an equilibrium.
tions of the dynamical system in (14)±(93) have right-
Theorem 7.15. [26]. Consider an LCS given by (67) continuous velocities of local bounded variation
such that (A, B, C, D) is passive, (A, B, C ) is minimal (RCLBV) [7]. This has important consequences, as
and derivatives of RCLBV functions may be complex
objects, but are assured to be measures. In other
B
col
B, D DT : words, the acceleration q is a measure, and quadratic
D DT
functions of q_ are also RCLBV. Thus their derivative
has full column rank. This LCS has only Lyapunov is also a measure. Since measures are signed (a Dirac
stable equilibrium points x. Moreover, if measure is signed ± it may be positive or negative ±), it
Hybrid Dynamical Systems 349
In view of item (iii) let us define the closed-loop The basic idea is to use a positive function V(x, t) such
impact Poincare map that corresponds to the section that the above stability notions can be proved. As a
I fx j hi
q 0, q_ T rhi
q < 0, i 2 I g, which is a consequence the design will be based on the choice of a
hypersurface of codimension card
I . The re- nonlinear tracking controller which is known to
impact velocities are chosen to define PI and the render the closed-loop error system globally asymp-
reason for this choice will be made clear after Claim totically stable when the system is unconstrained.
8.5. We define:
Why choosing a single Lyapunov function (conse-
PI : I ! I quently a single controller structure) and not one
function per phase?
xI
k7!xI
k 1,
103
In fact it is known that for phases
2k and
2k1,
where xI is the state of PI . the same controller structure can be used. Thus using
two different controllers for these phases, is useless.
Definition 8.2. (Strongly stable system). The system is
Moreover this would bring serious complications in
said strongly stable if: (i) it is
-weakly stable, (ii) on
the stability analysis, since the Lyapunov function for
phases Ik, PI is Lyapunov stable, and (iii) the
phases
2k has to be monitored during phases
2k1,
sequence ftk gk2N has a finite accumulation point
and vice versa. To say nothing of the third phase Ik. It
t1 < 1.
therefore seems that there would be very little (if none)
Clearly PI has a fixed point xI 2 @. Now that advantage in using a multiple-Lyapunov functions
we are equipped with a stabilization framework, let us approach for the tracking control of the hybrid
state the main control objective: dynamics (101)±(102).
Given a global asymptotic tracking controller for Let us define the jump function f(t) f(t) f(t )
free-motion tasks, with Lyapunov function V
~ q, q~_ , t, and [.] is the Lebesgue measure. Let V() satisfy
design a feedback control u in (101) such that given a
kXk > V(x, t) > (kxk),
0 (0) 0, () and
desired trajectory qd(), the closed-loop error system
strictly increasing. Let Ik 0k , tkf .
352 W.P.M.H. Heemels and B. Brogliato
Claim 8.3. (
-Weak Stability [16]). Assume that the difficult to master such jumps. The central question
task is as in Eq. (102), and that then is: can the decrease of V(t) during the impactless
phases, compensate for its variation during Ik? In the
(a) [
] 1,
ideal situation, we should manage so that V(t)
(b) for each k 2 N, [Ik] < 1,
decreases at impacts (this is the meaning of the strong
(c) V
x
tkf , tkf 6 V
x
0k , 0k ,
stability concept in Definition 8.2). But this will not
(d) V(x(.), .) uniformly bounded on each Ik.
always be easy to get. On the other hand, it is crucial
to assure that the variation of V() during Ik, does not
_
If on
, V
x
t, t 6 0 and V(tk) 6 0 for all k > 0, then amplify from one cycle to the next when the phases
k
the closed-loop system is
-weakly stable. If are of bounded duration.
_
V
x
t, t 6
k x k,
(0) 0,
() strictly increas-
Claim 8.5. (Strong Stability [16]). The system is
ing, then the system is asymptotically
-weakly stable.
strongly stable if in addition to the conditions in
Claim 8.3 one has:
This accommodates for other types of motions than
the one as in Eq. (102), see [16]. Let us assume that V
tk1 6 V
tk ;
t1 < 1. It is noteworthy that from [7, Proposition V is uniformly bounded and time continuous on
4.11] this precludes elastic impacts (because if there is Ik [ k{tk}.
no loss of kinetic energy atP impacts, impact times
Sufficient conditions are that V(tk) 6 0 and
satisfy tk1 tk > k > 0 with k>0 k unbounded, so
V
tk1 6 V
t k , but this framework permits
that t1 1). On the contrary the results presented
V(tk) > 0 provided V
tk1 < V
t k for some
in Section 8.2 hold only if impacts are elastic. Let us _ needs not be
large enough > 0. Notice also that V
t
now state a result which incorporates the relation
6 0 along the system's trajectories on the whole of
between the speed at which cycles in Eq. (102) are
covered, and the decrease of the Lyapunov function (tk, tk 1). The reason why we have chosen I and not
I in (103) is that it allows us to take into account the
after stabilization on I has been obtained:12
value V
t0 in the stability analysis. Notice that
Claim 8.4. (
-Weak Stability [13]). Let us assume that _
q
t _ 1
1 q
t
(a) and (b) in Claim (8.3) hold and that
(a) outside phase Ik one has V
t _ 6
V
t for some 8.1.4. What is the Difference Compared to the
> 0, Unconstrained Case?
(b) inside phase Ik one has V
tk1 V
t k 6 0,
0
(c) the
P system is initialized on
0 with V
0 6 1, Consider the dynamics in Eq. (99). A globally tracking
k
(d)
t
k>0 V k j 6 KV
0 for some > 0 and controller guarantees that the closed-loop system
some K > 0. possesses the desired trajectory qd() as its unique
Then there exists a constant N < 1 such that isolated invariant set, with a global basin of attrac-
tk1 , tkf 6 N, for all k > 0 (the cycle index), and such tion. The control input is a function of qd(), and the
that: Lyapunov function satisfies V
~ q, q~_ , t 0 when
q() qd(). In fact the closed-loop system that we are
(i) If > 1 and N 1=
ln
1 K= for some going to design, differs a lot. There are some funda-
0 < < 1, then V
0k1 6 V
0k . The system is mental reasons for such discrepancies, which we try
asymptotically weakly stable. now to summarize. Let us start from a simple one
(ii) If < 1, then V
0k 6
, where (
) can be degree-of-freedom example:
made arbitrarily small by increasing
. The
system is practically weakly stable with m
q qd
1
q_ q_ d
2
q qd 2 @
q
R 1((
)).
104
The spirit of Claim 8.4 is really ``hybrid'' since it
for some (possibly time-varying though we dropped
merges the continuous and the discrete-event dyna-
arguments in (104)) signal qd
. The feedback gains
mical features of the system. The upper-bound on the
are
1 > 0 and
2 > 0. Assume now that qd
t > 0 for
sum of the Lyapunov function jumps in (d), is the key
all t > 0 and qd
t 0 during nonzero time intervals.
of Claim 8.4. As said above it is in general quite
Clearly if R then the right-hand side of (104) is
equal to 0, and the unique globally stable invariant
12
In the following we sometimes denote with some abuse of trajectory is q
qd
. If ( 1, a], a > 0, then
notation V(t) V(x(t), t). this trajectory is still the unique invariant one,
Hybrid Dynamical Systems 353
however it may not be globally stable (there may be qd
qd
. On
2k1, qd() 0. In order to converge
impacts with the boundary of ). Consider now toward qic(), the signal qd
has to evolve from cycle k
( 1, a], a > 0. Then clearly qd
can no longer be to cycle k 1 such that, in proportion as tracking
the invariant trajectory, since it ``penetrates'' outside errors decrease on
k, qd
approaches an impactless
. Therefore the invariant and stable trajectory (if trajectory. There will consequently be two distinct
any) must be different from qd
. Let us denote it notions of convergence: one in time t, the other one
as qic(). Now let us think about stability with a Lya- along the cycle index k.
punov-like function V(t). One requirement is that
Remark 8.7. An idea (which will be discussed again in
V 0 when the tracking error is 0, i.e. when the system
Section 8.2) is to design qd() such that its first deri-
evolves on the closed-loop invariant. Consequently let
vative has discontinuities at times tk, i.e. simultaneous
us admit for the moment that V() has to be a function
_
to jumps in q
. This trick would allow one to get
of q qic and q_ q_ ic instead of q_ q_ d and q_ q_ d .
V
t0 0 when perfect tracking is assured. However
Remark 8.6. Notice that the right-hand side of the one has also to keep in mind that the variation of V()
inclusion in (104) can also be written in a closed-loop has to be characterized for nonzero tracking errors as
form as @
t~
~q where ~ q, t > 0, where
~ q~ j h
~ well, and studying such variations when impacts do
~ q, t h
~
h
~ q qd
t.This closed-loop formalism has not match with discontinuities in q_ d
adds useless
not yet been used for tracking design purpose (convex difficulties to the problem. The choice for qd () in
analysis could be at the core of such a design, as a [19,13] is made to simplify as much as possible all
direct extension of the stabilization framework in calculations for the variations of V() during transition
Section 7.4). phases Ik.
Let us think now of the transition phase of motion. In conclusion, asymptotic tracking along a discrete
Let us take R and qd a negative constant. Then trajectory as in Eq. (102) implies in general that [13]:
(104) is the bouncing-ball dynamics. The only invar-
the closed-loop invariant trajectory is impactless,
iant trajectory is
q, q_
0, 0. If one wants that the
but robustness of the stabilization on @ implies
same function V() serves for this impacting phase as
that impacts do occur during transition phases,
well (see Definition 8.2), then during phases Ik one
the signals ``desired trajectory'' which enter the
must have V
q 0, q_ 0 0. In this case it is even
controller and the Lyapunov function, are not
more appropriate to use an impact Poincare mapping
the same.
to represent the dynamics. Once again one sees that it
is not qd which enters the function V(). What do we mean by ``in general?'' In fact there are
A first conclusion is that the signal which enters the special cases where the inertia matrix is such that the
control input (i.e. qd
) is not the signal which enters dynamics tangential to the boundary @ and normal
the Lyapunov function. This is due to the fact that the to @, are decoupled (i.e. in a specific set of general-
system evolves along phases of motion, whose ized coordinates, M(q) is block diagonal). This
underlying dynamics are of different natures. There is decoupling allows one to compensate for the jump
another subtlety in the control design which is a V
t0 V
t0 in a finite time, at each cycle (see [16]).
consequence of asymptotic stability. Indeed assume But as soon as couplings exist, this nice feature van-
that the system is initialized on
0 and with tracking ishes. Let us expand a little on this by showing equa-
errors q qd 0, q_ q_ d 0. Then V(t) 0 for all tions. Assume that h(q) q1 2 R in (101), where q1 is
t > 0, until there is a first impact at time t0. Then if the first component of q. Let us write the inertia
there is a jump in V() at t0, one must have matrix as
V
t 0 V
t0 V
t0 > 0. Since we restrict our-
M11
q M12
q
selves to using only controllers which assure asymp- M
q :
totic convergence of the tracking errors toward zero, it MT12
q M22
q
will generally be impossible to compensate for such Then, choosing for instance Moreau's impact law with
positive jumps. Therefore the invariant closed-loop restitution e 2 [0, 1] [81], at an impact one has
trajectory qic will generally have to be impactless,
otherwise its asymptotic stabilization is impossible. q_ 1
t
k q_ 1
tk
1 eq_ 1
tk ,
Let us recapitulate: V() cannot be a function of q_ 2
t
k q_ 2
tk M221
qMT12
q
1 eq_ 1
tk :
qd
, qic() must be impactless, and during Ik,
105
V
q 0, q_ 0 0. This yields us to define a third
signal qd () that will enter V(). On Ik and still dealing It clearly appears that if M12 0, then the dynamics in
with (104), qd 0 to cope with the third item. On
2k, the (q1, q2) coordinates is decoupled, and there is no
354 W.P.M.H. Heemels and B. Brogliato
jump in the generalized velocity q_ 2 . But if M12 6 0, this reason the Paden and Panja scheme, which is the
then jumps occur in q_ 2 and consequently in the most direct extension of PD control, is chosen in a first
Lyapunov function V(). If V() exactly matches the instance. As we shall see, the closeness of the Lyapu-
kinetic energy at times tk, its variation will auto- nov function to the system's total energy, does not
matically be negative at tk. However making V() allow one to overcome all the obstacles of this control
match T
q, q _ and at the same time assuring the sta- problem. Other parameters, like the ability of char-
bility as in De®nitions 8.1 or 8.2, is not an easy design acterizing the decreasing of V() along continuous
task. When M12 0 it is easy to split the control motion phases t 2
, may be important.
problem into two subproblems with V() V1() In order to cope with the robustness issue (stabili-
V2() where V1() and V2() evolve independently. The zation on I ) and the asymptotic convergence of the
reader is referred to [13] for a description of this tracking error toward zero (which implies an impact-
control problem in terms of invariant trajectories of less closed-loop invariant trajectory), a specific signal
the closed-loop dynamical system. qd
has to be designed during transition phases Ik.
This signal should be such that when perfect tracking
is obtained on
k, then the systems tracks an
8.1.5. Control Design impactless trajectory, whereas for nonzero tracking
errors a bouncing-ball-like motion occurs during Ik.
Let us briefly summarize a solution for tracking con- Such a trajectory is depicted in Fig. 19, where only the
trol, in a general setting (no dynamic decoupling). As normal direction q1 is considered.
said above, the underlying strategy is to use a single The transition phase starts at a time 0k 2
2k (this
controller structure (consequently a unique Lyapunov k is a cycle
2k [ Ik [
2k 1 index), which is chosen
function) which, when applied to an unconstrained by the designer when stabilization on @ is desired. If
system, guarantees asymptotic global tracking. The V
0k 0 (perfect tracking) then there is no impact
input that we are going to present, has some switches and @ is attained tangentially. If V
0k 6 0 then
in it, but the switches are at the level of the desired q1d
t decreases toward V
0k < 0. The time 0k and
trajectory (see Fig. 18). Therefore it is a feedback the value V
0k reflect the convergence of qd
toward
control of the form u u
q, q, _ t that has a fixed qic() as explained in the foregoing subsection. The
_ but is discontinuous
structure with respect to q and q, Paden and Panja scheme has the Lyapunov function
in its second argument.
We do not have an unbounded panoply of non- V
t, q~, q~_ 12 q~_ T M
qq~_ 12
1 q~T q~
q,
linear controllers for tracking control at our disposal.
Among them: feedback linearization (known as the
106
computed torque method in Robotics), the Paden and
Panja scheme, the Slotine and Li scheme (see e.g. [67]). where, following the above discussion, q~ q qd . We
Since impacts dissipate kinetic energy it is natural to can set qd
t qd
t until a ®rst impact occurs at t0.
consider those controllers whose Lyapunov function Then qd(t) 0. Thus for t > t0 the Lyapunov function
is as close as possible the system's total energy. For in Eq. (106) resembles the system's energy, and in
Switching
strategy
Force control
Desired trajectories
generator
Lagrangian
Transition phase trajectory Nonlinear controller complementarity
system
structure
controller such that the trajectory q() converges evolutional variational inequalities, projected dyna-
towards
d in the sense that minp2
d kq(t) pk ! 0 mical systems and so on. However, one has observed
for all t > 0. Some constraints on the velocity should that the class of complementarity systems has its own
also be added to avoid triviality of this stability peculiarities in the sense of mode dynamics living on
concept. lower dimensional subspaces and the possibility of
discontinuities in the state variables. This complicates
8.4. Control of Biped Robots the analysis on one hand, but makes it a huge chal-
lenge on the other. In this paper we have tentatively
The control of biped robots is a very specific and shown that complementarity systems, despite the fact
challenging control problem. It is not our goal here to that they constitute only a subclass of hybrid systems,
survey this wide area, see [52] for references and provide a wealth of potential theoretical studies
discussions on modeling, stability and control. The (controllability, observability, stabilization, etc.) that
stability of a biped robot has itself been the subject of are widely open. Some preliminary results have been
some controversies, and still remains open (see [100] included in the paper that could serve as a starting
where it is shown that the widely used ZMP criterion, point on one hand and indicate the difficulties one has
has severe drawbacks and should not be used for to face on the other. We hope that we stimulated many
walking stability). The complementarity framework is readers in going into this appealing research field and
the suitable one for the design of stable feedback contributing to the analysis and synthesis problems,
controllers. It retains the most important dynamical that are interesting to overcome in view of the broad
features (impacts, Coulomb friction), and at the same range of applications.
time is simple enough.
The stability of a biped robot that walks, can be
seen from two points of view. The first one is that a Acknowledgements
biped is stable, as long as its foot does not slip on the
ground and as long as it does not fall down: this can be The first author would like to thank Kanat CËamllbel,
called the discrete-event stability. The second one is Hans Schumacher, Arjan van der Schaft, Bart De
trajectory tracking: the legs have to track some desired Schutter, Alberto Bemporad, Henk Nijmeijer and
trajectory. This is a low-level, or continuous part Sasha Pogromsky for the fruitful and pleasant coop-
stability. Both are related since the discrete-event eration on complementarity systems.
stability restricts the class of trajectories that can be
tracked. The feedback control of biped robots is, to a
large extent, still an open problem. References
Nonsmooth Mechanics series A 2001; 359 1789: 25. CËamllbel MK. Complementarity methods in the analy-
2327±2346 sis of piecewise linear dynamical systems. Tilburg
8. Bemporad A, Morari M. Control of systems integrating University, The Netherlands, 2001
logic, dynamics, and constraints. Automatica 1999; 26. CËamllbel MK, Heemels WPMH, Schumacher JM. On
35(3): 407±427 linear passive complementarity systems. Eur J Control
9. Bemporad A, Ferrari-Trecate G, Morari M. Observa- 2002; 8(3): 220±237
bility and controllability of piecewise affine and hybrid 27. CËamllbel MK, Heemels WPMH, Schumacher JM.
systems. IEEE Trans Autom Control 2000; 45 10: Consistency of a time-stepping method for a class of
1864±1876 piecewise-linear networks. IEEE Trans circuits and
10. Bemporad A, Morari M, Dua V, Pistikopoulos EN. systems I. Fundamental Theory and Applications 2002;
The explicit linear quadratic regulator for constrained 49(3): 349±357
systems. Automatica 2002; 38(1): 3±20 28. CËamllbel MK, Heemels WPMH, Schumacher JM.
11. Bemporad A, Heemels WPMH, De Schutter, B. On Stability and controllability of planar bimodal linear
hybrid systems and closed-loop MPC systems. IEEE complementarity systems. submitted for publication,
Trans Autom Control 2002; 47(5): 863±869 2003
12. van Bokhoven, WMG. Piecewise linear modelling and 29. Clarke FH, Ledyaev YuS, Sontag ED, Subbotin AI.
analysis. Kluwer, Deventer, The Netherlands, 1981 Asymptotic controllability implies feedback stabilisa-
13. Bourgeot JM, Brogliato B. Tracking control of tion. IEEE Trans Autom Control 1997; 42(10): 1394±
complementarity Lagrangian systems. Submitted to 1407
the International Journal of Bifurcation and Chaos, 30. Clarke FH, Ledyaev YuS, Stern RJ, Wolenski PR.
special issue Non-smooth Dynamical Systems: Recent Nonsmooth analysis and control theory. Springer
Trends and Perspectives. INRIA Research Report, GTM 178, New York, 1998
2003 available at http://www.inria.fr/rrrt/index.en.html 31. Cottle RW, Pang JS, Stone RE. The linear comple-
14. Branicky MS. Stability theory for hybrid dynamical mentarity problem. Academic Press, New York, 1992
systems. IEEE Trans Autom Control 1998; 43(4): 32. Ten Dam AA, Dwarshuis E, Willems JC. The contact
475±482 problem for linear continuous-time dynamical systems:
15. Brezis H. OpeÂrateurs maximaux monotones. Mathe- a geometric approach. IEEE Trans Autom Control
matics Studies, North-Holland, 1973 1997; 42(4): 458±472
16. Brogliato B, Niculescu S, Orhant P. On the control of 33. Deimling K. Multivalued differential equations. De
finite dimensional mechanical systems with unilateral Gruyter series in nonlinear analysis and applications,
constraints. IEEE Trans Autom Control 1997; 42(2): Berlin 1992
200±215 34. De Schutter B, van den Boom T. 2000; Model
17. Brogliato B. Nonsmooth mechanics. 2nd ed. Springer predictive control for max-min-plus-scaling systems.
CCES, London 1999. Erratum and addenda available Technical report of Technical University of Delft,
at http://www.inrialpes.fr/bip/people/brogliato/bro- Electrical Engineering, Control Lab. No. bds:00-03.
gli.html 35. Dupuis P, Nagurney A. Dynamical systems and
18. Brogliato B, Zavala-Rio A. On the control of variational inequalities. Ann Oper Res 1993; 44: 9±42
complementary-slackness juggling mechanical systems. 36. Eaves BC, Lemke CE. Equivalence of LCP and PLS.
IEEE Trans Autom Control 2000; 45(2) 235±246. Math Oper Res 1981; 6(4): 475±484
19. Brogliato B, Niculescu SI, Monteiro Marques MDP. 37. Ferris MC, Pang JS. Engineering and economic
On tracking control of a class of complementary- applications of complementarity problems. SIAM Rev
slackness mechanical systems. Systems Control Lett 1997; 39(4): 669±713
2000; 39(4) 255±266 38. Filippov AF. Differential equations with discontinuous
20. Brogliato B. Absolute stability and the Lagrange- righthand sides. In the series Mathematics and Its
Dirichlet theorem with monotone multivalued map- Applications, Kluwer, Dordrecht, The Netherlands,
pings. 40th IEEE Conference on Decision and Control, 1998
Orlando, USA, December 4±7 2001. Submitted to 39. FreÂmond M. Non-smooth thermomechanics. Springer
Systems and Control Letters. INRIA Research Report Verlag, Berlin-Heidelberg, 2002
4794, available at http://www.inria.fr/rrrt/index.en.html 40. GeÂnot F, Brogliato B. New results on painleve para-
21. Brogliato B, ten Dam AA, Paoli L, GeÂnot F, Abadie M. doxes. Eur J Mechanics A/Solids, 1999; 18(4): 653±678
Numerical simulation of finite dimensional multibody 41. Goeleven D, Brogliato B. 2003; Stability and unstabil-
nonsmooth mechanical systems. ASME Applied ity matrices for linear evolution variational inequalities.
Mechanics Rev 2002; 55(2): 107±150 INRIA Research Report 4768 (http://www.inria.fr/rrrt/
22. Brogliato B. Some perspectives on the analysis and index.en.html). Submitted
control of complementarity systems. IEEE Trans 42. Hartl RF, Sethi SP, Vickson RG. A survey of the
Autom Control June 2003; 48(6) maximum principles for optimal control problems with
23. Brogliato B, Mabrouk M, Zavala-Rio A. On the state constraints. SIAM Rev 1995; 37(2): 181±218
controllability of linear juggling mechanical systems. 43. Heemels WPMH, Schumacher JM, Weiland S. 1999;
INRIA Research Report 4772, 2003. available at Applications of complementarity systems. Proceedings
http://www.inria.fr/rrrt/index.en.html. Submitted to of the European control conference in Karlsruhe
Automatica. (Germany)
24. Brogliato B. Some results on the controllability of 44. Heemels WPMH, Schumacher JM, Weiland S. The
planar variational inequalities. INRIA Report 4788, rational complementarity problem. Linear Algebra and
2003. http://www.inria.fr/rrrt/index.en.html its Appli 1999; 294: 93±135
Hybrid Dynamical Systems 359
45. Heemels WPMH, Schumacher JM, Weiland S. 63. Lee KK, Arapostathis A. On the controllability of
Linear complementarity systems. SIAM J Appl Math piecewise-linear hypersurface systems. Systems &
2000; 60(4): 1234±1269 Control Lett 1987; 9: 89±96
46. Heemels WPMH, Schumacher JM, Weiland S. Pro- 64. Leenaerts DMW, van Bokhoven WMG. Piecewise
jected dynamical systems in a complementarity form- linear modelling and analysis. Kluwer Academic
alism. Oper Res Lett 2000; 27(2): 83±91 Publishers, Dordrecht, The Netherlands, 1998
47. Heemels WPMH, Camlibel MK, van der Schaft AJ, 65. LoÈtstedt P. Mechanical systems of rigid bodies subject
Schumacher JM. Well-posedness of the complementar- to unilateral constraints. SIAM J of Applied Mathe-
ity class of hybrid systems. Proceedings of the IFAC matics, 1998; 42(2): 281±296
World Congress, Barcelona, 2002 66. Lootsma YJ, van der Schaft AJ, Camlibel MK.
48. Heemels WPMH, CËamlibel MK, Schumacher JM. Uniqueness of solutions of linear relay systems.
On the dynamic analysis of piecewise-linear networks. Automatica 1999; 35(3): 467±478
IEEE Trans Circuits Systems ± I 2002; 49(3): 67. Lozano R, Brogliato B, Egeland O, Maschke B.
315±327 Dissipative Systems Analysis and Control. Theory
49. Heemels WPMH, CËamlibel MK, van der Schaft AJ, and Applications. Springer CCES, London, 2000
Schumacher JM. Modelling, well-posedness and stabi- 68. Lynch KM, Black CK. Recurrence, controllability, and
lity of switched electrical networks. In: Proceedings of stabilization of juggling. IEEE Trans Robotics Auto-
Hybrid Systems: Computation and Control, Prague, mation 2001; 17(21): 113±124
2003, 249±266 69. Mata Jimenez M, Brogliato B. Analysis of PD and
50. Heemels, WPMH, De Schutter B, Bemporad A. nonlinear control of mechanical systems with dynamic
Equivalence of hybrid dynamical models. Automatica backlash. J Vibrations Control 2003; 9(1): 119±156
2001; 37(Iss.): 1085±1091 70. McClamroch NH, Wang D. Feedback stabilization and
51. Hiriart-Urruty JB, LemareÂchal C. Fundamentals of tracking of constrained robots. IEEE Trans Autom
Convex Analysis. Springer Grundlehren Text Edition, Control 1988; 33(5): 419±426
Berlin, 2001 71. Menini L, Tornambe A. Asymptotic tracking of
52. Hurmuzlu Y, GeÂnot F, Brogliato B. 2001; Modeling, periodic trajectories for a simple mechanical system
stability and control of biped robots ± a general frame- subject to nonsmooth impacts. IEEE Trans Autom
work. INRIA Research Report 4290, http://www. Control 2001; 46(7): 1122±1126
inria.fr/rrrt/index.en.html. Submitted to Automatica. 72. Menini L, Tornambe A. Velocity observers for non-
53. Janin O, Lamarque CH. Comparison of several numer- linear mechanical systems subject to non-smooth
ical methods for mechanical systems with impacts. Int J impacts. Automatica 2002; 38(12): 2169±2176
Numerical Methods Eng 2001; 51(9): 1101±1132 73. Menini L, Tornambe A. State estimation of (otherwise
54. Johansson KJ, Egerstedt M, Lygeros J, Sastry S. On unobservable) linear mechanical systems through the
the regularization of zeno hybrid automata. Systems use of non-smooth impacts: the case of two mating
Control Lett 1999; 38: 141±150 gears. Automatica 2002; 38(10): 1823±1826
55. Johansson M, Rantzer A. Computation of piecewise 74. Moreau JJ. Les liaisons unilateÂrales et le principe de
quadratic lyapunov functions for hybrid systems. IEEE Gauss. CRAS Paris 1963; 256: 871±874
Trans Autom Control 1998; 43(4): 555±559 75. Moreau JJ. Quadratic programming in mechanics. J
56. John F. Extremum problems with inequalities as SIAM Control 1966; 4(1): 153±158
subsidiary conditions. In: Studies and Essays, Courant 76. Moreau JJ. Fonctionnelles convexes. SeÂminaire sur les
Anniver-sary Volume, Interscience, New York, 1948; eÂquations aux deÂriveÂes partielles, ColleÁge de France,
148±204 Paris, 1966±1967
57. Juloski A, Heemels WPMH, Weiland S. Observer 77. Moreau JJ. Evolution problem associated with a
design for a class of piecewise affine systems. In: moving convex set in a Hilbert space. J Differential
Proceedings of the conference on decision and control Equations 1977; 26: 347±374
2002, Las Vegas, USA, pp. 2606±2611 78. Moreau JJ. Liaisons unilateÂrales sans frottement et
58. Khalil HK. Nonlinear Systems. MacMillan, New York chocs ineÂlastiques. C.R. Acad Sc Paris 1983; t.296: seÂrie
1992 II 1473±1476
59. Klarbring A. Mathematical programming in contact 79. Moreau JJ. Standard inelastic shocks and the dynamics
problems. Computational Mechanics Publications, of unilateral constraints. Springer Verlag, CISM
Southamton, 1993, 233±264 Courses and Lectures 1985; 288: 173±221
60. Krupa M, Khun A, Poth W, Schagerl M, Steiner W, 80. Moreau JJ. Dynamique des systeÁmes a liaisons uni-
Steindl A, Troger H, Wiedermann G. Tethered lateÂrales avec frottement sec eÂventuel; essais numeÂr-
satellite systems: A new concept of space flight. iques. Technical note 85-1, LMGC, University of
Euro J Mechanics A/Solids 2000; 19(special issue): Montpellier 2, France 1986
149±164 81. Moreau JJ. Unilateral contact and dry friction in finite
61. Kuhn HW, Tucker AW. Nonlinear programming. freedom dynamics. In Nonsmooth Mechanics and
In: Proceedings of 2nd Berkeley Symposium on Applications, CISM Courses and Lectures no 302,
Mathematical Statistics and Probability, 1951, Moreau JJ, P.D. Panagiotopoulos (eds). International
481±492 Centre for Mechanical Sciences, Springer-Verlag, 1988;
62. Kunze M, Monteiro Marques M. An introduction to pp. 1±82
Moreau's sweeping process. In: Brogliato B (ed). 82. Moreau JJ. An introduction to unilateral dynamics. In:
Impacts in Mechanical Systems, Springer Lecture FreÂmond M, Macceri F (eds). Novel approaches in civil
Notes in Physics LNP 551, 2000, pp. 1±60. engineering, Springer Verlag, 2003; pp. 1±45
360 W.P.M.H. Heemels and B. Brogliato
83. Murty KG. Linear complementarity, linear and non- 93. van der Schaft AJ, Schumacher JM. Complementarity
linear programming. 1997 available at http://www- Modelling of Hybrid Systems. IEEE Trans Autom
personal. engin.umich.edu/Ämurty/book/LCPbook/ Control 1998; 43(4): 483±490
84. Nagurney A, Zhang D. Projected dynamical systems 94. van der Schaft AJ, Schumacher JM. An introduction
and variational inequalities with applications. Kluwer's to hybrid dynamical systems. Springer, London, Lect-
International Series in Operations research and ure Notes in Control and Information Sci 1999; 251
Management Science, Boston, 1996 95. Schumacher JM. Complementarity systems in optimi-
85. Nijmeijer H, van der Shaft AJ. Nonlinear dynamical zation. submitted for publication 2003
control systems. Springer Verlag, Berlin, 1990 96. Sontag ED. Nonlinear regulation: the piecewise linear
86. Paoli L. 2002 A numerical scheme for impact problems approach. IEEE Trans Automatic Control 1981;
with inelastic shocks: a convergence result in the multi- 26(2): 346±357
constraint case. In: Proc. of the international con- 97. Stewart DE. Rigid-body dynamics with friction and
ference on nonsmooth/nonconvex mechanics with impacts. SIAM Rev 2000; 42(1): 3±39
applications in Engineering, 5±6 July, Thessaloniki, 98. Stronge WJ. Impact Mechanics. Cambridge Univer-
Greece, C.C. Baniotopoulos (ed), Ziti editions, pp. 269± sity Press, Cambridge 2000
274 99. Veliov VM, Krastanov MI. Controllability of piece-
87. Paoli L, Schatzman M. A numerical scheme for impact wise linear systems. Systems & Control Lett 1986; 7:
problems. I The one-dimensional case; II The multi- 335±341
dimensional case. SIAM J Numerical Analysis 2002; 100. Wieber PB. On the Stability of walking systems. 3rd
40(2): 702±768 IARP international workshop on Humanoid
88. Pfeiffer F, Glocker C, Multibody Dynamics with and Human Friendly Robotics, December 11±12,
Unilateral Contacts. Wiley interscience, series in non- Tsukuba Research Center, Ibaraki, Japan, 2002
linear Dynamics 1996 101. Willems JC. Dissipative dynamical systems. Arch
89. Pontryagin LS, Boltyanskii VG, Gamkrelidze RV, Rational Mech Anal 1972; 45: 321±393
Mishchenko EF. The mathematical theory of optimal 102. Yoshikawa T, Dynamic hybrid position/force control
pro-cesses. John Wiley & Sons, 1962 of robot manipulatros ± Description of hand con-
90. Yu Pogromsky A, Heemels WPMH, Nijmeijer H. On straints and calculation of joint driving force. IEEE
well-posedness of relay systems. Proc. NOLCOS 2001, Trans Robotics and Autom 1987; 3(5): 386±392
St. Petersburg (Russia) 2001 103. Zavala-Rio A, Brogliato B. On the control of a one
91. Rockafellar RT. Convex Analysis. Princeton Univer- degree-of-freedom juggling robot. Dynamics and
sity Press, Princeton, NJ, 1972 Control 1999; 9: 67±90
92. van der Schaft AJ, Schumacher JM. The complemen- 104. Zavala-Rio A, Brogliato B. Direct adaptive control
tary-slackness class of hybrid systems. Mathematics of design for one-degree-of-freedom complementary-
Control, Signals and Systems 1996; 9: 266±301 slackness jugglers. Automatica, 2001; 37: 1117±1123