ML Book
ML Book
Thomas Duriez
Steven L. Brunton
Bernd R. Noack
Machine Learning
Control –
Taming Nonlinear
Dynamics and
Turbulence
Fluid Mechanics and Its Applications
Volume 116
Series editor
André Thess, German Aerospace Center, Institute of Engineering
Thermodynamics, Stuttgart, Germany
Founding Editor
René Moreau, Ecole Nationale Supérieure d’Hydraulique de Grenoble,
Saint Martin d’Hères Cedex, France
Aims and Scope of the Series
The purpose of this series is to focus on subjects in which fluid mechanics plays a
fundamental role.
As well as the more traditional applications of aeronautics, hydraulics, heat and
mass transfer etc., books will be published dealing with topics which are currently
in a state of rapid development, such as turbulence, suspensions and multiphase
fluids, super and hypersonic flows and numerical modeling techniques.
It is a widely held view that it is the interdisciplinary subjects that will receive
intense scientific attention, bringing them to the forefront of technological
advancement. Fluids have the ability to transport matter and its properties as well
as to transmit force, therefore fluid mechanics is a subject that is particularly open to
cross fertilization with other sciences and disciplines of engineering. The subject of
fluid mechanics will be highly relevant in domains such as chemical, metallurgical,
biological and ecological engineering. This series is particularly open to such new
multidisciplinary domains.
The median level of presentation is the first year graduate student. Some texts are
monographs defining the current state of a field; others are accessible to final year
undergraduates; but essentially the emphasis is on readability and clarity.
Bernd R. Noack
Machine Learning
Control –
Taming Nonlinear
Dynamics and
Turbulence
123
Thomas Duriez Bernd R. Noack
Laboratorio de Fluido Dinámica Département Mécanique-Energétique
CONICET - Universidad de Buenos Aires LIMSI-CNRS, UPR 3251
Buenos Aires Orsay
Argentina France
and
Steven L. Brunton
Mechanical Engineering Department Institut für Strömungsmechanik
University of Washington Technische Universität Braunschweig
Seattle, WA Braunschweig
USA Germany
vii
viii Preface
organized as follows. In Chap. 1, the reader is introduced to feedback control and its
challenges for complex real-world problems. Chapter 2 constitutes the core of the
book. This chapter formulates feedback control as a regression problem and
employs genetic programming as a powerful regression technique to identify the
best feedback law. Chapter 3 reviews classical methods of control theory against
which MLC is benchmarked in Chap. 4 for linear and weakly nonlinear dynamics.
These chapters provide context for feedback control, but they are not required to
implement the MLC methods in Chap. 2. The hurried reader may jump to Chap. 5
if she/he is interested in strongly nonlinear dynamics applications or to Chap. 6
if she/he is interested in experimental implementations of feedback flow control.
Chapter 7 distills good practices for real-world experiments that need to be taken
into account in any MLC implementation. In Chap. 8 we provide an outlook on
future methodological advances, which are expected to drastically amplify the
applicability and performance of MLC. In addition, we list a number of future MLC
applications with epic proportions.
We have profited tremendously from interactions with many colleagues on
machine learning control. First, we highly appreciate André Thess for his continual
encouragement to write a book about turbulence control for this Springer series. He
has nurtured the idea for years before we decided to write this book. We highly
appreciate the insightful and inspiring interviews with leading scholars of the field:
Shervin Bagheri, Belinda Batten, Mark Glauser, Marc Schoenauer, and David
Williams. These additions provide valuable perspectives for past progress and
future work. Eurika Kaiser has provided continual exquisite feedback on our
chapters and also contributed with her illuminating visualizations in Chap. 7
showing the performance of MLC.
We have also benefited greatly from our mentors throughout our careers. BRN is
deeply indebted to his turbulence control mentors Andrzej Banaszuk, Andreas
Dillmann, Helmut Eckelmann, Rudibert King, and William K. George, who shared
and fueled the passion for the field. SLB would like to gratefully acknowledge and
thank Nathan Kutz, Naomi Leonard, Richard Murray, Clancy Rowley, and Rob
Stengel, who each found unique ways to make dynamics and control theory come
to life. TD would like to acknowledge Eduardo Jose Wesfreid, Jean-Luc Aider,
Guillermo Artana, Luc Pastur, François Lusseyran, and Bernd R. Noack, who each
have had a profound (and most beneficial) impact on his perception of the different
fields he has been in contact with. This book would not have been possible without
our many colleagues, collaborators, and co-authors who have shared our early
enthusiasm for MLC and have dedicated significant energy to developing it:
Markus Abel, Jean-Luc Aider, Zhe Bai, Diogo Barros, Jean-Paul Bonnet, Jacques
Borée, Bing Brunton, Juan Martin Cabaleiro, Camila Chevot, Tom Daniel, Antoine
Debien, Laurent Cordier, Christophe Cuvier, Joël Delville (d), Caroline Fourment
(d), Hiroaki Fukumoto, Nicolas Gautier, Fabien Harambat, Eurika Kaiser, Laurent
Keirsbulck, Azeddine Kourta, Kai von Krbek, Nathan Kutz, Jean-Charles
Laurentie, Ruiying (Cecile) Li, François Lusseyran, Robert Martinuzzi, Lionel
Mathelin, Nicolas Mazellier, Marek Morzyński, Christian Nayeri, Robert Niven,
Akira Oyama, Vladimir Parezanović, Oliver Paschereit, Luc Pastur, Brian Polagye,
x Preface
Josh Proctor, Bartosz Protas, Rolf Radespiel, Cedric Raibaudo, Jim Riley,
Tony Ruiz, Michael Schlegel, Peter Scholz, Marc Segond, Richard Semaan, Tamir
Shaqarin, Andreas Spohn, Michel Stanislas, Ben Strom, and Sam Taira. Many of
our co-authors have applied the nascent MLC methodology in their own experi-
ments early on, when success was far from certain. We thank our students for
visiting our courses in Argentina, France, Germany, and the USA and contributing
with many good questions, new ideas and encouraging project results. Anneke Pot
from Springer Publisher has dependably supported us in critical decisions about
book contents and the production procedure.
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
1.1 Feedback in Engineering and Living Systems . . . . . . . . . . . . . . . 1
1.2 Benefits of Feedback Control . . . . . . . . . . . . . . . . . . . . . . . . . . 3
1.3 Challenges of Feedback Control . . . . . . . . . . . . . . . . . . . . . . . . 6
1.4 Feedback Turbulence Control is a Grand Challenge Problem . . . . 7
1.5 Nature Teaches Us the Control Design. . . . . . . . . . . . . . . . . . . . 8
1.6 Outline of the Book . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
1.7 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
2 Machine Learning Control (MLC) . . . . . . . . . . . . . . . . . . . . . . . . . 11
2.1 Methods of Machine Learning. . . . . . . . . . . . . . . . . . . . . . . . . . 12
2.1.1 System Identification as Machine Learning . . . . . . . . . . . . 13
2.1.2 Genetic Algorithms . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
2.1.3 Genetic Programming . . . . . . . . . . . . . . . . . . . . . . . . . . 16
2.1.4 Additional Machine Learning Methods . . . . . . . . . . . . . . 18
2.2 MLC with Genetic Programming. . . . . . . . . . . . . . . . . . . . . . . . 19
2.2.1 Control Problem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19
2.2.2 Parameterization of the Control Law . . . . . . . . . . . . . . . . 20
2.2.3 Genetic Programming as a Search Algorithm . . . . . . . . . . 21
2.2.4 Initializing a Generation . . . . . . . . . . . . . . . . . . . . . . . . . 23
2.2.5 Evaluating a Generation . . . . . . . . . . . . . . . . . . . . . . . . . 24
2.2.6 Selecting Individuals for Genetic Operations. . . . . . . . . . . 26
2.2.7 Selecting Genetic Operations . . . . . . . . . . . . . . . . . . . . . 27
2.2.8 Advancing Generations and Stopping Criteria . . . . . . . . . . 30
2.3 Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
2.3.1 Fitting a Function Through Data Points . . . . . . . . . . . . . . 33
2.3.2 MLC Applied to Control a Dynamical System . . . . . . . . . 36
2.4 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
2.5 Suggested Reading . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 45
2.6 Interview with Professor Marc Schoenauer . . . . . . . . . . . . . . . . . 46
xi
xii Contents
xv
xvi Abbreviations
xvii
xviii Symbols
U Characteristic velocity
u; us , uΔ , u , u Velocity (vector field; steady solution; deviation due to
Reynolds stresses; contribution of frequency ω ; contribution
of frequency ω )
u Slow varying mean flow
u
0
Flow fluctuations
u Streamwise velocity component
V; Vr Right singular vectors of SVD (complete reduced)
Vd Disturbance variance
Vn Noise variance
VJet Characteristic velocity of jets
v Velocity vector initial condition
v Transverse velocity component
WdC Discrete-time controllability Gramian
WdO Discrete-time observability Gramian
W Mixing layer width
w Disturbance array
wr External reference signal
wd External disturbance, process noise
wn Measurement noise
w Spanwise velocity component
X Solution to the Riccati equation for LQR
x Space vector
x Streamwise coordinate
Y Solution to the Riccati equation for Kalman filter
y Transverse coordinate
z System output
z Spanwise coordinate
β , β , β , β Parameter for growth rate change in oscillators of a general-
ized mean-field model (Table 5.1)
γ Penalization coefficient
γ , γ , γ , γ Parameter for frequency change in oscillators for a generalized
mean-field model (Table 5.1)
δðÞ Dirac delta function
ε Nonlinearity strength coefficient or state stabilization error
κ Gain of the generalized mean-field model
ν Kinematic viscosity
ρ
P P Fluid density
; r Singular values matrix of SVD, (complete; reduced)
σ Oscillator growth rate
σ ; σ H ; σ ; σ H Growth rate of oscillators of a generalized mean-field model
(Table 5.1)
τ; τa ; τu Period of time (with actuated system; with unactuated system)
xx Symbols