0% found this document useful (1 vote)

292 views52 pages

Linear Quadratic Regulator

The document discusses optimal control for linear dynamical systems with quadratic costs (LQR). It begins by noting LQR allows solving continuous state-space optimal control problems exactly through linear algebra. It then covers value iteration for LQR, showing the update can be done in closed form. Extensions are discussed like affine systems, stochastic systems, penalizing control changes, and trajectory following for nonlinear systems. Iterative methods like iterative LQR and differential dynamic programming are presented to solve generic optimal control problems. The cart-pole example is also covered.

Uploaded by

Sal Excel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (1 vote)

292 views52 pages

Linear Quadratic Regulator

Uploaded by

Sal Excel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 52

Optimal Control for Linear Dynamical Systems and Quadratic Cost

(LQR) Pieter Abbeel UC Berkeley EECS

Bellmans curse of dimensionality

!! !!

n-dimensional state space Number of states grows exponentially in n (assuming some fixed number of discretization levels per coordinate) In practice
!!

Discretization is considered only computationally feasible up to 5 or 6 dimensional state spaces even when using
!! !!

Variable resolution discretization Highly optimized implementations

This Lecture
!!

Optimal Control for Linear Dynamical Systems and Quadratic Cost (aka LQ setting, or LQR setting)
!!

Very special case: can solve continuous state-space optimal control problem exactly and only requires performing linear algebra operations

Great reference: [optional] Anderson and Moore, Linear Quadratic Methods --- standard reference for LQ setting

Note: strong similarity with Kalman filtering, which is able to compute the Bayes filter updates exactly even though in general there are no closed form solutions and numerical solutions scale poorly with dimensionality.

Linear Quadratic Regulator (LQR)

While LQ assumptions might (at first) seem very restrictive, we will see the method can be made applicable for non-linear systems, e.g., helicopter.

Value Iteration
!!

Back-up step for i+1 steps to go:

LQR:

LQR value iteration: J1

LQR value iteration: J1 (ctd)

In summary:

J1(x) is quadratic, just like J0(x). "Value iteration update is the same for all times and can be done in closed form for this particular continuous state-space system and cost!

Value iteration solution to LQR

Fact: Guaranteed to converge to the infinite horizon optimal policy iff the

LQR assumptions revisited

= for keeping a linear system at the all-zeros state while preferring to keep the control input small.
!!

Extensions which make it more generally applicable:

!! !! !! !! !! !!

Affine systems System with stochasticity Regulation around non-zero fixed point for non-linear systems Penalization for change in control inputs Linear time varying (LTV) systems Trajectory following for non-linear systems

LQR Ext0: Affine systems

Optimal control policy remains linear, optimal cost-to-go function remains quadratic Two avenues to do derivation:
!!

1. Re-derive the update, which is very similar to what we did for standard setting 2. Re-define the state as: zt = [xt; 1], then we have:

LQR Ext1: stochastic system

Exercise: work through similar derivation as we did for the deterministic case. Result:
!! !!

Same optimal control policy Cost-to-go function is almost identical: has one additional term which depends on the variance in the noise (and which cannot be influenced by the choice of control inputs)

LQR Ext2: non-linear systems

Nonlinear system: We can keep the system at the state x* iff Linearizing the dynamics around x* gives:

Equivalently:

Let zt = xt x* , let vt = ut u*, then:

[=standard LQR]

LQR Ext3: penalize for change in control inputs

Standard LQR:

When run in this format on real systems: often high frequency control inputs get generated. Typically highly undesirable and results in poor control performance. Why?

Solution: frequency shaping of the cost function. Can be done by augmenting the system with a filter and then the filter output can be used in the quadratic cost function. (See, e.g., Anderson and Moore.) Simple special case which works well in practice: penalize for change in control inputs. ---- How ??

LQR Ext3: penalize for change in control inputs

Standard LQR:

How to incorporate the change in controls into the cost/ reward function?
!!

Soln. method A: explicitly incorporate into the state by augmenting the state with the past control input vector, and the difference between the last two control input vectors. Soln. method B: change of variables to fit into the standard LQR setting.

LQR Ext3: penalize for change in control inputs

Standard LQR:

Introducing change in controls !u:

t+"

[If R=0, then equivalent to standard LQR.]

LQR Ext4: Linear Time Varying (LTV) Systems

LQR Ext5: Trajectory following for non-linear systems

A state sequence x#, x", , xH* is a feasible target trajectory iff

Problem statement:

Transform into linear time varying case (LTV):

LQR Ext5: Trajectory following for non-linear systems

Transformed into linear time varying case (LTV):

!! !!

Now we can run the standard LQR back-up iterations. Resulting policy at i time-steps from the end:

The target trajectory need not be feasible to apply this technique, however, if it is infeasible then the linearizations are not around the (state,input) pairs that will be visited

Most general cases

Methods which attempt to solve the generic optimal control problem

by iteratively approximating it and leveraging the fact that the linear quadratic formulation is easy to solve.

Iteratively apply LQR

Iterative LQR: in standard LTV format

Iteratively apply LQR: convergence

Need not converge as formulated!

Reason: the optimal policy for the LQ approximation might end up not staying close to the sequence of points around which the LQ approximation was computed by Taylor expansion. Solution: in each iteration, adjust the cost function so this is the case, i.e., use the cost function Assuming g is bounded, for $ close enough to one, the 2nd term will dominate and ensure the linearizations are good approximations around the solution trajectory found by LQR.

Iteratively apply LQR: practicalities

f is non-linear, hence this is a non-convex optimization problem. Can get stuck in local optima! Good initialization matters. g could be non-convex: Then the LQ approximation fails to have positive-definite cost matrices.
!!

Practical fix: if Qt or Rt are not positive definite " increase penalty for deviating from current state and input (x(i)t, u(i)t) until resulting Qt and Rt are positive definite.

Iterative LQR for trajectory following

Differential Dynamic Programming (DDP)

!! !!

Often loosely used to refer to iterative LQR procedure. More precisely: Directly perform 2nd order Taylor expansion of the Bellman back-up equation [rather than linearizing the dynamics and 2nd order approximating the cost] Turns out this retains a term in the back-up equation which is discarded in the iterative LQR approach [Its a quadratic term in the dynamics model though, so even if cost is convex, resulting LQ problem could be non-convex ]

[Reference: Jacobson and Mayne, Differential dynamic programming, 1970]

Differential dynamic programming

To keep entire expression 2nd order: Use Taylor expansions of f and then remove all resulting terms which are higher than 2nd order. Turns out this keeps 1 additional term compared to iterative LQR

Can we do even better?

!! !!

Yes! At convergence of iLQR and DDP, we end up with linearizations around the (state,input) trajectory the algorithm converged to In practice: the system could not be on this trajectory due to perturbations / initial state being off / dynamics model being off /

Solution: at time t when asked to generate control input ut, we could re-solve the control problem using iLQR or DDP over the time steps t through H Replanning entire trajectory is often impractical " in practice: replan over horizon h. = receding horizon control
!!

This requires providing a cost to go J(t+h) which accounts for all future costs. This could be taken from the offline iLQR or DDP run

Multiplicative noise
!!

In many systems of interest, there is noise entering the system which is multiplicative in the control inputs, i.e.:
x!" # = Ax! + (B + B$ w! )u!

Exercise: LQR derivation for this setting

[optional related reading:Todorov and Jordan, nips 2003]

Cart-pole

H(q) q + C(q, q ) + G(q) = B(q)u

[See also Section 3.3 in Tedrake notes.]

Cart-pole --- LQR

Results of running LQR for the linear time-invariant system obtained from linearizing around [0;0;0;0]. The cross-marks correspond to initial states. Green means the controller succeeded at stabilizing from that initial state, red means not.

Q = diag([1;1;1;1]); R = 0; [x, theta, xdot, thetadot]

Cart-pole --- LQR

Q = diag([1;1;1;1]); R = 1; [x, theta, xdot, thetadot]

[See, e.g., Slotine and Li, or Boyd lecture notes (pointers available on course website) if you want to find out more.]

Lyapunovs linearization method

Once we designed a controller, we obtain an autonomous system, xt+1 = f(xt) Defn. x* is an asymptotically stable equilibrium point for system f if there exists an % > 0 such that for all initial states x s.t. || x x* || ! % we have that limt! " xt = x* We will not cover any details, but here is the basic result: Assume x* is an equilibrium point for f(x), i.e., x* = f(x*). If x* is an asymptotically stable equilibrium point for the linearized system, then it is asymptotically stable for the non-linear system. If x* is unstable for the linear system, its unstable for the non-linear system. If x* is marginally stable for the linear system, no conclusion can be drawn. = additional justification for linear control design techniques

Controllability
!!

A system is t-time-steps controllable if from any start state, x0, we can reach any target state, x*, at time t. For a linear time-invariant systems, we have:

hence the system is t-time-steps controllable if and only if the above linear system of equations in u0, , ut-1 has a solution for all choices of x0 and xt. This is the case if and only if

with n the dimension of the statespace.

The Cayley-Hamilton theorem from linear algebra says that for all A, for all t " n :

Hence we obtain that the system (A,B) is controllable for all times t>=n, if and only if

Feedback linearization

Feedback linearization
x = f (x) + g(x)u (6.52)

[A function is called a di!eomorphism if it is smooth and its inverse is smooth.]

[From: Slotine and Li]

Feedback linearization

"!This condition can be checked by applying the chain rule and examining the rank of certain matrices! "! The proof is actually semi-constructive: it constructs a set of partial differential equations to which the state transformation is the solution.

Feedback linearization
!!

Tricycle:!! Simple Car:! Reeds-Shepp Car:! Dubins Car:!

Cart-pole

H(q) q + C(q, q ) + G(q) = B(q)u

[See also Section 3.3 in Tedrake notes.]

Acrobot

H(q) q + C(q, q ) + G(q) = B(q)u

[See also Section 3.2 in Tedrake notes.]

Lagrangian dynamics
!!

Newton: F = ma
!! !!

Quite generally applicable Its application can become a bit cumbersome in multibody systems with constraints/internal forces

Lagrangian dynamics method eliminates the internal forces from the outset and expresses dynamics w.r.t. the degrees of freedom of the system

Lagrangian dynamics
!! !! !! !! !!

ri: generalized coordinates T: total kinetic energy U: total potential energy Qi : generalized forces Lagrangian L = T U

" Lagrangian dynamic equations:

[Nice reference: Goldstein, Poole and Satko, Classical Mechanics]

Lagrangian dynamics: point mass example

Lagrangian dynamics: simple double pendulum

q! = !! , q" = !" , s# = sin !# , c# = cos !# , s!$ " = sin(!! + !" )

[From: Tedrake Appendix A]

2018-Regret Bounds For Robust Adaptive Control of The Linear Quadratic Regulator
No ratings yet
2018-Regret Bounds For Robust Adaptive Control of The Linear Quadratic Regulator
47 pages
2017 - On The Sample Complexity of The Linear Quadratic Regulator
No ratings yet
2017 - On The Sample Complexity of The Linear Quadratic Regulator
43 pages
LQG Lecture
No ratings yet
LQG Lecture
41 pages
Multi-Objective LQR With Linear Scalarization
No ratings yet
Multi-Objective LQR With Linear Scalarization
34 pages
On The Certainty-Equivalence Approach To Direct Data-Driven LQR Design
No ratings yet
On The Certainty-Equivalence Approach To Direct Data-Driven LQR Design
8 pages
Predictive Current Limiter For LQ Based Control of AC Drives
No ratings yet
Predictive Current Limiter For LQ Based Control of AC Drives
6 pages
Xi Important Mcqs - Ecat
0% (1)
Xi Important Mcqs - Ecat
67 pages
On LQR Controller Design For An Inverted Pendulum Stabilization
No ratings yet
On LQR Controller Design For An Inverted Pendulum Stabilization
9 pages
Geometric Jacobian Linearization and LQR Theory
No ratings yet
Geometric Jacobian Linearization and LQR Theory
48 pages
02 - Dynamic Programming and LQR
No ratings yet
02 - Dynamic Programming and LQR
25 pages
OPTCON LQ Optimal Control 2024-10-16
No ratings yet
OPTCON LQ Optimal Control 2024-10-16
13 pages
A8 Linear Quadratic Output Tracking and Disturbance
No ratings yet
A8 Linear Quadratic Output Tracking and Disturbance
9 pages
Lectures4and5 PDF
No ratings yet
Lectures4and5 PDF
9 pages
Linear Quadratic Optimal Control
No ratings yet
Linear Quadratic Optimal Control
32 pages
Class 4
No ratings yet
Class 4
4 pages
Nonlinear Control Feedback Linearization Sliding Mode Control
From Everand
Nonlinear Control Feedback Linearization Sliding Mode Control
Mourad Boufadene
No ratings yet
Ece5530 CH03LQR PDF
No ratings yet
Ece5530 CH03LQR PDF
37 pages
Linear-Quadratic Stochastic Control Problem - Solution Via Dynamic Programming
No ratings yet
Linear-Quadratic Stochastic Control Problem - Solution Via Dynamic Programming
16 pages
Linear Quadratic Regulator: Presented By: S.M.Mounesh (21011A0253
No ratings yet
Linear Quadratic Regulator: Presented By: S.M.Mounesh (21011A0253
34 pages
Infinite Horizon Linear Quadratic Regulator
No ratings yet
Infinite Horizon Linear Quadratic Regulator
11 pages
Analytical Methods of Optimization
From Everand
Analytical Methods of Optimization
D. F. Lawden
No ratings yet
RL and ObC Lecture 1
No ratings yet
RL and ObC Lecture 1
34 pages
Increasing and Decreasing Functions, Concavity and Inflection Points
No ratings yet
Increasing and Decreasing Functions, Concavity and Inflection Points
17 pages
A2 Linear-Quadratic Optimal Control
No ratings yet
A2 Linear-Quadratic Optimal Control
8 pages
Loop-shaping Robust Control
From Everand
Loop-shaping Robust Control
Philippe Feyel
No ratings yet
Optimal Control 2018 Souanef
No ratings yet
Optimal Control 2018 Souanef
15 pages
4F3 - Predictive Control
No ratings yet
4F3 - Predictive Control
27 pages
06722294
No ratings yet
06722294
6 pages
L2-1 LQR
No ratings yet
L2-1 LQR
14 pages
Iterative Linear Quadratic Regulator in C++: Thomas Lowe September 26, 2021
No ratings yet
Iterative Linear Quadratic Regulator in C++: Thomas Lowe September 26, 2021
2 pages
Reactive Power and Voltage Stabilization: David F. Taggart
No ratings yet
Reactive Power and Voltage Stabilization: David F. Taggart
32 pages
Plaxis Danang Course-Compiled
100% (5)
Plaxis Danang Course-Compiled
375 pages
Math 111 Test Questions Solved
100% (1)
Math 111 Test Questions Solved
2 pages
Lecture5 LQR PDF
No ratings yet
Lecture5 LQR PDF
54 pages
The Harmonic Oscillator With A Gaussian Perturbation: Evaluation of The Integrals and Example Applications
0% (1)
The Harmonic Oscillator With A Gaussian Perturbation: Evaluation of The Integrals and Example Applications
5 pages
Inno2020 Emt4203 Control II Chap3.3-4 LQ Optimal
No ratings yet
Inno2020 Emt4203 Control II Chap3.3-4 LQ Optimal
11 pages
Control Systems r13 Mtech
No ratings yet
Control Systems r13 Mtech
24 pages
High Voltage Engg r13 Mtech PDF
No ratings yet
High Voltage Engg r13 Mtech PDF
24 pages
Linear Quadratic Regulator
No ratings yet
Linear Quadratic Regulator
1 page
A Brief Tutorial On Linear and Nonlinear Control Theory
No ratings yet
A Brief Tutorial On Linear and Nonlinear Control Theory
46 pages
Neurocomputing: Xiaofeng Li, Lei Xue, Changyin Sun
No ratings yet
Neurocomputing: Xiaofeng Li, Lei Xue, Changyin Sun
8 pages
Interconnect 03 - Interconnect Modeling
No ratings yet
Interconnect 03 - Interconnect Modeling
31 pages
Lecture 4 Control
No ratings yet
Lecture 4 Control
23 pages
Methods of Linear Control Theory
No ratings yet
Methods of Linear Control Theory
20 pages
Inno2024 EMT4203 CONTROL II NOTES R6
No ratings yet
Inno2024 EMT4203 CONTROL II NOTES R6
9 pages
OCDM2223 Tutorial7solved
No ratings yet
OCDM2223 Tutorial7solved
5 pages
LQR Linear Quadratic Regulator: A State Space Optimal Control Technique Brett Shapiro
No ratings yet
LQR Linear Quadratic Regulator: A State Space Optimal Control Technique Brett Shapiro
41 pages
Chapter 4
No ratings yet
Chapter 4
33 pages
SISO Feedback Linearization
100% (1)
SISO Feedback Linearization
24 pages
Optimal Control and Decision Making: Eexam
No ratings yet
Optimal Control and Decision Making: Eexam
18 pages
Power System Contingencies
No ratings yet
Power System Contingencies
31 pages
Notes 9
No ratings yet
Notes 9
14 pages
CMIME Question Bank PDF
No ratings yet
CMIME Question Bank PDF
26 pages
SEM 2 - Mathematical-Methods-for-Economics-PART 2 - 2017 (OLD SYLLABUS)
No ratings yet
SEM 2 - Mathematical-Methods-for-Economics-PART 2 - 2017 (OLD SYLLABUS)
4 pages
cs229 Notes13
No ratings yet
cs229 Notes13
15 pages
Optimal Control
No ratings yet
Optimal Control
35 pages
Fem 7
No ratings yet
Fem 7
19 pages
4 - Matrix Displacement Method For Rigid Frames
No ratings yet
4 - Matrix Displacement Method For Rigid Frames
18 pages
LQG-LQR Controller Design
100% (2)
LQG-LQR Controller Design
37 pages
Lecture 03
No ratings yet
Lecture 03
51 pages
16 - Optimal Control of Unknown Parameter Systems
No ratings yet
16 - Optimal Control of Unknown Parameter Systems
3 pages
LQR
No ratings yet
LQR
34 pages
Riemannian Geometry, A Beginners Guide - F. Morgan - Text
No ratings yet
Riemannian Geometry, A Beginners Guide - F. Morgan - Text
121 pages
M.tech II Sem R13
No ratings yet
M.tech II Sem R13
13 pages
4 The Linear Quadratic Regulator: 4.1 Time Varying and Finite Horizon Case
No ratings yet
4 The Linear Quadratic Regulator: 4.1 Time Varying and Finite Horizon Case
12 pages
ENGR391 Final
No ratings yet
ENGR391 Final
3 pages
Linear-Quadratic Regulator - Wikipedia
No ratings yet
Linear-Quadratic Regulator - Wikipedia
6 pages
The Brachistochrone Problem: Mathematics HL Internal Assessment
No ratings yet
The Brachistochrone Problem: Mathematics HL Internal Assessment
11 pages
Trigonometry Chapter 4
No ratings yet
Trigonometry Chapter 4
7 pages
Linear-Quadratic-Gaussian Controllers: Immune Response Example
No ratings yet
Linear-Quadratic-Gaussian Controllers: Immune Response Example
11 pages
Neural Networks
No ratings yet
Neural Networks
10 pages
Dynamic Thermoviscoelasticity in A Thermal Shock Problem, Kartashov
No ratings yet
Dynamic Thermoviscoelasticity in A Thermal Shock Problem, Kartashov
11 pages
Parity Deformed Jaynes-Cummings Model: "Robust Maximally Entangled States"
No ratings yet
Parity Deformed Jaynes-Cummings Model: "Robust Maximally Entangled States"
15 pages
Revised M Tech r13 Regulations
No ratings yet
Revised M Tech r13 Regulations
9 pages
MAT040 Lesson 2: The Indefinite Integral 2B: Integration by Substitution
No ratings yet
MAT040 Lesson 2: The Indefinite Integral 2B: Integration by Substitution
12 pages
EE363 Review Session 1: LQR, Controllability and Observability
No ratings yet
EE363 Review Session 1: LQR, Controllability and Observability
6 pages
Optimization and Control: Examples Sheet 2: LQG Models
No ratings yet
Optimization and Control: Examples Sheet 2: LQG Models
2 pages
LQG Wind Turbine
No ratings yet
LQG Wind Turbine
5 pages
LQG/LQR Controller Design: Undergraduate Lecture Notes On
No ratings yet
LQG/LQR Controller Design: Undergraduate Lecture Notes On
37 pages
Adaptive DP For Discrete Time LQR Optimal Tracking Control Problems With Unknown Dynamics
No ratings yet
Adaptive DP For Discrete Time LQR Optimal Tracking Control Problems With Unknown Dynamics
6 pages
Application of Particle Swarm Optimization For Economic Load Dispatch and Loss Reduction
No ratings yet
Application of Particle Swarm Optimization For Economic Load Dispatch and Loss Reduction
5 pages
Class 10 Polynomials WS
No ratings yet
Class 10 Polynomials WS
15 pages
Linear-Quadratic Regulator (LQR) - Wikipedia
100% (1)
Linear-Quadratic Regulator (LQR) - Wikipedia
4 pages
Edexcel AL Term Test January 2024
No ratings yet
Edexcel AL Term Test January 2024
20 pages
L-02 Lines.6 PDF
No ratings yet
L-02 Lines.6 PDF
9 pages
Chapter 2 and 5 Final
No ratings yet
Chapter 2 and 5 Final
26 pages
Algebra & Graphs
No ratings yet
Algebra & Graphs
6 pages
Eight: Discrete Numeric Functions and Generating Functions
No ratings yet
Eight: Discrete Numeric Functions and Generating Functions
12 pages
15.8 Lagrange Multipliers: Click Here For Answers. Click Here For Solutions
No ratings yet
15.8 Lagrange Multipliers: Click Here For Answers. Click Here For Solutions
3 pages
Linear Quadratic Regulator
100% (1)
Linear Quadratic Regulator
4 pages
Lec19 - Linear Quadratic Regulator
No ratings yet
Lec19 - Linear Quadratic Regulator
7 pages
ASA Question Bank
No ratings yet
ASA Question Bank
2 pages
Sistemas Dinamicos
No ratings yet
Sistemas Dinamicos
10 pages
LQR
No ratings yet
LQR
5 pages
Practice Test 15 - Test Paper (Maths) - PDF Only - Parishram 2025
No ratings yet
Practice Test 15 - Test Paper (Maths) - PDF Only - Parishram 2025
10 pages
Generalized Sampling PDF
No ratings yet
Generalized Sampling PDF
2 pages
Investigation of the Usefulness of the PowerWorld Simulator Program: Developed by "Glover, Overbye & Sarma" in the Solution of Power System Problems
From Everand
Investigation of the Usefulness of the PowerWorld Simulator Program: Developed by "Glover, Overbye & Sarma" in the Solution of Power System Problems
Dr. Hidaia Mahmood Alassouli
No ratings yet
Integral and Vector Calculus: Mathematics
No ratings yet
Integral and Vector Calculus: Mathematics
1 page
Ee Gate'13
No ratings yet
Ee Gate'13
16 pages

Linear Quadratic Regulator

Uploaded by

Linear Quadratic Regulator

Uploaded by

Optimal Control for Linear Dynamical Systems and Quadratic Cost

(LQR) Pieter Abbeel UC Berkeley EECS

Bellmans curse of dimensionality

Variable resolution discretization Highly optimized implementations

Linear Quadratic Regulator (LQR)

Back-up step for i+1 steps to go:

LQR value iteration: J1

LQR value iteration: J1 (ctd)

Value iteration solution to LQR

LQR assumptions revisited

Extensions which make it more generally applicable:

LQR Ext0: Affine systems

LQR Ext1: stochastic system

LQR Ext2: non-linear systems

Let zt = xt x* , let vt = ut u*, then:

LQR Ext3: penalize for change in control inputs

LQR Ext3: penalize for change in control inputs

LQR Ext3: penalize for change in control inputs

Introducing change in controls !u:

[If R=0, then equivalent to standard LQR.]

LQR Ext4: Linear Time Varying (LTV) Systems

LQR Ext4: Linear Time Varying (LTV) Systems

LQR Ext5: Trajectory following for non-linear systems

A state sequence x#*, x"*, , xH* is a feasible target trajectory iff

Transform into linear time varying case (LTV):

LQR Ext5: Trajectory following for non-linear systems

Transformed into linear time varying case (LTV):

Most general cases

Methods which attempt to solve the generic optimal control problem

Iteratively apply LQR

Iterative LQR: in standard LTV format

Iteratively apply LQR: convergence

Need not converge as formulated!

Iteratively apply LQR: practicalities

Iterative LQR for trajectory following

Differential Dynamic Programming (DDP)

[Reference: Jacobson and Mayne, Differential dynamic programming, 1970]

Differential dynamic programming

Can we do even better?

Exercise: LQR derivation for this setting

[optional related reading:Todorov and Jordan, nips 2003]

H(q) q + C(q, q ) + G(q) = B(q)u

[See also Section 3.3 in Tedrake notes.]

Cart-pole --- LQR

Cart-pole --- LQR

Q = diag([1;1;1;1]); R = 0; [x, theta, xdot, thetadot]

Cart-pole --- LQR

Q = diag([1;1;1;1]); R = 1; [x, theta, xdot, thetadot]

Lyapunovs linearization method

with n the dimension of the statespace.

[A function is called a di!eomorphism if it is smooth and its inverse is smooth.]

[From: Slotine and Li]

Tricycle:!! Simple Car:! Reeds-Shepp Car:! Dubins Car:!

H(q) q + C(q, q ) + G(q) = B(q)u

[See also Section 3.3 in Tedrake notes.]

H(q) q + C(q, q ) + G(q) = B(q)u

[See also Section 3.2 in Tedrake notes.]

" Lagrangian dynamic equations:

[Nice reference: Goldstein, Poole and Satko, Classical Mechanics]

Lagrangian dynamics: point mass example

Lagrangian dynamics: simple double pendulum

[From: Tedrake Appendix A]

You might also like

A state sequence x#, x", , xH* is a feasible target trajectory iff