Author:
Description:
Optimal control theory has a long history and broad applications. Motivated by the goal of obtaining insights through unification and taking advantage of the abundant capability to generate data and perform online simulation, this thesis studies the discrete-time infinite horizon optimal control problems and introduces some approximate solution methods via abstract dynamic programming (DP) models. The proposed methods involve approximation in value space through the use of data and simulator, apply to a broad class of problems, and strike a good balance between satisfactory performance and computational expenditure. First, we consider deterministic problems with nonnegative stage costs. We derive sufficient conditions under which a local controllability condition holds for the constrained nonlinear systems, and apply the results to establish the convergence of the classical algorithms, including value iteration, policy iteration (PI), and optimistic PI. These results provide a starting point for the design of suboptimal schemes. Then we propose algorithms that take advantage of system trajectory or the presence of parallel computing units to approximate the optimal costs. These algorithms can be viewed as variants of model predictive control (MPC) or rollout, and can be applied to deterministic problems with arbitrary state and control spaces, and arbitrary dynamics. It admits extensions to problems with trajectory constraints, and a multiagent structure. Via the viewpoint provided by the abstract DP models, we also derive the performance bounds of MPC applied to unconstrained and constrained linear quadratic problems, as well as their nonlinear counterparts. These insights suggest new designs of MPC, which likely lead to larger feasible regions of the scheme while costing hardly any loss of performance measured by the costs accumulated over infinite stages. Moreover, we derive algorithms to address problems with a fixed discount factor on future costs. We apply abstract DP models to analyze $\lambda$-PI with ...
Publisher:
KTH, Reglerteknik ; Stockholm, Sweden
Year of Publication:
2023
Document Type:
Doctoral thesis, monograph ; info:eu-repo/semantics/doctoralThesis ; text ; [Doctoral and postdoctoral thesis]
Language:
eng
Subjects:
Optimal Control ; Dynamic Programming ; Rollout ; Model Predictive Control ; Control Engineering ; Reglerteknik
DDC:
004 Data processing & computer science (computed)
Rights:
info:eu-repo/semantics/openAccess
Content Provider:
Kungliga Tekniska Högskolan, Stockholm: KTHs Publikationsdatabas DiVA
Further nameRoyal Institute of Technology, Stockholm: KTHs Publication Database DiVA
Further nameRoyal Institute of Technology, Stockholm: KTHs Publication Database DiVA
- URL: http://kth.diva-portal.org/
- Research Organization Registry (ROR): KTH Royal Institute of Technology
- Continent: Europe
- Country: se
- Latitude / Longitude: 59.332720 / 18.064450 (Google Maps | OpenStreetMap)
- Number of documents: 53,797
- Open Access: 53,795 (99%)
- Type: Academic publications
- System: Diva
- Content provider indexed in BASE since:
- BASE URL: https://www.base-search.net/Search/Results?q=coll:ftkthstockholm
My Lists:
My Tags:
Notes: