Predictive Mechanisms for Safe Learning in Control Systems

Wabersich, Kim P.

doi:10.3929/ethz-b-000534919

Show simple item record

dc.contributor.author

Wabersich, Kim P.

dc.contributor.supervisor

Zeilinger, Melanie N.

dc.contributor.supervisor

Borelli, Francesco

dc.contributor.supervisor

Trimpe, Sebastian

dc.date.accessioned

2022-03-02T13:08:06Z

dc.date.available

2022-03-02T11:32:33Z

dc.date.available

2022-03-02T13:08:06Z

dc.date.issued

2021

dc.identifier.uri

http://hdl.handle.net/20.500.11850/534919

dc.identifier.doi

10.3929/ethz-b-000534919

dc.description.abstract

The increasing impact of data-driven technologies across various industries has sparked renewed interest in using learning-based approaches to automatically design and optimize control systems. While recent success stories from the field of reinforcement learning (RL) suggest an immense potential of such approaches, missing safety certificates still confine learning-based methods to simulation environments or fail-safe laboratory conditions. To this end, Part A of this dissertation introduces a predictive safety filter that allows to enhance existing, potentially unsafe learning-based controllers with safety guarantees. The underlying method is based on model predictive control (MPC) theory and ensures constraint satisfaction through an optimization-based safety mechanism that provides a safe backup control law at all times. To enable the efficient design of the proposed predictive safety filter from system data, this thesis extends available robustification methods from MPC to support diverse system classes through different model assumptions. This part of the thesis specifically introduces the core concepts for closed-loop chance constraint satisfaction using simple linear system models with data-driven uncertainties and learning-based linear model estimates with unbounded process noise. Moreover, uncertain system models with significant nonlinear effects are efficiently supported through a prediction mechanism, which exploits confident subsets of the state and input space. The further developments of these techniques are outlined in this thesis and additionally cover distributed systems and illustrate the predictive safety filter in a miniature racing application. Compared with existing safety frameworks based on control barrier function theory, predictive safety filters avoid the computationally difficult task to derive a control barrier function and thereby provide favorable scalability properties toward large-scale and distributed systems. Despite the seemingly different concepts of predictive safety filters and control barrier functions, this thesis establishes and formalizes the theoretical relations between the two approaches through a so-called ‘predictive control barrier function’, further enabling the recovery of infeasible nonlinear predictive control problems in an asymptotically stable fashion. While predictive safety filters offer a high degree of modularity in terms of safety and task-specific objectives, this separation can render a rigorous performance analysis a difficult task. To this end, Part B introduces specialized learning-based MPC controllers for accelerated learning towards a distinct goal. Even if the objective function is explicitly available, the design of an MPC controller requires an accurate prediction model, often in combination with a terminal constraint and objective function to compensate for short prediction horizons. Part B tackles the difficult design task of these components from three different angles. It first introduces a learning-based improvement of established and safe MPC controllers for asymptotic stabilization tasks through a stochastic tube-based MPC mechanism that supports probabilistic regression models. While this allows to take advantage of available system data for accurate predictions, insufficient prior knowledge or a deficient initial database requires additional mechanisms to efficiently acquire new data. To automate this identification process, the contributions of Part B continue with the question of how a controller can efficiently explore the system and when to transition from exploration to exploitation of available information. The proposed solution to these questions is based on posterior sampling theory and results in a computationally efficient active learning MPC formulation, which provides finite-time performance guarantees. The last contribution of this part addresses performance degenerations of an MPC controller caused by short prediction horizons, which are even present in the case of perfectly known prediction models. To overcome this limitation, Part B develops a data-driven mechanism to iteratively improve the terminal cost and terminal set of an MPC problem by leveraging system trajectories. During training, the proposed method efficiently handles model uncertainties and constraint violations to support learning-based prediction models and poorly performing initial controllers. This is achieved through a soft-constrained MPC formulation supporting polytopic state constraints.

en_US

dc.format

application/pdf

en_US

dc.language.iso

en

en_US

dc.publisher

ETH Zurich

en_US

dc.rights.uri

http://rightsstatements.org/page/InC-NC/1.0/

dc.subject

model predictive control (MPC)

en_US

dc.subject

stochastic control

en_US

dc.subject

Safe learning-based control

en_US

dc.subject

Constrained control

en_US

dc.title

Predictive Mechanisms for Safe Learning in Control Systems

en_US

dc.type

Doctoral Thesis

dc.rights.license

In Copyright - Non-Commercial Use Permitted

dc.date.published

2022-03-02

ethz.size

295 p.

en_US

ethz.code.ddc

DDC - DDC::6 - Technology, medicine and applied sciences::621.3 - Electric engineering

en_US

ethz.identifier.diss

27922

en_US

ethz.publication.place

Zurich

en_US

ethz.publication.status

published

en_US

ethz.leitzahl

ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02130 - Dep. Maschinenbau und Verfahrenstechnik / Dep. of Mechanical and Process Eng.::02619 - Inst. Dynam. Syst. u. Regelungstechnik / Inst. Dynamic Systems and Control::09563 - Zeilinger, Melanie / Zeilinger, Melanie

en_US

ethz.leitzahl.certified

ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02130 - Dep. Maschinenbau und Verfahrenstechnik / Dep. of Mechanical and Process Eng.::02619 - Inst. Dynam. Syst. u. Regelungstechnik / Inst. Dynamic Systems and Control::09563 - Zeilinger, Melanie / Zeilinger, Melanie

en_US

ethz.date.deposited

2022-03-02T11:32:39Z

ethz.source

FORM

ethz.eth

yes

en_US

ethz.availability

Open access

en_US

ethz.rosetta.installDate

2022-03-02T13:08:13Z

ethz.rosetta.lastUpdated

2023-02-07T00:17:58Z

ethz.rosetta.versionExported

true

ethz.COinS

ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.atitle=Predictive%20Mechanisms%20for%20Safe%20Learning%20in%20Control%20Systems&rft.date=2021&rft.au=Wabersich,%20Kim%20P.&rft.genre=unknown&rft.btitle=Predictive%20Mechanisms%20for%20Safe%20Learning%20in%20Control%20Systems

Search print copy at ETH Library

Files in this item

Name:: ThesisKimWabersich_ResearchCol ...
Size:: 5.856Mb
Format:: Adobe PDF
Label:: Full text

Download

Publication type

Doctoral Thesis [30263]

Show simple item record

Research Collection

Search

Predictive Mechanisms for Safe Learning in Control Systems Mendeley CSV RIS BibTeX

Files in this item

Publication type

Predictive Mechanisms for Safe Learning in Control Systems

Mendeley

CSV

RIS

BibTeX