From self-tuning regulators to reinforcement learning and back again

Nikolai Matni, Alexandre Proutiere, Anders Rantzer, Stephen Tu

Research output: Chapter in Book/Report/Conference proceedingPaper in conference proceedingpeer-review

Abstract

Machine and reinforcement learning (RL) are increasingly being applied to plan and control the behavior of autonomous systems interacting with the physical world. Examples include self-driving vehicles, distributed sensor networks, and agile robots. However, when machine learning is to be applied in these new settings, the algorithms had better come with the same type of reliability, robustness, and safety bounds that are hallmarks of control theory, or failures could be catastrophic. Thus, as learning algorithms are increasingly and more aggressively deployed in safety critical settings, it is imperative that control theorists join the conversation. The goal of this tutorial paper is to provide a starting point for control theorists wishing to work on learning related problems, by covering recent advances bridging learning and control theory, and by placing these results within an appropriate historical context of system identification and adaptive control.

Original languageEnglish
Title of host publication2019 IEEE 58th Conference on Decision and Control, CDC 2019
PublisherIEEE - Institute of Electrical and Electronics Engineers Inc.
Pages3724-3740
Number of pages17
ISBN (Electronic)9781728113982
ISBN (Print)978-1-7281-1399-9
DOIs
Publication statusPublished - 2020 Mar 12
Event58th IEEE Conference on Decision and Control, CDC 2019 - Nice, France
Duration: 2019 Dec 112019 Dec 13

Publication series

NameProceedings of the IEEE Conference on Decision and Control
PublisherIEEE
Volume2019-December
ISSN (Print)0743-1546
ISSN (Electronic)2576-2370

Conference

Conference58th IEEE Conference on Decision and Control, CDC 2019
Country/TerritoryFrance
CityNice
Period2019/12/112019/12/13

Subject classification (UKÄ)

  • Control Engineering

Fingerprint

Dive into the research topics of 'From self-tuning regulators to reinforcement learning and back again'. Together they form a unique fingerprint.

Cite this