Algorithmic Foundations for Safe and Efficient Reinforcement Learning from Human Feedback

Lindner, David

doi:10.3929/ethz-b-000635156

Download

Full text (PDF, 7.280Mb)

Open access

Author

Lindner, David

Date

2023

Type

Doctoral Thesis

ETH Bibliography

yes

Altmetrics

Download

Full text (PDF, 7.280Mb)

Rights / license

Creative Commons Attribution 4.0 International

Abstract

Reinforcement learning (RL) has shown remarkable success in applications with well-defined reward functions, such as maximizing the score in a video game or optimizing an algorithm’s run-time. However, in many real-world applications, there is no well-defined reward function. Instead, Reinforcement Show more

Permanent link

https://doi.org/10.3929/ethz-b-000635156

Publication status

published

External links

Search print copy at ETH Library

Contributors

Examiner: Krause, Andreas
Examiner: Hofmann, Katja
Examiner: Sadigh, Dorsa

Publisher

ETH Zurich

Subject

reinforcement learning; Inverse reinforcement learning; preference learning; reinforcement learning from human feedback

Organisational unit

03908 - Krause, Andreas / Krause, Andreas

More

Show all metadata

ETH Bibliography

yes

Altmetrics

Research Collection

Search

Algorithmic Foundations for Safe and Efficient Reinforcement Learning from Human Feedback Mendeley CSV RIS BibTeX

Algorithmic Foundations for Safe and Efficient Reinforcement Learning from Human Feedback

Mendeley

CSV

RIS

BibTeX