Bayesian Optimization in the wild: Risk-averse and Computationally-effective Decision-making

Makarova, Anastasiia

doi:10.3929/ethz-b-000653602

Show simple item record

dc.contributor.author

Makarova, Anastasiia

dc.contributor.supervisor

Krause, Andreas

dc.contributor.supervisor

Bogunovic, Ilija

dc.contributor.supervisor

Perez-Cruz, Fernando

dc.date.accessioned

2024-01-18T11:49:05Z

dc.date.available

2024-01-17T18:49:54Z

dc.date.available

2024-01-18T11:49:05Z

dc.date.issued

2023

dc.identifier.uri

http://hdl.handle.net/20.500.11850/653602

dc.identifier.doi

10.3929/ethz-b-000653602

dc.description.abstract

Sequential decision-making in some complex and uncertain environments can be formalized as optimizing a black-box function. For example, in drug design, the aim is to maximize compound efficiency by sequentially searching through the vast space of molecules, or, in tuning a particle accelerator, the aim is to maximize the particle beam energy. Interactions with these environments can be costly or time-consuming, demanding decision-making systems to work in small-data regimes. Bayesian optimization (BO) is a powerful data- driven framework for global optimization that adaptively chooses actions to evaluate. Its potential for impactful real-world applications is vast, from those mentioned above to widespread automated machine learning (ML) services. There are, however, practical challenges that impede its implementation or result in sub-optimal decisions. This dissertation aims to advance the applicability of BO by addressing three cru- cial points: risk-aversion, which ensures performance beyond the average only; query- effectiveness, which aims at smart use of budgets covering evaluation costs; and problem- adaptiveness, which leverages the structure of the decision space. To this end, we propose novel approaches and examine them theoretically and empirically in real scenarios. First, we tackle balancing high utility and low risk – a serious issue in high-stakes applica- tions. For example, the accelerator’s maximum pulse energy must be stable to observe chem- ical reactions; or in drug discovery, the drug must succeed for all individuals. Our approach trades off mean and input-dependent aleatoric uncertainty (both learned on the fly during optimization) and provides theoretical sample complexity results. Moreover, our empirical study on tuning a particle accelerator and ML models shows the benefit of the approach. Second, we aim at efficient budget allocation, enriching the practical performance heavily dependent on the interaction cost and the available budget covering these costs. For example, the success of a timely drug discovery depends on the smart use of the resources at hand. We tackle two crucial questions of query efficiency in BO: (1) how to use cheaper but less accurate evaluations, and (2) when to stop the optimization. We leverage multi-fidelity optimization and incorporate the evaluation costs in a natural information-theoretic manner. Moreover, our automatic termination approach for the BO loop determines the minimal budget required for obtaining a high-quality solution. Finally, the complex and non-continuous nature of the decision variables can make applying Bayesian optimization in areas such as drug design or automated ML difficult. Our approach for mixed-variable BO exploits the structure and interconnections within the discrete subdomain, learning them on the fly during the optimization. Our approaches make BO applicable to a wider range of critical applications while combining practical simplicity with theoretically grounded reasoning.

en_US

dc.format

application/pdf

en_US

dc.language.iso

en

en_US

dc.publisher

ETH Zurich

en_US

dc.rights.uri

http://rightsstatements.org/page/InC-NC/1.0/

dc.subject

Bayesian optimization (BO)

en_US

dc.title

Bayesian Optimization in the wild: Risk-averse and Computationally-effective Decision-making

en_US

dc.type

Doctoral Thesis

dc.rights.license

In Copyright - Non-Commercial Use Permitted

dc.date.published

2024-01-18

ethz.size

183 p.

en_US

ethz.code.ddc

DDC - DDC::0 - Computer science, information & general works::004 - Data processing, computer science

en_US

ethz.identifier.diss

29641

en_US

ethz.publication.place

Zurich

en_US

ethz.publication.status

published

en_US

ethz.leitzahl

ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02150 - Dep. Informatik / Dep. of Computer Science::02661 - Institut für Maschinelles Lernen / Institute for Machine Learning::03908 - Krause, Andreas / Krause, Andreas

en_US

ethz.leitzahl.certified

ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02150 - Dep. Informatik / Dep. of Computer Science::02661 - Institut für Maschinelles Lernen / Institute for Machine Learning::03908 - Krause, Andreas / Krause, Andreas

en_US

ethz.date.deposited

2024-01-17T18:49:55Z

ethz.source

FORM

ethz.eth

yes

en_US

ethz.availability

Open access

en_US

ethz.rosetta.installDate

2024-01-18T11:49:06Z

ethz.rosetta.lastUpdated

2024-02-03T08:52:38Z

ethz.rosetta.versionExported

true

ethz.COinS

ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.atitle=Bayesian%20Optimization%20in%20the%20wild:%20Risk-averse%20and%20Computationally-effective%20Decision-making&rft.date=2023&rft.au=Makarova,%20Anastasiia&rft.genre=unknown&rft.btitle=Bayesian%20Optimization%20in%20the%20wild:%20Risk-averse%20and%20Computationally-effective%20Decision-making

Search print copy at ETH Library

Files in this item

Name:: Thesis_Makarova_A.pdf
Size:: 11.95Mb
Format:: Adobe PDF
Label:: Full text

Download

Publication type

Doctoral Thesis [30301]

Show simple item record

Research Collection

Search

Bayesian Optimization in the wild: Risk-averse and Computationally-effective Decision-making Mendeley CSV RIS BibTeX

Files in this item

Publication type

Bayesian Optimization in the wild: Risk-averse and Computationally-effective Decision-making

Mendeley

CSV

RIS

BibTeX