Bayesian Optimization in the wild: Risk-averse and Computationally-effective Decision-making
dc.contributor.author
Makarova, Anastasiia
dc.contributor.supervisor
Krause, Andreas
dc.contributor.supervisor
Bogunovic, Ilija
dc.contributor.supervisor
Perez-Cruz, Fernando
dc.date.accessioned
2024-01-18T11:49:05Z
dc.date.available
2024-01-17T18:49:54Z
dc.date.available
2024-01-18T11:49:05Z
dc.date.issued
2023
dc.identifier.uri
http://hdl.handle.net/20.500.11850/653602
dc.identifier.doi
10.3929/ethz-b-000653602
dc.description.abstract
Sequential decision-making in some complex and uncertain environments can be formalized as optimizing a black-box function. For example, in drug design, the aim is to maximize compound efficiency by sequentially searching through the vast space of molecules, or, in tuning a particle accelerator, the aim is to maximize the particle beam energy. Interactions with these environments can be costly or time-consuming, demanding decision-making systems to work in small-data regimes. Bayesian optimization (BO) is a powerful data- driven framework for global optimization that adaptively chooses actions to evaluate. Its potential for impactful real-world applications is vast, from those mentioned above to widespread automated machine learning (ML) services. There are, however, practical challenges that impede its implementation or result in sub-optimal decisions.
This dissertation aims to advance the applicability of BO by addressing three cru- cial points: risk-aversion, which ensures performance beyond the average only; query- effectiveness, which aims at smart use of budgets covering evaluation costs; and problem- adaptiveness, which leverages the structure of the decision space. To this end, we propose novel approaches and examine them theoretically and empirically in real scenarios.
First, we tackle balancing high utility and low risk – a serious issue in high-stakes applica- tions. For example, the accelerator’s maximum pulse energy must be stable to observe chem- ical reactions; or in drug discovery, the drug must succeed for all individuals. Our approach trades off mean and input-dependent aleatoric uncertainty (both learned on the fly during optimization) and provides theoretical sample complexity results. Moreover, our empirical study on tuning a particle accelerator and ML models shows the benefit of the approach.
Second, we aim at efficient budget allocation, enriching the practical performance heavily dependent on the interaction cost and the available budget covering these costs. For example, the success of a timely drug discovery depends on the smart use of the resources at hand. We tackle two crucial questions of query efficiency in BO: (1) how to use cheaper but less accurate evaluations, and (2) when to stop the optimization. We leverage multi-fidelity optimization and incorporate the evaluation costs in a natural information-theoretic manner. Moreover, our automatic termination approach for the BO loop determines the minimal budget required for obtaining a high-quality solution.
Finally, the complex and non-continuous nature of the decision variables can make applying Bayesian optimization in areas such as drug design or automated ML difficult. Our approach for mixed-variable BO exploits the structure and interconnections within the discrete subdomain, learning them on the fly during the optimization.
Our approaches make BO applicable to a wider range of critical applications while combining practical simplicity with theoretically grounded reasoning.
en_US
dc.format
application/pdf
en_US
dc.language.iso
en
en_US
dc.publisher
ETH Zurich
en_US
dc.rights.uri
http://rightsstatements.org/page/InC-NC/1.0/
dc.subject
Bayesian optimization (BO)
en_US
dc.title
Bayesian Optimization in the wild: Risk-averse and Computationally-effective Decision-making
en_US
dc.type
Doctoral Thesis
dc.rights.license
In Copyright - Non-Commercial Use Permitted
dc.date.published
2024-01-18
ethz.size
183 p.
en_US
ethz.code.ddc
DDC - DDC::0 - Computer science, information & general works::004 - Data processing, computer science
en_US
ethz.identifier.diss
29641
en_US
ethz.publication.place
Zurich
en_US
ethz.publication.status
published
en_US
ethz.leitzahl
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02150 - Dep. Informatik / Dep. of Computer Science::02661 - Institut für Maschinelles Lernen / Institute for Machine Learning::03908 - Krause, Andreas / Krause, Andreas
en_US
ethz.leitzahl.certified
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02150 - Dep. Informatik / Dep. of Computer Science::02661 - Institut für Maschinelles Lernen / Institute for Machine Learning::03908 - Krause, Andreas / Krause, Andreas
en_US
ethz.date.deposited
2024-01-17T18:49:55Z
ethz.source
FORM
ethz.eth
yes
en_US
ethz.availability
Open access
en_US
ethz.rosetta.installDate
2024-01-18T11:49:06Z
ethz.rosetta.lastUpdated
2024-02-03T08:52:38Z
ethz.rosetta.versionExported
true
ethz.COinS
ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.atitle=Bayesian%20Optimization%20in%20the%20wild:%20Risk-averse%20and%20Computationally-effective%20Decision-making&rft.date=2023&rft.au=Makarova,%20Anastasiia&rft.genre=unknown&rft.btitle=Bayesian%20Optimization%20in%20the%20wild:%20Risk-averse%20and%20Computationally-effective%20Decision-making
Files in this item
Publication type
-
Doctoral Thesis [30301]