Show simple item record

dc.contributor.author
Karimi Jaghargh, Mohammad R.
dc.contributor.supervisor
Krause, R. Andreas
dc.contributor.supervisor
Bartlett, Peter L.
dc.contributor.supervisor
Hassani, Hamed S.
dc.contributor.supervisor
Mertikopoulos, Panayotis
dc.date.accessioned
2024-09-26T06:58:38Z
dc.date.available
2024-09-25T11:54:55Z
dc.date.available
2024-09-26T06:58:38Z
dc.date.issued
2024
dc.identifier.uri
http://hdl.handle.net/20.500.11850/696217
dc.identifier.doi
10.3929/ethz-b-000696217
dc.description.abstract
Stochastic approximation methods are a class of iterative algorithms that play an essential role in applications involving noisy and incomplete observations. Rooted in the seminal works of Robbins and Monro (1951) and Kiefer and Wolfowitz (1952), this class of iterative processes drive a system towards a specified objective despite noise and bias. Stochastic approximation methods have become increasingly important in fields like statistics and machine learning due to their resilience to noise and low computational costs. They are especially useful in training neural networks and adaptive learning systems. The rise of big data has further heightened the significance of stochastic approximation, as it necessitates efficient and scalable algorithms for real-time decision-making. This thesis embarks on a renewed exploration of stochastic approximation through a contemporary lens, focusing on its dynamics and long-term behavior in non-Euclidean spaces. Inspired by the dynamical systems approach introduced by Benaïm and Hirsch in the 1990s, this work tackles crucial questions surrounding the convergence of the iterates of a stochastic approximation algorithm, the characteristics of its limits, and the desirability of these limits. The thesis aims to provide profound theoretical insights into the stability and convergence of these algorithms amidst the challenges posed by their non-Euclidean structure and real-world conditions marked by noise and incomplete information. This work examines stochastic approximation within three unconventional contexts where traditional methods are inadequate. Firstly, we delve into stochastic approximation on Riemannian manifolds, analyzing problems related to Riemannian optimization and strategic games. This non-Euclidean setting introduces complexities requiring novel approaches for proving convergence. Secondly, the thesis investigates the discretization of stochastic differential equations by lifting the problem to the Wasserstein space. This is particularly relevant to sampling algorithms based on the Langevin diffusion, which are essential in Bayesian learning and generative modeling. We demonstrate how our findings can enhance the reliability and effectiveness of these sampling algorithms, ultimately supporting more robust Bayesian inference and generative processes. Lastly, the thesis explores algorithms lacking a step-size parameter, focusing on the Sinkhorn algorithm for solving the entropic optimal transport and Schrödinger bridge problems. By proposing a variant with a step-size parameter and examining its continuous-time limit, we offer a stochastic approximation analysis that ensures convergence of this variant of the Sinkhorn algorithm even under noise and bias. This thesis represents a harmonious blend of classical stochastic approximation and its extended applications in non-Euclidean setups. By integrating different areas of mathematics, we offer a thorough analysis that enriches the theoretical foundation of stochastic approximation and guarantees the robustness of these algorithms across a diverse array of scenarios.
en_US
dc.format
application/pdf
en_US
dc.language.iso
en
en_US
dc.publisher
ETH Zurich
en_US
dc.rights.uri
http://rightsstatements.org/page/InC-NC/1.0/
dc.subject
Stochastic Approximation;Dynamical Systems;Riemannian Geometry;Wasserstein Spaces;Entropic Optimal Transport;Sinkhorn Algorithm;Schrödinger Bridges
en_US
dc.title
Stochastic Approximation on Riemannian Manifolds and the Space of Measures
en_US
dc.type
Doctoral Thesis
dc.rights.license
In Copyright - Non-Commercial Use Permitted
ethz.size
257 p.
en_US
ethz.code.ddc
DDC - DDC::5 - Science::510 - Mathematics
en_US
ethz.code.ddc
DDC - DDC::0 - Computer science, information & general works::004 - Data processing, computer science
en_US
ethz.identifier.diss
30588
en_US
ethz.publication.place
Zurich
en_US
ethz.publication.status
published
en_US
ethz.leitzahl
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02150 - Dep. Informatik / Dep. of Computer Science::02661 - Institut für Maschinelles Lernen / Institute for Machine Learning::03908 - Krause, Andreas / Krause, Andreas
en_US
ethz.date.deposited
2024-09-25T11:54:55Z
ethz.source
FORM
ethz.eth
yes
en_US
ethz.availability
Open access
en_US
ethz.rosetta.installDate
2024-09-26T06:58:39Z
ethz.rosetta.lastUpdated
2024-09-26T06:58:39Z
ethz.rosetta.versionExported
true
ethz.COinS
ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.atitle=Stochastic%20Approximation%20on%20Riemannian%20Manifolds%20and%20the%20Space%20of%20Measures&rft.date=2024&rft.au=Karimi%20Jaghargh,%20Mohammad%20R.&rft.genre=unknown&rft.btitle=Stochastic%20Approximation%20on%20Riemannian%20Manifolds%20and%20the%20Space%20of%20Measures
 Search print copy at ETH Library

Files in this item

Thumbnail

Publication type

Show simple item record