Show simple item record

dc.contributor.author
Bhat, Goutam
dc.contributor.supervisor
Van Gool, Luc
dc.contributor.supervisor
Matas, Jiri
dc.contributor.supervisor
Favaro, Paolo
dc.contributor.supervisor
Danelljan, Martin
dc.date.accessioned
2023-08-21T13:55:40Z
dc.date.available
2023-08-20T21:16:31Z
dc.date.available
2023-08-21T13:55:40Z
dc.date.issued
2023
dc.identifier.uri
http://hdl.handle.net/20.500.11850/627399
dc.identifier.doi
10.3929/ethz-b-000627399
dc.description.abstract
The rise of mobile cameras has led to fundamental changes in photog- raphy, owing to their low cost, small size, and ease of use. However, the hardware constraints imposed on mobile cameras significantly limit the quality of their photos. Consequently, modern cameras rely on software technologies in order to improve the image quality. A promising direction is to combine information from multiple images to generate a higher quality image. We tackle this multi-frame image restoration problem in this thesis. First, we introduce a novel architecture for the RAW burst super- resolution task. Our network takes multiple noisy RAW images as input, and generates a denoised, demosaicked, and super-resolved RGB image as output. In order to enable training and evaluation on real world data, we additionally collect the first burst super-resolution dataset, consisting of smartphone bursts and high-resolution DSLR reference. We demonstrate promising super-resolution performance on real world bursts, despite the presence of spatial and color mis- alignments in our training pairs. Next, we propose a deep reparametrization of the maximum a posteriori (MAP) formulation commonly employed in multi-frame image restoration tasks. Our approach is derived by introducing a learned error metric and a latent representation of the target image, which transforms the MAP objective to a deep feature space. The deep reparametrization allows us to directly model the image formation process in the latent space, and to integrate learned image priors into the prediction. Thirdly, we introduce a self-supervised training strategy for RAW burst super-resolution. Our approach utilizes only noisy low-resolution bursts for training, thereby eliminating the need to use sophisticated methods for collecting paired training data, or manually tuning synthetic pipelines. This is achieved by developing a novel self- iii supervised objective which can exploit the aliased high-frequency information present within a burst for training supervision. Finally, we introduce a method to generate per-pixel segmentation masks for an object in a burst or a video. Our approach is not limited to segment a set of known object classes. Instead, it can learn to segment novel objects in a few-shot manner, given a single segmentation mask or a bounding box defining the object. We believe that such segmentation masks can serve as useful cues to improve restoration performance, specially in case of dynamic objects.
en_US
dc.format
application/pdf
en_US
dc.language.iso
en
en_US
dc.publisher
ETH Zurich
en_US
dc.rights.uri
http://rightsstatements.org/page/InC-NC/1.0/
dc.title
Multi-Frame Image Restoration
en_US
dc.type
Doctoral Thesis
dc.rights.license
In Copyright - Non-Commercial Use Permitted
ethz.size
197 p.
en_US
ethz.code.ddc
DDC - DDC::0 - Computer science, information & general works::004 - Data processing, computer science
en_US
ethz.identifier.diss
29198
en_US
ethz.publication.place
Zurich
en_US
ethz.publication.status
published
en_US
ethz.leitzahl
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02140 - Dep. Inf.technologie und Elektrotechnik / Dep. of Inform.Technol. Electrical Eng.::02652 - Institut für Bildverarbeitung / Computer Vision Laboratory::03514 - Van Gool, Luc (emeritus) / Van Gool, Luc (emeritus)
en_US
ethz.leitzahl.certified
ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02140 - Dep. Inf.technologie und Elektrotechnik / Dep. of Inform.Technol. Electrical Eng.::02652 - Institut für Bildverarbeitung / Computer Vision Laboratory::03514 - Van Gool, Luc (emeritus) / Van Gool, Luc (emeritus)
en_US
ethz.date.deposited
2023-08-20T21:16:32Z
ethz.source
FORM
ethz.eth
yes
en_US
ethz.availability
Open access
en_US
ethz.rosetta.installDate
2023-08-21T13:55:42Z
ethz.rosetta.lastUpdated
2024-02-03T02:37:11Z
ethz.rosetta.exportRequired
true
ethz.rosetta.versionExported
true
ethz.COinS
ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.atitle=Multi-Frame%20Image%20Restoration&rft.date=2023&rft.au=Bhat,%20Goutam&rft.genre=unknown&rft.btitle=Multi-Frame%20Image%20Restoration
 Search print copy at ETH Library

Files in this item

Thumbnail

Publication type

Show simple item record