Meta-Learning via Hypernetworks

Zhao, Dominic; Kobayashi, Seijin; Sacramento, João; von Oswald, Johannes

doi:10.3929/ethz-b-000465883

Show simple item record

dc.contributor.author

Zhao, Dominic

dc.contributor.author

Kobayashi, Seijin

dc.contributor.author

Sacramento, João

dc.contributor.author

von Oswald, Johannes

dc.date.accessioned

2022-04-12T14:09:06Z

dc.date.available

2021-01-27T08:08:37Z

dc.date.available

2021-01-27T08:28:16Z

dc.date.available

2021-01-27T08:38:53Z

dc.date.available

2022-04-11T13:44:40Z

dc.date.available

2022-04-12T14:09:06Z

dc.date.issued

2020-12

dc.identifier.uri

http://hdl.handle.net/20.500.11850/465883

dc.identifier.doi

10.3929/ethz-b-000465883

dc.description.abstract

Recent developments in few-shot learning have shown that during fast adaption, gradient-based meta-learners mostly rely on embedding features of powerful pretrained networks. This leads us to research ways to effectively adapt features and utilize the meta-learner's full potential. Here, we demonstrate the effectiveness of hypernetworks in this context. We propose a soft row-sharing hypernetwork architecture and show that training the hypernetwork with a variant of MAML is tightly linked to meta-learning a curvature matrix used to condition gradients during fast adaptation. We achieve similar results as state-of-art model-agnostic methods in the overparametrized case, while outperforming many MAML variants without using different optimization schemes in the compressive regime. Furthermore, we empirically show that hypernetworks do leverage the inner loop optimization for better adaptation, and analyse how they naturally try to learn the shared curvature of constructed tasks on a toy problem when using our proposed training algorithm.

en_US

dc.format

application/pdf

en_US

dc.language.iso

en

en_US

dc.publisher

NeurIPS

en_US

dc.rights.uri

http://rightsstatements.org/page/InC-NC/1.0/

dc.title

Meta-Learning via Hypernetworks

en_US

dc.type

Conference Paper

dc.rights.license

In Copyright - Non-Commercial Use Permitted

ethz.size

11 p.

en_US

ethz.version.deposit

publishedVersion

en_US

ethz.event

4th Workshop on Meta-Learning at NeurIPS 2020 (MetaLearn 2020)

en_US

ethz.event.location

Online

en_US

ethz.event.date

December 11, 2020

en_US

ethz.notes

Due to the Coronavirus (COVID-19) the conference was conducted virtually. Accpeted version replaced with published version. Number of authors and author order has been changed.

en_US

ethz.grant

Probabilistic learning in deep cortical networks

en_US

ethz.publication.place

s.l.

en_US

ethz.publication.status

published

en_US

ethz.leitzahl

ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02140 - Dep. Inf.technologie und Elektrotechnik / Dep. of Inform.Technol. Electrical Eng.::02533 - Institut für Neuroinformatik / Institute of Neuroinformatics::09479 - Grewe, Benjamin / Grewe, Benjamin

en_US

ethz.leitzahl.certified

ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02140 - Dep. Inf.technologie und Elektrotechnik / Dep. of Inform.Technol. Electrical Eng.::02533 - Institut für Neuroinformatik / Institute of Neuroinformatics::09479 - Grewe, Benjamin / Grewe, Benjamin

en_US

ethz.grant.agreementno

186027

ethz.grant.agreementno

186027

ethz.grant.fundername

SNF

ethz.grant.fundername

SNF

ethz.grant.funderDoi

10.13039/501100001711

ethz.grant.funderDoi

10.13039/501100001711

ethz.grant.program

Ambizione

ethz.grant.program

Ambizione

ethz.date.deposited

2021-01-27T08:08:45Z

ethz.source

FORM

ethz.eth

yes

en_US

ethz.availability

Open access

en_US

ethz.rosetta.installDate

2021-01-27T08:28:25Z

ethz.rosetta.lastUpdated

2023-02-07T00:47:37Z

ethz.rosetta.exportRequired

true

ethz.rosetta.versionExported

true

ethz.COinS

ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.atitle=Meta-Learning%20via%20Hypernetworks&rft.date=2020-12&rft.au=Zhao,%20Dominic&Kobayashi,%20Seijin&Sacramento,%20Jo%C3%A3o&von%20Oswald,%20Johannes&rft.genre=proceeding&rft.btitle=Meta-Learning%20via%20Hypernetworks

Search print copy at ETH Library

Files in this item

Name:: Meta-LearningviaHypernetworks.pdf
Size:: 321.6Kb
Format:: Adobe PDF
Label:: Full text (published version)

Download

Publication type

Conference Paper [35671]

Show simple item record

Research Collection

Search

Meta-Learning via Hypernetworks Mendeley CSV RIS BibTeX

Files in this item

Publication type

Meta-Learning via Hypernetworks

Mendeley

CSV

RIS

BibTeX