Neural Parameter Allocation Search

Fitting a model into GPU memory during training is an increasing concern as models continue to grow. Parameter sharing can reduce memory requirements, but existing methods only share parameters between identical layers, limiting their impact. This paper removes these restrictions with a novel task Show more

Publication status

published

External links

arxiv:2006.10598v3

Journal / series

arXiv

Pages / Article No.

2006.10598v3

Publisher

Cornell University

Organisational unit

03950 - Hoefler, Torsten / Hoefler, Torsten

More

Show all metadata

ETH Bibliography

yes

Altmetrics

Research Collection

Search

Neural Parameter Allocation Search Mendeley CSV RIS BibTeX

Neural Parameter Allocation Search

Mendeley

CSV

RIS

BibTeX