Towards Efficient Deep Neural Networks

Li, Yawei

doi:10.3929/ethz-b-000540498

Show simple item record

dc.contributor.author

Li, Yawei

dc.contributor.supervisor

Van Gool, Luc

dc.contributor.supervisor

Brox, Thomas

dc.contributor.supervisor

Yang, Ming-Hsuan

dc.contributor.supervisor

Timofte, Radu

dc.date.accessioned

2022-04-01T11:50:44Z

dc.date.available

2022-04-01T10:58:03Z

dc.date.available

2022-04-01T11:50:44Z

dc.date.issued

2022

dc.identifier.uri

http://hdl.handle.net/20.500.11850/540498

dc.identifier.doi

10.3929/ethz-b-000540498

dc.description.abstract

Computational efficiency is an essential factor that influences the applicability of computer vision algorithms. Although deep neural networks have reached state-of-the-art performances in a variety of computer vision tasks, there are a couple of efficiency related problems of the deep learning based solutions. First, the overparameterization of deep neural networks results in models with millions of parameters, which lowers the parameter efficiency of the designed networks. To store the parameters and intermediate feature maps during the computation, a large device memory footprint is required. Secondly, the massive computation in deep neural networks slows down their training and inference. This limits the application of deep neural networks to latency-demanding scenarios and low-end devices. Thirdly, the massive computation consumes significant amount of energy, which leaves a large carbon footprint of deep learning models. The aim of this thesis is to improve the computational efficiency of current deep neural networks. This problem is tackled from three perspective including neural network compression, neural architecture optimization, and computational procedure optimization. In the first part of the thesis, we reduce the model complexity of neural networks by network compression techniques including filter decomposition and filter pruning. The basic assumption for filter decomposition is that the ensemble of filters in deep neural networks constitutes an overcomplete set. Instead of using the original filters directly during the computation, they can be approximated by a linear combination of a set of basis filters. The contribution of this thesis is to provide a unified analysis of previous filter decomposition methods. On the other hand, a differentiable filter pruning method is proposed. To achieve differentiability, the layers of neural networks is reparameterized by a meta network. Sparsity regularization is applied to the input of the meta network, i.e. latent vectors. Optimizing with the introduced regularization leads to an automatic network pruning method. Additionally, a joint analysis of filter decomposition and filter pruning is presented from the perspective of compact tensor approximation. The hinge of the two techniques is the introduced sparsity inducing matrix. By simply changing the way the group sparsity regularization is enforced to the matrix, the two techniques can be derived accordingly. Secondly, we try to improve the performance of a baseline network by a fine-grained neural architecture optimization method. Different from network compression methods, the aim of this method is to improve the prediction accuracy of neural networks while reducing their model complexity at the same time. Achieving the two targets simultaneously makes the problem more challenging. In addition, a nearly cost-free constraint is enforced during the architecture optimization, which differs from current neural architecture search methods with bulky computation. This can be regarded as another efficiency-improving technique. Thirdly, we optimize the computational procedure of graph neural networks. By mathematically analyzing the operations in graph neural network, two methods are proposed to improve the computational efficiency. The first method is related to the simplification of neighbor querying in graph neural network while the second involves shuffling the order of graph feature gathering and an feature extraction operations. To summarize, this thesis contributes to multiple aspects of improving the computational efficiency of neural networks during the optimization, training, and test phase.

en_US

dc.format

application/pdf

en_US

dc.language.iso

en

en_US

dc.publisher

ETH Zurich

en_US

dc.rights.uri

http://rightsstatements.org/page/InC-NC/1.0/

dc.subject

Deep neural networks (DNNs)

en_US

dc.subject

Network pruning

en_US

dc.subject

Neural architecture search

en_US

dc.subject

Low-rank approximation

en_US

dc.subject

Graph Neural Networks (GNNs)

en_US

dc.subject

Hypernetworks

en_US

dc.subject

Network Acceleration

en_US

dc.subject

Image classification

en_US

dc.subject

Image restoration

en_US

dc.subject

Point cloud processing

en_US

dc.title

Towards Efficient Deep Neural Networks

en_US

dc.type

Doctoral Thesis

dc.rights.license

In Copyright - Non-Commercial Use Permitted

dc.date.published

2022-04-01

ethz.size

183 p.

en_US

ethz.code.ddc

DDC - DDC::0 - Computer science, information & general works::004 - Data processing, computer science

en_US

ethz.identifier.diss

28256

en_US

ethz.publication.place

Zurich

en_US

ethz.publication.status

published

en_US

ethz.leitzahl

ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02140 - Dep. Inf.technologie und Elektrotechnik / Dep. of Inform.Technol. Electrical Eng.::02652 - Institut für Bildverarbeitung / Computer Vision Laboratory

en_US

ethz.leitzahl

ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02140 - Dep. Inf.technologie und Elektrotechnik / Dep. of Inform.Technol. Electrical Eng.::02652 - Institut für Bildverarbeitung / Computer Vision Laboratory::03514 - Van Gool, Luc (emeritus) / Van Gool, Luc (emeritus)

en_US

ethz.leitzahl.certified

ETH Zürich::00002 - ETH Zürich::00012 - Lehre und Forschung::00007 - Departemente::02140 - Dep. Inf.technologie und Elektrotechnik / Dep. of Inform.Technol. Electrical Eng.::02652 - Institut für Bildverarbeitung / Computer Vision Laboratory::03514 - Van Gool, Luc (emeritus) / Van Gool, Luc (emeritus)

en_US

ethz.date.deposited

2022-04-01T10:58:11Z

ethz.source

FORM

ethz.eth

yes

en_US

ethz.availability

Open access

en_US

ethz.rosetta.installDate

2022-04-01T11:50:53Z

ethz.rosetta.lastUpdated

2023-02-07T00:39:57Z

ethz.rosetta.exportRequired

true

ethz.rosetta.versionExported

true

ethz.COinS

ctx_ver=Z39.88-2004&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.atitle=Towards%20Efficient%20Deep%20Neural%20Networks&rft.date=2022&rft.au=Li,%20Yawei&rft.genre=unknown&rft.btitle=Towards%20Efficient%20Deep%20Neural%20Networks

Search print copy at ETH Library

Files in this item

Name:: phd_thesis_yawei_li.pdf
Size:: 28.07Mb
Format:: Adobe PDF
Label:: Full text

Download

Publication type

Doctoral Thesis [30292]

Show simple item record

Research Collection

Search

Towards Efficient Deep Neural Networks Mendeley CSV RIS BibTeX

Files in this item

Publication type

Towards Efficient Deep Neural Networks

Mendeley

CSV

RIS

BibTeX