Stitching Weight-Shared Deep Neural Networks for Efficient Multitask Inference on GPU

Intelligent personal and home applications demand multiple deep neural networks (DNNs) running on resource-constrained platforms for compound inference tasks, known as multitask inference. To fit multiple DNNs into low-resource devices, emerging techniques resort to weight sharing among DNNs to red Show more

Publication status

published

External links

https://doi.org/10.1109/SECON55815.2022.9918563

Book title

2022 19th Annual IEEE International Conference on Sensing, Communication, and Networking (SECON)

Pages / Article No.

145 - 153

Publisher

IEEE

Event

19th IEEE International Conference on Sensing, Communication, and Networking (SECON 2022), September 20-23, 2022

Subject

Deep Neural Networks; Multitask Inference; Model Acceleration

Organisational unit

03429 - Thiele, Lothar (emeritus) / Thiele, Lothar (emeritus)

More

Show all metadata

ETH Bibliography

yes

Altmetrics

Research Collection

Search

Stitching Weight-Shared Deep Neural Networks for Efficient Multitask Inference on GPU Mendeley CSV RIS BibTeX

Stitching Weight-Shared Deep Neural Networks for Efficient Multitask Inference on GPU

Mendeley

CSV

RIS

BibTeX