Optimizing Layer-Fused Scheduling of Transformer Networks on Multi-accelerator Platforms

The impact of transformer networks is booming, yet, they come with significant computational complexity. It is therefore essential to understand how to optimally map and execute these networks on modern neural processor hardware. So far, literature on transformer scheduling optimization has been fo Show more

Publication status

published

External links

https://doi.org/10.1109/ISQED60706.2024.10528689

Book title

2024 25th International Symposium on Quality Electronic Design (ISQED)

Pages / Article No.

10528689

Publisher

IEEE

Event

25th International Symposium on Quality Electronic Design (ISQED 2023), San Francisco, CA, USA, April 3-5, 2024

Subject

CNN; transformer networks; cross-layer; scheduling; hardware modeling and optimization

More

Show all metadata

ETH Bibliography

yes

Altmetrics

Research Collection

Search

Optimizing Layer-Fused Scheduling of Transformer Networks on Multi-accelerator Platforms Mendeley CSV RIS BibTeX

Optimizing Layer-Fused Scheduling of Transformer Networks on Multi-accelerator Platforms

Mendeley

CSV

RIS

BibTeX