Metadata only
Date
2020-10Type
- Conference Paper
Abstract
Data processing and distributed querying workloads often involve a "scatter-gather"or "partition-aggregate"architectural pattern, whereby one application queries hundreds or even thousands of workers. Network communication is often a bottleneck in this pattern, especially when the compute task at each worker is small, such as for Web queries and interactive analytics. The network bottleneck can result in low throughput, high CPU utilization, and cause job completion time to increase by orders of magnitude. To overcome these inefficiencies, we explore hardware-offload of the scatter-gather primitive, whereby a smart NIC takes on the responsibility of sending out queries and collecting responses. We show that this approach not only virtually eliminates CPU usage, but with suitable scheduling of responses, it also speeds up scatter by allowing parallel queries, and gather by preventing throughput collapse due to excessive congestion. Besides response scheduling, we use a careful design at the NIC to limit FPGA resource usage: our approach uses about 25% of on-chip logic and 33% of on-chip memory on a mid-sized FPGA, leaving enough room for implementing other functions on the smart NIC. Show more
Publication status
publishedExternal links
Book title
Proceedings of the 11th ACM Symposium on Cloud Computing (SoCC '20)Pages / Article No.
Publisher
Association for Computing MachineryEvent
Subject
Data center networks; Hardware offload; FPGA; Scatter gather; Partition aggregateOrganisational unit
09484 - Singla, Ankit (ehemalig) / Singla, Ankit (former)
03506 - Alonso, Gustavo / Alonso, Gustavo
03506 - Alonso, Gustavo / Alonso, Gustavo
Notes
Due to the Coronavirus (COVID-19) the conference was conducted virtually.More
Show all metadata