[HTML][HTML] Review of state-of-the-art FPGA applications in IoT Networks

A Magyari, Y Chen - Sensors, 2022 - mdpi.com
Modern networks used for integrating custom Internet of Things (IoT) systems and devices
have restrictions and requirements unique to their individual applications. These application …

{FpgaNIC}: An {FPGA-based} versatile 100gb {SmartNIC} for {GPUs}

Z Wang, H Huang, J Zhang, F Wu… - 2022 USENIX Annual …, 2022 - usenix.org
Given that the increasing rate of network bandwidth is far ahead of that of the compute
capacity of host CPU, which by default processes network packets, SmartNIC has been …

Co-design hardware and algorithm for vector search

W Jiang, S Li, Y Zhu, J de Fine Licht, Z He… - Proceedings of the …, 2023 - dl.acm.org
Vector search has emerged as the foundation for large-scale information retrieval and
machine learning systems, with search engines like Google and Bing processing tens of …

Fleetrec: Large-scale recommendation inference on hybrid gpu-fpga clusters

W Jiang, Z He, S Zhang, K Zeng, L Feng… - Proceedings of the 27th …, 2021 - dl.acm.org
We present FleetRec, a high-performance and scalable recommendation inference system
within tight latency constraints. FleetRec takes advantage of heterogeneous hardware …

Smartfuse: Reconfigurable smart switches to accelerate fused collectives in hpc applications

P Haghi, C Tan, A Guo, C Wu, D Liu, A Li… - Proceedings of the 38th …, 2024 - dl.acm.org
Communication switches have sometimes been augmented to process collectives, eg, in the
IBM BlueGene and Mellanox SHArP switches. In this work, we find that there is a great …

{ACCL+}: an {FPGA-Based} Collective Engine for Distributed Applications

Z He, D Korolija, Y Zhu, B Ramhorst, T Laan… - … USENIX Symposium on …, 2024 - usenix.org
FPGAs are increasingly prevalent in cloud deployments, serving as Smart-NICs or network-
attached accelerators. To facilitate the development of distributed applications with FPGAs …

Skt: A one-pass multi-sketch data analytics accelerator

M Chiosa, TB Preußer… - Proceedings of the …, 2021 - research-collection.ethz.ch
Data analysts often need to characterize a data stream as a first step to its further
processing. Some of the initial insights to be gained include, eg, the cardinality of the data …

Flagger: Cooperative acceleration for large-scale cross-silo federated learning aggregation

X Pan, Y An, S Liang, B Mao, M Zhang… - 2024 ACM/IEEE 51st …, 2024 - ieeexplore.ieee.org
Cross-silo federated learning (FL) leverages homomorphic encryption (HE) to obscure the
model updates from the clients. However, HE poses the challenges of complex …

Data processing with fpgas on modern architectures

W Jiang, D Korolija, G Alonso - … of the 2023 International Conference on …, 2023 - dl.acm.org
Trends in hardware, the prevalence of the cloud, and the rise of highly demanding
applications have ushered an era of specialization that is quickly changing the way data is …

Distributed recommendation inference on fpga clusters

Y Zhu, Z He, W Jiang, K Zeng, J Zhou… - 2021 31st International …, 2021 - ieeexplore.ieee.org
Deep neural networks are widely used in personalized recommendation systems. Such
models involve two major components: the memory-bound embedding layer and the …