Software-hardware co-design of heterogeneous SmartNIC system for recommendation models inference and training

A Guo, Y Hao, C Wu, P Haghi, Z Pan, M Si… - Proceedings of the 37th …, 2023 - dl.acm.org
Deep Learning Recommendation Models (DLRMs) are important applications in various
domains and have evolved into one of the largest and most important machine learning …

ACiS: smart switches with application-level acceleration

P Haghi - 2023 - search.proquest.com
Network performance has contributed fundamentally to the growth of supercomputing over
the past decades. In parallel, High Performance Computing (HPC) peak performance has …

ACiS: Complex Processing in the Switch Fabric

P Haghi, A Guo, T Geng, A Skjellum… - arxiv preprint arxiv …, 2025 - arxiv.org
For the last three decades a core use of FPGAs has been for processing communication:
FPGA-based SmartNICs are in widespread use from the datacenter to IoT. Augmenting …

Innovative Approaches for Network Analysis and Optimization: Leveraging Deep Learning and Programmable Hardware

AG de Castro, CE Rothenberg - 2024 IEEE 10th International …, 2024 - ieeexplore.ieee.org
Network demand for real-time applications like self-driving cars and cloud gaming strains
existing networks. Latency and congestion hurt user experience. Realistic testing is vital to …

Flexible communication primitives for diverse deployment scenarios of hardware operating systems for FPGAs

Z Tahir - 2025 - search.proquest.com
Communication capabilities of FPGAs, combined with programmability in hardware
(reconfigurable logic) and software (soft-processors), often provide FPGAs a competitive …

Component design for application-directed FPGA system generation frameworks

SL Bandara - 2024 - search.proquest.com
Abstract Field Programmable Gate Arrays (FPGAs) can fulfill many critical and contrasting
roles in modern computing due to their combination of powerful computing and …

Optimizing the optimizer increasing performance efficiency of modern compilers

H Shahzad - 2025 - search.proquest.com
A long-standing goal, which is increasingly important in the post-Moore era, is to augment
system performance by building more intelligent compilers. One of our motivating …

Software and hardware codesign of SmartNIC-based heterogeneous HPC clusters with machine learning case studies

A Guo - 2024 - search.proquest.com
Abstract Machine learning has evolved significantly recently and has penetrated every
aspect of science, technology, and daily life. As application prediction demands higher …

云网融合背景下智能网卡的产业发展及趋势分析

赵静, 陈元谋 - 信息通信技术与政策, 2022 - ictp.caict.ac.cn
云网融合依托云的聚合能力, 实现多技术要素的协同, 集成及创新, 成为面向垂直行业场景化需求
的最佳选择. 海量数据增长, 数据频繁交互, 摩尔定律放缓都对基础设施的性能和成本提出更高 …

FPGA-based range-limited molecular dynamics acceleration

C Wu - 2023 - search.proquest.com
Molecular Dynamics (MD) is a computer simulation technique that executes iteratively over
discrete, infinitesimal time intervals. It has been a widely utilized application in the fields of …