AWB-GCN: A graph convolutional network accelerator with runtime workload rebalancing T Geng, A Li, R Shi, C Wu, T Wang, Y Li, P Haghi, A Tumeo, S Che, ... 2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture …, 2020 | 306 | 2020 |
FPGAs in the network and novel communicator support accelerate MPI collectives P Haghi, A Guo, Q Xiong, R Patel, C Yang, T Geng, JT Broaddus, ... 2020 IEEE High Performance Extreme Computing Conference (HPEC), 1-10, 2020 | 29 | 2020 |
FP-AMG: FPGA-based acceleration framework for algebraic multigrid solvers P Haghi, T Geng, A Guo, T Wang, M Herbordt 2020 IEEE 28th Annual International Symposium on Field-Programmable Custom …, 2020 | 28 | 2020 |
A reconfigurable compute-in-the-network fpga assistant for high-level collective support with distributed matrix multiply case study P Haghi, A Guo, T Geng, J Broaddus, D Schafer, A Skjellum, M Herbordt 2020 International Conference on Field-Programmable Technology (ICFPT), 159-164, 2020 | 22 | 2020 |
Software-hardware co-design of heterogeneous SmartNIC system for recommendation models inference and training A Guo, Y Hao, C Wu, P Haghi, Z Pan, M Si, D Tao, A Li, M Herbordt, ... Proceedings of the 37th International Conference on Supercomputing, 336-347, 2023 | 21 | 2023 |
Accelerating MPI collectives with FPGAs in the network and novel communicator support Q Xiong, C Yang, P Haghi, A Skjellum, M Herbordt 2020 IEEE 28th Annual International Symposium on Field-Programmable Custom …, 2020 | 21 | 2020 |
Reconfigurable switches for high performance and flexible MPI collectives P Haghi, A Guo, Q Xiong, C Yang, T Geng, JT Broaddus, R Marshall, ... Concurrency and Computation: Practice and Experience 34 (6), e6769, 2022 | 19 | 2022 |
A framework for neural network inference on fpga-centric smartnics A Guo, T Geng, Y Zhang, P Haghi, C Wu, C Tan, Y Lin, A Li, M Herbordt 2022 32nd International Conference on Field-Programmable Logic and …, 2022 | 18 | 2022 |
Workload imbalance in hpc applications: Effect on performance of in-network processing P Haghi, A Guo, T Geng, A Skjellum, MC Herbordt 2021 IEEE High Performance Extreme Computing Conference (HPEC), 1-8, 2021 | 18 | 2021 |
A survey: Handling irregularities in neural network acceleration with fpgas T Geng, C Wu, C Tan, C Xie, A Guo, P Haghi, SY He, J Li, M Herbordt, ... 2021 IEEE High Performance Extreme Computing Conference (HPEC), 1-8, 2021 | 15 | 2021 |
FASDA: An FPGA-aided, scalable and distributed accelerator for range-limited molecular dynamics C Wu, T Geng, A Guo, S Bandara, P Haghi, C Liu, A Li, M Herbordt Proceedings of the International Conference for High Performance Computing …, 2023 | 14 | 2023 |
FLASH: FPGA-accelerated smart switches with GCN case study P Haghi, W Krska, C Tan, T Geng, PH Chen, C Greenwood, A Guo, ... Proceedings of the 37th International Conference on Supercomputing, 450-462, 2023 | 13 | 2023 |
& Geng, T.(2023, June). Software-hardware co-design of heterogeneous SmartNIC system for recommendation models inference and training A Guo, Y Hao, C Wu, P Haghi, Z Pan, M Si Proceedings of the 37th International Conference on Supercomputing, 336-347, 0 | 12 | |
FCsN: A FPGA-Centric SmartNIC Framework for Neural Networks A Guo, T Geng, Y Zhang, P Haghi, C Wu, C Tan, Y Lin, A Li, M Herbordt 2022 IEEE 30th Annual International Symposium on Field-Programmable Custom …, 2022 | 10 | 2022 |
Optimized mappings for symmetric range-limited molecular force calculations on FPGAs C Wu, S Bandara, T Geng, A Guo, P Haghi, V Sachdeva, W Sherman, ... 2022 32nd International Conference on Field-Programmable Logic and …, 2022 | 9 | 2022 |
Copa use case: Distributed secure joint computation R Patel, P Haghi, S Jain, A Kot, V Krishnan, M Varia, M Herbordt 2022 IEEE 30th Annual International Symposium on Field-Programmable Custom …, 2022 | 7 | 2022 |
SmartFuse: Reconfigurable Smart Switches to Accelerate Fused Collectives in HPC Applications P Haghi, C Tan, A Guo, C Wu, D Liu, A Li, A Skjellum, T Geng, M Herbordt Proceedings of the 38th ACM International Conference on Supercomputing, 413-425, 2024 | 6 | 2024 |
FPGA-Accelerated Range-Limited Molecular Dynamics C Wu, C Yang, S Bandara, T Geng, A Guo, P Haghi, A Li, M Herbordt IEEE Transactions on Computers, 2024 | 6 | 2024 |
Distributed hardware accelerated secure joint computation on the copa framework R Patel, P Haghi, S Jain, A Kot, V Krishnan, M Varia, M Herbord 2022 IEEE High Performance Extreme Computing Conference (HPEC), 1-7, 2022 | 6 | 2022 |
O⁴-DNN: A Hybrid DSP-LUT-Based Processing Unit With Operation Packing and Out-of-Order Execution for Efficient Realization of Convolutional Neural Networks on FPGA Devices P Haghi, M Kamal, A Afzali-Kusha, M Pedram IEEE Transactions on Circuits and Systems I: Regular Papers 67 (9), 3056-3069, 2020 | 6 | 2020 |