AWB-GCN: A graph convolutional network accelerator with runtime workload rebalancing T Geng, A Li, R Shi, C Wu, T Wang, Y Li, P Haghi, A Tumeo, S Che, ... 2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture …, 2020 | 305 | 2020 |
I-GCN: A graph convolutional network accelerator with runtime locality enhancement through islandization T Geng, C Wu, Y Zhang, C Tan, C Xie, H You, M Herbordt, Y Lin, A Li MICRO-54: 54th annual IEEE/ACM international symposium on microarchitecture …, 2021 | 125 | 2021 |
Fully integrated FPGA molecular dynamics simulations C Yang, T Geng, T Wang, R Patel, Q Xiong, A Sanaullah, C Wu, J Sheng, ... Proceedings of the International Conference for High Performance Computing …, 2019 | 64 | 2019 |
LP-BNN: Ultra-low-latency BNN inference with layer parallelism T Geng, T Wang, C Wu, C Yang, SL Song, A Li, M Herbordt 2019 IEEE 30th International Conference on Application-specific Systems …, 2019 | 49 | 2019 |
O3BNN-R: An out-of-order architecture for high-performance and regularized BNN inference T Geng, A Li, T Wang, C Wu, Y Li, R Shi, W Wu, M Herbordt IEEE Transactions on parallel and distributed systems 32 (1), 199-213, 2020 | 47 | 2020 |
O3BNN: An out-of-order architecture for high-performance binarized neural network inference with fine-grained pruning T Geng, T Wang, C Wu, C Yang, W Wu, A Li, MC Herbordt Proceedings of the ACM International Conference on Supercomputing, 461-472, 2019 | 31 | 2019 |
Software-hardware co-design of heterogeneous SmartNIC system for recommendation models inference and training A Guo, Y Hao, C Wu, P Haghi, Z Pan, M Si, D Tao, A Li, M Herbordt, ... Proceedings of the 37th International Conference on Supercomputing, 336-347, 2023 | 25 | 2023 |
A framework for neural network inference on fpga-centric smartnics A Guo, T Geng, Y Zhang, P Haghi, C Wu, C Tan, Y Lin, A Li, M Herbordt 2022 32nd International Conference on Field-Programmable Logic and …, 2022 | 19 | 2022 |
CQNN: a CGRA-based QNN framework T Geng, C Wu, C Tan, B Fang, A Li, M Herbordt 2020 IEEE High Performance Extreme Computing Conference (HPEC), 1-7, 2020 | 19 | 2020 |
Upgrade of FPGA range-limited molecular dynamics to handle hundreds of processors C Wu, T Geng, S Bandara, C Yang, V Sachdeva, W Sherman, M Herbordt 2021 IEEE 29th Annual International Symposium on Field-Programmable Custom …, 2021 | 17 | 2021 |
A survey: Handling irregularities in neural network acceleration with fpgas T Geng, C Wu, C Tan, C Xie, A Guo, P Haghi, SY He, J Li, M Herbordt, ... 2021 IEEE High Performance Extreme Computing Conference (HPEC), 1-8, 2021 | 16 | 2021 |
FASDA: An FPGA-aided, scalable and distributed accelerator for range-limited molecular dynamics C Wu, T Geng, A Guo, S Bandara, P Haghi, C Liu, A Li, M Herbordt Proceedings of the International Conference for High Performance Computing …, 2023 | 15 | 2023 |
FLASH: FPGA-accelerated smart switches with GCN case study P Haghi, W Krska, C Tan, T Geng, PH Chen, C Greenwood, A Guo, ... Proceedings of the 37th International Conference on Supercomputing, 450-462, 2023 | 14 | 2023 |
A communication-efficient multi-chip design for range-limited molecular dynamics C Wu, T Geng, C Yang, V Sachdeva, W Sherman, M Herbordt 2020 IEEE High Performance extreme Computing Conference (HPEC), 1-8, 2020 | 14 | 2020 |
System-level modeling of GPU/FPGA clusters for molecular dynamics simulations C Wu, S Bandara, T Geng, V Sachdeva, W Sherman, M Herbordt 2021 IEEE high performance extreme computing conference (HPEC), 1-8, 2021 | 13 | 2021 |
FCsN: A FPGA-Centric SmartNIC Framework for Neural Networks A Guo, T Geng, Y Zhang, P Haghi, C Wu, C Tan, Y Lin, A Li, M Herbordt 2022 IEEE 30th Annual International Symposium on Field-Programmable Custom …, 2022 | 11 | 2022 |
Uwb-gcn: Hardware acceleration of graph-convolution-network through runtime workload rebalancing T Geng, A Li, T Wang, C Wu, Y Li, A Tumeo, M Herbordt arXiv preprint arXiv:1908.10834, 2019 | 11 | 2019 |
Optimized mappings for symmetric range-limited molecular force calculations on FPGAs C Wu, S Bandara, T Geng, A Guo, P Haghi, V Sachdeva, W Sherman, ... 2022 32nd International Conference on Field-Programmable Logic and …, 2022 | 10 | 2022 |
& Geng, T.(2023, June). Software-hardware co-design of heterogeneous SmartNIC system for recommendation models inference and training A Guo, Y Hao, C Wu, P Haghi, Z Pan, M Si Proceedings of the 37th International Conference on Supercomputing, 336-347, 0 | 10 | |
Smartfuse: Reconfigurable smart switches to accelerate fused collectives in hpc applications P Haghi, C Tan, A Guo, C Wu, D Liu, A Li, A Skjellum, T Geng, M Herbordt Proceedings of the 38th ACM International Conference on Supercomputing, 413-425, 2024 | 6 | 2024 |