Toward a holistic performance evaluation of large language models across diverse ai accelerators M Emani, S Foreman, V Sastry, Z Xie, S Raskar, W Arnold, R Thakur, ... 2024 IEEE International Parallel and Distributed Processing Symposium …, 2024 | 3 | 2024 |
Impacts of floating-point non-associativity on reproducibility for HPC and deep learning applications S Shanmugavelu, M Taillefumier, C Culver, O Hernandez, M Coletti, ... SC24-W: Workshops of the International Conference for High Performance …, 2024 | 1 | 2024 |
Scientific Computing with Large Language Models C Culver, P Hicks, M Milenkovic, S Shanmugavelu, T Becker arXiv preprint arXiv:2406.07259, 2024 | 1 | 2024 |
Exploring the Use of Dataflow Architectures for Graph Neural Network Workloads R Hosseini, F Simini, V Vishwanath, R Sivakumar, S Shanmugavelu, ... International Conference on High Performance Computing, 648-661, 2023 | 1 | 2023 |
WActiGrad: Structured Pruning for Efficient Finetuning and Inference of Large Language Models on AI Accelerators KT Chitty-Venkata, VK Sastry, M Emani, V Vishwanath, S Shanmugavelu, ... European Conference on Parallel Processing, 317-331, 2024 | | 2024 |