A comprehensive evaluation of novel AI accelerators for deep learning workloads M Emani, Z Xie, S Raskar, V Sastry, W Arnold, B Wilson, R Thakur, ... 2022 IEEE/ACM international workshop on performance modeling, benchmarking …, 2022 | 19 | 2022 |
A Comprehensive Performance Study of Large Language Models on Novel AI Accelerators M Emani, S Foreman, V Sastry, Z Xie, S Raskar, W Arnold, R Thakur, ... arXiv preprint arXiv:2310.04607, 2023 | 11 | 2023 |
Cross-Feature Transfer Learning for Efficient Tensor Program Generation G Verma, S Raskar, M Emani, B Chapman Applied Sciences 14 (2), 513, 2024 | 5 | 2024 |
DEMAC: A Modular Platform for HW-SW Co-Design DAR Perdomo, R Kabrick, JMM Diaz, S Raskar, D Fox, GR Gao 2020 IEEE/ACM Fourth Annual Workshop on Emerging Parallel and Distributed …, 2020 | 5 | 2020 |
Toward A High-Performance Emulation Platform for Brain-Inspired Intelligent Systems Exploring Dataflow-Based Execution Model and Beyond S Zeng, JMM Diaz, S Raskar 2019 IEEE 43rd Annual Computer Software and Applications Conference (COMPSAC …, 2019 | 5 | 2019 |
Position paper: Extending codelet model for dataflow software pipelining using software-hardware co-design S Raskar, T Applencourt, K Kumaran, G Gao 2019 IEEE 43rd Annual Computer Software and Applications Conference (COMPSAC …, 2019 | 5 | 2019 |
Toward a holistic performance evaluation of large language models across diverse ai accelerators M Emani, S Foreman, V Sastry, Z Xie, S Raskar, W Arnold, R Thakur, ... 2024 IEEE International Parallel and Distributed Processing Symposium …, 2024 | 3 | 2024 |
Thorough Characterization and Analysis of Large Transformer Model Training At-Scale S Cheng, JL Lin, M Emani, S Raskar, S Foreman, Z Xie, V Vishwanath, ... Proceedings of the ACM on Measurement and Analysis of Computing Systems 8 (1 …, 2024 | 2 | 2024 |
Implementation of Dataflow Software Pipelining for Codelet Model S Raskar, JM Monsalve Diaz, T Applencourt, K Kumaran, G Gao Proceedings of the 2023 ACM/SPEC International Conference on Performance …, 2023 | 2 | 2023 |
Transfer Learning Across Heterogeneous Features For Efficient Tensor Program Generation G Verma, S Raskar, Z Xie, AM Malik, M Emani, B Chapman Proceedings of the 2nd International Workshop on Extreme Heterogeneity …, 2023 | 2 | 2023 |
Throughput-oriented and Accuracy-aware DNN Training with BFloat16 on GPU Z Xie, S Raskar, M Emani 2022 IEEE International Parallel and Distributed Processing Symposium …, 2022 | 2 | 2022 |
Dataflow software pipelining for codelet model using hardware-software co-design S Raskar University of Delaware, 2021 | 2 | 2021 |
CODIR: towards an MLIR codelet model dialect R Kabrick, DAR Perdomo, S Raskar, JMM Diaz, D Fox, GR Gao 2020 IEEE/ACM Fourth Annual Workshop on Emerging Parallel and Distributed …, 2020 | 2 | 2020 |
Characterizing the Performance of Triangle Counting on Graphcore's IPU Architecture R Barik, S Raskar, M Emani, V Vishwanath Proceedings of the SC'23 Workshops of The International Conference on High …, 2023 | 1 | 2023 |
TrainBF: High-Performance DNN Training Engine Using BFloat16 on AI Accelerators Z Xie, S Raskar, M Emani, V Vishwanath European Conference on Parallel Processing, 458-473, 2023 | 1 | 2023 |
DEMAC and CODIR: A whole stack solution for a HW/SW co-design using an MLIR Codelet model dialect R Kabrick, D Roa, S Raskar, JMM Diaz, G Gao Univ. Delaware, Newark, DE, USA, Tech. Rep. CAPSL Technical Memo 136, 2020 | 1 | 2020 |
LLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI Accelerators KT Chitty-Venkata, S Raskar, B Kale, F Ferdaus, A Tanikanti, K Raffenetti, ... SC24-W: Workshops of the International Conference for High Performance …, 2024 | | 2024 |
2023 AI Testbed Expeditions Report V Vishwanath, M Emani, V Sastry, W Arnold, R Thakur, V Taylor, I Foster, ... Argonne National Laboratory (ANL), Argonne, IL (United States). Argonne …, 2023 | | 2023 |
Towards Fault Tolerance and Resilience in the Sequential Codelet Model DAR Perdomo, RAH Guaitero, D Fox, H Yviquel, S Raskar, X Li, ... Latin American High Performance Computing Conference, 77-94, 2023 | | 2023 |
Codelet Pipe: Realization of Dataflow Software Pipelining for Extended Codelet Model S Raskar, T Applencourt, K Kumaran, GR Gao Proceedings of the 52nd International Conference on Parallel Processing …, 2023 | | 2023 |