Suchita Pati

Cited by

	All	Since 2020
Citations	203	193
h-index	6	6
i10-index	6	6

201820192020202120222023202420251 9 18 19 27 43 81 5

Co-authors

Matthew D. SinclairUniversity of Wisconsin-MadisonVerified email at cs.wisc.edu
Shaizeen AgaAMD ResearchVerified email at amd.com
Mahzabeen IslamAMD ResearchVerified email at my.unt.edu
Negar GoliSenior ML/HPC architect Eng. @ AMD - M.Sc. from UBCVerified email at ece.ubc.ca
Timothy G. RogersPurdue UniversityVerified email at purdue.edu
Mengchi ZhangResearch Scientist, MetaVerified email at meta.com
Tor M. AamodtProfessor, Electrical and Computer Engineering, University of British ColumbiaVerified email at ece.ubc.ca
Amruth SandhupatlaUniversity of British ColumbiaVerified email at ece.ubc.ca
Mohamed Assem IbrahimAMD Research and Advanced Development (RAD)Verified email at amd.com
Kanishka LahiriAdvanced Micro DevicesVerified email at amd.com

Suchita Pati

AMD Research, University of Wisconsin, Madison

Verified email at cs.wisc.edu - Homepage

Systems for ML Computer Architecture


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Analyzing machine learning workloads using a detailed GPU simulator J Lew, DA Shah, S Pati, S Cattell, M Zhang, A Sandhupatla, C Ng, N Goli, ... 2019 IEEE international symposium on performance analysis of systems and …, 2019	92	2019
Demystifying bert: System design implications S Pati, S Aga, N Jayasena, MD Sinclair 2022 IEEE International Symposium on Workload Characterization (IISWC), 296-309, 2022	29	2022
SeqPoint: Identifying representative iterations of sequence-based neural networks S Pati, S Aga, MD Sinclair, N Jayasena 2020 IEEE International Symposium on Performance Analysis of Systems and …, 2020	17	2020
Demystifying bert: Implications for accelerator design S Pati, S Aga, N Jayasena, MD Sinclair arXiv preprint arXiv:2104.08335, 2021	15	2021
T3: Transparent Tracking & Triggering for Fine-grained Overlap of Compute & Collectives S Pati, S Aga, M Islam, N Jayasena, MD Sinclair Proceedings of the 29th ACM International Conference on Architectural …, 2024	12	2024
Tale of Two Cs: Computation vs. Communication Scaling for Future Transformers on Future Hardware S Pati, S Aga, M Islam, N Jayasena, MD Sinclair 2023 IEEE International Symposium on Workload Characterization (IISWC), 140-153, 2023	12	2023
JIT-Q: Just-in-time Quantization with Processing-In-Memory for Efficient ML Training M Ibrahim, S Aga, A Li, S Pati, M Islam Proceedings of Machine Learning and Systems 6, 46-59, 2024	6*	2024
Computation vs. communication scaling for future transformers on future hardware S Pati, S Aga, M Islam, N Jayasena, MD Sinclair arXiv preprint arXiv:2302.02825, 2023	6	2023
Improving GPU Utilization in ML Workloads Through Finer-Grained Synchronization R Kuper, S Pati, MD Sinclair 3rd Young Architects Workshop, 2021	5	2021
Darts: Performance-counter driven sampling using binary translators R Kumar, S Pati, K Lahiri 2017 IEEE International Symposium on Performance Analysis of Systems and …, 2017	4	2017
Analyzing Machine Learning Workloads Using a Detailed GPU Simulator. CoRR abs/1811.08933 (2018) J Lew, D Shah, S Pati, S Cattell, M Zhang, A Sandhupatla, C Ng, N Goli, ... arXiv preprint arXiv:1811.08933, 2018	3	2018
Global Optimizations & Lightweight Dynamic Logic for Concurrency S Pati, S Aga, N Jayasena, M Sinclair https://arxiv.org/pdf/2409.02227, 2024	1	2024
Exploring GPU Architectural Optimizations for RNNs S Pati Young Architect Workshop (YArch), in conjunction with HPCA'19, 2019	1	2019
Optimizing ML Concurrent Computation and Communication with GPU DMA Engines A Agrawal, S Aga, S Pati, M Islam arXiv preprint arXiv:2412.14335, 2024		2024
Dynamic control of work scheduling S Pati, AGA Shaizeen, N Jayasena, MD Sinclair US Patent App. 18/091,443, 2024		2024
Fused Data Generation and Associated Communication SD Aga, S Pati, NS Jayasena US Patent App. 18/190,620, 2024		2024
Cross-Stack Optimizations for Sequence-Based Models on GPUs S Pati https://www.proquest.com/docview/3054333891, 2024		2024
IISWC 2024 A Jog, A Hankin, A Samajdar, A Putnam, A Shriraman, A Mishra, B Asgari, ...
Effective Prefetching for Multicore/Multiprocessor Systems S Pati, P Mahapatra
Transparent Compression for Flash SSDs S Pati, Y Trivedi

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors