Amey Agrawal

Zitiert von

	Alle	Seit 2020
Zitate	274	272
h-index	6	6
i10-index	5	5

200

100

150

20192020202120222023202420252 2 11 7 28 196 28

Koautoren

Ramachandran RamjeeMicrosoft Research IndiaBestätigte E-Mail-Adresse bei microsoft.com
Alexey TumanovGeorgia Institute of TechnologyBestätigte E-Mail-Adresse bei gatech.edu
Jayashree MohanMicrosoft Research IndiaBestätigte E-Mail-Adresse bei microsoft.com
Nipun KwatraComputer Science, Stanford UniversityBestätigte E-Mail-Adresse bei graphics.stanford.edu
Bhargav GulavaniMicrosoftBestätigte E-Mail-Adresse bei microsoft.com
Ashish PanwarSenior Researcher, Microsoft Research IndiaBestätigte E-Mail-Adresse bei microsoft.com
Nitin KediaResearch Fellow, Microsoft Research IndiaBestätigte E-Mail-Adresse bei microsoft.com
Anmol AgarwalGeorgia Institute of TechnologyBestätigte E-Mail-Adresse bei gatech.edu
Satwik BhattamishraUniversity of OxfordBestätigte E-Mail-Adresse bei cs.ox.ac.uk
Kexin RongSchool of Computer Science, Georgia Institute of TechnologyBestätigte E-Mail-Adresse bei gatech.edu
Vidushi VashishthGeorgia Institute of TechnologyBestätigte E-Mail-Adresse bei gatech.edu
Sameer ReddyBestätigte E-Mail-Adresse bei cisco.com
Íñigo GoiriResearch Software Developer, Azure Systems ResearchBestätigte E-Mail-Adresse bei microsoft.com
Chaojie ZhangMicrosoftBestätigte E-Mail-Adresse bei microsoft.com
Esha ChoukseMicrosoft ResearchBestätigte E-Mail-Adresse bei utexas.edu
Muthian SivathanuMicrosoft Research IndiaBestätigte E-Mail-Adresse bei cs.wisc.edu
Srinidhi ViswanathaBestätigte E-Mail-Adresse bei microsoft.com

Folgen

Amey Agrawal

PhD Student at Georgia Tech

Bestätigte E-Mail-Adresse bei gatech.edu - Startseite

Systems for AI


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
Taming {Throughput-Latency} Tradeoff in {LLM} Inference with {Sarathi-Serve} A Agrawal, N Kedia, A Panwar, J Mohan, N Kwatra, B Gulavani, ... 18th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2024	91	2024
Sarathi: Efficient llm inference by piggybacking decodes with chunked prefills A Agrawal, A Panwar, J Mohan, N Kwatra, BS Gulavani, R Ramjee arXiv preprint arXiv:2308.16369, 2023	79	2023
Singularity: Planet-scale, preemptive and elastic scheduling of AI workloads D Shukla, M Sivathanu, S Viswanatha, B Gulavani, R Nehme, A Agrawal, ... arXiv preprint arXiv:2202.07848, 2022	33	2022
Logan: A distributed online log parser A Agrawal, R Karlupia, R Gupta 2019 IEEE 35th International Conference on Data Engineering (ICDE), 1946-1951, 2019	30	2019
Vidur: A Large-Scale Simulation Framework For LLM Inference A Agrawal, N Kedia, J Mohan, A Panwar, N Kwatra, B Gulavani, ... Proceedings of Machine Learning and Systems 6, 351-366, 2024	22	2024
Delog: A high-performance privacy preserving log filtering framework A Agrawal, A Dixit, NA Shettar, D Kapadia, V Agrawal, R Gupta, ... 2019 IEEE International Conference on Big Data (Big Data), 1739-1748, 2019	8*	2019
Etalon: Holistic Performance Evaluation Framework for LLM Inference Systems A Agrawal, A Agarwal, N Kedia, J Mohan, S Kundu, N Kwatra, R Ramjee, ... arXiv preprint arXiv:2407.07000, 2024	5	2024
Inshrinkerator: Compressing Deep Learning Training Checkpoints via Dynamic Quantization A Agrawal, S Reddy, S Bhattamishra, VPS Nookala, V Vashishth, K Rong, ... Proceedings of the 2024 ACM Symposium on Cloud Computing, 1012-1031, 2024	3*	2024
Mnemosyne: Parallelization strategies for efficiently serving multi-million context length llm inference requests without approximations A Agrawal, J Chen, Í Goiri, R Ramjee, C Zhang, A Tumanov, E Choukse arXiv preprint arXiv:2409.17264, 2024	3	2024
Elastically managing workers of multi-worker workloads on accelerator devices M Sivathanu, S Viswanatha, B Gulavani, DK Shukla, RV Nehme, ... US Patent App. 17/855,722, 2023		2023
Learning Digital Circuits: A Journey Through Weight Invariant Self-Pruning Neural Networks A Agrawal, R Karlupia arXiv preprint arXiv:1909.00052, 2019		2019

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–11

Zitate pro Jahr

Doppelte Zitate

Zusammengeführte Zitate

Koautor hinzufügenKoautoren

Folgen

Zitiert von

Koautoren