Keshav Santhanam

Zitiert von

	Alle	Seit 2020
Zitate	7359	7326
h-index	14	13
i10-index	15	13

4000

2000

1000

3000

20202021202220232024202520 133 692 2173 3981 294

Öffentlicher Zugriff

Alle anzeigen

6 Artikel

0 Artikel

verfügbar

nicht verfügbar

Basierend auf Fördermandaten

Folgen

Keshav Santhanam

Stanford University

Bestätigte E-Mail-Adresse bei stanford.edu - Startseite


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
On the opportunities and risks of foundation models R Bommasani, DA Hudson, E Adeli, R Altman, S Arora, S von Arx, ... arXiv preprint arXiv:2108.07258, 2021	4676	2021
Holistic evaluation of language models P Liang, R Bommasani, T Lee, D Tsipras, D Soylu, M Yasunaga, Y Zhang, ... arXiv preprint arXiv:2211.09110, 2022	1181	2022
Colbertv2: Effective and efficient retrieval via lightweight late interaction K Santhanam, O Khattab, J Saad-Falcon, C Potts, M Zaharia arXiv preprint arXiv:2112.01488, 2021	396	2021
{Heterogeneity-Aware} cluster scheduling policies for deep learning workloads D Narayanan, K Santhanam, F Kazhamiaka, A Phanishayee, M Zaharia 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2020	257	2020
Demonstrate-search-predict: Composing retrieval and language models for knowledge-intensive nlp O Khattab, K Santhanam, XL Li, D Hall, P Liang, C Potts, M Zaharia arXiv preprint arXiv:2212.14024, 2022	219	2022
Dspy: Compiling declarative language model calls into self-improving pipelines O Khattab, A Singhvi, P Maheshwari, Z Zhang, K Santhanam, ... arXiv preprint arXiv:2310.03714, 2023	176	2023
On the opportunities and risks of foundation models. arXiv 2021 R Bommasani, DA Hudson, E Adeli, R Altman, S Arora, S von Arx, ... arXiv preprint arXiv:2108.07258, 2023	107	2023
On the opportunities and risks of foundation models (2021) R Bommasani, DA Hudson, E Adeli, R Altman, S Arora, S von Arx, ... arXiv preprint arXiv:2108.07258, 2022	74	2022
PLAID: an efficient engine for late interaction retrieval K Santhanam, O Khattab, C Potts, M Zaharia Proceedings of the 31st ACM International Conference on Information …, 2022	69	2022
Accelerating deep learning workloads through efficient multi-model execution D Narayanan, K Santhanam, A Phanishayee, M Zaharia NeurIPS Workshop on Systems for Machine Learning 20, 2018	64	2018
Analysis and exploitation of dynamic pricing in the public cloud for ml training D Narayanan, K Santhanam, F Kazhamiaka, A Phanishayee, M Zaharia VLDB DISPA Workshop 2020, 2020	38	2020
DSPy: Compiling Declarative Language Model Calls into State-of-the-Art Pipelines O Khattab, A Singhvi, P Maheshwari, Z Zhang, K Santhanam, S Haq, ... The Twelfth International Conference on Learning Representations, 2024	20	2024
ROLA: A New Distributed Transaction Protocol and Its Formal Analysis. S Liu, PC Ölveczky, K Santhanam, Q Wang, I Gupta, J Meseguer FASE, 77-93, 2018	19	2018
UDAPDR: unsupervised domain adaptation via LLM prompting and distillation of rerankers J Saad-Falcon, O Khattab, K Santhanam, R Florian, M Franz, S Roukos, ... arXiv preprint arXiv:2303.00807, 2023	15	2023
Accelerating model search with model batching D Narayanan, K Santhanam, M Zaharia 1st Conference on Systems and Machine Learning (SysML), SysML 18, 2018	11	2018
DistIR: An intermediate representation for optimizing distributed neural networks K Santhanam, S Krishna, R Tomioka, A Fitzgibbon, T Harris Proceedings of the 1st Workshop on Machine Learning and Systems, 15-23, 2021	9	2021
Cheaply estimating inference efficiency metrics for autoregressive transformer models D Narayanan, K Santhanam, P Henderson, R Bommasani, T Lee, ... Advances in Neural Information Processing Systems 36, 66518-66538, 2023	7	2023
Moving beyond downstream task accuracy for information retrieval benchmarking K Santhanam, J Saad-Falcon, M Franz, O Khattab, A Sil, R Florian, ... arXiv preprint arXiv:2212.01340, 2022	7	2022
ALTO: An Efficient Network Orchestrator for Compound AI Systems K Santhanam, D Raghavan, MS Rahman, T Venkatesh, N Kunjal, ... Proceedings of the 4th Workshop on Machine Learning and Systems, 117-125, 2024	6	2024
Cheaply evaluating inference efficiency metrics for autoregressive transformer APIs D Narayanan, K Santhanam, P Henderson, R Bommasani, T Lee, P Liang arXiv preprint arXiv:2305.02440, 2023	4	2023

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–20

Zitate pro Jahr

Doppelte Zitate

Zusammengeführte Zitate

Koautor hinzufügenKoautoren

Folgen

Zitiert von