Sobhan Miryoosefi

Zitiert von

	Alle	Seit 2020
Zitate	567	566
h-index	7	7
i10-index	7	7

220

110

165

20202021202220232024202513 58 140 139 211 4

Öffentlicher Zugriff

Alle anzeigen

1 Artikel

0 Artikel

verfügbar

nicht verfügbar

Basierend auf Fördermandaten

Koautoren

Chi JinAssistant Professor, Princeton UniversityBestätigte E-Mail-Adresse bei princeton.edu
Sanjiv KumarGoogle Fellow, VP, Google ResearchBestätigte E-Mail-Adresse bei google.com
Miroslav DudikMicrosoft ResearchBestätigte E-Mail-Adresse bei microsoft.com
Kianté BrantleyAssistant Professor, Harvard UniversityBestätigte E-Mail-Adresse bei g.harvard.edu
Qinghua LiuMicrosoft ResearchBestätigte E-Mail-Adresse bei princeton.edu
Hal Daumé IIIAssociate Professor of Computer Science, University of MarylandBestätigte E-Mail-Adresse bei umiacs.umd.edu
Robert SchapireMicrosoft ResearchBestätigte E-Mail-Adresse bei microsoft.com
Sashank J. ReddiResearch Scientist, Google ResearchBestätigte E-Mail-Adresse bei cs.cmu.edu
Wen SunAssistant Professor, Cornell UniversityBestätigte E-Mail-Adresse bei cornell.edu
Thodoris LykourisMITBestätigte E-Mail-Adresse bei mit.edu
Aleksandrs SlivkinsSenior Principal Researcher, Microsoft Research NYCBestätigte E-Mail-Adresse bei microsoft.com
Stefani KarpCarnegie Mellon UniversityBestätigte E-Mail-Adresse bei cs.cmu.edu
Yonathan EfroniMeta, New YorkBestätigte E-Mail-Adresse bei fb.com
Akshay KrishnamurthyUniversity of Massachusetts AmherstBestätigte E-Mail-Adresse bei cs.umass.edu
Satyen KaleResearch Scientist, AppleBestätigte E-Mail-Adresse bei satyenkale.com
Daliang LiAnthropicBestätigte E-Mail-Adresse bei anthropic.com
Manzil ZaheerGoogle ResearchBestätigte E-Mail-Adresse bei cmu.edu
Felix Xinnan YuSr. Staff Research Scientist, Google New YorkBestätigte E-Mail-Adresse bei google.com
Renat AksitovGoogle DeepMindBestätigte E-Mail-Adresse bei google.com
Nikunj SaunshiResearch Scientist, GoogleBestätigte E-Mail-Adresse bei google.com

Folgen

Sobhan Miryoosefi

Princeton University | Google Research

Bestätigte E-Mail-Adresse bei google.com - Startseite

Machine Learning Theoretical Machine Learning Reinforcement Learning Natural Language Processing


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
Bellman Eluder dimension: New rich classes of RL problems, and sample-efficient algorithms C Jin, Q Liu, S Miryoosefi Advances in Neural Information Processing Systems 34, 13406-13418, 2021	268	2021
Reinforcement learning with convex constraints S Miryoosefi, K Brantley, H Daumé III, M Dudík, R Schapire Advances in Neural Information Processing Systems 32, 14093-14102, 2019	108	2019
Constrained episodic reinforcement learning in concave-convex and knapsack settings K Brantley, M Dudik, T Lykouris, S Miryoosefi, M Simchowitz, A Slivkins, ... Advances in Neural Information Processing Systems 33, 16315-16326, 2020	61	2020
Provable reinforcement learning with a short-term memory Y Efroni, C Jin, A Krishnamurthy, S Miryoosefi International Conference on Machine Learning, 5832-5850, 2022	44	2022
A simple reward-free approach to constrained reinforcement learning S Miryoosefi, C Jin International Conference on Machine Learning, 15666-15698, 2022	41	2022
Rest meets react: Self-improvement for multi-step reasoning llm agent R Aksitov, S Miryoosefi, Z Li, D Li, S Babayan, K Kopparapu, Z Fisher, ... arXiv preprint arXiv:2312.10003, 2023	29	2023
Efficient training of language models using few-shot learning SJ Reddi, S Miryoosefi, S Karp, S Krishnan, S Kale, S Kim, S Kumar International Conference on Machine Learning, 14553-14568, 2023	11	2023
Efficient Stagewise Pretraining via Progressive Subnetworks A Panigrahi, N Saunshi, K Lyu, S Miryoosefi, S Reddi, S Kale, S Kumar arXiv preprint arXiv:2402.05913, 2024	4	2024
Landscape-Aware Growing: The Power of a Little LAG S Karp, N Saunshi, S Miryoosefi, SJ Reddi, S Kumar arXiv preprint arXiv:2406.02469, 2024	1	2024
On the Inductive Bias of Stacking Towards Improving Reasoning N Saunshi, S Karp, S Krishnan, S Miryoosefi, SJ Reddi, S Kumar arXiv preprint arXiv:2409.19044, 2024		2024
Provable Reinforcement Learning with Constraints and Function Approximation SSM Yoosefi Princeton University, 2022		2022

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–11

Zitate pro Jahr

Doppelte Zitate

Zusammengeführte Zitate

Koautor hinzufügenKoautoren

Folgen

Zitiert von

Koautoren