Rémi Munos

عدد مرات الاقتباسات

	الكل	قبل 2020
اقتباسات	45565	33756
h-index	91	78
i10-index	199	160

9000

4500

2250

6750

2007200820092010201120122013201420152016201720182019202020212022202320242025170 247 221 356 471 532 585 769 802 893 1124 2007 3065 4112 5708 6961 7795 8349 805

عدد المنشورات المتاحة للجميع

عرض المجموعة جميعها

20 مقالة

0 مقالة

المقالات البحثية المتاحة للجميع

المقالات البحثية غير المتاحة للجميع

تمّ اختيار المعلومات استنادًا إلى تفويضات التمويل

المؤلفون المشاركون

Michal ValkoChief Models Officer @ Stealth Startup, Inria & MVA - Ex: Llama at Meta; Gemini and BYOL @ Deepmindبريد إلكتروني تم التحقق منه على meta.com
Mohammad Gheshlaghi AzarCohereبريد إلكتروني تم التحقق منه على cohere.com
Will DabneyDeepMindبريد إلكتروني تم التحقق منه على google.com
Bilal PiotGoogle Deepmindبريد إلكتروني تم التحقق منه على google.com
Marc G. BellemareReliant AIبريد إلكتروني تم التحقق منه على reliant.ai
Csaba SzepesvariDeepMind & University of Albertaبريد إلكتروني تم التحقق منه على cs.ualberta.ca
Zhaohan Daniel GuoDeepMindبريد إلكتروني تم التحقق منه على google.com
Mark RowlandResearch Scientist, Google DeepMindبريد إلكتروني تم التحقق منه على google.com
Jean-bastien Grillبريد إلكتروني تم التحقق منه على google.com
Florent AltchéResearch Engineer, DeepMindبريد إلكتروني تم التحقق منه على google.com
Corentin TallecDeepMindبريد إلكتروني تم التحقق منه على google.com
Alessandro LazaricResearch Scientist, Facebook Artificial Intelligence Researchبريد إلكتروني تم التحقق منه على inria.fr
Yunhao TangResearch Scientist, Llama research team; Previously, DeepMindبريد إلكتروني تم التحقق منه على columbia.edu
koray kavukcuogluDeepMindبريد إلكتروني تم التحقق منه على kavukcuoglu.org
Gilles StoltzCNRS / Université Paris-Saclay / HEC Parisبريد إلكتروني تم التحقق منه على hec.fr
Odalric-Ambrym MaillardInria Lille - Nord Europeبريد إلكتروني تم التحقق منه على inria.fr
Mohammad GhavamzadehAmazon AGIبريد إلكتروني تم التحقق منه على amazon.com
Pierre RichemondGoogle DeepMindبريد إلكتروني تم التحقق منه على deepmind.com
Sebastien BubeckOpenAIبريد إلكتروني تم التحقق منه على openai.com
Daniele CalandrielloResearch Scientist, DeepMindبريد إلكتروني تم التحقق منه على google.com

متابعة

Rémi Munos

FAIR, Meta

بريد إلكتروني تم التحقق منه على inria.fr - الصفحة الرئيسية

deepRL RLHF MCTS bandit theory statistical learning


عنوان ترتيب حسب الاقتباسات ترتيب حسب السنة الترتيب حسب العنوان	عدد مرات الاقتباسات عدد مرات الاقتباسات	السنة
Bootstrap your own latent-a new approach to self-supervised learning‏ JB Grill, F Strub, F Altché, C Tallec, P Richemond, E Buchatskaya, ...‏ Advances in neural information processing systems 33, 21271-21284, 2020‏	7392	2020
A distributional perspective on reinforcement learning‏ MG Bellemare, W Dabney, R Munos‏ International conference on machine learning, 449-458, 2017‏	1981	2017
Unifying count-based exploration and intrinsic motivation‏ M Bellemare, S Srinivasan, G Ostrovski, T Schaul, D Saxton, R Munos‏ Advances in neural information processing systems 29, 2016‏	1817	2016
Impala: Scalable distributed deep-rl with importance weighted actor-learner architectures‏ L Espeholt, H Soyer, R Munos, K Simonyan, V Mnih, T Ward, Y Doron, ...‏ International conference on machine learning, 1407-1416, 2018‏	1775	2018
Learning to reinforcement learn‏ JX Wang, Z Kurth-Nelson, D Tirumala, H Soyer, JZ Leibo, R Munos, ...‏ arXiv preprint arXiv:1611.05763, 2016‏	1119	2016
Sample efficient actor-critic with experience replay‏ Z Wang, V Bapst, N Heess, V Mnih, R Munos, K Kavukcuoglu, ...‏ arXiv preprint arXiv:1611.01224, 2016‏	1064	2016
Best arm identification in multi-armed bandits‏ JY Audibert, S Bubeck‏ COLT-23th Conference on learning theory-2010, 13 p., 2010‏	1005	2010
Distributional reinforcement learning with quantile regression‏ W Dabney, M Rowland, M Bellemare, R Munos‏ Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018‏	944	2018
Minimax regret bounds for reinforcement learning‏ MG Azar, I Osband, R Munos‏ International conference on machine learning, 263-272, 2017‏	899	2017
Exploration–exploitation tradeoff using variance estimates in multi-armed bandits‏ JY Audibert, R Munos, C Szepesvári‏ Theoretical Computer Science 410 (19), 1876-1902, 2009‏	814	2009
Thompson sampling: An asymptotically optimal finite-time analysis‏ E Kaufmann, N Korda, R Munos‏ International conference on algorithmic learning theory, 199-213, 2012‏	806	2012
Count-based exploration with neural density models‏ G Ostrovski, MG Bellemare, A Oord, R Munos‏ International conference on machine learning, 2721-2730, 2017‏	772	2017
Safe and efficient off-policy reinforcement learning‏ R Munos, T Stepleton, A Harutyunyan, M Bellemare‏ Advances in neural information processing systems 29, 2016‏	750	2016
Successor features for transfer in reinforcement learning‏ A Barreto, W Dabney, R Munos, JJ Hunt, T Schaul, HP van Hasselt, ...‏ Advances in neural information processing systems 30, 2017‏	695	2017
Finite-Time Bounds for Fitted Value Iteration.‏ R Munos, C Szepesvári‏ Journal of Machine Learning Research 9 (5), 2008‏	672	2008
Implicit quantile networks for distributional reinforcement learning‏ W Dabney, G Ostrovski, D Silver, R Munos‏ International conference on machine learning, 1096-1105, 2018‏	668	2018
Automated curriculum learning for neural networks‏ A Graves, MG Bellemare, J Menick, R Munos, K Kavukcuoglu‏ international conference on machine learning, 1311-1320, 2017‏	660	2017
Pure exploration in multi-armed bandits problems‏ S Bubeck, R Munos, G Stoltz‏ Algorithmic Learning Theory: 20th International Conference, ALT 2009, Porto …, 2009‏	643	2009
Recurrent experience replay in distributed reinforcement learning‏ S Kapturowski, G Ostrovski, J Quan, R Munos, W Dabney‏ International conference on learning representations, 2018‏	600	2018
Maximum a posteriori policy optimisation‏ A Abdolmaleki, JT Springenberg, Y Tassa, R Munos, N Heess, ...‏ arXiv preprint arXiv:1806.06920, 2018‏	554	2018

يتعذر على النظام إجراء العملية في الوقت الحالي. عاود المحاولة لاحقًا.

مقالات 1–20

عدد الاقتباسات في العام

اقتباسات مكررة

الاقتباسات المدمجة

إضافة مؤلفين مشاركينالمؤلفون المشاركون

متابعة

عدد مرات الاقتباسات

المؤلفون المشاركون