Alessandro Lazaric

Посилання

	Усі	З 2020
Цитування	7878	5615
h-індекс	48	39
i10-індекс	108	96

1400

700

350

1050

20082009201020112012201320142015201620172018201920202021202220232024202525 23 52 88 132 137 190 182 257 285 366 480 680 1000 1176 1399 1111 248

Доступні для всіх

Переглянути всі

18 статей

0 статей

доступні

недоступні

За умовами фінансування

Співавтори

Matteo PirottaResearch Scientist, Meta (FAIR)Підтверджена електронна адреса в fb.com
Mohammad GhavamzadehAmazon AGIПідтверджена електронна адреса в amazon.com
Marcello RestelliAssociate Professor, Politecnico di MilanoПідтверджена електронна адреса в polimi.it
Michal ValkoChief Models Officer @ Stealth Startup, Inria & MVA - Ex: Llama at Meta; Gemini and BYOL @ DeepmindПідтверджена електронна адреса в meta.com
Rémi MunosFAIR, MetaПідтверджена електронна адреса в inria.fr
Andrea BonariniFull Professor, Politecnico di Milano, Dipartimento di Eletronica, Informazione e Biongegneria, AIПідтверджена електронна адреса в polimi.it
Emma BrunskillAssociate Professor of Computer Science, Stanford UniversityПідтверджена електронна адреса в cs.stanford.edu
Daniele CalandrielloResearch Scientist, DeepMindПідтверджена електронна адреса в google.com
Jean TarbouriechGoogle DeepMindПідтверджена електронна адреса в google.com
Marc AbeilleCriteoПідтверджена електронна адреса в ens-cachan.fr
Andrea TirinzoniMetaПідтверджена електронна адреса в fb.com
Ronan FruitPhD candidate, Inria Lille, SequeL teamПідтверджена електронна адреса в inria.fr
Evrard GarcelonFacebook AI ResearchПідтверджена електронна адреса в fb.com
Andrea ZanetteAssistant Professor, Carnegie Mellon UniversityПідтверджена електронна адреса в andrew.cmu.edu
Denis YaratsCofounder and CTO, Perplexity AIПідтверджена електронна адреса в perplexity.ai
Lerrel PintoNew York UniversityПідтверджена електронна адреса в cs.nyu.edu
Marta SoareUniversité d'OrléansПідтверджена електронна адреса в univ-orleans.fr
Anima AnandkumarCalifornia Institute of Technology and NVIDIAПідтверджена електронна адреса в caltech.edu
Kamyar AzizzadenesheliNvidiaПідтверджена електронна адреса в nvidia.com
Amir SaniSilentroПідтверджена електронна адреса в amirsani.com

Підписатись

Alessandro Lazaric

Research Scientist, Facebook Artificial Intelligence Research

Підтверджена електронна адреса в inria.fr - Домашня сторінка

Machine Learning


Назва Сортувати за цитуваннями Сортувати за роком Сортувати за назвою	Посилання Посилання	Рік
Transfer in reinforcement learning: a framework and a survey A Lazaric Reinforcement Learning: State-of-the-Art, 143-173, 2012	387	2012
Best arm identification: A unified approach to fixed budget and fixed confidence V Gabillon, M Ghavamzadeh, A Lazaric Advances in neural information processing systems 25, 2012	378	2012
Mastering visual continuous control: Improved data-augmented reinforcement learning D Yarats, R Fergus, A Lazaric, L Pinto arXiv preprint arXiv:2107.09645, 2021	361	2021
Linear thompson sampling revisited M Abeille, A Lazaric Artificial Intelligence and Statistics, 176-184, 2017	306	2017
Reinforcement learning with prototypical representations D Yarats, R Fergus, A Lazaric, L Pinto International Conference on Machine Learning, 11920-11931, 2021	259	2021
Learning near optimal policies with low inherent bellman error A Zanette, A Lazaric, M Kochenderfer, E Brunskill International Conference on Machine Learning, 10978-10989, 2020	259	2020
Best-arm identification in linear bandits M Soare, A Lazaric, R Munos Advances in neural information processing systems 27, 2014	240	2014
Transfer of samples in batch reinforcement learning A Lazaric, M Restelli, A Bonarini Proceedings of the 25th international conference on Machine learning, 544-551, 2008	212	2008
Reinforcement learning in continuous action spaces through sequential monte carlo methods A Lazaric, M Restelli, A Bonarini Advances in neural information processing systems 20, 2007	203	2007
Risk-aversion in multi-armed bandits A Sani, A Lazaric, R Munos Advances in neural information processing systems 25, 2012	198	2012
Frequentist regret bounds for randomized least-squares value iteration A Zanette, D Brandfonbrener, E Brunskill, M Pirotta, A Lazaric International Conference on Artificial Intelligence and Statistics, 1954-1964, 2020	158	2020
Reinforcement learning of pomdps using spectral methods K Azizzadenesheli, A Lazaric, A Anandkumar Conference on Learning Theory, 193-256, 2016	156	2016
Upper-confidence-bound algorithms for active learning in multi-armed bandits A Carpentier, A Lazaric, M Ghavamzadeh, R Munos, P Auer International Conference on Algorithmic Learning Theory, 189-203, 2011	150	2011
Bayesian multi-task reinforcement learning A Lazaric, M Ghavamzadeh ICML-27th international conference on machine learning, 599-606, 2010	146	2010
Finite-sample analysis of least-squares policy iteration A Lazaric, M Ghavamzadeh, R Munos The Journal of Machine Learning Research 13 (1), 3041-3074, 2012	138	2012
Multi-bandit best arm identification V Gabillon, M Ghavamzadeh, A Lazaric, S Bubeck Advances in Neural Information Processing Systems 24, 2011	131	2011
Efficient bias-span-constrained exploration-exploitation in reinforcement learning R Fruit, M Pirotta, A Lazaric, R Ortner International Conference on Machine Learning, 1578-1586, 2018	126	2018
Sequential transfer in multi-armed bandit with finite set of models A Lazaric, E Brunskill Advances in Neural Information Processing Systems 26, 2013	126	2013
Improved regret bounds for thompson sampling in linear quadratic control problems M Abeille, A Lazaric International Conference on Machine Learning, 1-9, 2018	113	2018
Don't change the algorithm, change the data: Exploratory data for offline reinforcement learning D Yarats, D Brandfonbrener, H Liu, M Laskin, P Abbeel, A Lazaric, L Pinto arXiv preprint arXiv:2201.13425, 2022	105	2022

У даний момент система не може виконати операцію. Спробуйте пізніше.

Статті 1–20

Кількість бібліографічних посилань на рік

Повторювані посилання

Об’єднані посилання

Додати співавторівСпівавтори

Підписатись

Посилання

Співавтори