Підписатись
Alessandro Lazaric
Alessandro Lazaric
Research Scientist, Facebook Artificial Intelligence Research
Підтверджена електронна адреса в inria.fr - Домашня сторінка
Назва
Посилання
Посилання
Рік
Transfer in reinforcement learning: a framework and a survey
A Lazaric
Reinforcement Learning: State-of-the-Art, 143-173, 2012
3872012
Best arm identification: A unified approach to fixed budget and fixed confidence
V Gabillon, M Ghavamzadeh, A Lazaric
Advances in neural information processing systems 25, 2012
3782012
Mastering visual continuous control: Improved data-augmented reinforcement learning
D Yarats, R Fergus, A Lazaric, L Pinto
arXiv preprint arXiv:2107.09645, 2021
3612021
Linear thompson sampling revisited
M Abeille, A Lazaric
Artificial Intelligence and Statistics, 176-184, 2017
3062017
Reinforcement learning with prototypical representations
D Yarats, R Fergus, A Lazaric, L Pinto
International Conference on Machine Learning, 11920-11931, 2021
2592021
Learning near optimal policies with low inherent bellman error
A Zanette, A Lazaric, M Kochenderfer, E Brunskill
International Conference on Machine Learning, 10978-10989, 2020
2592020
Best-arm identification in linear bandits
M Soare, A Lazaric, R Munos
Advances in neural information processing systems 27, 2014
2402014
Transfer of samples in batch reinforcement learning
A Lazaric, M Restelli, A Bonarini
Proceedings of the 25th international conference on Machine learning, 544-551, 2008
2122008
Reinforcement learning in continuous action spaces through sequential monte carlo methods
A Lazaric, M Restelli, A Bonarini
Advances in neural information processing systems 20, 2007
2032007
Risk-aversion in multi-armed bandits
A Sani, A Lazaric, R Munos
Advances in neural information processing systems 25, 2012
1982012
Frequentist regret bounds for randomized least-squares value iteration
A Zanette, D Brandfonbrener, E Brunskill, M Pirotta, A Lazaric
International Conference on Artificial Intelligence and Statistics, 1954-1964, 2020
1582020
Reinforcement learning of pomdps using spectral methods
K Azizzadenesheli, A Lazaric, A Anandkumar
Conference on Learning Theory, 193-256, 2016
1562016
Upper-confidence-bound algorithms for active learning in multi-armed bandits
A Carpentier, A Lazaric, M Ghavamzadeh, R Munos, P Auer
International Conference on Algorithmic Learning Theory, 189-203, 2011
1502011
Bayesian multi-task reinforcement learning
A Lazaric, M Ghavamzadeh
ICML-27th international conference on machine learning, 599-606, 2010
1462010
Finite-sample analysis of least-squares policy iteration
A Lazaric, M Ghavamzadeh, R Munos
The Journal of Machine Learning Research 13 (1), 3041-3074, 2012
1382012
Multi-bandit best arm identification
V Gabillon, M Ghavamzadeh, A Lazaric, S Bubeck
Advances in Neural Information Processing Systems 24, 2011
1312011
Efficient bias-span-constrained exploration-exploitation in reinforcement learning
R Fruit, M Pirotta, A Lazaric, R Ortner
International Conference on Machine Learning, 1578-1586, 2018
1262018
Sequential transfer in multi-armed bandit with finite set of models
A Lazaric, E Brunskill
Advances in Neural Information Processing Systems 26, 2013
1262013
Improved regret bounds for thompson sampling in linear quadratic control problems
M Abeille, A Lazaric
International Conference on Machine Learning, 1-9, 2018
1132018
Don't change the algorithm, change the data: Exploratory data for offline reinforcement learning
D Yarats, D Brandfonbrener, H Liu, M Laskin, P Abbeel, A Lazaric, L Pinto
arXiv preprint arXiv:2201.13425, 2022
1052022
У даний момент система не може виконати операцію. Спробуйте пізніше.
Статті 1–20