Підписатись
Victor Gabillon
Victor Gabillon
Невідома організація
Немає підтвердженої електронної адреси - Домашня сторінка
Назва
Посилання
Посилання
Рік
Best arm identification: A unified approach to fixed budget and fixed confidence
V Gabillon, M Ghavamzadeh, A Lazaric
NIPS, Neural Information Processing Systems, 2012
3762012
Approximate modified policy iteration and its application to the game of Tetris.
B Scherrer, M Ghavamzadeh, V Gabillon, B Lesner, M Geist
JMLR, Journal of Machine Learning Research 16, 2015
1582015
Multi-bandit best arm identification
V Gabillon, M Ghavamzadeh, A Lazaric, S Bubeck
NIPS, Neural Information Processing Systems, 2011
1312011
Approximate dynamic programming finally performs well in the game of Tetris
V Gabillon, M Ghavamzadeh, B Scherrer
NIPS, Neural Information Processing systems, 2013
822013
Adaptive submodular maximization in bandit setting
V Gabillon, B Kveton, Z Wen, B Eriksson, S Muthukrishnan
NIPS, Neural Information Processing Systems, 2013
682013
Approximate modified policy iteration
B Scherrer, V Gabillon, M Ghavamzadeh, M Geist
ICML, International Conference on Machine Learning, 2012
622012
Best of both worlds: Stochastic & adversarial best-arm identification
Y Abbasi-Yadkori, P Bartlett, V Gabillon, A Malek, M Valko
Conference on learning theory, 918-949, 2018
562018
Improved learning complexity in combinatorial pure exploration bandits
V Gabillon, A Lazaric, M Ghavamzadeh, R Ortner, P Bartlett
AISTATS, Artificial Intelligence and Statistics, 2016
482016
A simple parameter-free and adaptive approach to optimization under a minimal local smoothness assumption
PL Bartlett, V Gabillon, M Valko
ALT, Algorithmic Learning Theory, 2019
412019
MANAS: Multi-agent neural architecture search
V Lopes, FM Carlucci, PM Esperança, M Singh, A Yang, V Gabillon, H Xu, ...
Machine Learning 113 (1), 73-96, 2024
352024
Classification-based policy iteration with a critic
V Gabillon, A Lazaric, M Ghavamzadeh, B Scherrer
ICML, International Conference on Machine Learning, 2011
302011
Hit-and-Run for Sampling and Planning in Non-Convex Spaces
Y Abbasi-Yadkori, PL Bartlett, V Gabillon, A Malek
AISTATS, Artificial Intelligence and Statistics, 2017
262017
Near Minimax Optimal Players for the Finite-Time 3-Expert Prediction Problem
Y Abbasi-Yadkori, PL Bartlett, V Gabillon
NIPS, Neural Information Processing Systems, 2017
172017
Large-Scale Optimistic Adaptive Submodularity.
V Gabillon, B Kveton, Z Wen, B Eriksson, S Muthukrishnan
AAAI, Association for the Advancement of Artificial Intelligence, 2014
172014
Rollout allocation strategies for classification-based policy iteration
V Gabillon, A Lazaric, M Ghavamzadeh
Workshop on Reinforcement Learning and Search in Very Large Spaces, 2010
142010
Adaptive multi-fidelity optimization with fast learning rates
C Fiegel, V Gabillon, M Valko
International Conference on Artificial Intelligence and Statistics, 3493-3502, 2020
82020
Derivative-Free & Order-Robust Optimisation
V Gabillon, R Tutunov, M Valko, HB Ammar
AISTATS, Artificial Intelligence and Statistics, 2020
8*2020
Scale-free adaptive planning for deterministic dynamics & discounted rewards
P Bartlett, V Gabillon, J Healey, M Valko
ICML, International Conference on Machine Learning, 495-504, 2019
72019
Multi-media content-recommender system that learns how to elicit user preferences
VF Gabillon, B Kveton, B Eriksson
US Patent App. 14/489,703, 2016
52016
Machine learning tools for online advertisement
V Gabillon
Technical report, INRIA Lille, France, 2009
52009
У даний момент система не може виконати операцію. Спробуйте пізніше.
Статті 1–20