עקוב אחר
Doina Precup
Doina Precup
DeepMind and McGill University
כתובת אימייל מאומתת בדומיין cs.mcgill.ca
כותרת
צוטט על ידי
צוטט על ידי
שנה
The multimodal brain tumor image segmentation benchmark (BRATS)
BH Menze, A Jakab, S Bauer, J Kalpathy-Cramer, K Farahani, J Kirby, ...
IEEE transactions on medical imaging 34 (10), 1993-2024, 2014
65432014
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
RS Sutton, D Precup, S Singh
Artificial intelligence 112 (1-2), 181-211, 1999
47291999
Deep reinforcement learning that matters
P Henderson, R Islam, P Bachman, J Pineau, D Precup, D Meger
Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018
26142018
Off-policy deep reinforcement learning without exploration
S Fujimoto, D Meger, D Precup
International conference on machine learning, 2052-2062, 2019
18482019
The option-critic architecture
PL Bacon, J Harb, D Precup
Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017
13732017
Eligibility traces for off-policy policy evaluation.
D Precup, RS Sutton, S Singh
ICML 2000, 759-766, 2000
10222000
Fast gradient-descent methods for temporal-difference learning with linear function approximation
RS Sutton, HR Maei, D Precup, S Bhatnagar, D Silver, C Szepesvári, ...
Proceedings of the 26th annual international conference on machine learning …, 2009
7512009
Learning with pseudo-ensembles
P Bachman, O Alsharif, D Precup
Advances in neural information processing systems 27, 2014
7352014
Reward is enough
D Silver, S Singh, D Precup, RS Sutton
Artificial intelligence 299, 103535, 2021
6692021
Horde: A scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction
RS Sutton, J Modayil, M Delp, T Degris, PM Pilarski, A White, D Precup
The 10th International Conference on Autonomous Agents and Multiagent …, 2011
6332011
Algorithms for multi-armed bandit problems
V Kuleshov, D Precup
arXiv preprint arXiv:1402.6028, 2014
6142014
Exploring uncertainty measures in deep networks for multiple sclerosis lesion detection and segmentation
T Nair, D Precup, DL Arnold, T Arbel
Medical image analysis 59, 101557, 2020
5232020
Learning options in reinforcement learning
M Stolle, D Precup
Abstraction, Reformulation, and Approximation: 5th International Symposium …, 2002
4992002
Off-policy temporal-difference learning with function approximation
D Precup, RS Sutton, S Dasgupta
ICML, 417-424, 2001
4862001
Temporal abstraction in reinforcement learning
D Precup
University of Massachusetts Amherst, 2000
4162000
Metrics for Finite Markov Decision Processes.
N Ferns, P Panangaden, D Precup
UAI 4, 162-169, 2004
3912004
Conditional computation in neural networks for faster models
E Bengio, PL Bacon, J Pineau, D Precup
arXiv preprint arXiv:1511.06297, 2015
3842015
Deep learning, reinforcement learning, and world models
Y Matsuo, Y LeCun, M Sahani, D Precup, D Silver, M Sugiyama, E Uchibe, ...
Neural Networks 152, 267-275, 2022
3752022
Convergent temporal-difference learning with arbitrary smooth function approximation
H Maei, C Szepesvari, S Bhatnagar, D Precup, D Silver, RS Sutton
Advances in neural information processing systems 22, 2009
3672009
Towards continual reinforcement learning: A review and perspectives
K Khetarpal, M Riemer, I Rish, D Precup
Journal of Artificial Intelligence Research 75, 1401-1476, 2022
3562022
המערכת אינה יכולה לבצע את הפעולה כעת. נסה שוב מאוחר יותר.
מאמרים 1–20