팔로우
Ioannis Antonoglou
Ioannis Antonoglou
Deepmind, UCL
reflection.ai의 이메일 확인됨
제목
인용
인용
연도
Human-level control through deep reinforcement learning
V Mnih, K Kavukcuoglu, D Silver, AA Rusu, J Veness, MG Bellemare, ...
nature 518 (7540), 529-533, 2015
339252015
Mastering the game of Go with deep neural networks and tree search
D Silver, A Huang, CJ Maddison, A Guez, L Sifre, G Van Den Driessche, ...
nature 529 (7587), 484-489, 2016
209922016
Playing atari with deep reinforcement learning
V Mnih
arXiv preprint arXiv:1312.5602, 2013
169262013
Mastering the game of go without human knowledge
D Silver, J Schrittwieser, K Simonyan, I Antonoglou, A Huang, A Guez, ...
nature 550 (7676), 354-359, 2017
119272017
Prioritized Experience Replay
T Schaul
arXiv preprint arXiv:1511.05952, 2015
54142015
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai, A Guez, M Lanctot, ...
Science 362 (6419), 1140-1144, 2018
51122018
Mastering atari, go, chess and shogi by planning with a learned model
J Schrittwieser, I Antonoglou, T Hubert, K Simonyan, L Sifre, S Schmitt, ...
Nature 588 (7839), 604-609, 2020
27202020
Mastering chess and shogi by self-play with a general reinforcement learning algorithm
D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai, A Guez, M Lanctot, ...
arXiv preprint arXiv:1712.01815, 2017
25052017
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ...
arXiv preprint arXiv:2312.11805, 2023
24942023
Playing atari with deep reinforcement learning. arXiv 2013
V Mnih, K Kavukcuoglu, D Silver, A Graves, I Antonoglou, D Wierstra, ...
arXiv preprint arXiv:1312.5602, 2013
11052013
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ...
arXiv preprint arXiv:2403.05530, 2024
9662024
Gemini: A family of highly capable multimodal models
R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ...
arXiv preprint arXiv:2312.11805 1, 2023
3052023
Mastering chess and shogi by self-play with a general reinforcement learning algorithm. arXiv 2017
D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai, A Guez, M Lanctot, ...
arXiv preprint arXiv:1712.01815, 2017
1702017
Bayesian optimization in alphago
Y Chen, A Huang, Z Wang, I Antonoglou, J Schrittwieser, D Silver, ...
arXiv preprint arXiv:1812.06855, 2018
1642018
Prioritized experience replay. arXiv 2015
T Schaul, J Quan, I Antonoglou, D Silver
arXiv preprint arXiv:1511.05952 5952, 2016
1592016
Unit tests for stochastic optimization
T Schaul, I Antonoglou, D Silver
arXiv preprint arXiv:1312.6055, 2013
1422013
Online and offline reinforcement learning by planning with a learned model
J Schrittwieser, T Hubert, A Mandhane, M Barekatain, I Antonoglou, ...
Advances in Neural Information Processing Systems 34, 27580-27591, 2021
1322021
Learning to search with mctsnets
A Guez, T Weber, I Antonoglou, K Simonyan, O Vinyals, D Wierstra, ...
International conference on machine learning, 1822-1831, 2018
1002018
Learning and planning in complex action spaces
T Hubert, J Schrittwieser, I Antonoglou, M Barekatain, S Schmitt, D Silver
International Conference on Machine Learning, 4476-4486, 2021
972021
A test of relative similarity for model selection in generative models
W Bounliphone, E Belilovsky, MB Blaschko, I Antonoglou, A Gretton
arXiv preprint arXiv:1511.04581, 2015
902015
현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.
학술자료 1–20