Human-level control through deep reinforcement learning V Mnih, K Kavukcuoglu, D Silver, AA Rusu, J Veness, MG Bellemare, ... nature 518 (7540), 529-533, 2015 | 34114 | 2015 |
Mastering the game of Go with deep neural networks and tree search D Silver, A Huang, CJ Maddison, A Guez, L Sifre, G Van Den Driessche, ... nature 529 (7587), 484-489, 2016 | 20953 | 2016 |
Playing atari with deep reinforcement learning V Mnih, K Kavukcuoglu, D Silver, A Graves, I Antonoglou, D Wierstra, ... arXiv preprint arXiv:1312.5602, 2013 | 16868 | 2013 |
Mastering the game of go without human knowledge D Silver, J Schrittwieser, K Simonyan, I Antonoglou, A Huang, A Guez, ... nature 550 (7676), 354-359, 2017 | 11928 | 2017 |
Prioritized experience replay T Schaul, J Quan, I Antonoglou, D Silver arXiv preprint arXiv:1511.05952, 2015 | 6023 | 2015 |
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai, A Guez, M Lanctot, ... Science 362 (6419), 1140-1144, 2018 | 5125 | 2018 |
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ... arXiv preprint arXiv:2312.11805, 2023 | 3196 | 2023 |
Mastering atari, go, chess and shogi by planning with a learned model J Schrittwieser, I Antonoglou, T Hubert, K Simonyan, L Sifre, S Schmitt, ... Nature 588 (7839), 604-609, 2020 | 2742 | 2020 |
Mastering chess and shogi by self-play with a general reinforcement learning algorithm D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai, A Guez, M Lanctot, ... arXiv preprint arXiv:1712.01815, 2017 | 2672 | 2017 |
Playing atari with deep reinforcement learning. arXiv 2013 V Mnih, K Kavukcuoglu, D Silver, A Graves, I Antonoglou, D Wierstra, ... arXiv preprint arXiv:1312.5602 10, 2013 | 1295 | 2013 |
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ... arXiv preprint arXiv:2403.05530, 2024 | 1165 | 2024 |
& Hassabis, D.(2016). Mastering the game of Go with deep neural networks and tree search D Silver, A Huang, CJ Maddison, A Guez, L Sifre, G Van Den Driessche Nature 529 (7587), 484-489, 0 | 204 | |
Thore Graepel, Thore and Demis Hassabis D Silver, A Huang, CJ Maddison, A Guez, L Sifre, G Van Den Driessche, ... Mastering the game of Go with deep neural networks and tree search.” nature …, 2016 | 194 | 2016 |
Bayesian optimization in alphago Y Chen, A Huang, Z Wang, I Antonoglou, J Schrittwieser, D Silver, ... arXiv preprint arXiv:1812.06855, 2018 | 167 | 2018 |
Prioritized experience replay. arXiv 2015 T Schaul, J Quan, I Antonoglou, D Silver arXiv preprint arXiv:1511.05952 5952, 2016 | 162 | 2016 |
Unit tests for stochastic optimization T Schaul, I Antonoglou, D Silver arXiv preprint arXiv:1312.6055, 2013 | 143 | 2013 |
Online and offline reinforcement learning by planning with a learned model J Schrittwieser, T Hubert, A Mandhane, M Barekatain, I Antonoglou, ... Advances in Neural Information Processing Systems 34, 27580-27591, 2021 | 134 | 2021 |
Yutian chen, Timothy P. Lillicrap, Fan Hui, Laurent Sifre, George van den Driessche, Thore Graepel, and Demis Hassabis.“Mastering the game of Go without human knowledge” D Silver, J Schrittwieser, K Simonyan, I Antonoglou, A Huang, A Guez, ... Nature 550, 354-359, 2017 | 108 | 2017 |
PlayingAtariwithdeeprein forcementlearning V Mnih, K Kavukcuoglu, D Silver, A Graves, I Antonoglou, D Wierstra, ... 2013 12 19). https://arxiv. org/abs/1312.5602, 2013 | 105 | 2013 |
Playing atari with deep reinforcement learning. CoRR abs/1312.5602 (2013) V Mnih, K Kavukcuoglu, D Silver, A Graves, I Antonoglou, D Wierstra, ... arXiv preprint arXiv:1312.5602, 2013 | 104 | 2013 |