Gpt-4 technical report J Achiam, S Adler, S Agarwal, L Ahmad, I Akkaya, FL Aleman, D Almeida, ... arXiv preprint arXiv:2303.08774, 2023 | 7932 | 2023 |
Neural combinatorial optimization with reinforcement learning I Bello, H Pham, QV Le, M Norouzi, S Bengio arXiv preprint arXiv:1611.09940, 2016 | 2070 | 2016 |
Stand-alone self-attention in vision models P Ramachandran, N Parmar, A Vaswani, I Bello, A Levskaya, J Shlens Advances in neural information processing systems 32, 2019 | 1568* | 2019 |
Attention augmented convolutional networks I Bello, B Zoph, A Vaswani, J Shlens, QV Le Proceedings of the IEEE/CVF international conference on computer vision …, 2019 | 1431 | 2019 |
Neural optimizer search with reinforcement learning I Bello, B Zoph, V Vasudevan, QV Le International Conference on Machine Learning, 459-468, 2017 | 476 | 2017 |
Revisiting resnets: Improved training and scaling strategies I Bello, W Fedus, X Du, ED Cubuk, A Srinivas, TY Lin, J Shlens, B Zoph Advances in Neural Information Processing Systems 34, 22614-22627, 2021 | 370 | 2021 |
Lambdanetworks: Modeling long-range interactions without attention I Bello arXiv preprint arXiv:2102.08602, 2021 | 220 | 2021 |
St-moe: Designing stable and transferable sparse expert models B Zoph, I Bello, S Kumar, N Du, Y Huang, J Dean, N Shazeer, W Fedus arXiv preprint arXiv:2202.08906, 2022 | 161 | 2022 |
Designing effective sparse expert models B Zoph, I Bello, S Kumar, N Du, Y Huang, J Dean, N Shazeer, W Fedus arXiv preprint arXiv:2202.08906 2 (3), 17, 2022 | 100 | 2022 |
Seq2Slate: Re-ranking and slate optimization with RNNs I Bello, S Kulkarni, S Jain, C Boutilier, E Chi, E Eban, X Luo, A Mackey, ... arXiv preprint arXiv:1810.02019, 2018 | 89 | 2018 |
Gpt-4 technical report, 2024 JA OpenAI, S Adler, S Agarwal, L Ahmad, I Akkaya, FL Aleman, ... URL https://arxiv. org/abs/2303.08774 2, 6, 2024 | 81 | 2024 |
GPT-4 technical report (2023) J Achiam, S Adler, S Agarwal, L Ahmad, I Akkaya, FL Aleman, D Almeida, ... URL https://api. semanticscholar. org/CorpusID 257532815, 2023 | 60 | 2023 |
Global self-attention networks for image recognition Z Shen, I Bello, R Vemulapalli, X Jia, CH Chen arXiv preprint arXiv:2010.03019, 2020 | 42 | 2020 |
Revisiting 3d resnets for video recognition X Du, Y Li, Y Cui, R Qian, J Li, I Bello arXiv preprint arXiv:2109.01696, 2021 | 25 | 2021 |
Backprop evolution M Alber, I Bello, B Zoph, PJ Kindermans, P Ramachandran, Q Le arXiv preprint arXiv:1808.02822, 2018 | 19 | 2018 |
Neural network optimizer search I Bello, B Zoph, V Vasudevan, QV Le US Patent App. 17/145,524, 2021 | 6 | 2021 |
Systems and Methods for Slate Optimization with Recurrent Neural Networks OP Meshi, I Bello, S Kulkarni, S Jain US Patent App. 16/415,854, 2019 | 5 | 2019 |
Fully attentional computer vision J Shlens, AT Vaswani, NJ Parmar, P Ramachandran, AC Levskaya, ... US Patent App. 17/606,976, 2022 | 3 | 2022 |
Neural network optimizer search I Bello, B Zoph, V Vasudevan, QV Le US Patent 10,922,611, 2021 | 2 | 2021 |
Learning Control Policies from High-Dimensional Visual Inputs I Bello, Y Tkachenko Stanford CS231N, 2015 | 1 | 2015 |