Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ... arXiv preprint arXiv:2312.11805, 2023 | 3308 | 2023 |
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ... arXiv preprint arXiv:2403.05530, 2024 | 1234 | 2024 |
Stagewise safe bayesian optimization with gaussian processes Y Sui, V Zhuang, J Burdick, Y Yue International Conference on Machine Learning, 4781-4789, 2018 | 181 | 2018 |
Multi-dueling bandits with dependent arms Y Sui, V Zhuang, JW Burdick, Y Yue arXiv preprint arXiv:1705.00253, 2017 | 95 | 2017 |
Training language models to self-correct via reinforcement learning A Kumar, V Zhuang, R Agarwal, Y Su, JD Co-Reyes, A Singh, K Baumli, ... arXiv preprint arXiv:2409.12917, 2024 | 63 | 2024 |
Barkour: Benchmarking animal-level agility with quadruped robots K Caluwaerts, A Iscen, JC Kew, W Yu, T Zhang, D Freeman, KH Lee, ... arXiv preprint arXiv:2305.14654, 2023 | 53 | 2023 |
Learning to learn faster from human feedback with language model predictive control J Liang, F Xia, W Yu, A Zeng, MG Arenas, M Attarian, M Bauza, M Bennice, ... arXiv preprint arXiv:2402.11450, 2024 | 28 | 2024 |
Kepler: robust learning for parametric query optimization L Doshi, V Zhuang, G Jain, R Marcus, H Huang, D Altinbüken, E Brevdo, ... Proceedings of the ACM on Management of Data 1 (1), 1-25, 2023 | 22 | 2023 |
No-regret reinforcement learning with heavy-tailed rewards V Zhuang, Y Sui International Conference on Artificial Intelligence and Statistics, 3385-3393, 2021 | 17 | 2021 |
Training language models to self-correct via reinforcement learning, 2024 A Kumar, V Zhuang, R Agarwal, Y Su, JD Co-Reyes, A Singh, K Baumli, ... URL https://arxiv. org/abs/2409.12917, 0 | 14 | |
Inference-aware fine-tuning for best-of-n sampling in large language models Y Chow, G Tennenholtz, I Gur, V Zhuang, B Dai, S Thiagarajan, ... arXiv preprint arXiv:2412.15287, 2024 | 6 | 2024 |
Scalable Bayesian Optimization via Focalized Sparse Gaussian Processes Y Wei, V Zhuang, S Soedarmadji, Y Sui Advances in Neural Information Processing Systems 37, 120443-120467, 2025 | | 2025 |
The Design of the Barkour Benchmark for Robot Agility W Yu, K Caluwaerts, A Iscen, JC Kew, T Zhang, D Freeman, L Lee, ... 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2024 | | 2024 |
Workload-Driven Index Selections H Huang, V Zhuang, S Idicula, G Jain US Patent App. 18/183,925, 2024 | | 2024 |
UNCERTAINTY IN ARTIFICIAL INTELLIGENCE-PROCEEDINGS OF THE 33RD CONFERENCE, UAI 2017 C Zhang, S Mandt, H Kjellström, J Suzuki, J Kawahara, Y Sui, V Zhuang, ... | | 2017 |
Motion Control of High-Dimensional Musculoskeletal System with Hierarchical Model-Based Planning Y Wei, S Zhuang, V Zhuang, Y Sui The Thirteenth International Conference on Learning Representations, 0 | | |