Urmăriți
Vincent Zhuang
Vincent Zhuang
Google DeepMind
Adresă de e-mail confirmată pe google.com
Titlu
Citat de
Citat de
Anul
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ...
arXiv preprint arXiv:2312.11805, 2023
33082023
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ...
arXiv preprint arXiv:2403.05530, 2024
12342024
Stagewise safe bayesian optimization with gaussian processes
Y Sui, V Zhuang, J Burdick, Y Yue
International Conference on Machine Learning, 4781-4789, 2018
1812018
Multi-dueling bandits with dependent arms
Y Sui, V Zhuang, JW Burdick, Y Yue
arXiv preprint arXiv:1705.00253, 2017
952017
Training language models to self-correct via reinforcement learning
A Kumar, V Zhuang, R Agarwal, Y Su, JD Co-Reyes, A Singh, K Baumli, ...
arXiv preprint arXiv:2409.12917, 2024
632024
Barkour: Benchmarking animal-level agility with quadruped robots
K Caluwaerts, A Iscen, JC Kew, W Yu, T Zhang, D Freeman, KH Lee, ...
arXiv preprint arXiv:2305.14654, 2023
532023
Learning to learn faster from human feedback with language model predictive control
J Liang, F Xia, W Yu, A Zeng, MG Arenas, M Attarian, M Bauza, M Bennice, ...
arXiv preprint arXiv:2402.11450, 2024
282024
Kepler: robust learning for parametric query optimization
L Doshi, V Zhuang, G Jain, R Marcus, H Huang, D Altinbüken, E Brevdo, ...
Proceedings of the ACM on Management of Data 1 (1), 1-25, 2023
222023
No-regret reinforcement learning with heavy-tailed rewards
V Zhuang, Y Sui
International Conference on Artificial Intelligence and Statistics, 3385-3393, 2021
172021
Training language models to self-correct via reinforcement learning, 2024
A Kumar, V Zhuang, R Agarwal, Y Su, JD Co-Reyes, A Singh, K Baumli, ...
URL https://arxiv. org/abs/2409.12917, 0
14
Inference-aware fine-tuning for best-of-n sampling in large language models
Y Chow, G Tennenholtz, I Gur, V Zhuang, B Dai, S Thiagarajan, ...
arXiv preprint arXiv:2412.15287, 2024
62024
Scalable Bayesian Optimization via Focalized Sparse Gaussian Processes
Y Wei, V Zhuang, S Soedarmadji, Y Sui
Advances in Neural Information Processing Systems 37, 120443-120467, 2025
2025
The Design of the Barkour Benchmark for Robot Agility
W Yu, K Caluwaerts, A Iscen, JC Kew, T Zhang, D Freeman, L Lee, ...
2024 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2024
2024
Workload-Driven Index Selections
H Huang, V Zhuang, S Idicula, G Jain
US Patent App. 18/183,925, 2024
2024
UNCERTAINTY IN ARTIFICIAL INTELLIGENCE-PROCEEDINGS OF THE 33RD CONFERENCE, UAI 2017
C Zhang, S Mandt, H Kjellström, J Suzuki, J Kawahara, Y Sui, V Zhuang, ...
2017
Motion Control of High-Dimensional Musculoskeletal System with Hierarchical Model-Based Planning
Y Wei, S Zhuang, V Zhuang, Y Sui
The Thirteenth International Conference on Learning Representations, 0
Sistemul nu poate realiza operația în acest moment. Încercați din nou mai târziu.
Articole 1–16