Vincent Zhuang

Citat de

	Toate	Din 2020
Referințe bibliografice	5021	4996
h-index	10	10
i10-index	10	10

3800

1900

950

2850

201920202021202220232024202515 31 52 61 129 3738 976

Acces public

Afișați-le pe toate

1 articol

0 articole

disponibile

indisponibile

Pe baza cerințelor privind finanțarea

Urmăriți

Vincent Zhuang

Google DeepMind

Adresă de e-mail confirmată pe google.com


Titlu Sortați după descrierea bibliografică Sortați după an Sortați după titlu	Citat de Citat de	Anul
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, JB Alayrac, J Yu, R Soricut, J Schalkwyk, ... arXiv preprint arXiv:2312.11805, 2023	3308	2023
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ... arXiv preprint arXiv:2403.05530, 2024	1234	2024
Stagewise safe bayesian optimization with gaussian processes Y Sui, V Zhuang, J Burdick, Y Yue International Conference on Machine Learning, 4781-4789, 2018	181	2018
Multi-dueling bandits with dependent arms Y Sui, V Zhuang, JW Burdick, Y Yue arXiv preprint arXiv:1705.00253, 2017	95	2017
Training language models to self-correct via reinforcement learning A Kumar, V Zhuang, R Agarwal, Y Su, JD Co-Reyes, A Singh, K Baumli, ... arXiv preprint arXiv:2409.12917, 2024	63	2024
Barkour: Benchmarking animal-level agility with quadruped robots K Caluwaerts, A Iscen, JC Kew, W Yu, T Zhang, D Freeman, KH Lee, ... arXiv preprint arXiv:2305.14654, 2023	53	2023
Learning to learn faster from human feedback with language model predictive control J Liang, F Xia, W Yu, A Zeng, MG Arenas, M Attarian, M Bauza, M Bennice, ... arXiv preprint arXiv:2402.11450, 2024	28	2024
Kepler: robust learning for parametric query optimization L Doshi, V Zhuang, G Jain, R Marcus, H Huang, D Altinbüken, E Brevdo, ... Proceedings of the ACM on Management of Data 1 (1), 1-25, 2023	22	2023
No-regret reinforcement learning with heavy-tailed rewards V Zhuang, Y Sui International Conference on Artificial Intelligence and Statistics, 3385-3393, 2021	17	2021
Training language models to self-correct via reinforcement learning, 2024 A Kumar, V Zhuang, R Agarwal, Y Su, JD Co-Reyes, A Singh, K Baumli, ... URL https://arxiv. org/abs/2409.12917, 0	14
Inference-aware fine-tuning for best-of-n sampling in large language models Y Chow, G Tennenholtz, I Gur, V Zhuang, B Dai, S Thiagarajan, ... arXiv preprint arXiv:2412.15287, 2024	6	2024
Scalable Bayesian Optimization via Focalized Sparse Gaussian Processes Y Wei, V Zhuang, S Soedarmadji, Y Sui Advances in Neural Information Processing Systems 37, 120443-120467, 2025		2025
The Design of the Barkour Benchmark for Robot Agility W Yu, K Caluwaerts, A Iscen, JC Kew, T Zhang, D Freeman, L Lee, ... 2024 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2024		2024
Workload-Driven Index Selections H Huang, V Zhuang, S Idicula, G Jain US Patent App. 18/183,925, 2024		2024
UNCERTAINTY IN ARTIFICIAL INTELLIGENCE-PROCEEDINGS OF THE 33RD CONFERENCE, UAI 2017 C Zhang, S Mandt, H Kjellström, J Suzuki, J Kawahara, Y Sui, V Zhuang, ...		2017
Motion Control of High-Dimensional Musculoskeletal System with Hierarchical Model-Based Planning Y Wei, S Zhuang, V Zhuang, Y Sui The Thirteenth International Conference on Learning Representations, 0

Sistemul nu poate realiza operația în acest moment. Încercați din nou mai târziu.

Articole 1–16

Referințe bibliografice pe an

Citate duplicat

Citate fuzionate

Adăugați coautoriCoautori

Urmăriți

Citat de