Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context G Team, P Georgiev, VI Lei, R Burnell, L Bai, A Gulati, G Tanzer, ... arXiv preprint arXiv:2403.05530, 2024 | 972 | 2024 |
Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF S Cen, J Mei, K Goshvadi, H Dai, T Yang, S Yang, D Schuurmans, Y Chi, ... arXiv preprint arXiv:2405.19320, 2024 | 15 | 2024 |
Revisiting sampling for combinatorial optimization H Sun, K Goshvadi, A Nova, D Schuurmans, H Dai International Conference on Machine Learning, 32859-32874, 2023 | 9 | 2023 |
Azade Nova, Dale Schuurmans, and Hanjun Dai. Revisiting sampling for combinatorial optimization H Sun, K Goshvadi International Conference on Machine Learning, 32859-32874, 2023 | 8 | 2023 |
DISCS: a benchmark for discrete sampling K Goshvadi, H Sun, X Liu, A Nova, R Zhang, W Grathwohl, D Schuurmans, ... Advances in Neural Information Processing Systems 36, 2024 | 1 | 2024 |
Exploring and Benchmarking the Planning Capabilities of Large Language Models B Bohnet, A Nova, AT Parisi, K Swersky, K Goshvadi, H Dai, ... arXiv preprint arXiv:2406.13094, 2024 | | 2024 |
Multi-Objective Body Stabilization of a Legged Robot via Distributed Reinforcement Learning G Sartoretti, K Goshvadi, H Choset International Conference on Robotics and Automation (ICRA) - Learning Legged …, 2019 | | 2019 |
Stochastic Gradient Discrete Langevin Dynamics H Sun, BY Wang, K Goshvadi, Y Xue, D Schuurmans, H Dai | | |