Aligning text-to-image models using human feedback K Lee, H Liu, M Ryu, O Watkins, Y Du, C Boutilier, P Abbeel, ... arXiv preprint arXiv:2302.12192, 2023 | 227 | 2023 |
Guiding Pretraining in Reinforcement Learning with Large Language Models Y Du*, O Watkins*, Z Wang, C Colas, T Darrell, P Abbeel, A Gupta, ... International Conference on Machine Learning (ICML) 2023, 2023 | 207 | 2023 |
Reinforcement Learning for Fine-tuning Text-to-Image Diffusion Models Y Fan, O Watkins, Y Du, H Liu, M Ryu, C Boutilier, P Abbeel, ... Thirty-seventh Conference on Neural Information Processing Systems (NeurIPS …, 2023 | 178 | 2023 |
Robust Reinforcement Learning using Adversarial Populations E Vinitsky*, Y Du*, K Parvate*, K Jang, P Abbeel, A Bayen arXiv preprint arXiv:2008.01825, 2020 | 101 | 2020 |
Auto-tuned sim-to-real transfer Y Du*, O Watkins*, T Darrell, P Abbeel, D Pathak 2021 IEEE International Conference on Robotics and Automation (ICRA), 1290-1296, 2021 | 88 | 2021 |
Vision-Language Models as Success Detectors Y Du, K Konyushkova, M Denil, A Raju, J Landon, F Hill, N de Freitas, ... Conference on Lifelong Learning Agents (CoLLAs) 2023, 2023 | 70 | 2023 |
Learning to model the world with language J Lin, Y Du, O Watkins, D Hafner, P Abbeel, D Klein, A Dragan arXiv preprint arXiv:2308.01399, 2023 | 46 | 2023 |
Ave: Assistance via empowerment Y Du, S Tiomkin, E Kiciman, D Polani, P Abbeel, A Dragan Advances in Neural Information Processing Systems 33, 4560-4571, 2020 | 46 | 2020 |
Teaching large language models to reason with reinforcement learning A Havrilla, Y Du, SC Raparthy, C Nalmpantis, J Dwivedi-Yu, ... arXiv preprint arXiv:2403.04642, 2024 | 41 | 2024 |
Group surfing: A pedestrian-based approach to sidewalk robot navigation Y Du, NJ Hetherington, CL Oon, WP Chan, CP Quintero, E Croft, ... 2019 international conference on robotics and automation (ICRA), 6518-6524, 2019 | 41 | 2019 |
It Takes Four to Tango: Multiagent Selfplay for Automatic Curriculum Generation Y Du, P Abbeel, A Grover International Conference on Learning Representations (ICLR) 2022, 2022 | 21 | 2022 |
Imagen 3 J Baldridge, J Bauer, M Bhutani, N Brichtova, A Bunner, K Chan, Y Chen, ... arXiv preprint arXiv:2408.07009, 2024 | 16 | 2024 |
Bayesian Imitation Learning for End-to-End Mobile Manipulation Y Du, D Ho, AA Alemi, E Jang, M Khansari International Conference on Machine Learning (ICML) 2022, 2022 | 10 | 2022 |
Practical Visual Deep Imitation Learning via Task-Level Domain Consistency M Khansari, D Ho, Y Du, A Fuentes, M Bennice, N Sievers, S Kirmani, ... 2023 IEEE International Conference on Robotics and Automation (ICRA), 1837-1844, 2023 | 8* | 2023 |
What can AI Learn from Human Exploration? Intrinsically-Motivated Humans and Agents in Open-World Exploration Y Du, E Kosoy, A Dayan, M Rufova, P Abbeel, A Gopnik Neurips 2023 workshop: Information-theoretic principles in cognitive systems, 2023 | 7 | 2023 |
Sidewalk delivery robot navigation: a pedestrian-based approach Y Du, NJ Hetherington, CL Oon, WP Chan, CP Quintero, E Croft, ... Human-Aiding Robotics: Open Issues and Future Direction 2018, 2018 | 2 | 2018 |
Semi-Supervised One-Shot Imitation Learning P Wu, K Hakhamaneshi, Y Du, I Mordatch, A Rajeswaran, P Abbeel arXiv preprint arXiv:2408.05285, 2024 | 1 | 2024 |
Using embeddings, generated using robot action models, in controlling robot to perform robotic task D Ho, E Jang, M Khansari, YQ Du, AA Alemi US Patent App. 18/102,053, 2024 | 1 | 2024 |
A Study on Improving Reasoning in Language Models Y Du, A Havrilla, S Sukhbaatar, P Abbeel, R Raileanu I Can’t Believe It’s Not Better! (ICBINB) Workshop @ NeurIPS 2023, 2023 | 1 | 2023 |
Mitigating reality gap through feature-level domain adaptation in training of vision-based robot action model M Khansari, D Ho, E Jang, YQ Du US Patent App. 17/986,428, 2023 | | 2023 |