עקוב אחר
Jiaxuan Gao
Jiaxuan Gao
Institute for Interdisciplinary Information Sciences, Tsinghua University
כתובת אימייל מאומתת בדומיין mails.tsinghua.edu.cn
כותרת
צוטט על ידי
צוטט על ידי
שנה
The surprising effectiveness of ppo in cooperative multi-agent games
C Yu, A Velu, E Vinitsky, J Gao, Y Wang, A Bayen, Y Wu
Advances in neural information processing systems 35, 24611-24624, 2022
16102022
Is dpo superior to ppo for llm alignment? a comprehensive study
S Xu, W Fu, J Gao, W Ye, W Liu, Z Mei, G Wang, C Yu, Y Wu
arXiv preprint arXiv:2404.10719, 2024
862024
Asynchronous multi-agent reinforcement learning for efficient real-time multi-robot cooperative exploration
C Yu, X Yang, J Gao, J Chen, Y Li, J Liu, Y Xiang, R Huang, H Yang, ...
arXiv preprint arXiv:2301.03398, 2023
442023
Learning zero-shot cooperation with humans, assuming humans are biased
C Yu, J Gao, W Liu, B Xu, H Tang, J Yang, Y Wang, Y Wu
arXiv preprint arXiv:2302.01605, 2023
372023
Llm-powered hierarchical language agent for real-time human-ai coordination
J Liu, C Yu, J Gao, Y Xie, Q Liao, Y Wu, Y Wang
arXiv preprint arXiv:2312.15224, 2023
352023
The surprising effectiveness of PPO in cooperative, multi-agent games (2021)
C Yu, A Velu, E Vinitsky, J Gao, Y Wang, A Bayen, Y Wu
arXiv preprint arXiv:2103.01955, 2021
312021
Learning efficient multi-agent cooperative visual exploration
C Yu, X Yang, J Gao, H Yang, Y Wang, Y Wu
European Conference on Computer Vision, 497-515, 2022
302022
Srl: Scaling distributed reinforcement learning to over ten thousand cores
Z Mei, W Fu, J Gao, G Wang, H Zhang, Y Wu
arXiv preprint arXiv:2306.16688, 2023
52023
Save: Spatial-attention visual exploration
X Yang, C Yu, J Gao, Y Wang, H Yang
2022 IEEE International Conference on Image Processing (ICIP), 1356-1360, 2022
52022
On designing effective rl reward at training time for llm reasoning
J Gao, S Xu, W Ye, W Liu, C He, W Fu, Z Mei, G Wang, Y Wu
arXiv preprint arXiv:2410.15115, 2024
42024
LAGOON: Language-Guided Motion Control
S Xu, H Wang, Y Ouyang, J Gao, Z Mei, C Yu, Y Wu
2024 IEEE International Conference on Robotics and Automation (ICRA), 9743-9750, 2024
42024
Few-shot in-context preference learning using large language models
C Yu, H Lu, J Gao, Q Tan, X Yang, Y Wang, Y Wu, E Vinitsky
arXiv preprint arXiv:2410.17233, 2024
12024
Robot Generating Data for Learning Generalizable Visual Robotic Manipulation
Y Li, Y Yuan, J Cui, H Huan, W Fu, J Gao, Z Xu, Y Wu
2024 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2024
2024
A Benchmark of Planning-based Exploration Methods in Photo-Realistic 3D Simulator
X Du, X Yang, C Yu, J Gao, H Yang, Y Wang, Q Liao
2022 IEEE International Conference on Robotics and Biomimetics (ROBIO), 1562 …, 2022
2022
Sharing Minds during MARL Training for Enhanced Cooperative LLM Agents
J Gao, Y Wen, C Yu, Y Wu
Language Gamification-NeurIPS 2024 Workshop, 0
Supplementary Materials of Learning Efficient Multi-Agent Cooperative Visual Exploration
C Yu, X Yang, J Gao, H Yang, Y Wang, Y Wu23
המערכת אינה יכולה לבצע את הפעולה כעת. נסה שוב מאוחר יותר.
מאמרים 1–16